Skip to content

Tools and data to work with data from the oireachtas api a bit easier

License

Notifications You must be signed in to change notification settings

RobertLucey/oireachtas-data

Repository files navigation

Oireachtas Data

Tools and data to work with data from the oireachtas api a bit easier.

Parses data not given through the api but is included in the PDF of the minutes. For some reason a lot of debate sections are forbidden.

Installation

pip install oireachtas_data

Dependencies

Poppler - sudo apt install libpoppler-cpp-dev PDFToHTML - sudo apt install pdftohtml

Usage

From the repo after running make setup to download and parse all the debates (or as much as you like) run make pull_debates. Once you have enough downloaded you can run make load_debates where you will be dropped into a debugger with a variable debates which you can work with

Caveats

The API often denies access to resources so content is parsed from the PDF of the minutes. This can make it so metadata isn't included everywhere and sometimes (though rarely) the parsing of speech segments and the associated speaker isn't correct

About

Tools and data to work with data from the oireachtas api a bit easier

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published