You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A Ruff based GitHub workflow was added to check the code formatting and lint the codebase on each pull request (#109).
The _update_files directory was renamed update_files as these files are used in non-internal manners now (#57).
A common function has been created to map Wikidata ids to noun genders (#69).
The project now is installed locally for development and command line usage, so usages of sys.path have been removed from files (#122).
The directory structure has been dramatically streamlined and includes folders for future projects where language data could come from other sources like Wiktionary (#139).
Translation files are moved to their own directory.
The extract_transform directory has been removed and all files within it have been moved one level up.
The languages directory has been renamed language_data_extraction.
All files within wikidata/_resources have been moved to the resources directory.
The gender and case annotations for data formatting have now been commonly defined.
All language directory formatted_data files have been now moved to the scribe_data_json_export directory to prepare for outputs being required to be directed to a directory outside of the package.
Path computing has been refactored throughout the codebase, and unneeded functions for data transfers have been removed.