Releases: rosettatype/hyperglot
Releases · rosettatype/hyperglot
0.7.0 & 0.7.1
0.7.1 (07.10.2024)
- FIX: Fixed
hyperglot-data
error
0.7.0 (07.10.2024)
- DATA: All language yaml documents now have their
contributors
listed, some havereviewers
listed - DATA: *Massive improvement of language
sources
with proper source citations where possible - DATA: Added
punctuation
,numerals
andcurrency
attributes to orthographies - checking for these attributes will be added in the next update! - DATA: Added
lib/hyperglot/extra_data/default.yaml
to include inheritable defaults per script - DATA: Refined
jpn
,ryu
andain
Katakana orthographies - FEATURE: Orthography attributes can inherit from other languages with
<iso>
syntax, see README - TWEAK: Improved loading time for repeat access by saving parsed language data cache file
- TWEAK: Orthographies can no longer have an
inherit
attribute - TWEAK: Improved loading speed for repeat queries and indivudal language queries
- TWEAK: Refactored
Languages
,Language
andOrthography
object instantiation to always return parsed and defaulted nested objects - TWEAK: Removed the
--speakers
and--autonym
CLI options - TWEAK: Removed the
--comparison
CLI option (seeexamples
instead) - TWEAK: Removed the
--languages
CLI option, usehyperglot-info LanguageName/ISO
instead - TWEAK: Removed the
--strict_iso
CLI option; use the python library to access this option, particularlyLanguage.get_name(script, strict_iso=True)
0.6.4
- FIX: Fixed an issue where trying to log missing shaping glyphs would crash in
FontChecker
- FIX: Improved mark shaping detection to interpret ccmp substitutions of base + mark as correctly shaping (thanks @arialcrime)
- TWEAK: Cleaned up
hyperglot.language.Language
class and added attribute properties for dict properties with computed defaults (as opposed to writing defaults for missing attributes) as well as more code annotation - TWEAK:
hyperglot.orthography.Orthography
object hasscript_iso
attribute returning the mapped ISO 15924 script tag - DATA: Added
lib/extra_data/script-names.yaml
with a list of all current Hyperglot scripts and a mapping to their ISO 15924 code equivalent - DATA: Added di/tri-graphs to Czech and Hungarian orthographies and fixed their order
- DATA: Added Squamish (
squ
) (thanks @justinpenner) - DATA: Unified "Geʽez" script with reversed comma, as opposed to previous mixed use of "Ge'ez/Fidel" and "Ge'ez"
- DATA: Amended spelling "Tai Viet" script in title case to match other script names
- DATA: Corrected spelling of "Bamum" script and language (instead of less used "Bamun" used in Hyperglot)
- DATA: Use "Coptic" instead of "Coptic/Numbian" script name
- DATA: Use "Burmese" script for language "Mon"
- DATA: Use "Baybayin" script name instead of "Tagalog (Baybayin, Alibata)"
- DATA: Fixed Toki Pona (
tok
) file name - TWEAK: Make sure
Orthography.base_chars
andOrthography.aux_chars
return no duplicates for decomposed character sequences - TWEAK: Define
Languages
,Language
andOrthography
as module top level exports for easier importing, e.g. now:from hyperglot import Language
0.6.3
- FIX: Set correct default values for
Language.status
andOrthography.preferred_as_group
and provide validation and tests for these. - TWEAK: Deprecated plain list
SUPPORTLEVELS, VALIDITYLEVELS, STATUSES, ORTHOGRAPHY_STATUSES
and replaced them withSupportLevel, LanguageValidity, LanguageStatus, OrthographyStatus
enums throughout the code base. The deprecated values will be removed in the next minor version. - TESTS: Added simple tox config for running test on all supported minor python versions
0.6.2
0.6.1
0.6.0
This update introduces some initial joining and mark attachment checks to the hyperglot CLI. Parts of the python library have been restructured to accommodate this, so some object and method signatures have changed.
- DATA: Refined Klingon (
tlh
) orthography and added a draft version of Toki Pona (tok
) - FEATURE: Implemented shaping checks for mark positioning when required by unencoded base + mark combinations or
--decompose
- FEATURE: Implemented shaping checks for connecting scripts to detect presence of required positional forms
- FEATURE: Implemented
hyperglot-report
command with same options ashyperglot
and additional--report-missing n
,--report-marks n
and--report-joining n
— or--report-all n
to toggle all aforementioned — parameters/flags for outputting languages almost supported by the font - TWEAK: Support checking is now done via
hyperglot.checker
objects for cleaner separation between language data and checking fonts - TWEAK: Various python APIs and objects changed and refactored
- TWEAK: Bumped required python version to 3.8.0
0.5.3
- DATA : Added Tlingit
tli
language data (thanks @jcrippen) - DATA: Fixed inconsistent note about
Ŋ
in various languages (thanks @moyogo) - TWEAK: Improved
hyperglot-validate
to spot lookalike characters in the wrong script, e.g.a
(Latin U+0061) vsа
(Cyrillic U+0430) - TWEAK: Explicitly ignore non-yaml files (e.g. operating system or other) in the data when parsing
- TWEAK: Improved
hyperglot-validate
command to better catch yaml issues (thanks for reporting @jcrippen)
0.5.2
- DATA: Removed orthography status
deprecated
and usinghistorical
for those instances - DATA: Added Ethiopic languages
awn
,byn
,gez
,har
,sgw
,tig
,xan
and updatedtir
(thanks @dyacob and @NeilSureshPatel) - DATA: Added Avestan
- DATA: Corrections to
jbo
(thanks @berrymot) - DATA: Updated
sco
primary orthography (thanks @moyogo) - DATA: Some fixes to
kkj
orthography (thanks @moyogo) - DATA: Small note added to
Dagbani
(thanks @clauseggers and @moyogo) - DATA: Fix to Shan (
shn
) containing some stray Latin characters - FIX: Fix issue with file name conflicts on Windows systems