Skip to content

Commit

Permalink
Modify languages file
Browse files Browse the repository at this point in the history
  • Loading branch information
michaelnmmeyer committed Apr 19, 2024
1 parent f057caa commit 9a9ee03
Show file tree
Hide file tree
Showing 3 changed files with 59 additions and 56 deletions.
67 changes: 30 additions & 37 deletions DHARMA_languages.tsv
Original file line number Diff line number Diff line change
@@ -1,37 +1,30 @@
Id Print_Name Inverted_Name
und Undetermined Undetermined
ara Arabic Arabic
ban Modern Balinese Balinese, Modern
oldbalinese Old Balinese Balinese, Old
bya Batak Batak
mya Modern Burmese Burmese, Modern
obr Old Burmese Burmese, Old
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia)
cjm Modern Cham (of Phanrang) Cham, Modern (of Phanrang)
ocm Old Cham Cham, Old
nld Dutch Dutch
eng English English
fra French French
deu German German
ind Indonesian Indonesian
jpn Japanese Japanese
jav Modern Javanese Javanese, Modern
kaw Old Javanese Javanese, Old
kan Kannada Kannada
xhm Middle Khmer Khmer, Middle
khm Modern Khmer Khmer, Modern
okz Old Khmer Khmer, Old
zlm Modern Malay (Bahasa Malaysia) Malay, Modern (Bahasa Malaysia)
omy Old Malay Malay, Old
omx Old Mon Mon, Old
pli Pali Pali
pra Prakrit Prakrit
pyx Pyu Pyu
und Undetermined Undetermined
ara Arabic Arabic
ban Modern Balinese Balinese, Modern
bya Batak Batak
mya Modern Burmese Burmese, Modern
obr Old Burmese Burmese, Old
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia)
ori Oriya Oriya
Id Print_Name Inverted_Name source
ara Arabic Arabic
ban Modern Balinese Balinese, Modern
bya Batak Batak
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia)
cjm Modern Cham (of Phanrang) Cham, Modern (of Phanrang)
deu German German false
eng English English false
fra French French false
ind Indonesian Indonesian
jav Modern Javanese Javanese, Modern
jpn Japanese Japanese
kan Kannada Kannada
kaw Old Javanese Javanese, Old
khm Modern Khmer Khmer, Modern
mya Modern Burmese Burmese, Modern
nld Dutch Dutch false
obr Old Burmese Burmese, Old
ocm Old Cham Cham, Old
okz Old Khmer Khmer, Old
oldbalinese Old Balinese Balinese, Old
omx Old Mon Mon, Old
omy Old Malay Malay, Old
ori Oriya Oriya
pli Pali Pali
pra Prakrit Prakrit
pyx Pyu Pyu
und Undetermined Undetermined
xhm Middle Khmer Khmer, Middle
zlm Modern Malay (Bahasa Malaysia) Malay, Modern (Bahasa Malaysia)
29 changes: 29 additions & 0 deletions DHARMA_languages_readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
# DHARMA languages list

The file `DHARMA_languages.tsv` is a table that enumerates languages used in the
project. The corresponding display is at https://dharmalekha.info/languages. The
table is in the same format as the ISO 639-3 table here:
https://iso639-3.sil.org/sites/iso639-3/files/downloads/iso-639-3_Name_Index.tab,
but includes an extra column.

Columns are:

* `Id`. A three-letter language codes from ISO 639-3 or ISO 639-5. It is OK to
invent a new language code if needed, but in this case you must use a string
longer than three characters, to prevent collisions with ISO codes, and you
must not use hyphens, because they have a special meaning in language codes.
* `Print_Name`. Example: "Old Cham". The name you use here overrides the one
from the ISO standard. For instance, for the language code `kaw`, we use
"Old Javanese" as `Print_Name` instead of the default "Kawi".
* `Inverted_Name`. Like `Print_Name`, but used when sorting names, in
particular. Example: "Cham, Old". Note the use of capitals. The value you
provide here overrides the one from the ISO standard, as for `Print_Name`.
* `source`. This is a DHARMA-specific field, which should contain a boolean
value viz. `true` or `false`. If this column is empty, `true` is assumed.
Languages that have `source` set to `true` are treated as source languages by
the DHARMA application, per contrast with translation languages (all the
others: English, etc.) Source languages are displayed in texts' metadata and
in data aggregations, while the others are not. This field is needed
because translation languages do appear in our texts' `div[@type='edition']`
(`head` elements are in English, for instance). Thus, we cannot determine
automatically which languages are source languages and which are not.
19 changes: 0 additions & 19 deletions DHARMA_languages_readme.txt

This file was deleted.

0 comments on commit 9a9ee03

Please sign in to comment.