-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
f057caa
commit 9a9ee03
Showing
3 changed files
with
59 additions
and
56 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,37 +1,30 @@ | ||
Id Print_Name Inverted_Name | ||
und Undetermined Undetermined | ||
ara Arabic Arabic | ||
ban Modern Balinese Balinese, Modern | ||
oldbalinese Old Balinese Balinese, Old | ||
bya Batak Batak | ||
mya Modern Burmese Burmese, Modern | ||
obr Old Burmese Burmese, Old | ||
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia) | ||
cjm Modern Cham (of Phanrang) Cham, Modern (of Phanrang) | ||
ocm Old Cham Cham, Old | ||
nld Dutch Dutch | ||
eng English English | ||
fra French French | ||
deu German German | ||
ind Indonesian Indonesian | ||
jpn Japanese Japanese | ||
jav Modern Javanese Javanese, Modern | ||
kaw Old Javanese Javanese, Old | ||
kan Kannada Kannada | ||
xhm Middle Khmer Khmer, Middle | ||
khm Modern Khmer Khmer, Modern | ||
okz Old Khmer Khmer, Old | ||
zlm Modern Malay (Bahasa Malaysia) Malay, Modern (Bahasa Malaysia) | ||
omy Old Malay Malay, Old | ||
omx Old Mon Mon, Old | ||
pli Pali Pali | ||
pra Prakrit Prakrit | ||
pyx Pyu Pyu | ||
und Undetermined Undetermined | ||
ara Arabic Arabic | ||
ban Modern Balinese Balinese, Modern | ||
bya Batak Batak | ||
mya Modern Burmese Burmese, Modern | ||
obr Old Burmese Burmese, Old | ||
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia) | ||
ori Oriya Oriya | ||
Id Print_Name Inverted_Name source | ||
ara Arabic Arabic | ||
ban Modern Balinese Balinese, Modern | ||
bya Batak Batak | ||
cja Modern Cham (of Cambodia) Cham, Modern (of Cambodia) | ||
cjm Modern Cham (of Phanrang) Cham, Modern (of Phanrang) | ||
deu German German false | ||
eng English English false | ||
fra French French false | ||
ind Indonesian Indonesian | ||
jav Modern Javanese Javanese, Modern | ||
jpn Japanese Japanese | ||
kan Kannada Kannada | ||
kaw Old Javanese Javanese, Old | ||
khm Modern Khmer Khmer, Modern | ||
mya Modern Burmese Burmese, Modern | ||
nld Dutch Dutch false | ||
obr Old Burmese Burmese, Old | ||
ocm Old Cham Cham, Old | ||
okz Old Khmer Khmer, Old | ||
oldbalinese Old Balinese Balinese, Old | ||
omx Old Mon Mon, Old | ||
omy Old Malay Malay, Old | ||
ori Oriya Oriya | ||
pli Pali Pali | ||
pra Prakrit Prakrit | ||
pyx Pyu Pyu | ||
und Undetermined Undetermined | ||
xhm Middle Khmer Khmer, Middle | ||
zlm Modern Malay (Bahasa Malaysia) Malay, Modern (Bahasa Malaysia) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,29 @@ | ||
# DHARMA languages list | ||
|
||
The file `DHARMA_languages.tsv` is a table that enumerates languages used in the | ||
project. The corresponding display is at https://dharmalekha.info/languages. The | ||
table is in the same format as the ISO 639-3 table here: | ||
https://iso639-3.sil.org/sites/iso639-3/files/downloads/iso-639-3_Name_Index.tab, | ||
but includes an extra column. | ||
|
||
Columns are: | ||
|
||
* `Id`. A three-letter language codes from ISO 639-3 or ISO 639-5. It is OK to | ||
invent a new language code if needed, but in this case you must use a string | ||
longer than three characters, to prevent collisions with ISO codes, and you | ||
must not use hyphens, because they have a special meaning in language codes. | ||
* `Print_Name`. Example: "Old Cham". The name you use here overrides the one | ||
from the ISO standard. For instance, for the language code `kaw`, we use | ||
"Old Javanese" as `Print_Name` instead of the default "Kawi". | ||
* `Inverted_Name`. Like `Print_Name`, but used when sorting names, in | ||
particular. Example: "Cham, Old". Note the use of capitals. The value you | ||
provide here overrides the one from the ISO standard, as for `Print_Name`. | ||
* `source`. This is a DHARMA-specific field, which should contain a boolean | ||
value viz. `true` or `false`. If this column is empty, `true` is assumed. | ||
Languages that have `source` set to `true` are treated as source languages by | ||
the DHARMA application, per contrast with translation languages (all the | ||
others: English, etc.) Source languages are displayed in texts' metadata and | ||
in data aggregations, while the others are not. This field is needed | ||
because translation languages do appear in our texts' `div[@type='edition']` | ||
(`head` elements are in English, for instance). Thus, we cannot determine | ||
automatically which languages are source languages and which are not. |
This file was deleted.
Oops, something went wrong.