This repository makes available the source texts of stories from the African Storybook Project in Markdown format.
Each folder in the repository represents a language, identified by its ISO 639-1 or ISO 639-3 code. Source translations into each language are stored in the appropriate folders.
All of these source texts have been extracted from the pdfs available on the African Storybook Project website. The markdown files in this repo provide data for many other projects, for example the ASP Image Bank Explorer and Global-ASP Translator, as well as making possible the creation of bilingual storybooks in many languages.
Corresponding images for the stories in this repository can be found in the ASP Image Bank.
The extracted source text of all stories has been provided here in Markdown format. See here for specific details about the format used.
A sequence of two hashes ##
on a separate line indicates a page break.
Editing of the story content has been kept to a minimum and for the most part the stories are presented as they are. Corrections other than obvious errors of orthography or traces of the conversion process should be directed to the African Storybook Project website directly. (An easy way to do this is to simply version the story in question and use the online tools provided to modify and edit the story.)
The African Storybook Project currently provides stories in 62 different African languages. This repository attempts to provide the source text for all of these stories in machine- and human-readable Markdown format.
Below is a key to the languages covered by this repository and their ISO 639-1/3 codes.
ISO code | Language Name |
---|---|
ach | Acholi |
adh | Dhopadhola |
af | Afrikaans |
alz | Alur |
am | Amharic |
anu | Anywak |
bem | IciBemba |
bxk | Lubukusu |
cce | Cicopi |
dag | Dagbani/Dagbanli |
de | German |
dga | Dagaare |
dje | Zarma |
ee | Ewe |
en | English |
ff | Fulfulde |
fr | French |
gaa | Ga |
gjn | Gonja |
gur | Gurene |
guz | Ekegusii |
ha | Hausa |
ha_ne | Hausa (Niger) |
hz | Otjiherero |
kam | Kikamba |
kdj | Ngakarimojong |
keo | Kakwa |
ki | Gikuyu |
kj | Oshikwanyama |
kln | Kalenjin |
koo | Lukhonzo |
kpz | Sabinyi |
kr | Kanuri |
kwn | Rukwangali |
laj | Leblango |
lgg | Lugbarati |
lgg-official | Lugbarati (Official) |
lg | Luganda |
lko | Olukhayo |
loz | Lozi |
lsm | Lusamia |
luc | Aringati |
luo | Dholuo |
lwg | Oluwanga |
mas | Maa |
mer | Kimîîru |
mhi | Ma'di |
mhw | Thimbukushu |
myx | Lumasaaba |
naq | Khoekhoegowab |
ng | Oshindonga |
nle | Olunyala |
nr | isiNdebele |
nso | Sepedi |
nuj | Lunyole |
ny | Chichewa |
nyn | Runyankore |
nyu | Chinyungwe |
om | Afan Oromo |
pt | Portuguese |
rw | Kinyarwanda |
saq | Samburu |
so | Somali |
ss | siSwati |
st | Sesotho |
sw | Kiswahili |
teo | Ateso |
ti | Tigrinya |
tn | Setswana |
toh | Gitonga |
toi | Tonga |
ts | Xitsonga |
tsc | Xitswa |
ttj | Rutooro |
tuv | Ng'aturkana |
ve | Tshivenda |
xh | isiXhosa |
xog | Lusoga |
yo | Yoruba |
zne | Zande |
zu | isiZulu |
All stories in this repository are Creative Commons licensed (either CC-BY or CC-BY-NC). The specific license for each story is indicated both in the file itself (in the metadata section), and in the corresponding README.md
file for that language. Links to the original stories on the ASP website can also be found in the README.md
files.