Here you can find the set of files that was used for the development of the Amharic Corpus.
scrapy_project contains program for crawling Amharic sites.
pos_tagger contains different programs for pos-tagging including different clustering and classifying algorithms and hybrid models.
The Amharic Corpus can be found at web-corpora.