Skip to content

Latest commit

 

History

History
executable file
·
42 lines (32 loc) · 1.06 KB

README.md

File metadata and controls

executable file
·
42 lines (32 loc) · 1.06 KB

Open NLP

POS tagging and Named-entity recognizing

Distribution

Binary distribution can be downloaded here (JRE 1.7 or later required, Unix or Windows only)

Usage

Please find usage guide in the README

Building

Prerequisites:

  • JDK 1.7 or later
  • Maven 3.3.9 or later

Building program:

$ cd java/nlp
$ mvn clean package

NLTK

Prequisites

POS Tagging

POS tagging with predefined training and test data:

$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv ../data/pos-tagging/Wikipedia.txt

POS tagging by splitting training data to training and test data:

$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv 1000 sentences.tag