named entity recognizer for neuronal cells, based on UIMA Ruta rules
TODO how to apply NER to simple text
NeuroNER uses Ruta (site, documentation), a workbench to develop and test NLP rules.
-
Install RUTA plugin
- Help > Install New Software...
- Work with:
http://www.apache.org/dist/uima/eclipse-update-site/
- Add...
- Name: RUTA, OK
- Select
Apache UIMA Eclipse tooling and runtime support
andApache UIMA Ruta
- Next, Next, Finish
- Work with:
- Help > Install New Software...
-
Open Ruta
- Window > Open Perspective > Other...
- Select: Uima Ruta
- Window > Open Perspective > Other...
-
Install neuroNER
- File > Import
- General > Existing Project into Workspace, Next
- Select archive file
- Browse... to your neuroNER folder
- Finish
- The project should appear in your Script Browser on the left
- File > Import
-
Now let's apply the rules on some text.
- Open script
script/neuroNER/NeuroNER.ruta
. This is where rules are defined. Check Ruta documentation to understand the syntax. - Right click, Debug As, UIMA Ruta
- Open Console view to see progress, wait until it's
<terminated>
- Open script
-
View results
- Open files in
output/
- Select Type System
NeuroNERTypeSystem.xml
- Select Type System
- Click on Eclipse View
Annotation Browser View
(tabed window on the right)- Select checkboxes, to reveal the annotations we added.
- Open files in
-
Inspect results
- This allows to see which rules have been applied to which sentence.
- Window > Open Perspective > Ruta Explain
- See what rules have been applied
- Click on Eclipse View
Applied Rules
- Click on Eclipse View