-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MEO Node ID Error #189
Comments
MEO may not support node IDs with underscores based on Neha's EGFR dataset testing and my initial inspection of its Vertex class: https://github.com/agitter/meo/blob/master/src/alg/Vertex.java#L45-L50 |
Doing something similar that domino does would help with fixing this issue Ex.replacing the "" with a "-" and then in post processing, replacing the "-" with "" |
The relevant DOMINO code is: Lines 215 to 231 in a26f4d0
I don't believe replacing "_" with "-" will work because then "A_B" and "A-B" would be the same. Those IDs are unlikely to both be in the network, but they could be. We'll need to replace "_" with another string that is very unlikely to appear in real protein names. If MEO supports Unicode, using some combination of strange Unicode characters would be safe. |
If a node is in the format [name]_[info], MEO ignores the [info] part and only uses the [name] during reconstructions.
This happened when doing parameter tuning egfr dataset with MEO, the node names were missing the “_HUMAN” suffix. In the gold standard dataset, nodes such as EGF_HUMAN and EDGR_HUMAN are expected, but in the MEO output, they appear without the suffix. This error led to zero matching nodes to the gold standard nodes during evaluation.
input prize file
raw pathway:
The text was updated successfully, but these errors were encountered: