-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update genome-preprocess #26
base: master
Are you sure you want to change the base?
Conversation
Thank you for the PR. I got the following errors when processing GFF3: ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_mouse/release_M26/gencode.vM26.annotation.gff3.gz
I got no errors when trying to process corresponding GTF: ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_mouse/release_M26/gencode.vM26.annotation.gtf.gz
My question is 1) is the GFF3 corrupted or is it a converter problem; 2) is the conversion of GTF fine or some problems are hidden and slip through @haxiomic ? |
hey @yunhailuo, I did some investigation and found a few rows in that gff3 file had trailing semi-colons. I've improved handing this, if you reinstall the node modules it should no longer produce errors (you will need to delete package-lock before doing so). See VALIS-software/Genomics-Formats#1 the good news is the error doesn't affect the output – so it was still generating the correct valis files |
Thank you very much for the quick fix. Tested and no errors now. I'm going to move forward using GTF. Any extra comment on the difference between GFF3 and GTF (in terms of the visual on Valis)? |
GTF is the same thing as GFF3 but it's version 2 whereas GFF3 is version 3 GFF3 adds a few extra features and can contain more information so I'd recommend using GFF3 where you can. They both use the same parser in this case, the error you were getting is because I only applied the more loose semi-colon parsing for GTF, so the fix was just to use it for GFF3 too |
Thanks, George. Will use GFF3 whenever possible then. |
efdae02
to
2f50742
Compare
Only minor changes (most of the commit is generated code update)
Source change: https://github.com/ENCODE-DCC/valis-hpgv/compare/master...haxiomic:update-genome-tool?expand=1#diff-7a7c40168d698faaf4c64c7dd4e091b0b9a234498b48938c2f42b3642a0265f1