Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

plink(bed-bim-fam) to treemix #19

Open
josephresearcher opened this issue Nov 17, 2020 · 5 comments
Open

plink(bed-bim-fam) to treemix #19

josephresearcher opened this issue Nov 17, 2020 · 5 comments

Comments

@josephresearcher
Copy link

Hi
I Need Convert plink(bed-bim-fam) to treemix input
With This Command:
glactools bplink2acf --fai human_g1k_v37.fasta.fai afghan | glactools acf2treemix - | gzip > treemix.gz

but error:

Cannot write to /dev/stdout
Error: GlacParser tried to read 4 bytes but got 0

Data:
https://evolbio.ut.ee/afghan/

@grenaud
Copy link
Owner

grenaud commented Nov 17, 2020

Hello! I managed to replicate the error using:
glactools bplink2acf --fai human_g1k_v37.fasta.fai afghan > /dev/null

bplink2acf: WARNING: The reference allele between the EPO/FASTA (G) was not found in the the .bim file (found: C,T) at chr:pos 1:5732473 at line 1 rs771071 0 5732473 T C

So I looked at the USCS browser and it shows that rs771071 is a reference G morphing to A. Around this snp is GGC.

samtools faidx seems to agree:

1:5732472-5732474
GGC

Are you sure it was v37? not 38 for the bim file?

@josephresearcher
Copy link
Author

thank you
Dear Renaud
I tried several files, unfortunately the problem persists

@grenaud
Copy link
Owner

grenaud commented Nov 17, 2020

This is not a bug, it is a safeguard feature to make sure that the consistency between the reference and the bim file is respected. How is it possible that this line is in your bim file with the wrong reference allele? It should be a G.

@josephresearcher
Copy link
Author

I mentioned the link of these files for you
What do you think is the solution?
https://evolbio.ut.ee/
https://evolbio.ut.ee/afghan/

@grenaud
Copy link
Owner

grenaud commented Nov 18, 2020

The solution is to try to understand why there is a SNP without the reference allele being present in your .bim file and where the bases do not match UCSC. Go over the steps you used to generate the .bed, .bim and fam files and try to track the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants