You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! I would be grateful if you can answer my question.
I am trying to use the cellSNP line tool on a large bam file (1.2 T) generated by SMARTseq.
I aim to count all the possible variants in a specific gene.
Do you think I should use the mode 2? Can the analysis be limited to a specific gene?
I'm trying this command:
Cellsnp-lite does not have an option for genotyping a specific region (gene) now. As your BAM is indeed large, mode 2a could be very slow and you may try mode 2b + 1a instead, i.e., calling first in a bulk manner by mode 2b followed by genotyping in mode 1a.
Alternatively, you may also run mode 1a directly if you can prepare a VCF listing every position of that gene, i.e., specify the CHROM, POS, REF as it is and set the ALT as "." in the VCF file (or set both REF and ALT as ".", and then use -f option to extract the reference allele from the FASTA file). Cellsnp-lite would assign the allele with the highest UMI/read counts except REF as the ALT.
Generally, the mode 2b + 1a is recommended. Please check the example of each mode in the manual.
Is your merged BAM file from SMART-seq3? As far as I know, SMART-seq2 will produce one BAM file per cell. The empty output could be due to the unmatched cell tag. The defualt cell tag of cellsnp-lite is "CB", which is missing in the three BAM records you shared. You may set proper cell tag with --cellTAG option, based on your experimental settings.
Hi! I would be grateful if you can answer my question.
I am trying to use the cellSNP line tool on a large bam file (1.2 T) generated by SMARTseq.
I aim to count all the possible variants in a specific gene.
Do you think I should use the mode 2? Can the analysis be limited to a specific gene?
I'm trying this command:
cellsnp-lite -s myfile.bam -b barcodes.txt -O ./results --chrom 3 --minMAF 0.1 --minCOUNT 100 -p 20 -f chr3_genome.fa
But the vcf in output is empty.
here the header of my bam file:
I really appreciate any help you can provide.
Best
The text was updated successfully, but these errors were encountered: