-
Notifications
You must be signed in to change notification settings - Fork 62
Amplicon Mode in VarDict
VarDict has a built-in capability of calling variants for PCR-based target enrichment (amplicon). The goals of amplicon based variant calling are:
-
Identify and filter out variants that showed amplicon-bias, that is, if a variant is targeted by more than one amplicon, it has to be detectable in all amplicons. Otherwise, it's determined to be amplicon-bias and filtered out.
-
Softly discounting PCR primers. No pre-process is needed to remove or soft clip PCR primers. Given the design, VarDict will automatically clip PCR primers.
-
Avoid mis-paired PCR products, especially when there're mismatches in primers.
VarDict's amplicon mode is automatically triggered when an 8-column BED file is supplied. Below is an example:
chr1 46713345 46713502 RAD54L 0 . 46713363 46713481 chr1 46713418 46713578 RAD54L 0 . 46713435 46713561
The 8 columns are:
-
chr (required)
-
PCR start (including primer) (required)
-
PCR end (including primer) (required)
-
Gene info (can be .)
-
Score (can be .)
-
Strand (can be .)
-
Insert start (excluding primer) (required)
-
Insert end (excluding primer) (required)
If you split the BED file into smaller ones in order to run VarDict in parallel, please ensure that any overlapping PCR amplicons are present in same BED file. If the gene info column is present, you can use splitBed.pl script in the repository to split BED files, which ensures amplicons from the same gene are in the same BED file, assuming amplicons don't overlap if they're from different genes.
The script var2vcf_valid.pl script will automatically recognize the amplicon mode based on the output and no special parameter is needed.