MAPseq-HMP_reanalysis

This repository contains code used to reanalyse Human Microbiome Project data to benchmark MAPseq.

Raw data from the HMP is pre-processed and matched to samples and sequencing subregions in the script prepare.raw_data.pl. This generates one folder per sample for subsequent sample-wise (parallel) processing. It also generates global mapping tables.

The script submit.filter_chimeras.sh then calls the script filter.chimeric_reads.sh for each sample in parallel to remove chimeras.

The script submit.align.hmp_samples.sh calls the script align.hmp_samples.sh for each sample in parallel to run INFERNAL. Afterwards, the script make_alignments.global.sh stitches together individual per-sample alignments and de-replicates and de-noises them a bit. This script also defines the "consensus" lists of sequences to be used downstream by clustering methods – keeping only those sequences which are non-chimeric and align satisfactorily to the target subregion.

The scripts "make_otus.*.sh" then call the different mapping/clustering tools to generate OTU sets and translate them into R-readable formats (OTU tables etc.).

Finally, the script hmp.benchmark.R contains all the code for R-based analyses, as detailed in the manuscript.

(Raw) results are available in the folder results.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
R		R
bash		bash
perl		perl
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAPseq-HMP_reanalysis

About

Releases

Packages

Languages

License

meringlab/MAPseq-HMP_reanalysis

Folders and files

Latest commit

History

Repository files navigation

MAPseq-HMP_reanalysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages