The following fastq files are subsets of the original SRA sequences designed for getting assemblies that have prokaryotic, eukaryotic, and viral genomes.
id_sample | source_sra | format | type | num_seqs | min_len | avg_len | max_len | GC(%) |
---|---|---|---|---|---|---|---|---|
S1 | SRR17458614 | FASTQ | DNA | 1474203 | 75 | 150.5 | 151 | 46.91 |
S2 | SRR17458615 | FASTQ | DNA | 1638398 | 75 | 150.5 | 151 | 46.92 |
S3 | SRR17458630 | FASTQ | DNA | 2389989 | 75 | 150.4 | 151 | 56.38 |
S4 | SRR17458638 | FASTQ | DNA | 3142566 | 75 | 150.5 | 151 | 46.34 |
Also includes the following:
- Metagenomic assemblies using metaSPAdes with sorted BAM files from Bowtie2
- Genomes, gene models, etc.
- Taxonomy classifications at the genome and genome cluster level
- Annotations for genes and protein clusters
- Biosynthetic gene clusters
- Clusters for genomes and proteins