Skip to content

gis-rpd/rpd-rnaseq

Repository files navigation

rpd-rnaseq

Introduction

rpd-rnaseq is a bioinformatics best-practice analysis pipeline for bulk RNAseq analyis and was developed by the Research Pipeline Development Team at the Genome Institute of Singapore (GIS) based on a request by the Cancer Therapeutics & Stratified Oncology 6.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It further comes with Conda, Docker and Singularity support, making software installation easy and results reproducible.

The pipeline itself is mostly based on the GIS STAR-RSEM pipeline (2017-10) and nf-core/rnaseq. The implementation tries to stay in line with nf-core practices and recommendations.

Pipeline Steps

Step Main program/s
Trimmming, combining of read-pairs per sample and QC Trim Galore, FastQC
Mapping STAR
Visualization files (bigWigs) Samtools, deepTools (bamCoverage)
RNASeq Metrics CollectRnaSeqMetrics (Picard)
RNASeq QC RSeQC
Gene and isoform expression RSEM
QC report MultiQC

Documentation

Documentation about the pipeline can be found in the docs/ directory:

  1. Installation
  2. Usage
  3. Output and results