MENU

Fun & Interesting

Setup RNA-Seq Pipeline from scratch: fastq (reads) to counts | Step-by-Step Tutorial

Bioinformagician 116,051 lượt xem 2 years ago
Video Not Working? Fix It Now

This is a detailed workflow tutorial of how to process bulk RNA-Seq reads (fastq) and generate counts matrix which can be used for various downstream analysis. In this video, I walk through setting up a pipeline in bash (linux) and perform steps to process RNA-Seq data including -
• Quality control (fastQC)
• Trimming (Trimmomatic)
• Alignment (HISAT2)
• Quantification (featureCounts)
In addition I also talk about run times, memory requirements and aligner accuracies of various aligners. I hope you find this video helpful! Leave your thoughts in the comment section below!


Link to code:
https://github.com/kpatel427/YouTubeTutorials/blob/main/RNASeqpipeline.sh

Link to data:
▸ https://drive.google.com/file/d/1DGHjbhcRy_zTm6H9C_AUpkzBML-JhtA3/view?usp=sharing

Linux Basics
▸ https://ubuntu.com/tutorials/command-line-for-beginners#1-overview
▸ https://xie186.github.io/Novice2Expert4Bioinformatics/install-bioinformatics-software-in-linux.html
▸ https://hackr.io/blog/basic-linux-commands

To Trim or to not Trim?
▸ https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671312/

Strandedness
▸ https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-015-1876-7
▸ https://chipster.csc.fi/manual/library-type-summary.html
▸ http://rseqc.sourceforge.net/#infer-experiment-py


Chapters:
0:00 Intro
1:11 - Applications of RNA-Seq data
2:41 Schematic detailed workflow
3:49 What are splice-aware aligners?
5:59 Workflow for this tutorial
6:31 Comparison of run times, memory usage and aligner accuracies for various aligners
8:27 Which aligner should I choose?
9:05 Pre-requistes to build this pipeline (things that will not be covered in this video)
9:40 Set-up before building the pipeline
10:37 Some good practices while building a pipeline
11:21 Quality control: FastQC
15:27 To trim or to not trim?
15:58 Trimming reads: Trimmomatic
19:18 Align reads: HISAT2
24:20 Read quantification: featureCounts


Show your support and encouragement by buying me a coffee:
https://www.buymeacoffee.com/bioinformagic


To get in touch:
Website: https://bioinformagician.org/
Github: https://github.com/kpatel427
Email: khushbu_p@hotmail.com

#bioinformagician #bioinformatics #rnaseq #omics #rnastar #ensembl #affymetrix #microarray #affy #ncbi #genomics #beginners #tutorial #howto #research #biology #GEO #ngs

Comment