Quality control of RNA-seq experiments

Xing Li, Asha Nair, Shengqin Wang, Liguo Wang

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Direct sequencing of the complementary DNA (cDNA) using high-throughput sequencing technologies (RNA-seq) is widely used and allows for more comprehensive understanding of the transcriptome than microarray. In theory, RNA-seq should be able to precisely identify and quantify all RNA species, small or large, at low or high abundance. However, RNA-seq is a complicated, multistep process involving reverse transcription, amplification, fragmentation, purification, adaptor ligation, and sequencing. Improper operations at any of these steps could make biased or even unusable data. Additionally, RNA-seq intrinsic biases (such as GC bias and nucleotide composition bias) and transcriptome complexity can also make data imperfect. Therefore, comprehensive quality assessment is the first and most critical step for all downstream analyses and results interpretation. This chapter discusses the most widely used quality control metrics including sequence quality, sequencing depth, reads duplication rates (clonal reads), alignment quality, nucleotide composition bias, PCR bias, GC bias, rRNA and mitochondria contamination, coverage uniformity, etc.

Original languageEnglish (US)
Title of host publicationRNA Bioinformatics
PublisherSpringer New York
Pages137-146
Number of pages10
ISBN (Electronic)9781493922918
ISBN (Print)9781493922901
DOIs
StatePublished - Jan 10 2015

    Fingerprint

Keywords

  • High-throughput sequencing
  • Next-generation sequencing
  • Quality control
  • RNA-seq

ASJC Scopus subject areas

  • Medicine(all)
  • Biochemistry, Genetics and Molecular Biology(all)

Cite this

Li, X., Nair, A., Wang, S., & Wang, L. (2015). Quality control of RNA-seq experiments. In RNA Bioinformatics (pp. 137-146). Springer New York. https://doi.org/10.1007/978-1-4939-2291-8_8