PUBLICATION

Normalization of RNA-sequencing data from samples with varying mRNA levels

Authors
Aanes, H., Winata, C., Moen, L.F., Østrup, O., Mathavan, S., Collas, P., Rognes, T., and Aleström, P.
ID
ZDB-PUB-140429-1
Date
2014
Source
PLoS One   9(2): e89158 (Journal)
Registered Authors
Aleström, Peter, Collas, Philippe, Mathavan, S., Moen, Lars F., Winata, Cecilia Lanny
Keywords
none
Datasets
GEO:GSE22830
MeSH Terms
  • Animals
  • Base Sequence/genetics*
  • Gene Expression/genetics*
  • Gene Expression Profiling/methods*
  • Polymerase Chain Reaction/methods
  • RNA, Messenger/genetics*
  • Sequence Analysis, RNA/methods*
  • Zebrafish/genetics
PubMed
24586560 Full text @ PLoS One
Abstract

Methods for normalization of RNA-sequencing gene expression data commonly assume equal total expression between compared samples. In contrast, scenarios of global gene expression shifts are many and increasing. Here we compare the performance of three normalization methods when polyA+ RNA content fluctuates significantly during zebrafish early developmental stages. As a benchmark we have used reverse transcription-quantitative PCR. The results show that reads per kilobase per million (RPKM) and trimmed mean of M-values (TMM) normalization systematically leads to biased gene expression estimates. Biological scaling normalization (BSN), designed to handle differences in total expression, showed improved accuracy compared to the two other methods in estimating transcript level dynamics. The results have implications for past and future studies using RNA-sequencing on samples with different levels of total or polyA+ RNA.

Genes / Markers
Figures
Show all Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping