Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R

DJ McCarthy, KR Campbell, ATL Lun, QF Wills - Bioinformatics, 2017 - academic.oup.com
Bioinformatics, 2017academic.oup.com
Motivation Single-cell RNA sequencing (scRNA-seq) is increasingly used to study gene
expression at the level of individual cells. However, preparing raw sequence data for further
analysis is not a straightforward process. Biases, artifacts and other sources of unwanted
variation are present in the data, requiring substantial time and effort to be spent on pre-
processing, quality control (QC) and normalization. Results We have developed the
R/Bioconductor package scater to facilitate rigorous pre-processing, quality control …
Motivation
Single-cell RNA sequencing (scRNA-seq) is increasingly used to study gene expression at the level of individual cells. However, preparing raw sequence data for further analysis is not a straightforward process. Biases, artifacts and other sources of unwanted variation are present in the data, requiring substantial time and effort to be spent on pre-processing, quality control (QC) and normalization.
Results
We have developed the R/Bioconductor package scater to facilitate rigorous pre-processing, quality control, normalization and visualization of scRNA-seq data. The package provides a convenient, flexible workflow to process raw sequencing reads into a high-quality expression dataset ready for downstream analysis. scater provides a rich suite of plotting tools for single-cell data and a flexible data structure that is compatible with existing tools and can be used as infrastructure for future software development.
Availability and Implementation
The open-source code, along with installation instructions, vignettes and case studies, is available through Bioconductor at http://bioconductor.org/packages/scater.
Supplementary information
Supplementary data are available at Bioinformatics online.
Oxford University Press