Integration, exploration, and analysis of high‐dimensional single‐cell cytometry data using Spectre

TM Ashhurst, F Marsh‐Wakefield, GH Putri… - Cytometry Part …, 2022 - Wiley Online Library
Cytometry Part A, 2022Wiley Online Library
As the size and complexity of high‐dimensional (HD) cytometry data continue to expand,
comprehensive, scalable, and methodical computational analysis approaches are essential.
Yet, contemporary clustering and dimensionality reduction tools alone are insufficient to
analyze or reproduce analyses across large numbers of samples, batches, or experiments.
Moreover, approaches that allow for the integration of data across batches or experiments
are not well incorporated into computational toolkits to allow for streamlined workflows. Here …
Abstract
As the size and complexity of high‐dimensional (HD) cytometry data continue to expand, comprehensive, scalable, and methodical computational analysis approaches are essential. Yet, contemporary clustering and dimensionality reduction tools alone are insufficient to analyze or reproduce analyses across large numbers of samples, batches, or experiments. Moreover, approaches that allow for the integration of data across batches or experiments are not well incorporated into computational toolkits to allow for streamlined workflows. Here we present Spectre, an R package that enables comprehensive end‐to‐end integration and analysis of HD cytometry data from different batches or experiments. Spectre streamlines the analytical stages of raw data pre‐processing, batch alignment, data integration, clustering, dimensionality reduction, visualization, and population labelling, as well as quantitative and statistical analysis. Critically, the fundamental data structures used within Spectre, along with the implementation of machine learning classifiers, allow for the scalable analysis of very large HD datasets, generated by flow cytometry, mass cytometry, or spectral cytometry. Using open and flexible data structures, Spectre can also be used to analyze data generated by single‐cell RNA sequencing or HD imaging technologies, such as Imaging Mass Cytometry. The simple, clear, and modular design of analysis workflows allow these tools to be used by bioinformaticians and laboratory scientists alike. Spectre is available as an R package or Docker container. R code is available on Github (https://github.com/immunedynamics/spectre).
Wiley Online Library