The genome sequence archive family: toward explosive data growth and diverse data types

T Chen, X Chen, S Zhang, J Zhu, B Tang… - Genomics …, 2021 - academic.oup.com
T Chen, X Chen, S Zhang, J Zhu, B Tang, A Wang, L Dong, Z Zhang, C Yu, Y Sun, L Chi…
Genomics, Proteomics and Bioinformatics, 2021academic.oup.com
Abstract The Genome Sequence Archive (GSA) is a data repository for archiving raw
sequence data, which provides data storage and sharing services for worldwide scientific
communities. Considering explosive data growth with diverse data types, here we present
the GSA family by expanding into a set of resources for raw data archive with different
purposes, namely, GSA (https://ngdc. cncb. ac. cn/gsa/), GSA for Human (GSA-Human,
https://ngdc. cncb. ac. cn/gsa-human/), and Open Archive for Miscellaneous Data (OMIX …
Abstract
The Genome Sequence Archive (GSA) is a data repository for archiving raw sequence data, which provides data storage and sharing services for worldwide scientific communities. Considering explosive data growth with diverse data types, here we present the GSA family by expanding into a set of resources for raw data archive with different purposes, namely, GSA (https://ngdc.cncb.ac.cn/gsa/), GSA for Human (GSA-Human, https://ngdc.cncb.ac.cn/gsa-human/), and Open Archive for Miscellaneous Data (OMIX, https://ngdc.cncb.ac.cn/omix/). Compared with the 2017 version, GSA has been significantly updated in data model, online functionalities, and web interfaces. GSA-Human, as a new partner of GSA, is a data repository specialized in human genetics-related data with controlled access and security. OMIX, as a critical complement to the two resources mentioned above, is an open archive for miscellaneous data. Together, all these resources form a family of resources dedicated to archiving explosive data with diverse types, accepting data submissions from all over the world, and providing free open access to all publicly available data in support of worldwide research activities.
Oxford University Press