The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications

RH Nilsson, KH Larsson, AFS Taylor… - Nucleic acids …, 2019 - academic.oup.com
Nucleic acids research, 2019academic.oup.com
Abstract UNITE (https://unite. ut. ee/) is a web-based database and sequence management
environment for the molecular identification of fungi. It targets the formal fungal barcode—the
nuclear ribosomal internal transcribed spacer (ITS) region—and offers all∼ 1 000 000
public fungal ITS sequences for reference. These are clustered into∼ 459 000 species
hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference
across studies. In-house and web-based third-party sequence curation and annotation have …
Abstract
UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode—the nuclear ribosomal internal transcribed spacer (ITS) region—and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustered into ∼459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.
Oxford University Press