BACKGROUND. Novel rapid diagnostics for active tuberculosis (TB) are required to overcome the time delays and inadequate sensitivity of current microbiological tests that are critically dependent on sampling the site of disease. Multiparametric blood transcriptomic signatures of TB have been described as potential diagnostic tests. We sought to identify the best transcript candidates as host biomarkers for active TB, extend the evaluation of their specificity by comparison with other infectious diseases, and to test their performance in both pulmonary and extrapulmonary TB.
METHODS. Support vector machine learning, combined with feature selection, was applied to new and previously published blood transcriptional profiles in order to identify the minimal TB‑specific transcriptional signature shared by multiple patient cohorts including pulmonary and extrapulmonary TB, and individuals with and without HIV-1 coinfection.
RESULTS. We identified and validated elevated blood basic leucine zipper transcription factor 2 (BATF2) transcript levels as a single sensitive biomarker that discriminated active pulmonary and extrapulmonary TB from healthy individuals, with receiver operating characteristic (ROC) area under the curve (AUC) scores of 0.93 to 0.99 in multiple cohorts of HIV-1–negative individuals, and 0.85 in HIV-1–infected individuals. In addition, we identified and validated a potentially novel 4-gene signature comprising CD177, haptoglobin, immunoglobin J chain, and galectin 10 that discriminated active pulmonary and extrapulmonary TB from other febrile infections, giving ROC AUCs of 0.94 to 1.
CONCLUSIONS. Elevated blood BATF2 transcript levels provide a sensitive biomarker that discriminates active TB from healthy individuals, and a potentially novel 4-gene transcriptional signature differentiates between active TB and other infectious diseases in individuals presenting with fever.
FUNDING. MRC, Wellcome Trust, Rosetrees Trust, British Lung Foundation, NIHR.
Jennifer K Roe, Niclas Thomas, Eliza Gil, Katharine Best, Evdokia Tsaliki, Stephen Morris‑Jones, Sian Stafford, Nandi Simpson, Karolina D Witt, Benjamin Chain, Robert F Miller, Adrian Martineau, Mahdad Noursadeghi
(A) Statistically significant (P < 0.05 by Mann Whitney U test with false discovery rate of < 5%) greater than 2‑fold differences in transcript abundance in genome‑wide blood transcriptional profiles of patients with active TB compared with postrecovery samples in the AdjuVIT cohort. (B) Gene ontology analysis of the genes in A that were expressed at higher levels in AdjuVIT active TB samples than in the postrecovery samples. (C) Comparison of significant blood gene expression differences in active TB and different healthy states from 3 different studies: TB after recovery (AdjuVIT), healthy volunteers (Berry), or people with latent TB (Bloom).