Relating chemical structure to cellular response: an integrative analysis of gene expression, bioactivity, and structural data across 11,000 compounds

B Chen, P Greenside, H Paik, M Sirota… - CPT …, 2015 - Wiley Online Library
CPT: pharmacometrics & systems pharmacology, 2015Wiley Online Library
A central premise in systems pharmacology is that structurally similar compounds have
similar cellular responses; however, this principle often does not hold. One of the most
widely used measures of cellular response is gene expression. By integrating gene
expression data from Library of Integrated Network‐based Cellular Signatures (LINCS) with
chemical structure and bioactivity data from PubChem, we performed a large‐scale
correlation analysis of chemical structures and gene expression profiles of over 11,000 …
A central premise in systems pharmacology is that structurally similar compounds have similar cellular responses; however, this principle often does not hold. One of the most widely used measures of cellular response is gene expression. By integrating gene expression data from Library of Integrated Network‐based Cellular Signatures (LINCS) with chemical structure and bioactivity data from PubChem, we performed a large‐scale correlation analysis of chemical structures and gene expression profiles of over 11,000 compounds taking into account confounding factors such as biological conditions (e.g., cell line, dose) and bioactivities. We found that structurally similar compounds do indeed yield similar gene expression profiles. There is an ∼20% chance that two structurally similar compounds (Tanimoto Coefficient ≥ 0.85) share significantly similar gene expression profiles. Regardless of structural similarity, two compounds tend to share similar gene expression profiles in a cell line when they are administrated at a higher dose or when the cell line is sensitive to both compounds.
Wiley Online Library