Go to The Journal of Clinical Investigation
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Transfers
  • Advertising
  • Job board
  • Contact
  • Physician-Scientist Development
  • Current issue
  • Past issues
  • By specialty
    • COVID-19
    • Cardiology
    • Immunology
    • Metabolism
    • Nephrology
    • Oncology
    • Pulmonology
    • All ...
  • Videos
  • Collections
    • In-Press Preview
    • Resource and Technical Advances
    • Clinical Research and Public Health
    • Research Letters
    • Editorials
    • Perspectives
    • Physician-Scientist Development
    • Reviews
    • Top read articles

  • Current issue
  • Past issues
  • Specialties
  • In-Press Preview
  • Resource and Technical Advances
  • Clinical Research and Public Health
  • Research Letters
  • Editorials
  • Perspectives
  • Physician-Scientist Development
  • Reviews
  • Top read articles
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Transfers
  • Advertising
  • Job board
  • Contact
Top
  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal
  • Top
  • Abstract
  • Introduction
  • Results
  • Discussion
  • Methods
  • Author contributions
  • Supplemental material
  • Acknowledgments
  • Footnotes
  • References
  • Version history
  • Article usage
  • Citations to this article
Advertisement

Research ArticleDermatologyGenetics Open Access | 10.1172/jci.insight.172956

Large-scale functional inference for skin-expressing lncRNAs using expression and sequence information

Matthew T. Patrick,1 Sutharzan Sreeskandarajan,1,2 Alanna Shefler,1 Rachael Wasikowski,1 Mrinal K. Sarkar,1 Jiahan Chen,1,3 Tingting Qin,4 Allison C. Billi,1 J. Michelle Kahlenberg,1,5 Errol Prens,6 Alain Hovnanian,7 Stephan Weidinger,8 James T. Elder,1,9 Chao-Chung Kuo,10 Johann E. Gudjonsson,1 and Lam C. Tsoi1,4,11

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Patrick, M. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Sreeskandarajan, S. in: PubMed | Google Scholar

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Shefler, A. in: PubMed | Google Scholar

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Wasikowski, R. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Sarkar, M. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Chen, J. in: PubMed | Google Scholar

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Qin, T. in: PubMed | Google Scholar

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Billi, A. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Kahlenberg, J. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Prens, E. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Hovnanian, A. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Weidinger, S. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Elder, J. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Kuo, C. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Gudjonsson, J. in: PubMed | Google Scholar |

1Department of Dermatology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA.

2Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA.

3College of Sciences, North Carolina State University, Raleigh, North Carolina, USA.

4Department of Computational Medicine & Bioinformatics and

5Division of Rheumatology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.

6Department of Dermatology, Erasmus University Medical Center, Rotterdam, Netherlands.

7Laboratory of Genetic Skin Diseases, Imagine Institute, Paris, France.

8Department of Dermatology and Allergy, University Medical Center Schleswig-Holstein, Kiel, Germany.

9Ann Arbor Veterans Affairs Hospital, Ann Arbor, Michigan, USA.

10Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.

11Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Find articles by Tsoi, L. in: PubMed | Google Scholar |

Published December 22, 2023 - More info

Published in Volume 8, Issue 24 on December 22, 2023
JCI Insight. 2023;8(24):e172956. https://doi.org/10.1172/jci.insight.172956.
© 2023 Patrick et al. This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Published December 22, 2023 - Version history
Received: June 8, 2023; Accepted: November 8, 2023
View PDF
Abstract

Long noncoding RNAs (lncRNAs) regulate the expression of protein-coding genes and have been shown to play important roles in inflammatory skin diseases. However, we still have limited understanding of the functional impact of lncRNAs in skin, partly due to their tissue specificity and lower expression levels compared with protein-coding genes. We compiled a comprehensive list of 18,517 lncRNAs from different sources and studied their expression profiles in 834 RNA-Seq samples from multiple inflammatory skin conditions and cytokine-stimulated keratinocytes. Applying a balanced random forest to predict involvement in biological functions, we achieved a median AUROC of 0.79 in 10-fold cross-validation, identifying significant DNA binding domains (DBDs) for 39 lncRNAs. G18244, a skin-expressing lncRNA predicted for IL-4/IL-13 signaling in keratinocytes, was highly correlated in expression with F13A1, a protein-coding gene involved in macrophage regulation, and we further identified a significant DBD in F13A1 for G18244. Reflecting clinical implications, AC090198.1 (predicted for IL-17 pathway) and AC005332.6 (predicted for IFN-γ pathway) had significant negative correlation with the SCORAD metric for atopic dermatitis. We also utilized single-cell RNA and spatial sequencing data to validate cell type specificity. Our research demonstrates lncRNAs have important immunological roles and can help prioritize their impact on inflammatory skin diseases.

Introduction

Less than 2% of the human genome is translated into proteins (1); however, 8%–15% is estimated to be functional (2, 3), and results from ENCODE suggest as much as 80% of the genome can have biological activity (4). Advances in high-resolution sequencing technologies have enabled deep molecular profiling, identifying elements that were not previously revealed due to their cell type specificity and low-expressing nature. Long noncoding RNA (lncRNA; 200 or more nucleotides) is arguably the gene category that has expanded the most in the genomic era, from 9,640 in 2012 (5) to 19,928 in 2022 (6). lncRNAs are of particular interest (7) for their ability to regulate gene expression (8) and alter splicing control (9) by binding to target DNA; there is also some evidence that lncRNAs may act as a scaffold to recruit proteins (10, 11). Therefore, lncRNAs have been shown to affect the regulation of different biological mechanisms; however, few lncRNAs have been functionally characterized (12), and much remains to be discovered regarding their role in health and disease.

Specific lncRNAs have been found to operate in different skin conditions (13), such as ANRIL for melanoma (14), HOTAIR for systemic sclerosis (15), TINCR for burn injuries (16) and epidermal differentiation more generally (17), MALAT1 for squamous cell carcinoma (18), and WAKMAR1 for wound healing (19). Our previous study (20) revealed that many skin-expressing lncRNAs — including lncRNAs G2608 and G25746 — are dysregulated in lesional skin of patients with psoriasis. Intriguingly, compared with previously annotated lncRNAs, a greater proportion of the 1,080 potentially novel skin-expressing lncRNAs we identified were differentially expressed in psoriasis. lncRNAs are highly tissue specific (21), and this can be beneficial for precision medicine. For example, drug-response models trained using lncRNAs outperformed protein-coding models for several medications (22). However, tissue specificity — along with unique spatiotemporal dynamics (23), lower overall expression, and poor sequence conservation (24) (compared with protein-coding transcripts) — can make the biological roles of lncRNAs difficult to determine.

Various approaches have been applied to predict lncRNA biological functions. Proximity to coding genes (e.g, direct sense-antisense pairing; refs. 25, 26) can reveal the function of some lncRNAs. Sequence motifs have also been used to associate with subcellular localization or protein binding (27). For example, XIST contains a repeat E motif, without which it is unable to interact with CIZ1 for enrichment in the nucleus (28). However, this approach requires careful analysis to understand how specific motifs may influence the function of each lncRNA. Automated techniques are increasingly being developed that apply machine learning to gene coexpression (29) or generic sequence features such as k-mer profiles (30, 31). It is also possible to predict lncRNA disease associations through matrix completion of similarity based on disease-gene and disease-ontology pairs (32). However, all these approaches depend on the quality and quantity of existing data, which is particularly limited for skin conditions and cutaneous biology.

In our study, we apply machine-learning techniques to a large collection of RNA-Seq data sets from different in vivo and in vitro experiments to predict lncRNAs for specific cytokine pathways; we then utilize the sequence information to refine the regulatory effects. Our study provides a comprehensive catalog for lncRNA genomics and highlights the potential roles of lncRNAs in inflammatory skin diseases.

Results

lncRNA are differentially expressed in inflammatory skin diseases. Figure 1 provides an overview of the workflow for our approach. We profiled gene expression in 434 transcriptomes from 4 inflammatory skin disease RNA-Seq cohorts (atopic dermatitis [AD], psoriasis, Netherton syndrome, and hidradenitis suppurativa [HS]), revealing 11,082 lncRNAs and 18,022 protein-coding genes to be expressed in at least 1 skin condition (Table 1). Of these, 12,547 genes — including 4,163 (33%) lncRNAs — were significantly (FDR ≤ 0.05) differentially expressed in at least 1 condition (Supplemental Tables 1–4; supplemental material available online with this article; https://doi.org/10.1172/jci.insight.172956DS1). Interestingly, for all 4 diseases, a higher proportion of the skin-expressing lncRNAs was differentially expressed compared with protein-coding genes, suggesting that cellular and molecular changes at the tissue level might have a higher regulatory effect on genes with high cell type specificity, and their overall expressions can be affected by cellular composition changes (33, 34). Importantly, we identified a core set of 51 lncRNAs that were dysregulated in the same direction (14 upregulated and 37 downregulated) for all 4 skin diseases (Supplemental Figures 2 and 3).

Overview of our approach.Figure 1

Overview of our approach. Workflow describing the different data sources and steps in our prediction and analysis of lncRNA functions.

Table 1

Differential expression results

Gene expression was further evaluated in cytokine-stimulated (IFN-α, IFN-γ, IL-4, IL-13, IL-17, IL-17 + TNF, and TNF) keratinocytes and unstimulated keratinocytes (50 samples of each condition). Of the 4,543 lncRNA and 15,406 protein-coding genes expressed (Supplemental Table 5), 2,179 genes — including 429 (20%) lncRNAs — were significantly (FDR ≤ 0.05) differentially expressed in at least 1 stimulation (Supplemental Tables 6–12). By contrast with the skin conditions, the proportions of skin-expressing lncRNAs that are differentially expressed were consistently lower than those for protein-coding genes for all 7 cytokine stimulations. For each pair of cytokines and skin conditions, we counted the number of shared differentially expressed lncRNA (Supplemental Table 13) and found a total of 17 pairs to be significantly enriched after correcting for multiple testing through Bonferroni adjustment, including IFN-γ in psoriasis (P = 6.1 × 10–18, OR = 3.37) and IL-13 (P = 1.1 × 10–5, OR = 63.90) in AD.

Of the 4,163 lncRNAs (5%) differentially expressed in a skin condition, 217 were differentially expressed in a cytokine simulation, including 5 of the 51 lncRNAs (10%) that have the same direction of effect in all 4 skin conditions. Interestingly, these 5 lncRNAs were more likely to be upregulated rather than downregulated in the skin conditions (P = 0.018, OR = 13.1, Fisher enrichment test). We also calculated the distance of each of the 51 lncRNAs to H3K27ac marks for active enhancers (35) and the nearest protein-coding gene expressed in a cytokine stimulation (Supplemental Table 14); however, no significant differences were found comparing lncRNAs dysregulated/not dysregulated in at least 1 cytokine condition.

Predicting cytokine pathways in skin for lncRNAs. To predict the potential functions of the lncRNAs, we compiled multiple gene sets (Supplemental Table 15) relevant to pathways of cytokine response from 5 different sources: BioCarta (36), Reactome (37), NCI (38), GO (39), and KEGG (40). Aggregating these gene sets provides 135 protein-coding genes for the IFN-α pathway, 239 for IFN-γ, 120 for IL-17, 195 for IL-4/IL-13, and 330 for TNF. We selected these genes as the positive set to predict the involvement of lncRNAs for each cytokine signaling pathway, using the aforementioned RNA-Seq samples from inflammatory skin diseases and cytokine stimulations as training data. An additional 129 samples, available from the same data sets (taken, for example, from nonlesional sites), were included such that the training data had 834 samples in total.

Machine-learning prediction was conducted using 10-fold cross-validation to help reduce the risk of overfitting. We benchmarked 10 different classifiers on the cytokine pathways using the protein-coding genes (Figure 2A and Supplemental Figure 4) and found they gave consistently satisfactory performance, with 8 of 10 classifiers having an area under the receiver operating characteristic (AUROC) greater than 0.6 for all 5 pathways. The highest-performing classifier overall was random forest, which had a median AUROC (across the pathways) of 0.75. We further found that, when setting the number of cases and controls in each tree of the random forest to be equal (i.e., balanced), this significantly increased the median AUROC to 0.79 in cross-validation (paired Wilcoxon P = 0.031). When dividing the data into 70% training and 30% testing, the balanced random forest achieved 0.80 AUROC for IFN-α, 0.79 for IFN-γ, 0.79 for IL-4/IL-13, 0.71 for IL-17, and 0.74 for TNF.

Results of lncRNA function prediction.Figure 2

Results of lncRNA function prediction. (A) Benchmarking the performance of classifiers across different cytokine pathways on the prediction task, and measuring according to the AUROC on protein-coding genes. (B) The F-measure curve for protein-coding genes in each cytokine pathway, as a harmonic mean of precision and recall, is used to set the threshold for lncRNA prediction. (C) Heatmap of correlations between the top 3 lncRNA for each cytokine pathway and the most correlated protein-coding genes. *P < 1e-10; **P ≤ 1e-100. (D) Enrichment of lncRNA predicted for different cytokines in k-mer clusters from SEEKR analysis. *nominally significant (P < 0.05); ***significant after Bonferroni (p_adj < 0.05).

Applying the balanced random forest classifier to predict which lncRNAs participate in each cytokine pathway, we averaged the prediction scores for every gene over 100 trials (Supplemental Table 16). AL445490.1, the lncRNA with the highest prediction score for IFN-α (mean score = 0.931), is antisense and positioned intronic and upstream of IFN-α–inducible protein 6 (IFI6) and was previously found to be highly expressed in an inflammatory subtype of breast cancer (41). Indeed, we found the expression of AL445490.1 to be highly correlated with IFI6 (Spearman r = 0.79, P = 1.5 × 10–175) across our keratinocyte and skin disease samples. The top predicted lncRNA for other cytokines are differentially expressed and/or play roles in different kinds of cancer: AC016027.1 for TNF (mean score = 0.81 of 1) in colorectal cancer (42, 43); AC005332.6 for IFN-γ (mean score = 0.94) in pancreatic cancer (44); and AC008074.2 for IL-17 (mean score = 0.87) in bladder cancer (45). Meanwhile, the highest prediction for the IL-4/IL-13 pathway, G18244 (mean score = 0.94), was identified as a skin-expressing lncRNA in our previous study (20). Interestingly, single-cell RNA-Seq (scRNA-Seq) analysis on 17 skin samples (46) demonstrates that AC016027.1 and AC008074.2 are more expressed in keratinocytes, while G18244 has greater expression in endothelial cells (Supplemental Figure 5–7).

We set a prediction score threshold for each cytokine pathway by maximizing the F-statistic (Figure 2B). The F-statistic represents the harmonic mean of precision (the proportion of genes predicted to be involved in a pathway that actually are involved) and recall (the proportion of genes involved in the pathway that are successfully predicted to be involved), thus providing an optimal trade-off between these 2 competing goals. We calculated the maximum F-statistic for each cytokine pathway using the protein-coding genes and set the threshold 1 percentage point lower (to account for statistical error); we then included all lncRNAs with mean prediction scores greater than this in the final set of lncRNAs predicted for each cytokine (Supplemental Table 17). Table 2 presents a summary of these results and shows that we have predicted 27 lncRNAs for IFN-α, 148 for IFN-γ, 683 for IL-4/IL-13, 25 for IL-17, and 35 for TNF, with 165 lncRNAs being predicted for more than 1 pathway and 721 lncRNAs overall (Supplemental Figure 8). We compared the predicted lncRNAs with those differentially expressed for each cytokine (Supplemental Table 18) and found significant enrichment (after Bonferroni correction) among lncRNAs predicted for IFN-α (P = 5.5 × 10–8, OR = 750.8), IFN-γ (P = 3.80 × 10–21, OR = 14.2), and IL-17 (P = 4.0 × 10–3, OR = 383.5).

Table 2

Summary of machine-learning prediction results for each cytokine

Exploring a possible further refinement, we aggregated interaction scores (from the RNAInter v4.0 database) between lncRNAs and the coding genes in each pathway; we then correlated these with the prediction scores from our machine-learning framework. Interestingly, the correlation for all 5 cytokine pathways — IFN-α (r = 0.26, P = 3.2 × 10–182), IFN-γ (r = 0.24, P = 2.5 × 10–153), IL-17 (r = 0.33, P = 3.7 × 10–308), IL-4/IL-13 (r = 0.33, P = 5.0 × 10–296), and TNF (r = 0.27, P = 5.4 × 10–191) — was strong, although it is difficult to evaluate the effect on predictions without a gold standard for lncRNA functions.

Assessing the roles of predicted lncRNAs. Figure 2C presents the top 3 lncRNAs predicted for each cytokine pathway, along with the protein-coding genes that have the highest correlation in gene expression with each one, across the different samples in the training data set. For the IL-17 pathway, 2 of the top 3 lncRNAs, AC090198.1 and TRA3IP2-AS1, have the highest correlation with the protein-coding gene PIK3R1, so there are only 2 protein-coding genes shown. As expected, lncRNAs tend to have a higher correlation with protein-coding genes in their predicted pathways (Figure 2C), and when extending to all predicted lncRNAs, we found they had consistently and significantly (after Bonferroni correction) higher mean correlation with protein-coding genes connected with the cytokines they are predicted for compared with those from other cytokines (Supplemental Table 19). Interestingly, in the IL-4/IL-13 pathway, AC005288.1 has a strong positive correlation with GRB2 and a weaker negative correlation with the other 2 protein-coding genes (F13A1 and COL1A2), while for lncRNAs G18244 and DNM3OS, it is the opposite. IL-4 and IL-13 antagonize IFN-γ and TNF via GATA3 and STAT6 (47); accordingly, we found AC005288.1 to be negatively correlated with GATA3 and STAT6, while G18244 and DNM3OS were positively correlated. There is also some overlap in the correlations between the different cytokine pathways, with lncRNAs predicted for IFN-α also being correlated with protein-coding genes involved in TNF and vice versa. To evaluate the clinical implication of our analysis, we studied whether the top-prediction results would be biomarkers for disease severity. Interestingly, we observed multiple instances where the expressions of lncRNAs in AD lesional skin are significantly correlated with SCORAD, a clinical tool used to assess extent and severity (48), including AC090198.1 (P = –0.48; P = 1 × 10–4) and AC005332.6 (P = –0.42; P = 1 × 10–4).

Another lncRNA of interest, LINC02541, predicted by our machine-learning approach to be involved in the IFN-γ pathway (mean score = 0.822), was significantly negatively correlated (r = –0.628, P = 9.5 × 10–93) with STAT3, a gene known to be involved in the pathogenesis of psoriasis (49), among other inflammatory skin diseases. Using scRNA-Seq, Figure 3, A and B, we show that LINC02541 is specifically expressed in keratinocytes, and we found it to be significantly downregulated (P = 1.6 × 10–20, FC = 0.48) in keratinocytes upon stimulation with IFN-γ (Figure 3C), while STAT3 was significantly upregulated (P = 2.1 × 10–38, FC = 3.31). We used spatial transcriptomic data to further assay the expression profile of LINC02541 in skin, highlighting that its expression is mostly in the basal/differentiated compartments in epidermis (Figure 3, D and E). Furthermore, LINC02541 was consistently downregulated in psoriasis (P = 2.6 × 10–20, FC = 0.54), AD (P = 4.0 × 10–10, FC = 0.54), and Netherton syndrome (P = 1.0 × 10–3, FC = 0.55), while STAT3 was upregulated in all 3 diseases. These results suggest a potential regulatory role in skin inflammation for LINC02541 through STAT3-mediated IFN-γ signaling and demonstrate the utility of our approach to reveal previously unknown roles of lncRNAs.

Investigation of LINC02541.Figure 3

Investigation of LINC02541. (A) UMAP plot of scRNA-Seq analysis, with color scale representing normalized gene expression in each cell. (B) Dot plot of scRNA-Seq analysis, with color scale representing average expression and size representing percentage of cells in which the gene is expressed. (C) Coverage plot comparing read counts for a keratinocyte sample stimulated with IFN-γ against control, illustrating downregulation upon cytokine stimulation. (D) Spatial plot of keratinized keratinocyte predicted proportions shows the approximate location of keratinocytes within the tissue sample. (E) Histology image, confirming LINC02541 (indicated in yellow) is expressed mostly by the keratinocytes.

The highest correlation we observed was between one of our skin-expressed lncRNAs, G18244, and the protein-coding gene F13A1 (r = 0.978, P < 5 × 10–324) in the IL-4/IL-13 pathway. F13A1 encodes subunit A of factor XIII, which plays important roles in skin pathophysiology (50). F13A1 is involved in macrophage regulation (51, 52), which can be stimulated by IL-4 (53, 54). To further investigate the link between G18244 and F13A1, we applied the Triplex Domain Finder (TDF) from the Regulatory Genomics Toolbox (RGT) (55), which tests for DNA binding domains (DBDs) between lncRNAs and DNA promoter regions. Applying this approach to G18244, we identified a significant DBD (OR = 171.8, P = 4 × 10–2) that encompasses the promoter for F13A1. Significantly, when screening putative DBD for G18244 against promoters for 500 randomly selected protein-coding genes, there were no significant DBDs. We expanded our RGT-TDF analysis to the lncRNAs predicted for each pathway (under the F-statistic thresholds), using the protein-coding genes from each respective cytokine pathway for the target promoters, and we found significant DBDs for 38 lncRNAs predicted for the IL-4/IL-13 cytokine pathway, 2 for IFN-γ, and 1 for TNF, with 39 unique lncRNAs in total (Supplemental Table 20). Five of the lncRNAs predicted for IL-4/IL-13 (AC022098.1, AL139423.1, KCNQ1OT1, LINC00861, and MEG3) had multiple significant DBDs (Figure 4).

Circos plot showing RGT-TDF predictions for IL-4/IL-13.Figure 4

Circos plot showing RGT-TDF predictions for IL-4/IL-13. The outer circle indicates the different lncRNA predicted by our machine-learning approach and the protein-coding genes annotated as being involved in the pathway. Inside are bar plots showing the –log10 P values from the differential expression for atopic dermatitis (red), IL-4 stimulation (blue), and IL-13 stimulation (green). The innermost part of the plot shows the links between the lncRNAs and the protein-coding genes for the predicted DNA binding domains (DBDs), color-coded to distinguish the different lncRNAs.

Inspired by previous work (30), which suggests that short motif sequences of lncRNAs (k-mers) may provide clues for their function, we counted the number of different 6-mers for each lncRNA in our combined catalog. Then, we performed community detection using SEEKR (30), an efficient tool that performs hierarchical clustering based on the Pearson correlation metric. SEEKR identified 6 distinct clusters of lncRNAs (Figure 2D), one of which (cluster 2) is enriched among lncRNAs predicted for all 5 cytokine pathways. By contrast, cluster 3 appears to be specific to TNF and IL-4/IL-13, while cluster 4 was more enriched among lncRNAs predicted to be involved in IL-17 and IFN-γ. This supports previous research claims that sequence motifs in lncRNAs can affect their function, confirms the distinctiveness of lncRNAs predicted by our approach, and suggests some grouping among lncRNAs involved in different cytokine pathways.

Discussion

Previous work has demonstrated that lncRNAs can play important roles in inflammatory skin diseases (20); however, little is known regarding the specific immune functions of these genes. By combining multiple gene catalogs with our own skin-expressing transcripts from the largest available data set (>200 RNA-Seq lesional and normal skin samples), we increased the number of lncRNAs to investigate by 30%. Other sources of potentially novel lncRNA annotations are available; however, many of these overlap with our data set (e.g., >99% for LNCipedia; ref. 56); we decided to focus primarily on established and skin-expressing lncRNAs in this study; we also acknowledge that newer versions of GENCODE are available since the time of our analysis. Interestingly, we found that a higher proportion of skin-expressing lncRNAs was significantly differentially expressed for skin diseases compared with protein-coding genes, whereas, for cytokine stimulations in keratinocytes, the converse was true. This simultaneously highlights the importance of lncRNAs for skin diseases and the difficulties in interpreting their function. lncRNAs are especially difficult to analyze because they tend to have lower expression levels than protein-coding genes, and their expression patterns are highly context specific. We countered these difficulties by preparing a large data set of RNA-Seq gene expression (834 samples in total) across 4 skin diseases and 7 cytokine stimulations; we then applied advanced machine-learning techniques to predict lncRNA functions, using promoter binding analysis and sequence motifs (k-mers) to interpret the results.

We applied a machine-learning approach to predict lncRNAs involved in pathways of response to IFN-α, IFN-γ, IL-4/IL-13, IL-17 and TNF, using protein coding genes from these pathways as training data, since lncRNAs’ expression has been found to correlate positively with their coding targets (57–59). The highest-performing classifier in our benchmarking analysis across the different cytokines was random forest, and we further improved its performance (from 0.75 to 0.79 AUROC) by requiring all the trees in the classifier to be balanced — i.e., having an equal number of cases and controls. This helps account for bias in the proportion of cases and controls, with far fewer protein-coding genes being annotated as involved in a cytokine pathway than not. In addition to splitting the data 70/30 for training and testing, we confirmed the robustness of our approach and addressed the potential risk of overfitting by using 10-fold cross-validation, repeating the classification analysis 100 times to obtain an average prediction score for each lncRNA in each cytokine pathway. We also observed consistently high performance for the different classifiers, with 8 of 10 having AUROC > 0.6 for all 5 pathways. Nevertheless, there may be measurement errors due to the methodology employed, and the outcome cannot be considered a ground truth.

Since RNA-Seq samples from cytokine stimulations were used in the training data, we tried excluding them to test whether they could give an unfair advantage to predictions for cytokine pathways (Supplemental Table 21); however, the new prediction scores were highly correlated (r > 0.8 for every cytokine) with those that include these samples, suggesting that inclusion of the cytokine stimulations did not overly bias the results. Furthermore, we trained the balanced random forest classifier using 200-dimension gene embeddings created from 984 data sets in GEO (60) and compared the prediction results with our own for the 1,166 lncRNAs available in that data set. Although the Pearson correlations were significant for IFN-γ, IL-4/IL-13, and TNF (but not IFN-α or IL-17), they were much smaller in size (r < 0.3 for every cytokine) and smaller also than those for the protein-coding genes (which were all significant), highlighting the tissue specificity of lncRNAs and the utility of our own large skin/keratinocyte RNA-Seq data set.

The lack of previous research into lncRNAs in skin diseases can be a limitation on interpreting the findings of our study. Most previous studies have focused on specific lncRNAs (15, 19), so when researching the highest predicted lncRNAs for each cytokine, literature was limited to reports of putative roles in cancer. These previous studies confirm that lncRNAs can have an immune role, but their involvement in skin biology is a potentially novel finding. Some of the lncRNAs we identified (e.g., G18244) are new to our knowledge, but we can understand their mechanisms of action through correlation with protein-coding genes (for example, F13A1) and provide further confirmation through promoter testing using RGT-TDF analysis. In addition, k-mer clustering and correlation analysis was used to help reveal patterns and provide insight into the role of sequence motifs. Overall, our study provides a large body of functional predictions and research findings, which can be used to guide future research into the roles of lncRNAs in inflammatory skin diseases.

Methods

Merged gene catalog. We compiled a comprehensive gene catalog, including 18,517 lncRNAs, by combining data from GENCODE v29 (6) with Broad’s Human Body Map (21) and the skin-expressing transcripts identified in our previous study (20). This increased the number of lncRNAs by 31% compared with using GENCODE alone (Supplemental Figure 1), bringing it closer to the 20,110 protein-coding genes. Our gene catalog also includes 14,881 pseudogenes; 7,517 short noncoding RNAs; and 1,424 other genes, which we group together into the “other” category.

Gene transfer format (GTF) files were merged across the 3 data sets using our custom software, GTFmerge (https://github.com/CutaneousBioinf/GTF; commit ID 44a323b), which is written in C++ for efficiency. GTFmerge only adds genes that are nonoverlapping, by checking whether any of the exons from the target GTF overlap with an exon from the reference GTF. First, we set GENCODE as the reference and the Human Body Map as the target; then, we set the merged GTF as the reference and our skin-expressing transcripts as the target, so as to create a new merged GTF comprising all 3 sources.

RNA-Seq data processing. RNA-Seq samples were processed, as described in their respective papers, for AD (61), psoriasis (20), Netherton syndrome (62), and HS (34), using STAR for alignment and HTSeq to count the number of reads for each gene in our merged gene catalog. In addition to the lesional/control samples used in differential expression analysis (Table 1), additional samples were included in machine learning: 6 acute AD, 27 nonlesional AD, 28 psoriasis lesional, and 27 psoriasis nonlesional from ref. 61; 28 nonlesional psoriasis from ref. 20; and 13 nonlesional Netherton from ref. 62. Differential expression analysis was performed using limma (63). We compared lesional skin with control samples from healthy patients and included all available covariates (sex for AD; sex and age for Netherton syndrome and HS). Genes were considered differentially expressed if they have |logFC| ≥ 1 and FDR ≤ 0.05. Single-cell and spatial data were extracted from a previous study (64) and analysis performed to reveal the cell and spatial specificity of particular lncRNAs. Spatial deconvolution of cell types was performed using conditional autoregressive-based deconvolution (65).

Machine learning. We performed supervised machine learning for functional prediction, using 10-fold cross-validation on a range of different classifiers (rpart, naiveBayes, logreg, ksvm, ctree, gbm, plsdaCaret, lda, sda, and randomForest) through the MLR package in R, using default parameters for each. We performed 5 binary classification tasks, 1 for each cytokine pathway (aggregated from gene sets; Supplemental Table 15), with genes as the instances and disease/cytokine stimulation expression profiles (samples) as the features. Each task was trained using the expression of all genes on all samples, so the size of the training set was the same for each task: 20,110 protein-coding genes, with 834 samples. For the balanced random forest, we sampled all genes involved in the pathway and an equal number not involved. Protein-coding genes annotated as belonging to each cytokine in the aggregated pathways were marked as positive cases, and the other protein-coding genes were marked as negative controls for the purposes of classification; hence, our approach can be considered a form of positive-unlabeled (PU) classification. Expression data were inverse normalized for use in machine learning. The resulting models were then applied to predict lncRNA for each cytokine pathway, using the maximum F-statistic to set thresholds on the prediction score for each cytokine.

Promoter testing and k-mer clustering. We applied the TDF from the RGT (55) using the hg19 reference genome and the default input parameters (promoter length: 1,000; minimum length of triplex: 20; triplex-forming promoter cut-off: 0.1; tolerated number of consecutive errors: 2). The k-mer clustering tool SEEKR (30) was applied using k-mers of length 6 and the default threshold for the adjacency matrix (0.13). We then calculated enrichments of lncRNAs predicted for different cytokine pathways using Fisher’s exact test.

Statistics. We performed various statistical tests, including Spearman/Pearson correlation, Fisher enrichment, and Wilcoxon rank-sum tests. Where appropriate, Bonferroni adjustment was used to correct for multiple tests.

Study approval. Since our study focused on the analysis and utilization of previously collected data, no prior approval was required.

Data availability. The GENCODE v29 GTF can be downloaded from the GENCODE website (https://www.gencodegenes.org/human/release_29.html). The additional lncRNA GTF has been deposited in GEO (GSE63979), as has the AD (GSE224783), psoriasis (GSE63979), Netherton syndrome (GSE164285), and HS (GSE154773) expression data. Values for all data points in graphs are reported in the Supporting Data Values file.

Author contributions

Design of research studies was contributed by MTP and LCT. Data were acquired by MKS, JEG, and LCT. Data were analyzed by MTP, SS, AS, and RW. The original draft was written by MTP and LCT. Review and editing were contributed by MTP, SS, AS, RW, MKS, JC, TQ, ACB, JMK, EP, AH, SW, JTE, CCK, JEG, and LCT.

Supplemental material

View Supplemental data

View Supplemental tables 1-18

View Supporting data values

Acknowledgments

This work was supported by the Dermatology Foundation (MTP), National Psoriasis Foundation (LCT, MTP, and JEG), and awards from the NIH (K01AR072129 to LCT; P30AR075043 to LCT, MTP, and JEG; UC2 AR081033 to LCT and JEG; and R01AR042742, R01AR050511, R01AR054966, R01AR063611, and R01AR065183 to JTE).

Address correspondence to: Lam C. Tsoi, University of Michigan Department of Dermatology, 7421 Medical Science Building I 1301 E. Catherine St., Ann Arbor, Michigan 48109-5609, USA. Phone: 734.647.8070; Email: alextsoi@med.umich.edu.

Footnotes

Conflict of interest: JEG has served as a consultant to AbbVie, Eli Lilly, Almirall, Celgene, BMS, Janssen, Prometheus, TimberPharma, Galderma, Novatis, MiRagen, and AnaptysBio and has received research support from AbbVie, SunPharma, Eli Lilly, Kyowa Kirin, Almirall, Celgene, BMS, Janssen, Prometheus, and TimberPharma. LCT has received support from Novartis, Galderma, and Janssen.

Copyright: © 2023, Patrick et al. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.

Reference information: JCI Insight. 2023;8(24):e172956.https://doi.org/10.1172/jci.insight.172956.

References
  1. Djebali S, et al. Landscape of transcription in human cells. Nature. 2012;489(7414):101–108.
    View this article via: CrossRef PubMed Google Scholar
  2. Rands CM, et al. 8.2% of the human genome is constrained: variation in rates of turnover across functional element classes in the human lineage. PLoS Genet. 2014;10(7):e1004525.
    View this article via: CrossRef PubMed Google Scholar
  3. Graur D. An upper limit on the functional fraction of the human genome. Genome Biol Evol. 2017;9(7):1880–1885.
    View this article via: CrossRef PubMed Google Scholar
  4. Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74.
    View this article via: CrossRef PubMed Google Scholar
  5. Harrow J, et al. GENCODE: the reference human genome annotation for the ENCODE project. Genome Res. 2012;22(9):1760–1774.
    View this article via: CrossRef PubMed Google Scholar
  6. Frankish A, et al. GENCODE: reference annotation for the human and mouse genomes in 2023. Nucleic Acids Res. 2019;47(d1):D766–D949.
    View this article via: CrossRef PubMed Google Scholar
  7. Kapranov P, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316(5830):1484–1488.
    View this article via: CrossRef PubMed Google Scholar
  8. Chen YG, et al. Gene regulation in the immune system by long noncoding RNAs. Nat Immunol. 2017;18(9):962–972.
    View this article via: CrossRef PubMed Google Scholar
  9. Tripathi V, et al. The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation. Mol Cell. 2010;39(6):925–938.
    View this article via: CrossRef PubMed Google Scholar
  10. Sun TT, et al. lncRNA GClnc1 promotes gastric carcinogenesis and may act as a modular scaffold of WDR5 and KAT2A complexes to specify the histone modification pattern. Cancer Discov. 2016;6(7):784–801.
    View this article via: CrossRef PubMed Google Scholar
  11. Andric V, et al. A scaffold lncRNA shapes the mitosis to meiosis switch. Nat Commun. 2021;12(1):770.
    View this article via: CrossRef PubMed Google Scholar
  12. Moore JBt, Uchida S. Functional characterization of long noncoding RNAs. Curr Opin Cardiol. 2020;35(3):199–206.
    View this article via: CrossRef PubMed Google Scholar
  13. Shefler A, et al. Skin-expressing lncRNAs in inflammatory responses. Front Genet. 2022;13(835740):835740.
    View this article via: CrossRef PubMed Google Scholar
  14. Xu S, et al. ANRIL lncRNA triggers efficient therapeutic efficacy by reprogramming the aberrant INK4-hub in melanoma. Cancer Lett. 2016;381(1):41–48.
    View this article via: CrossRef PubMed Google Scholar
  15. Wasson CW, et al. Long non-coding RNA HOTAIR drives EZH2-dependent myofibroblast activation in systemic sclerosis through miRNA 34a-dependent activation of NOTCH. Ann Rheum Dis. 2020;79(4):507–517.
    View this article via: CrossRef PubMed Google Scholar
  16. Qin G, et al. LincRNA TINCR facilitates excessive proliferation and inflammation in post-burn skin fibroblasts by directly binding with SND1 protein and inducing SND1-mediated TGF-β1 expression. Biochem Biophys Res Commun. 2019;509(4):903–910.
    View this article via: CrossRef PubMed Google Scholar
  17. Kretz M, et al. Control of somatic tissue differentiation by the long non-coding RNA TINCR. Nature. 2013;493(7431):231–235.
    View this article via: CrossRef PubMed Google Scholar
  18. Zhang Y, et al. MALAT1-KTN1-EGFR regulatory axis promotes the development of cutaneous squamous cell carcinoma. Cell Death Differ. 2019;26(10):2061–2073.
    View this article via: CrossRef PubMed Google Scholar
  19. Li D, et al. Human skin long noncoding RNA WAKMAR1 regulates wound healing by enhancing keratinocyte migration. Proc Natl Acad Sci U S A. 2019;116(19):9443–9452.
    View this article via: CrossRef PubMed Google Scholar
  20. Tsoi LC, et al. Analysis of long non-coding RNAs highlights tissue-specific expression patterns and epigenetic profiles in normal and psoriatic skin. Genome Biol. 2015;16(1):24.
    View this article via: CrossRef PubMed Google Scholar
  21. Cabili MN, et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011;25(18):1915–1927.
    View this article via: CrossRef PubMed Google Scholar
  22. Wang Y, et al. Systematic identification of non-coding pharmacogenomic landscape in cancer. Nat Commun. 2018;9(1):3192.
    View this article via: CrossRef PubMed Google Scholar
  23. Zhang XQ, et al. Spatial-temporal transcriptional dynamics of long non-coding RNAs in human brain. Hum Mol Genet. 2017;26(16):3202–3211.
    View this article via: CrossRef PubMed Google Scholar
  24. Washietl S, et al. Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals. Genome Res. 2014;24(4):616–628.
    View this article via: CrossRef PubMed Google Scholar
  25. Faghihi MA, et al. Expression of a noncoding RNA is elevated in Alzheimer’s disease and drives rapid feed-forward regulation of beta-secretase. Nat Med. 2008;14(7):723–730.
    View this article via: CrossRef PubMed Google Scholar
  26. Carrieri C, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012;491(7424):454–457.
    View this article via: CrossRef PubMed Google Scholar
  27. Carlevaro-Fita J, et al. Ancient exapted transposable elements promote nuclear enrichment of human long noncoding RNAs. Genome Res. 2019;29(2):208–222.
    View this article via: CrossRef PubMed Google Scholar
  28. Sunwoo H, et al. Repeat E anchors Xist RNA to the inactive X chromosomal compartment through CDKN1A-interacting protein (CIZ1). Proc Natl Acad Sci U S A. 2017;114(40):10654–10659.
    View this article via: CrossRef PubMed Google Scholar
  29. Guo X, et al. Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks. Nucleic Acids Res. 2013;41(2):e35.
    View this article via: CrossRef PubMed Google Scholar
  30. Kirk JM, et al. Functional classification of long non-coding RNAs by k-mer content. Nat Genet. 2018;50(10):1474–1482.
    View this article via: CrossRef PubMed Google Scholar
  31. Hu H, et al. HLPI-Ensemble: prediction of human lncRNA-protein interactions based on ensemble strategy. RNA Biol. 2018;15(6):797–806.
    View this article via: CrossRef PubMed Google Scholar
  32. Lu C, et al. Prediction of lncRNA-disease associations based on inductive matrix completion. Bioinformatics. 2018;34(19):3357–3364.
    View this article via: CrossRef PubMed Google Scholar
  33. Li B, et al. Transcriptome analysis of psoriasis in a large case-control sample: RNA-Seq provides insights into disease mechanisms. J Invest Dermatol. 2014;134(7):1828–1838.
    View this article via: CrossRef PubMed Google Scholar
  34. Gudjonsson JE, et al. Contribution of plasma cells and B cells to hidradenitis suppurativa pathogenesis. JCI Insight. 2020;5(19):139930.
    View this article via: JCI Insight CrossRef PubMed Google Scholar
  35. Farh KK, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 2015;518(7539):337–343.
    View this article via: CrossRef PubMed Google Scholar
  36. Nishimura D. BioCarta - A View From the Web. https://www.liebertpub.com/doi/abs/10.1089/152791601750294344 Accessed November 9, 2023.
  37. Croft D, et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 2014;42(database issue):D472–D477.
    View this article via: CrossRef PubMed Google Scholar
  38. Schaefer CF, et al. PID: the pathway interaction database. Nucleic Acids Res. 2009;37(database issue):D674–D679.
    View this article via: CrossRef PubMed Google Scholar
  39. Gene Ontology Consortium. Gene ontology consortium: going forward. Nucleic Acids Res. 2015;43(database issue):D1049–D1056.
    View this article via: CrossRef PubMed Google Scholar
  40. Kanehisa M, et al. New approach for understanding genome variations in KEGG. Nucleic Acids Res. 2019;47(d1):D590–D595.
    View this article via: CrossRef PubMed Google Scholar
  41. Mathias C, et al. Unraveling immune-related lncRNAs in breast cancer molecular subtypes. Front Oncol. 2021;11:692170.
    View this article via: CrossRef PubMed Google Scholar
  42. Lin Y, et al. Identification of an immune-related nine-lncRNA signature predictive of overall survival in colon cancer. Front Genet. 2020;11:318.
    View this article via: CrossRef PubMed Google Scholar
  43. Li W, et al. The construction and comprehensive prognostic analysis of the lncRNA-associated competitive endogenous RNAs network in colorectal cancer. Front Genet. 2020;11:583.
    View this article via: CrossRef PubMed Google Scholar
  44. Chen G, et al. Comprehensive analysis of autophagy-associated lncRNAs reveal potential prognostic prediction in pancreatic cancer. Front Oncol. 2021;11:596573.
    View this article via: CrossRef PubMed Google Scholar
  45. Cui Y, et al. Identification of a nomogram from ferroptosis-related long noncoding RNAs signature to analyze overall survival in patients with bladder cancer. J Oncol. 2021;2021:8533464.
    View this article via: CrossRef PubMed Google Scholar
  46. Ma F, et al. Single cell and spatial sequencing define processes by which keratinocytes and fibroblasts amplify inflammatory responses in psoriasis. Nat Commun. 2023;14(1):3455.
    View this article via: CrossRef PubMed Google Scholar
  47. Ouyang W, et al. Inhibition of Th1 development mediated by GATA-3 through an IL-4-independent mechanism. Immunity. 1998;9(5):745–755.
    View this article via: CrossRef PubMed Google Scholar
  48. Kunz B, et al. Clinical validation and guidelines for the SCORAD index: consensus report of the European Task Force on Atopic Dermatitis. Dermatology. 1997;195(1):10–19.
    View this article via: CrossRef PubMed Google Scholar
  49. Tsoi LC, et al. Identification of 15 new psoriasis susceptibility loci highlights the role of innate immunity. Nat Genet. 2012;44(12):1341–1348.
    View this article via: CrossRef PubMed Google Scholar
  50. Paragh L, Törőcsik D. Factor XIII subunit A in the skin: applications in diagnosis and treatment. Biomed Res Int. 2017;2017:3571861.
    View this article via: CrossRef PubMed Google Scholar
  51. Cerio R, et al. Characterization of factor XIIIa positive dermal dendritic cells in normal and inflamed skin. Br J Dermatol. 1989;121(4):421–431.
    View this article via: CrossRef PubMed Google Scholar
  52. Ochoa MT, et al. “Dermal dendritic cells” comprise two distinct populations: CD1+ dendritic cells and CD209+ macrophages. J Invest Dermatol. 2008;128(9):2225–2231.
    View this article via: CrossRef PubMed Google Scholar
  53. Namgaladze D, et al. Polarization of human macrophages by interleukin-4 does not require ATP-citrate lyase. Front Immunol. 2018;9:2858.
    View this article via: CrossRef PubMed Google Scholar
  54. Moratal C, et al. IL-1β- and IL-4-polarized macrophages have opposite effects on adipogenesis of intramuscular fibro-adipogenic progenitors in humans. Sci Rep. 2018;8(1):17005.
    View this article via: CrossRef PubMed Google Scholar
  55. Kuo CC, et al. Detection of RNA-DNA binding sites in long noncoding RNAs. Nucleic Acids Res. 2019;47(6):e32.
    View this article via: CrossRef PubMed Google Scholar
  56. Volders PJ, et al. LNCipedia 5: towards a reference set of human long non-coding RNAs. Nucleic Acids Res. 2019;47(d1):D135–D139.
    View this article via: CrossRef PubMed Google Scholar
  57. Ning Q, et al. The evolution and expression pattern of human overlapping lncRNA and protein-coding gene pairs. Sci Rep. 2017;7:42775.
    View this article via: CrossRef PubMed Google Scholar
  58. Derrien T, et al. The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 2012;22(9):1775–1789.
    View this article via: CrossRef PubMed Google Scholar
  59. Wang F, et al. Coregulatory long non-coding RNA and protein-coding genes in serum starved cells. Biochim Biophys Acta Gene Regul Mech. 2019;1862(1):84–95.
    View this article via: CrossRef PubMed Google Scholar
  60. Du J, et al. Gene2vec: distributed representation of genes based on co-expression. BMC Genomics. 2019;20(suppl 1):82.
    View this article via: CrossRef PubMed Google Scholar
  61. Tsoi LC, et al. Progression of acute-to-chronic atopic dermatitis is associated with quantitative rather than qualitative changes in cytokine responses. J Allergy Clin Immunol. 2020;145(5):1406–1415.
    View this article via: CrossRef PubMed Google Scholar
  62. Barbieux C, et al. Netherton syndrome subtypes share IL-17/IL-36 signature with distinct IFN-α and allergic responses. J Allergy Clin Immunol. 2022;149(4):1358–1372.
    View this article via: CrossRef PubMed Google Scholar
  63. Ritchie ME, et al. Limma powers differential expression analyses for RNA-Sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.
    View this article via: CrossRef PubMed Google Scholar
  64. Ma F, et al. Single cell and spatial sequencing define processes by which keratinocytes and fibroblasts amplify inflammatory responses in psoriasis. Nat Commun. 2023;14(1):3455.
    View this article via: CrossRef PubMed Google Scholar
  65. Ma Y, Zhou X. Spatially informed cell-type deconvolution for spatial transcriptomics. Nat Biotechnol. 2022;40(9):1349–1359.
    View this article via: CrossRef PubMed Google Scholar
Version history
  • Version 1 (December 22, 2023): Electronic publication

Article tools

  • View PDF
  • Download citation information
  • Send a comment
  • Terms of use
  • Standard abbreviations
  • Need help? Email the journal

Metrics

  • Article usage
  • Citations to this article

Go to

  • Top
  • Abstract
  • Introduction
  • Results
  • Discussion
  • Methods
  • Author contributions
  • Supplemental material
  • Acknowledgments
  • Footnotes
  • References
  • Version history
Advertisement
Advertisement

Copyright © 2025 American Society for Clinical Investigation
ISSN 2379-3708

Sign up for email alerts