Supplementary material: Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data

SND-ID: 2024-340. Version: 1. DOI: https://doi.org/10.57804/wnev-6d20

Citation

Creator/Principal investigator(s)

Sara A. Yones - Uppsala University, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics. Science for Life Laboratory, SciLifeLab

Alva Annett - Uppsala University, Department of Cell and Molecular Biology. Science for Life Laboratory, SciLifeLab

Patricia Stoll - ETH Zurich, Department of Biosystems Science and Engineering

Klev Diamanti - Uppsala University, Department of Immunology, Genetics and Pathology. Science for Life Laboratory, SciLifeLab orcid

Linda Holmfeldt - Uppsala University, Department of Immunology, Genetics and Pathology, Experimental and Clinical Oncology. Science for Life Laboratory, SciLifeLab orcid

... Show more..

Sara A. Yones - Uppsala University, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics. Science for Life Laboratory, SciLifeLab

Alva Annett - Uppsala University, Department of Cell and Molecular Biology. Science for Life Laboratory, SciLifeLab

Patricia Stoll - ETH Zurich, Department of Biosystems Science and Engineering

Klev Diamanti - Uppsala University, Department of Immunology, Genetics and Pathology. Science for Life Laboratory, SciLifeLab orcid

Linda Holmfeldt - Uppsala University, Department of Immunology, Genetics and Pathology, Experimental and Clinical Oncology. Science for Life Laboratory, SciLifeLab orcid

Fredrik Barrenäs - Uppsala University, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics. Science for Life Laboratory, SciLifeLab

Jennifer Meadows - Uppsala University, Department of Medical Biochemistry and Microbiology. Science for Life Laboratory, SciLifeLab

Jan Komorowski - Uppsala University / Washington National Primate Research Center / The Institute of Computer Science, Polish Academy of Sciences, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics. Science for Life Laboratory, SciLifeLab. Swedish Collegium for Advanced Study (SCAS) orcid

Show less..

Research principal

Uppsala University rorId

Description

Supplementary tables for manuscript "Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data".

Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included role

... Show more..
Supplementary tables for manuscript "Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data".

Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included roles for genes i) induced by interferons (IFI35 and OTOF), ii) key to SLE cell types (KLRB1 encoding CD161), or iii) with roles in autophagy and NF-κB pathway responses (CKAP4). As demonstrated here, RBML approaches have the potential to reveal novel gene patterns from within a heterogeneous disease, facilitating patient clinical and therapeutic stratification.

The dataset was originally published in DiVA and moved to SND in 2024. Show less..

Data contains personal data

No

Language

Method and outcome
Data collection
Geographic coverage
Administrative information

Identifiers

Topic and keywords

Research area

Bioinformatics (computational biology) (Standard för svensk indelning av forskningsämnen 2011)

Rheumatology and autoimmunity (Standard för svensk indelning av forskningsämnen 2011)

Publications
Published: 2024-06-24