Distributed Biomarker Discovery for Immuno-Oncology

This article has 0 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Background: Identifying robust predictive biomarkers for immuno-oncology (IO) therapy response remains challenging due to the complexity of tumor host interactions and the limited availability of public datasets. While achieving clinical relevance requires biomarkers that generalize across diverse datasets, privacy concerns surrounding sensitive clinical and molecular data further restrict data sharing, impeding progress in biomarker discovery. Materials and Methods: We developed a scalable, cloud-based distributed pipeline, implemented as an in-silico simulation of federated learning, to enable privacy-preserving integration of clinical and molecular data across institutions. This approach supports the development of genomic prediction algorithms without sharing sensitive patient data. Using curated gene expression profiles, we evaluated the association of tumor microenvironment and IO gene expression signatures with clinical outcomes across pan-cancer, cancer-specific, and treatment-specific settings, and developed a multivariable distributed model to predict IO response. Results: We implemented an integrated meta-analysis pipeline using harmonized data across 18 datasets comprising 967 patients and seven cancer types treated with PD-1/PD-L1, CTLA-4, or combination immunotherapies. This approach identified biomarkers specific to lung cancer and CTLA-4 therapy previously unreported in pan-cancer and pan-IO analyses, highlighting the value of context-specific associations. Similar gene expression signatures were associated with progression-free survival and response in both pan-cancer and melanoma datasets, with notable overlap between dual checkpoint blockade and PD-1/PD-L1 therapies. Distributed multivariable XGBoost models outperformed the average of locally trained models, indicating improved generalizability. Conclusion: We present a new distributed pipeline for biomarker discovery in immuno-oncology that preserves patient privacy through federated analysis and secure data handling. This study highlights the importance of larger, diverse datasets to refine cancer- and treatment-specific biomarkers, paving the way for more precise and personalized IO therapies.

Related articles

Related articles are currently not available for this article.