← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo

Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer

Marcela A. Johnson et al · Nature Portfolio · 2026

Acceso abierto disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Acceso abierto disponible

DOAJ DOAJ - Open Access Journals
Recurso identificado como acceso abierto, sin confirmar automáticamente si es texto completo directo.
Abrir recurso

Resumen

Descripción general del contenido del recurso.

Abstract Background Genetic aberrations are among the critical driving factors of lung cancer. Importantly, the impact of genetic variations on proteomic dysregulations with the goal of characterizing potential diagnostic biomarkers at the population-level requires additional investigation. Modeling such proteogenomic interactions is crucial in understanding early-stage biological disruptions to inform biomarker discovery, successful clinical trials, and developing effective therapeutics. Methods We investigated two complementary aspects of lung cancer risk. First, we performed a genome-wide association study of lung cancer using population-scale datasets, then examined whether lung cancer risk-associated variants influence plasma protein levels using the UK Biobank Pharma Proteomics Project data. Second, we identified plasma proteomic dysregulations in presymptomatic and symptomatic patients with the objective of pinpointing diagnostic biomarkers through leveraging machine learning methods. Results Using the identified proteins, machine learning models achieved median cross-validated AUCs of 0.85–0.88 (0–4 years before diagnosis [YBD]), 0.81–0.84 (5–9 YBD), and 0.80–0.86 (0–9 YBD). Performing survival analyses within the 5–9 YBD group, elevated levels of eight proteins, such as CALCB, PLAUR, and CD74, were found to significantly associate with lower survival. We identified 22 disease-associated proteins, of which 14 have been previously implicated in lung cancer, including CEACAM5, CXCL17, GDF15, WFDC2 along with 8 novel proteins. These proteins were enriched in pathways related to cytokine signaling, interleukin regulation, neutrophil degranulation, and lung fibrosis. Conclusions While these findings do not establish mechanistic causality, they highlight proteomic alterations reflecting systemic changes preceding the diagnosis. Our study contributes to understanding genome–proteome relationships in lung cancer and identifies circulating proteins warranting further investigation as potential early biomarkers for screening and risk stratification.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, M. A. J. E. (2026). Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer. https://doi.org/10.1038/s43856-026-01500-1

MLA

al, Marcela A. Johnson et. "Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer." 2026. https://doi.org/10.1038/s43856-026-01500-1.

Chicago

al, Marcela A. Johnson et. 2026. "Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer.". https://doi.org/10.1038/s43856-026-01500-1.

Harvard

al, M. A. J. E. 2026, Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer, Nature Portfolio, available at: https://doi.org/10.1038/s43856-026-01500-1 [Accessed 25 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer
Autor / colaboradores
Marcela A. Johnson et al
Editorial
Nature Portfolio
Año de publicación
2026
ISSN
2730-664X
ISSN
2730-664X
Idioma
eng
Copiado