← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo

Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis

Yixiao Lu et al · BMC · 2026

Material complementario disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Material complementario disponible

El enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.
Abrir material

Resumen

Descripción general del contenido del recurso.

Abstract Background Tuberculosis (TB) remains a critical global public health issue, particularly in rapidly urbanizing regions where population density and migration facilitate transmission. While whole-genome sequencing (WGS) has revolutionized our understanding of TB transmission dynamics, its high cost and technical complexity limit widespread application in resource-limited settings. To address these limitations, we developed a machine learning (ML) framework that integrates routinely collected surveillance data-demographic, clinical, and spatial variables-to predict recent TB transmission hotspots without relying solely on WGS. Methods We trained six ML models using sequenced TB cases (n = 1,442) from Songjiang District, Shanghai, to classify cases into recent transmission clusters (≤ 12 SNPs) versus non-clustered cases. Individual-level data (e.g., age, sex, treatment history) and contextual variables (e.g., population density, land use) were incorporated. Model performance was evaluated using 10-fold cross-validation and an independent test set. Spatial analysis, including Getis-Ord Gi* statistics, was employed to identify and compare notification rate hotspots with predicted transmission hotspots. Results Among the six ML models tested, CATBoost achieved the highest predictive performance (AUC of 0.83 in cross-validation) and maintained robustness on the independent test set. Spatial analysis revealed significant disparities: only 12% of high-notification areas overlapped with recent transmission hotspots, highlighting the limitations of traditional surveillance strategies. Key predictors of recent transmission included population density, industrial land use, and migrant proportion. Notably, our approach identified three recent transmission hotspots that would have been missed if relying solely on sequenced cases. Conclusions Our framework provides a less resource-intensive alternative to WGS-dependent approaches for identifying TB transmission hotspots, validated in Songjiang District and potentially adaptable to other urban settings. By leveraging routinely collected surveillance data, this model enables targeted screening and optimized resource allocation. Its flexible design allows adaptation to other urban settings and retraining as new data becomes available, supporting its potential application in resource-limited contexts.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, Y. L. E. (2026). Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis. https://doi.org/10.1186/s12879-026-13018-x

MLA

al, Yixiao Lu et. "Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis." 2026. https://doi.org/10.1186/s12879-026-13018-x.

Chicago

al, Yixiao Lu et. 2026. "Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis.". https://doi.org/10.1186/s12879-026-13018-x.

Harvard

al, Y. L. E. 2026, Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis, BMC, available at: https://doi.org/10.1186/s12879-026-13018-x [Accessed 30 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Identification of tuberculosis transmission hotspots in urban China using surveillance data: a machine learning approach based on genomic and spatial analysis
Autor / colaboradores
Yixiao Lu et al
Editorial
BMC
Año de publicación
2026
ISSN
1471-2334
ISSN
1471-2334
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado