← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo de revista

Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification

Geofrey Owino et al · IEEE · 2026

Acceso abierto disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.
Publicación seriada

3PS-RAN: A Real-Time Framework for Securing the O-RAN RACH Against DDoS Attacks Toward NextG

Esta publicación seriada contiene 172 contenidos relacionados.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Acceso abierto disponible

Recurso identificado como acceso abierto, sin confirmar automáticamente si es texto completo directo.
Abrir recurso

Resumen

Descripción general del contenido del recurso.

Infant cry paralinguistics provides a non-invasive signal for early assessment of physiological and affective states. Yet, most learning approaches implicitly assume that the mapping between acoustic patterns and semantic labels is globally identifiable. In practice, this assumption often fails. Cry realizations vary substantially across infants due to physiological, developmental, and contextual factors, thereby increasing the risk of identity-related information leakage in federated aggregation. In this work, we formalize federated infant cry classification as a non-identifiable learning problem in federated settings. We show that standard federated objectives can become poorly aligned under this setting, even when average accuracy appears competitive. To address this limitation, we propose a Primitive-Preserving Audio Transformer (PPAT) that decouples shared representation learning from client-specific decision boundaries. Federation is restricted to transferable paralinguistic primitives, while semantic ambiguity is resolved locally. Across realistic federated evaluations with subject-level partitioning, the proposed approach improves worst-infant Macro F1 by up to 7 percentage points over strong federated baselines, while achieving per-class recall above 81% across all cry categories. Under the evaluated setting, the method lowers infant identity probe accuracy by more than 18%, substantially reduces observed negative transfer during training, and yields improved calibration and lower selective risk across coverage levels. Class-wise analysis reveals structured and acoustically plausible confusion patterns rather than diffuse errors. Beyond infant cry analysis, this work highlights a broader principle for decentralized learning in human-centered domains. When task semantics are not globally identifiable, robustness, stability, and reduced leakage risk may require rethinking what is shared across clients rather than how aggregation is performed.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, G. O. E. (2026). Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification. https://doi.org/10.1109/ACCESS.2026.3686735

MLA

al, Geofrey Owino et. "Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification." 2026. https://doi.org/10.1109/ACCESS.2026.3686735.

Chicago

al, Geofrey Owino et. 2026. "Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification.". https://doi.org/10.1109/ACCESS.2026.3686735.

Harvard

al, G. O. E. 2026, Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification, IEEE, available at: https://doi.org/10.1109/ACCESS.2026.3686735 [Accessed 28 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Federated Primitive-Preserving Audio Transformers for Non-Identifiable Infant Cry Classification
Autor / colaboradores
Geofrey Owino et al
Editorial
IEEE
Año de publicación
2026
ISSN
2169-3536
ISSN
2169-3536
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado