← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo de revista

LongSemAnnotator: A Longformer Framework for Column Type Annotation

Tiago Santos et al · IEEE · 2026

Material complementario disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.
Publicación seriada

3PS-RAN: A Real-Time Framework for Securing the O-RAN RACH Against DDoS Attacks Toward NextG

Esta publicación seriada contiene 172 contenidos relacionados.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Material complementario disponible

El enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.
Abrir material

Resumen

Descripción general del contenido del recurso.

Column type annotation (CTA) is a critical task in semantic table interpretation (STI), where meaningful semantic labels are automatically assigned to table columns. This process enriches tabular data with essential semantic context, making it fundamental for efficient data integration, cataloging, governance, and the development of natural language database interfaces. Many existing methods struggle with large tables and integrating context from multiple tables. To address these challenges, we introduce a novel approach to improve the accuracy and scalability of CTA. Our framework employs sentence transformers to generate semantic embeddings for columns, enabling the identification of similar columns across different tables. We then utilize a Longformer-based model, which can handle long input sequences, to incorporate inter-table context into the annotation process. Evaluated on the SOTAB benchmark, our approach achieves competitive overall performance, reaching 82.86 micro-F1 when trained on the full training split and 79.13 micro-F1 with the small training split. We also investigate the model’s robustness on test sets containing missing values, format inconsistencies, and edge cases. Experimental results show that incorporating inter-table context and using the Longformer model significantly improves CTA accuracy compared to the best models in the benchmark. Detailed error analysis identifies semantic overlap between similar label types as the primary remaining challenge, with retrieval quality directly constraining annotation accuracy. However, challenges remain with columns exhibiting high semantic overlap and those requiring intra-table context for disambiguation.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, T. S. E. (2026). LongSemAnnotator: A Longformer Framework for Column Type Annotation. https://doi.org/10.1109/ACCESS.2026.3686891

MLA

al, Tiago Santos et. "LongSemAnnotator: A Longformer Framework for Column Type Annotation." 2026. https://doi.org/10.1109/ACCESS.2026.3686891.

Chicago

al, Tiago Santos et. 2026. "LongSemAnnotator: A Longformer Framework for Column Type Annotation.". https://doi.org/10.1109/ACCESS.2026.3686891.

Harvard

al, T. S. E. 2026, LongSemAnnotator: A Longformer Framework for Column Type Annotation, IEEE, available at: https://doi.org/10.1109/ACCESS.2026.3686891 [Accessed 29 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
LongSemAnnotator: A Longformer Framework for Column Type Annotation
Autor / colaboradores
Tiago Santos et al
Editorial
IEEE
Año de publicación
2026
ISSN
2169-3536
ISSN
2169-3536
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado