ICB-UMA at #SMM4H–HeaRD 2026: Hybrid Clinical Entity Projection for MultiClinAI: Adaptive Candidate Windows, XGBoost, and LLM Refinement
Alvaro Rey-Blanes, Sara Giménez-Gómez, Francisco J. Veredas, Francisco J. Moreno-Barea
Abstract
This paper presents our submission to the MultiClinAI Shared Task (Gallego-Donoso et al., 2026) on cross-lingual clinical entity annotation projection from Spanish to English. Our system transfers expert annotations for Diseases, Symptoms and Procedures entities. The approach integrates three core components: adaptive candidate window generation, an XGBoost classifier leveraging surface and semantic features, and an LLM-based post-processing stage to resolve complex misalignments. Our highest-performing run ranked 3rd on the official leaderboard, achieving strict F1 scores of 0.737, 0.549, and 0.538 for Diseases, Symptoms and Procedures, respectively. These results show that combining supervised candidate scoring with targeted LLM refinement provides a robust strategy for clinical entity projection.- Anthology ID:
- 2026.smm4h-1.21
- Volume:
- Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, United States
- Editors:
- Guillermo Lopez-Garcia, Graciela Gonzalez-Hernandez
- Venues:
- SMM4H | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 127–132
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.smm4h-1.21/
- DOI:
- Cite (ACL):
- Alvaro Rey-Blanes, Sara Giménez-Gómez, Francisco J. Veredas, and Francisco J. Moreno-Barea. 2026. ICB-UMA at #SMM4H–HeaRD 2026: Hybrid Clinical Entity Projection for MultiClinAI: Adaptive Candidate Windows, XGBoost, and LLM Refinement. In Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks, pages 127–132, San Diego, United States. Association for Computational Linguistics.
- Cite (Informal):
- ICB-UMA at #SMM4H–HeaRD 2026: Hybrid Clinical Entity Projection for MultiClinAI: Adaptive Candidate Windows, XGBoost, and LLM Refinement (Rey-Blanes et al., SMM4H 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.smm4h-1.21.pdf