pAtChWoRK: Patching the Pieces of Public Procurement Documents

Lorena Calvo-Bartolomé, Saúl Blanco Fortes, Erick Cedeño, Jerónimo Arenas-García


Abstract
Public procurement data is legally open, yet practically locked inside thousands of unstructured PDFs and inconsistent portal metadata. pAtChWoRK starts with these fragmented, unstructured sources then leverages a hybrid pipeline (traditional NLP with LLM-based technologies) to restructure this information into a navigable knowledge base. Specifically, pAtChWoRK corrects manual classification errors, extracts complex unstructured fields such as award and solvency criteria and tenders’ objectives, and assists users in easily navigating the tender landscape. This unified process enables more effective handling of the transparency bottlenecks that hinder competition and oversight in public administration. A user study with practitioners across different procurement
Anthology ID:
2026.acl-demo.55
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Greg Durrett, Ping Jian
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
556–565
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-demo.55/
DOI:
Bibkey:
Cite (ACL):
Lorena Calvo-Bartolomé, Saúl Blanco Fortes, Erick Cedeño, and Jerónimo Arenas-García. 2026. pAtChWoRK: Patching the Pieces of Public Procurement Documents. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 556–565, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
pAtChWoRK: Patching the Pieces of Public Procurement Documents (Calvo-Bartolomé et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-demo.55.pdf