pAtChWoRK: Patching the Pieces of Public Procurement Documents
Lorena Calvo-Bartolomé, Saúl Blanco Fortes, Erick Cedeño, Jerónimo Arenas-García
Abstract
Public procurement data is legally open, yet practically locked inside thousands of unstructured PDFs and inconsistent portal metadata. pAtChWoRK starts with these fragmented, unstructured sources then leverages a hybrid pipeline (traditional NLP with LLM-based technologies) to restructure this information into a navigable knowledge base. Specifically, pAtChWoRK corrects manual classification errors, extracts complex unstructured fields such as award and solvency criteria and tenders’ objectives, and assists users in easily navigating the tender landscape. This unified process enables more effective handling of the transparency bottlenecks that hinder competition and oversight in public administration. A user study with practitioners across different procurement- Anthology ID:
- 2026.acl-demo.55
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Greg Durrett, Ping Jian
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 556–565
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-demo.55/
- DOI:
- Cite (ACL):
- Lorena Calvo-Bartolomé, Saúl Blanco Fortes, Erick Cedeño, and Jerónimo Arenas-García. 2026. pAtChWoRK: Patching the Pieces of Public Procurement Documents. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 556–565, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- pAtChWoRK: Patching the Pieces of Public Procurement Documents (Calvo-Bartolomé et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-demo.55.pdf