Jona Neumeier
2025
AttributeForge: An Agentic LLM Framework for Automated Product Schema Modeling
Yunhan Huang
|
Klevis Ramo
|
Andrea Iovine
|
Melvin Monteiro
|
Sedat Gokalp
|
Arjun Bakshi
|
Hasan Turalic
|
Arsh Kumar
|
Jona Neumeier
|
Ripley Yates
|
Rejaul Monir
|
Simon Hartmann
|
Tushar Manglik
|
Mohamed Yakout
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Effective product schema modeling is fundamental to e-commerce success, enabling accurate product discovery and superior customer experience. However, traditional manual schema modeling processes are severely bottlenecked, producing only tens of attributes per month, which is insufficient for modern e-commerce platforms managing thousands of product types. This paper introduces AttributeForge, the first framework to automate end-to-end product schema modeling using Large Language Models (LLMs). Our key innovation lies in orchestrating 43 specialized LLM agents through strategic workflow patterns to handle the complex interdependencies in schema generation. The framework incorporates two novel components: MC2-Eval, a comprehensive validation system that assesses schemas against technical, business, and customer experience requirements; and AutoFix, an intelligent mechanism that automatically corrects modeling defects through iterative refinement. Deployed in production, AttributeForge achieves an 88× increase in modeling throughput while delivering superior quality: a 59.83% Good-to-Good (G2G) conversion rate compared to 37.50% for manual approaches. This significant improvement in both speed and quality enables e-commerce platforms to rapidly adapt their product schemas to evolving market needs.
Search
Fix author
Co-authors
- Arjun Bakshi 1
- Sedat Gokalp 1
- Simon Hartmann 1
- Yunhan Huang 1
- Andrea Iovine 1
- show all...