Tigran T. Tchrakian
2026
FactCorrector: A Graph-Inspired Approach to Long-Form Factuality Correction of Large Language Models
Javier Carnerero-Cano | Massimiliano Pronesti | Radu Marinescu | Tigran T. Tchrakian | James Barry | Jasmina Gajcin | Yufang Hou | Alessandra Pascale | Elizabeth M. Daly
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Javier Carnerero-Cano | Massimiliano Pronesti | Radu Marinescu | Tigran T. Tchrakian | James Barry | Jasmina Gajcin | Yufang Hou | Alessandra Pascale | Elizabeth M. Daly
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large language models (LLMs) are widely used in knowledge-intensive applications but often generate factually incorrect responses. A promising approach to rectify these flaws is correcting LLMs using feedback. Therefore, in this paper, we introduce FactCorrector, a new post-hoc correction method that adapts across domains without retraining and leverages structured feedback about the factuality of the original response to generate a correction. To support rigorous evaluations of factuality correction methods, we also develop the VELI5 benchmark, a novel dataset containing systematically injected factual errors and ground-truth corrections. Experiments on VELI5 and several popular long-form factuality datasets show that the FactCorrector approach significantly improves factual precision while preserving relevance, outperforming strong baselines. We release our code at https://ibm.biz/factcorrector.
2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models
Radu Marinescu | Debarun Bhattacharjya | Junkyu Lee | Tigran T. Tchrakian | Javier Carnerero-Cano | Yufang Hou | Elizabeth M. Daly | Alessandra Pascale
Findings of the Association for Computational Linguistics: EMNLP 2025
Radu Marinescu | Debarun Bhattacharjya | Junkyu Lee | Tigran T. Tchrakian | Javier Carnerero-Cano | Yufang Hou | Elizabeth M. Daly | Alessandra Pascale
Findings of the Association for Computational Linguistics: EMNLP 2025
Large language models (LLMs) have achieved remarkable success in generative tasks, yet they often fall short in ensuring the factual accuracy of their outputs thus limiting their reliability in real-world applications where correctness is critical. In this paper, we present FactReasoner, a novel neuro-symbolic based factuality assessment framework that employs probabilistic reasoning to evaluate the truthfulness of long-form generated responses. FactReasoner decomposes a response into atomic units, retrieves relevant contextual information from external knowledge sources, and models the logical relationships (e.g., entailment, contradiction) between these units and their contexts using probabilistic encodings. It then estimates the posterior probability that each atomic unit is supported by the retrieved evidence. Our experiments on both labeled and unlabeled benchmark datasets demonstrate that FactReasoner often outperforms state-of-the-art prompt-based methods in terms of factual precision and recall.