Decoupling Task-Solving and Output Formatting in LLM Generation

Haikang Deng; Po-Nien Kung; Nanyun Peng

Decoupling Task-Solving and Output Formatting in LLM Generation

Abstract

Large language models (LLMs) are increasingly adept at solving complex problems, such as mathematical reasoning and automatic evaluation. However, performance often degrades when prompts intertwine task instructions with rigid formatting requirements. This entanglement creates competing goals for the model, hindering its reasoning capabilities. To address this, we introduce Deco-G, a decoding framework that explicitly decouples format adherence from problem solving. Deco-G delegates format adherence to a separate Format Estimation Module (FEM), which performs probabilistic lookahead to estimate future format compliance rate and reweighs token probabilities, allowing the LLM to focus solely on task resolution. To make this approach both practical and efficient, we introduce three key innovations: instruction-aware distillation, a flexible trie-building algorithm, and HMM state pruning. Experiments across mathematical reasoning, event argument extraction, and LLM-as-a-judge demonstrate that Deco-G constantly gains over prompting or structured generation baselines, with guaranteed format compliance.

Anthology ID:: 2026.acl-long.764
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16764–16781
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.764/
DOI:
Bibkey:
Cite (ACL):: Haikang Deng, Po-Nien Kung, and Nanyun Peng. 2026. Decoupling Task-Solving and Output Formatting in LLM Generation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 16764–16781, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Decoupling Task-Solving and Output Formatting in LLM Generation (Deng et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.764.pdf
Checklist:: 2026.acl-long.764.checklist.pdf

PDF Cite Search Checklist Fix data