@inproceedings{lv-etal-2024-urg,
    title = "{URG}: A Unified Ranking and Generation Method for Ensembling Language Models",
    author = "Lv, Bo  and
      Tang, Chen  and
      Zhang, Yanan  and
      Liu, Xin  and
      Luo, Ping  and
      Yu, Yue",
    editor = "Ku, Lun-Wei  and
      Martins, Andre  and
      Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2024.findings-acl.261/",
    doi = "10.18653/v1/2024.findings-acl.261",
    pages = "4421--4434",
    abstract = "Prior research endeavors of the ensemble Large Language Models (LLMs) achieved great success by employing an individual language model (LM) rank before the text generation. However, the use of an individual LM ranker faces two primary challenges: (1) The time-intensive nature of the ranking process, stemming from the comparisons between models; (2) The issue of error propagation arising from the separate ranking and generation models within the framework. In order to overcome these challenges, we propose a novel ensemble framework, namely Unified Ranking and Generation (URG). URG represents an end-to-end framework that jointly ranks the outputs of LLMs and generates fine-grained fusion results, via utilizing a dedicated cross-attention-based module and noise mitigation training against irrelevant information stemming from bad ranking results. Through extensive experimentation and evaluation, we demonstrate the efficiency and effectiveness of our framework in both the ranking and generation tasks. With the close coordination of the ranking and generation modules, our end-to-end framework achieves the state-of-the-art (SOTA) performance on these tasks, and exhibits substantial enhancements to any of the ensembled models."
}Markdown (Informal)
[URG: A Unified Ranking and Generation Method for Ensembling Language Models](https://preview.aclanthology.org/ingest-emnlp/2024.findings-acl.261/) (Lv et al., Findings 2024)
ACL