@inproceedings{korkmaz-del-rio-chanona-2024-integrating,
    title = "Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension",
    author = "Korkmaz, Buse Sibel  and
      Del Rio Chanona, Antonio",
    editor = "Ghosal, Tirthankar  and
      Singh, Amanpreet  and
      Waard, Anita  and
      Mayr, Philipp  and
      Naik, Aakanksha  and
      Weller, Orion  and
      Lee, Yoonjoo  and
      Shen, Shannon  and
      Qin, Yanxia",
    booktitle = "Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2024.sdp-1.28/",
    pages = "293--306",
    abstract = "We address the challenge of interpreting and reasoning over scientific tables with Large Language Models (LLMs), a crucial aspect of scholarly documents. Despite significant progress in natural language processing, the integration of tabular data into scientific LLMs remains limited. We propose an innovative approach leveraging intermediate task pre-training on table question-answering datasets, followed by model adaptation to comprehend tables in computer science literature. Our findings reveal that incorporating table understanding substantially improves the performance of LLMs on scientific literature understanding tasks, which we showcase in peer-review score prediction. This improvement underscores the importance of utilizing tabular data in the training of scientific language models. The code and models are publicly available at [this link](https://github.com/buseskorkmaz/Integrating-Table-Representations-into-LLMs)."
}Markdown (Informal)
[Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension](https://preview.aclanthology.org/ingest-emnlp/2024.sdp-1.28/) (Korkmaz & Del Rio Chanona, sdp 2024)
ACL