Large Human Language Models: A Need and the Challenges
Nikita Soni, H. Schwartz, João Sedoc, Niranjan Balasubramanian
Abstract
As research in human-centered NLP advances, there is a growing recognition of the importance of incorporating human and social factors into NLP models. At the same time, our NLP systems have become heavily reliant on LLMs, most of which do not model authors. To build NLP systems that can truly understand human language, we must better integrate human contexts into LLMs. This brings to the fore a range of design considerations and challenges in terms of what human aspects to capture, how to represent them, and what modeling strategies to pursue. To address these, we advocate for three positions toward creating large human language models (LHLMs) using concepts from psychological and behavioral sciences: First, LM training should include the human context. Second, LHLMs should recognize that people are more than their group(s). Third, LHLMs should be able to account for the dynamic and temporally-dependent nature of the human context. We refer to relevant advances and present open challenges that need to be addressed and their possible solutions in realizing these goals.- Anthology ID:
- 2024.naacl-long.477
- Volume:
- Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
- Month:
- June
- Year:
- 2024
- Address:
- Mexico City, Mexico
- Editors:
- Kevin Duh, Helena Gomez, Steven Bethard
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 8631–8646
- Language:
- URL:
- https://aclanthology.org/2024.naacl-long.477
- DOI:
- 10.18653/v1/2024.naacl-long.477
- Cite (ACL):
- Nikita Soni, H. Schwartz, João Sedoc, and Niranjan Balasubramanian. 2024. Large Human Language Models: A Need and the Challenges. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 8631–8646, Mexico City, Mexico. Association for Computational Linguistics.
- Cite (Informal):
- Large Human Language Models: A Need and the Challenges (Soni et al., NAACL 2024)
- PDF:
- https://preview.aclanthology.org/ingest-bitext-workshop/2024.naacl-long.477.pdf