FINKRX: Establishing Best Practices for Korean Financial NLP

Guijin Son, Hyunwoo Ko, Hanearl Jung, Chami Hwang


Abstract
In this work, we present the first open leaderboard for evaluating Korean large language models focused on finance. Operated for abouteight weeks, the leaderboard evaluated 1,119 submissions on a closed benchmark covering five MCQA categories: finance and accounting, stock price prediction, domestic company analysis, financial markets, and financial agent tasks and one open-ended qa task. Building on insights from these evaluations, we release an open instruction dataset of 80k instances and summarize widely used training strategies observed among top-performing models. Finally, we introduce FINKRX, a fully open and transparent LLM built using these best practices. We hope our contributions help advance the development of better and safer financial LLMs for Korean and other languages.
Anthology ID:
2025.acl-industry.81
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Georg Rehm, Yunyao Li
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1161–1174
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.acl-industry.81/
DOI:
Bibkey:
Cite (ACL):
Guijin Son, Hyunwoo Ko, Hanearl Jung, and Chami Hwang. 2025. FINKRX: Establishing Best Practices for Korean Financial NLP. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), pages 1161–1174, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
FINKRX: Establishing Best Practices for Korean Financial NLP (Son et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.acl-industry.81.pdf