Hanearl Jung
2024
KRX Bench: Automating Financial Benchmark Creation via Large Language Models
Guijin Son
|
Hyunjun Jeon
|
Chami Hwang
|
Hanearl Jung
Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing
In this work, we introduce KRX-Bench, an automated pipeline for creating financial benchmarks via GPT-4. To demonstrate the effectiveness of the pipeline, we create KRX-Bench-POC, a benchmark assessing the knowledge of LLMs in real-world companies. This dataset comprises 1,002 questions, each focusing on companies across the U.S., Japanese, and Korean stock markets. We make our pipeline and dataset publicly available and integrate the evaluation code into EleutherAI’s Language Model Evaluation Harness.
2023
Beyond Classification: Financial Reasoning in State-of-the-Art Language Models
Guijin Son
|
Hanearl Jung
|
Moonjeong Hahm
|
Keonju Na
|
Sol Jin
Proceedings of the Fifth Workshop on Financial Technology and Natural Language Processing and the Second Multimodal AI For Financial Forecasting
Search
Co-authors
- Guijin Son 2
- Moonjeong Hahm 1
- Keonju Na 1
- Sol Jin 1
- Hyunjun Jeon 1
- show all...