Gabor Proszeky

Also published as: Gábor Prószéky, Gabor Prbszeky


2025

pdf bib
OpenHuEval: Evaluating Large Language Model on Hungarian Specifics
Haote Yang | Xingjian Wei | Jiang Wu | Noémi Ligeti-Nagy | Jiaxing Sun | Yinfan Wang | Győző Zijian Yang | Junyuan Gao | Jingchao Wang | Bowen Jiang | Shasha Wang | Nanjun Yu | Zihao Zhang | Shixin Hong | Hongwei Liu | Wei Li | Songyang Zhang | Dahua Lin | Lijun Wu | Gábor Prószéky | Conghui He
Findings of the Association for Computational Linguistics: ACL 2025

We introduce OpenHuEval, the first benchmark for LLMs focusing on the Hungarian language and specifics. OpenHuEval is constructed from a vast collection of Hungarian-specific materials sourced from multiple origins. In the construction, we incorporated the latest design principles for evaluating LLMs, such as using real user queries from the internet, emphasizing the assessment of LLMs’ generative capabilities, and employing LLM-as-judge to enhance the multidimensionality and accuracy of evaluations. Ultimately, OpenHuEval encompasses eight Hungarian-specific dimensions, featuring five tasks and 3953 questions. Consequently, OpenHuEval provides the comprehensive, in-depth, and scientifically accurate assessment of LLM performance in the context of the Hungarian language and its specifics. We evaluated current mainstream LLMs, including both traditional LLMs and recently developed Large Reasoning Models. The results demonstrate the significant necessity for evaluation and model optimization tailored to the Hungarian language and specifics. We also established the framework for analyzing the thinking processes of LRMs with OpenHuEval, revealing intrinsic patterns and mechanisms of these models in non-English languages, with Hungarian serving as a representative example. We will release OpenHuEval at https://github.com/opendatalab/OpenHuEval .

2014

pdf bib
Almost fifty years after the (first?) ALPAC report
Gábor Prószéky
Proceedings of Translating and the Computer 36

2011

pdf bib
Endangered Uralic Languages and Language Technologies
Gábor Prószéky
Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage

2008

pdf bib
The MetaMorpho Translation System
Attila Novák | László Tihanyi | Gábor Prószéky
Proceedings of the Third Workshop on Statistical Machine Translation

2005

pdf bib
An approach to machine translation via the rule-to-rule hypothesis
Gábor Prószéky
Proceedings of the 10th EAMT Conference: Practical applications of machine translation

2004

pdf bib
Moose: a robust high-performance parser and generator
Gábor Prószéky | László Tihanyi | Gábor Ugray
Proceedings of the 9th EAMT Workshop: Broadening horizons of machine translation and its applications

2003

pdf bib
Annotated Hungarian National Corpus
Zoltán Alexin | János Csirik | Tibor Gyimóthy | Károly Bibok | Csaba Hatvani | Gábor Prószéky | László Tihanyi
10th Conference of the European Chapter of the Association for Computational Linguistics

2002

pdf bib
MetaMorpho: A Pattern-Based Machine Translation System
Gábor Prószéky
Proceedings of Translating and the Computer 24

pdf bib
Recognition Assistance - Treating Errors in Texts Acquired from Various Recognition Processes
Gábor Prószéky | Mátyás Naszódi | Balázs Kis
COLING 2002: The 17th International Conference on Computational Linguistics: Project Notes

pdf bib
Context-Sensitive Electronic Dictionaries
Gábor Prószéky | Balázs Kis
COLING 2002: The 17th International Conference on Computational Linguistics: Project Notes

pdf bib
Automatism and User Interaction: Building a Hungarian WordNet
Gábor Prószéky | Márton Miháltz
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

1999

pdf bib
Experience from translation of EU documents
Gábor Prószéky
EAMT Workshop: EU and the new languages

pdf bib
A Unification-based Approach to Morpho-syntactic Parsing of Agglutinative and Other (Highly) Inflectional Languages
Gabor Proszeky | Balazs Kis
Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics

1998

pdf bib
An Intelligent Multi-Dictionary Environment
Gabor Proszeky
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

pdf bib
An Intelligent Multi-Dictionary Environment
Gabor Prbszeky
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

1997

pdf bib
Reading more into Foreign Languages
John Nerbonne | Lauri Karttunen | Elena Paskaleva | Gabor Proszeky | Tiit Roosmaa
Fifth Conference on Applied Natural Language Processing

1996

pdf bib
Morphological Analyzer as Syntactic Parser
Gábor Prószéky
COLING 1996 Volume 2: The 16th International Conference on Computational Linguistics

1994

pdf bib
Industrial Applications of Unification Morphology
Gabor Proszeky
Fourth Conference on Applied Natural Language Processing

pdf bib
Humor-Based Applications
Gabor Proszeky | Miklos Pal | Laszlo Tihanyi
COLING 1994 Volume 2: The 15th International Conference on Computational Linguistics

1993

pdf bib
Helyette: Inflectional Thesaurus for Agglutinative Languages
Gabor Proszeky | Laszlo Tihanyi
Sixth Conference of the European Chapter of the Association for Computational Linguistics

1986

pdf bib
Processing Clinical Narratives in Hungarian
Gabor Proszeky
Coling 1986 Volume 1: The 11th International Conference on Computational Linguistics