ULSAna: Universal Language Semantic Analyzer

Ondřej Pražák, Miloslav Konopik


Abstract
We present a live cross-lingual system capable of producing shallow semantic annotations of natural language sentences for 51 languages at this time. The domain of the input sentences is in principle unconstrained. The system uses single training data (in English) for all the languages. The resulting semantic annotations are therefore consistent across different languages. We use CoNLL Semantic Role Labeling training data and Universal dependencies as the basis for the system. The system is publicly available and supports processing data in batches; therefore, it can be easily used by the community for the following research tasks.
Anthology ID:
R19-1112
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
Month:
September
Year:
2019
Address:
Varna, Bulgaria
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
967–972
Language:
URL:
https://aclanthology.org/R19-1112
DOI:
10.26615/978-954-452-056-4_112
Bibkey:
Cite (ACL):
Ondřej Pražák and Miloslav Konopik. 2019. ULSAna: Universal Language Semantic Analyzer. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 967–972, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
ULSAna: Universal Language Semantic Analyzer (Pražák & Konopik, RANLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/R19-1112.pdf