Sudachi: a Japanese Tokenizer for Business

Kazuma Takaoka, Sorami Hisamoto, Noriko Kawahara, Miho Sakamoto, Yoshitaka Uchida, Yuji Matsumoto


Anthology ID:
L18-1355
Volume:
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Month:
May
Year:
2018
Address:
Miyazaki, Japan
Editors:
Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
https://aclanthology.org/L18-1355
DOI:
Bibkey:
Cite (ACL):
Kazuma Takaoka, Sorami Hisamoto, Noriko Kawahara, Miho Sakamoto, Yoshitaka Uchida, and Yuji Matsumoto. 2018. Sudachi: a Japanese Tokenizer for Business. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
Cite (Informal):
Sudachi: a Japanese Tokenizer for Business (Takaoka et al., LREC 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl-24-ws-corrections/L18-1355.pdf
Code
 WorksApplications/Sudachi