Arup K. Nath
2024
Assessing Assamese Suffix Productivity: A Probabilistic Study in Resource-Limited Contexts
Pinky Moni Gayan
|
Arup K. Nath
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Numerous digitally advanced global languages have been studied under the light of morphological productivity; however, Assamese and other Indo-Aryan languages are still understudied in this field, though it is a widely discussed area of morphology. The purpose of this paper is to demonstrate the productivity of 15 suffixes replicated by a few measuring methods in a manually prepared sample. The obtained values are used in the later section to group the suffixes into different clusters based on their similar productivity rate in clustering in R. By determining the general productivity rate of the suffixes from the total productivity rates of all the methods, it demonstrates how clustering in R may be used as an empirical and visual tool for grouping similarly productive suffixes. The paper also reports about the paucity of language resources as well as tools in the language and how bridging this gap could have resulted in more precise, seamless results in a notably shorter amount of time.