Candy Angulo
2025
Universal Dependencies for Amahuaca
Candy Angulo | Pilar Valenzuela | Roberto Zariquiey
Proceedings of the Eight Workshop on the Use of Computational Methods in the Study of Endangered Languages
Candy Angulo | Pilar Valenzuela | Roberto Zariquiey
Proceedings of the Eight Workshop on the Use of Computational Methods in the Study of Endangered Languages
This paper presents the creation of a Universal Dependency (UD) treebank for Amahuaca (Peru), marking the first UD treebank within the Headwaters subbranch of the Panoan family, spoken mostly in Peru and Brazil. While the UD guidelines provided a general framework for our annotations, language-specific decisions were necessary due to the rich morphology of the Amahuaca language. The paper also describes specific constructions to initiate a discussion on several general UD annotation guidelines, particularly those concerning clitics and morpheme-level dependencies.
2022
UniMorph 4.0: Universal Morphology
Khuyagbaatar Batsuren | Omer Goldman | Salam Khalifa | Nizar Habash | Witold Kieraś | Gábor Bella | Brian Leonard | Garrett Nicolai | Kyle Gorman | Yustinus Ghanggo Ate | Maria Ryskina | Sabrina Mielke | Elena Budianskaya | Charbel El-Khaissi | Tiago Pimentel | Michael Gasser | William Abbott Lane | Mohit Raj | Matt Coler | Jaime Rafael Montoya Samame | Delio Siticonatzi Camaiteri | Esaú Zumaeta Rojas | Didier López Francis | Arturo Oncevay | Juan López Bautista | Gema Celeste Silva Villegas | Lucas Torroba Hennigen | Adam Ek | David Guriel | Peter Dirix | Jean-Philippe Bernardy | Andrey Scherbakov | Aziyana Bayyr-ool | Antonios Anastasopoulos | Roberto Zariquiey | Karina Sheifer | Sofya Ganieva | Hilaria Cruz | Ritván Karahóǧa | Stella Markantonatou | George Pavlidis | Matvey Plugaryov | Elena Klyachko | Ali Salehi | Candy Angulo | Jatayu Baxi | Andrew Krizhanovsky | Natalia Krizhanovskaya | Elizabeth Salesky | Clara Vania | Sardana Ivanova | Jennifer White | Rowan Hall Maudslay | Josef Valvoda | Ran Zmigrod | Paula Czarnowska | Irene Nikkarinen | Aelita Salchak | Brijesh Bhatt | Christopher Straughn | Zoey Liu | Jonathan North Washington | Yuval Pinter | Duygu Ataman | Marcin Wolinski | Totok Suhardijanto | Anna Yablonskaya | Niklas Stoehr | Hossep Dolatian | Zahroh Nuriah | Shyam Ratan | Francis M. Tyers | Edoardo M. Ponti | Grant Aiton | Aryaman Arora | Richard J. Hatcher | Ritesh Kumar | Jeremiah Young | Daria Rodionova | Anastasia Yemelina | Taras Andrushko | Igor Marchenko | Polina Mashkovtseva | Alexandra Serova | Emily Prud’hommeaux | Maria Nepomniashchaya | Fausto Giunchiglia | Eleanor Chodroff | Mans Hulden | Miikka Silfverberg | Arya D. McCarthy | David Yarowsky | Ryan Cotterell | Reut Tsarfaty | Ekaterina Vylomova
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Khuyagbaatar Batsuren | Omer Goldman | Salam Khalifa | Nizar Habash | Witold Kieraś | Gábor Bella | Brian Leonard | Garrett Nicolai | Kyle Gorman | Yustinus Ghanggo Ate | Maria Ryskina | Sabrina Mielke | Elena Budianskaya | Charbel El-Khaissi | Tiago Pimentel | Michael Gasser | William Abbott Lane | Mohit Raj | Matt Coler | Jaime Rafael Montoya Samame | Delio Siticonatzi Camaiteri | Esaú Zumaeta Rojas | Didier López Francis | Arturo Oncevay | Juan López Bautista | Gema Celeste Silva Villegas | Lucas Torroba Hennigen | Adam Ek | David Guriel | Peter Dirix | Jean-Philippe Bernardy | Andrey Scherbakov | Aziyana Bayyr-ool | Antonios Anastasopoulos | Roberto Zariquiey | Karina Sheifer | Sofya Ganieva | Hilaria Cruz | Ritván Karahóǧa | Stella Markantonatou | George Pavlidis | Matvey Plugaryov | Elena Klyachko | Ali Salehi | Candy Angulo | Jatayu Baxi | Andrew Krizhanovsky | Natalia Krizhanovskaya | Elizabeth Salesky | Clara Vania | Sardana Ivanova | Jennifer White | Rowan Hall Maudslay | Josef Valvoda | Ran Zmigrod | Paula Czarnowska | Irene Nikkarinen | Aelita Salchak | Brijesh Bhatt | Christopher Straughn | Zoey Liu | Jonathan North Washington | Yuval Pinter | Duygu Ataman | Marcin Wolinski | Totok Suhardijanto | Anna Yablonskaya | Niklas Stoehr | Hossep Dolatian | Zahroh Nuriah | Shyam Ratan | Francis M. Tyers | Edoardo M. Ponti | Grant Aiton | Aryaman Arora | Richard J. Hatcher | Ritesh Kumar | Jeremiah Young | Daria Rodionova | Anastasia Yemelina | Taras Andrushko | Igor Marchenko | Polina Mashkovtseva | Alexandra Serova | Emily Prud’hommeaux | Maria Nepomniashchaya | Fausto Giunchiglia | Eleanor Chodroff | Mans Hulden | Miikka Silfverberg | Arya D. McCarthy | David Yarowsky | Ryan Cotterell | Reut Tsarfaty | Ekaterina Vylomova
Proceedings of the Thirteenth Language Resources and Evaluation Conference
The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation, and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements on several fronts that were made in the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 66 new languages, including 24 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g., missing gender and macrons information. We have amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.
2018
Toward Universal Dependencies for Shipibo-Konibo
Alonso Vasquez | Renzo Ego Aguirre | Candy Angulo | John Miller | Claudia Villanueva | Željko Agić | Roberto Zariquiey | Arturo Oncevay
Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
Alonso Vasquez | Renzo Ego Aguirre | Candy Angulo | John Miller | Claudia Villanueva | Željko Agić | Roberto Zariquiey | Arturo Oncevay
Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
We present an initial version of the Universal Dependencies (UD) treebank for Shipibo-Konibo, the first South American, Amazonian, Panoan and Peruvian language with a resource built under UD. We describe the linguistic aspects of how the tagset was defined and the treebank was annotated; in addition we present our specific treatment of linguistic units called clitics. Although the treebank is still under development, it allowed us to perform a typological comparison against Spanish, the predominant language in Peru, and dependency syntax parsing experiments in both monolingual and cross-lingual approaches.
Search
Fix author
Co-authors
- Roberto Zariquiey 3
- Arturo Oncevay 2
- Željko Agić 1
- Grant Aiton 1
- Antonios Anastasopoulos 1
- Taras Andrushko 1
- Aryaman Arora 1
- Duygu Ataman 1
- Yustinus Ghanggo Ate 1
- Khuyagbaatar Batsuren 1
- Jatayu Baxi 1
- Aziyana Bayyr-ool 1
- Gábor Bella 1
- Jean-Philippe Bernardy 1
- Brijesh Bhatt 1
- Elena Budianskaya 1
- Delio Siticonatzi Camaiteri 1
- Eleanor Chodroff 1
- Matt Coler 1
- Ryan Cotterell 1
- Hilaria Cruz 1
- Paula Czarnowska 1
- Peter Dirix 1
- Hossep Dolatian 1
- Renzo Ego Aguirre 1
- Adam Ek 1
- Charbel El-Khaissi 1
- Sofya Ganieva 1
- Michael Gasser 1
- Fausto Giunchiglia 1
- Omer Goldman 1
- Kyle Gorman 1
- David Guriel 1
- Nizar Habash 1
- Richard J. Hatcher 1
- Mans Hulden 1
- Sardana Ivanova 1
- Ritván Karahóǧa 1
- Salam Khalifa 1
- Witold Kieraś 1
- Elena Klyachko 1
- Natalia Krizhanovskaya 1
- Andrew Krizhanovsky 1
- Ritesh Kumar 1
- William Abbott Lane 1
- Brian Leonard 1
- Zoey Liu 1
- Juan López Bautista 1
- Didier López Francis 1
- Igor Marchenko 1
- Stella Markantonatou 1
- Polina Mashkovtseva 1
- Rowan Hall Maudslay 1
- Arya D. McCarthy 1
- Sabrina J. Mielke 1
- John Miller 1
- Maria Nepomniashchaya 1
- Garrett Nicolai 1
- Irene Nikkarinen 1
- Zahroh Nuriah 1
- George Pavlidis 1
- Tiago Pimentel 1
- Yuval Pinter 1
- Matvey Plugaryov 1
- Edoardo M. Ponti 1
- Emily Prud’hommeaux 1
- Mohit Raj 1
- Shyam Ratan 1
- Daria Rodionova 1
- Esaú Zumaeta Rojas 1
- Maria Ryskina 1
- Aelita Salchak 1
- Ali Salehi 1
- Elizabeth Salesky 1
- Jaime Rafael Montoya Samame 1
- Andrey Scherbakov 1
- Alexandra Serova 1
- Karina Sheifer 1
- Miikka Silfverberg 1
- Niklas Stoehr 1
- Christopher Straughn 1
- Totok Suhardijanto 1
- Lucas Torroba Hennigen 1
- Reut Tsarfaty 1
- Francis Tyers 1
- Pilar Valenzuela 1
- Josef Valvoda 1
- Clara Vania 1
- Alonso Vasquez 1
- Claudia Villanueva 1
- Gema Celeste Silva Villegas 1
- Ekaterina Vylomova 1
- Jonathan Washington 1
- Jennifer White 1
- Marcin Woliński 1
- Anna Yablonskaya 1
- David Yarowsky 1
- Anastasia Yemelina 1
- Jeremiah Young 1
- Ran Zmigrod 1