Marcelo Finger
2024
Analysing and Validating Language Complexity Metrics Across South American Indigenous Languages
Felipe Serras | Miguel Carpi | Matheus Branco | Marcelo Finger
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Felipe Serras | Miguel Carpi | Matheus Branco | Marcelo Finger
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Language complexity is an emerging concept critical for NLP and for quantitative and cognitive approaches to linguistics. In this work, we evaluate the behavior of a set of compression-based language complexity metrics when applied to a large set of native South American languages. Our goal is to validate the desirable properties of such metrics against a more diverse set of languages, guaranteeing the universality of the techniques developed on the basis of this type of theoretical artifact. Our analysis confirmed with statistical confidence most propositions about the metrics studied, affirming their robustness, despite showing less stability than when the same metrics were applied to Indo-European languages. We also observed that the trade-off between morphological and syntactic complexities is strongly related to language phylogeny.
Exploring Computational Discernibility of Discourse Domains in Brazilian Portuguese within the Carolina Corpus
Felipe Ribas Serras | Mariana Sturzeneker | Miguel de Mello Carpi | Mayara Feliciano Palma | Maria Clara Ramos Morales Crespo | Aline Silva Costa | Vanessa Martins Do Monte | Cristiane Namiuti | Maria Clara Paixão de Souza | Marcelo Finger
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
Felipe Ribas Serras | Mariana Sturzeneker | Miguel de Mello Carpi | Mayara Feliciano Palma | Maria Clara Ramos Morales Crespo | Aline Silva Costa | Vanessa Martins Do Monte | Cristiane Namiuti | Maria Clara Paixão de Souza | Marcelo Finger
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
2021
Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese Speech
Edresson Casanova | Lucas Gris | Augusto Camargo | Daniel da Silva | Murilo Gazzola | Ester Sabino | Anna Levin | Arnaldo Candido Jr | Sandra Aluisio | Marcelo Finger
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Edresson Casanova | Lucas Gris | Augusto Camargo | Daniel da Silva | Murilo Gazzola | Ester Sabino | Anna Levin | Arnaldo Candido Jr | Sandra Aluisio | Marcelo Finger
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Audio MFCC-gram Transformers for respiratory insufficiency detection in COVID-19
Marcelo Gauy | Marcelo Finger
Proceedings of the 13th Brazilian Symposium in Information and Human Language Technology
Marcelo Gauy | Marcelo Finger
Proceedings of the 13th Brazilian Symposium in Information and Human Language Technology
verBERT: Automating Brazilian Case Law Document Multi-label Categorization Using BERT
Felipe Serras | Marcelo Finger
Proceedings of the 13th Brazilian Symposium in Information and Human Language Technology
Felipe Serras | Marcelo Finger
Proceedings of the 13th Brazilian Symposium in Information and Human Language Technology
2019
A logical-based corpus for cross-lingual evaluation
Felipe Salvatore | Marcelo Finger | Roberto Hirata Jr
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019)
Felipe Salvatore | Marcelo Finger | Roberto Hirata Jr
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019)
At present, different deep learning models are presenting high accuracy on popular inference datasets such as SNLI, MNLI, and SciTail. However, there are different indicators that those datasets can be exploited by using some simple linguistic patterns. This fact poses difficulties to our understanding of the actual capacity of machine learning models to solve the complex task of textual inference. We propose a new set of syntactic tasks focused on contradiction detection that require specific capacities over linguistic logical forms such as: Boolean coordination, quantifiers, definite description, and counting operators. We evaluate two kinds of deep learning models that implicitly exploit language structure: recurrent models and the Transformer network BERT. We show that although BERT is clearly more efficient to generalize over most logical forms, there is space for improvement when dealing with counting operators. Since the syntactic tasks can be implemented in different languages, we show a successful case of cross-lingual transfer learning between English and Portuguese.
2013
Improving CoGrOO: the Brazilian Portuguese Grammar Checker
William D. Colen M. Silva | Marcelo Finger
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology
William D. Colen M. Silva | Marcelo Finger
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology
2011
Resolução da Heterogeneidade na Identificação de Pacientes (Resolution of Heterogeneity in the Identification of Patients) [in Portuguese]
Fábio Filocomo | Marcelo Finger | Diogo F. C. Patrão
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology
Fábio Filocomo | Marcelo Finger | Diogo F. C. Patrão
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology
2010
Search
Fix author
Co-authors
- Felipe Serras 2
- Sandra Aluísio 1
- Matheus Branco 1
- Augusto Camargo 1
- Arnaldo Candido, Jr. 1
- Miguel Carpi 1
- Edresson Casanova 1
- Aline Silva Costa 1
- Maria Clara Ramos Morales Crespo 1
- Fábio Filocomo 1
- Marcelo Gauy 1
- Murilo Gazzola 1
- Lucas Gris 1
- Roberto Hirata Jr 1
- Fabio Kepler 1
- Anna Levin 1
- Vanessa Martins Do Monte 1
- Cristiane Namiuti 1
- Mayara Feliciano Palma 1
- Diogo F. C. Patrão 1
- Ester Sabino 1
- Felipe Salvatore 1
- Felipe Ribas Serras 1
- William D. Colen M. Silva 1
- Mariana Sturzeneker 1
- Daniel da Silva 1
- Miguel de Mello Carpi 1
- Maria Clara Paixão de Souza 1