International Conference on Language Resources and Evaluation (2018)
- Venue:
- LREC
up
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Nicoletta Calzolari | Khalid Choukri | Christopher Cieri | Thierry Declerck | Sara Goggi | Koiti Hasida | Hitoshi Isahara | Bente Maegaard | Joseph Mariani | Hélène Mazo | Asuncion Moreno | Jan Odijk | Stelios Piperidis | Takenobu Tokunaga
Nicoletta Calzolari | Khalid Choukri | Christopher Cieri | Thierry Declerck | Sara Goggi | Koiti Hasida | Hitoshi Isahara | Bente Maegaard | Joseph Mariani | Hélène Mazo | Asuncion Moreno | Jan Odijk | Stelios Piperidis | Takenobu Tokunaga
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
Ali Can Kocabiyikoglu | Laurent Besacier | Olivier Kraif
Ali Can Kocabiyikoglu | Laurent Besacier | Olivier Kraif
Evaluating Domain Adaptation for Machine Translation Across Scenarios
Thierry Etchegoyhen | Anna Fernández Torné | Andoni Azpeitia | Eva Martínez Garcia | Anna Matamala
Thierry Etchegoyhen | Anna Fernández Torné | Andoni Azpeitia | Eva Martínez Garcia | Anna Matamala
Upping the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation
Christian Hadiwinoto | Hwee Tou Ng
Christian Hadiwinoto | Hwee Tou Ng
ESCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi
Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method
Yutong Shao | Rico Sennrich | Bonnie Webber | Federico Fancellu
Yutong Shao | Rico Sennrich | Bonnie Webber | Federico Fancellu
Cross-Lingual Generation and Evaluation of a Wide-Coverage Lexical Semantic Resource
Attila Novák | Borbála Novák
Attila Novák | Borbála Novák
Advances in Pre-Training Distributed Word Representations
Tomas Mikolov | Edouard Grave | Piotr Bojanowski | Christian Puhrsch | Armand Joulin
Tomas Mikolov | Edouard Grave | Piotr Bojanowski | Christian Puhrsch | Armand Joulin
Integrating Generative Lexicon Event Structures into VerbNet
Susan Windisch Brown | James Pustejovsky | Annie Zaenen | Martha Palmer
Susan Windisch Brown | James Pustejovsky | Annie Zaenen | Martha Palmer
Multi-layer Annotation of the Rigveda
Oliver Hellwig | Heinrich Hettrich | Ashutosh Modi | Manfred Pinkal
Oliver Hellwig | Heinrich Hettrich | Ashutosh Modi | Manfred Pinkal
The Natural Stories Corpus
Richard Futrell | Edward Gibson | Harry J. Tily | Idan Blank | Anastasia Vishnevetsky | Steven Piantadosi | Evelina Fedorenko
Richard Futrell | Edward Gibson | Harry J. Tily | Idan Blank | Anastasia Vishnevetsky | Steven Piantadosi | Evelina Fedorenko
Semi-automatic Korean FrameNet Annotation over KAIST Treebank
Younggyun Hahm | Jiseong Kim | Sunggoo Kwon | Key-Sun Choi
Younggyun Hahm | Jiseong Kim | Sunggoo Kwon | Key-Sun Choi
Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text
Géraldine Damnati | Jeremy Auguste | Alexis Nasr | Delphine Charlet | Johannes Heinecke | Frédéric Béchet
Géraldine Damnati | Jeremy Auguste | Alexis Nasr | Delphine Charlet | Johannes Heinecke | Frédéric Béchet
Multi-Dialect Arabic POS Tagging: A CRF Approach
Kareem Darwish | Hamdy Mubarak | Ahmed Abdelali | Mohamed Eldesouki | Younes Samih | Randah Alharbi | Mohammed Attia | Walid Magdy | Laura Kallmeyer
Kareem Darwish | Hamdy Mubarak | Ahmed Abdelali | Mohamed Eldesouki | Younes Samih | Randah Alharbi | Mohammed Attia | Walid Magdy | Laura Kallmeyer
A Corpus for Modeling Word Importance in Spoken Dialogue Transcripts
Sushant Kafle | Matt Huenerfauth
Sushant Kafle | Matt Huenerfauth
Dialogue Structure Annotation for Multi-Floor Interaction
David Traum | Cassidy Henry | Stephanie Lukin | Ron Artstein | Felix Gervits | Kimberly Pollard | Claire Bonial | Su Lei | Clare Voss | Matthew Marge | Cory Hayes | Susan Hill
David Traum | Cassidy Henry | Stephanie Lukin | Ron Artstein | Felix Gervits | Kimberly Pollard | Claire Bonial | Su Lei | Clare Voss | Matthew Marge | Cory Hayes | Susan Hill
Effects of Gender Stereotypes on Trust and Likability in Spoken Human-Robot Interaction
Matthias Kraus | Johannes Kraus | Martin Baumann | Wolfgang Minker
Matthias Kraus | Johannes Kraus | Martin Baumann | Wolfgang Minker
A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction
Dimosthenis Kontogiorgos | Vanya Avramova | Simon Alexanderson | Patrik Jonell | Catharine Oertel | Jonas Beskow | Gabriel Skantze | Joakim Gustafson
Dimosthenis Kontogiorgos | Vanya Avramova | Simon Alexanderson | Patrik Jonell | Catharine Oertel | Jonas Beskow | Gabriel Skantze | Joakim Gustafson
Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level
AbdelRahim Elmadany | Sherif Abdou | Mervat Gheith
AbdelRahim Elmadany | Sherif Abdou | Mervat Gheith
Data Management Plan (DMP) for Language Data under the New General Da-ta Protection Regulation (GDPR)
Pawel Kamocki | Valérie Mapelli | Khalid Choukri
Pawel Kamocki | Valérie Mapelli | Khalid Choukri
We Are Depleting Our Research Subject as We Are Investigating It: In Language Technology, more Replication and Diversity Are Needed
António Branco
António Branco
Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library.
Thorsten Trippel | Claus Zinn
Thorsten Trippel | Claus Zinn
Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data
Christopher Cieri | James Fiumara | Mark Liberman | Chris Callison-Burch | Jonathan Wright
Christopher Cieri | James Fiumara | Mark Liberman | Chris Callison-Burch | Jonathan Wright
Three Dimensions of Reproducibility in Natural Language Processing
K. Bretonnel Cohen | Jingbo Xia | Pierre Zweigenbaum | Tiffany Callahan | Orin Hargraves | Foster Goss | Nancy Ide | Aurélie Névéol | Cyril Grouin | Lawrence E. Hunter
K. Bretonnel Cohen | Jingbo Xia | Pierre Zweigenbaum | Tiffany Callahan | Orin Hargraves | Foster Goss | Nancy Ide | Aurélie Névéol | Cyril Grouin | Lawrence E. Hunter
Representation Mapping: A Novel Approach to Generate High-Quality Multi-Lingual Emotion Lexicons
Sven Buechel | Udo Hahn
Sven Buechel | Udo Hahn
Unfolding the External Behavior and Inner Affective State of Teammates through Ensemble Learning: Experimental Evidence from a Dyadic Team Corpus
Aggeliki Vlachostergiou | Mark Dennison | Catherine Neubauer | Stefan Scherer | Peter Khooshabeh | Andre Harrison
Aggeliki Vlachostergiou | Mark Dennison | Catherine Neubauer | Stefan Scherer | Peter Khooshabeh | Andre Harrison
Understanding Emotions: A Dataset of Tweets to Study Interactions between Affect Categories
Saif Mohammad | Svetlana Kiritchenko
Saif Mohammad | Svetlana Kiritchenko
When ACE met KBP: End-to-End Evaluation of Knowledge Base Population with Component-level Annotation
Bonan Min | Marjorie Freedman | Roger Bock | Ralph Weischedel
Bonan Min | Marjorie Freedman | Roger Bock | Ralph Weischedel
Simple Large-scale Relation Extraction from Unstructured Text
Christos Christodoulopoulos | Arpit Mittal
Christos Christodoulopoulos | Arpit Mittal
Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task
Dagmar Gromann | Thierry Declerck
Dagmar Gromann | Thierry Declerck
A Large Resource of Patterns for Verbal Paraphrases
Octavian Popescu | Ngoc Phuoc An Vo | Vadim Sheinin
Octavian Popescu | Ngoc Phuoc An Vo | Vadim Sheinin
A Recorded Debating Dataset
Shachar Mirkin | Michal Jacovi | Tamar Lavee | Hong-Kwang Kuo | Samuel Thomas | Leslie Sager | Lili Kotlerman | Elad Venezian | Noam Slonim
Shachar Mirkin | Michal Jacovi | Tamar Lavee | Hong-Kwang Kuo | Samuel Thomas | Leslie Sager | Lili Kotlerman | Elad Venezian | Noam Slonim
Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging
Kyoko Sugisaki | Nicolas Wiedmer | Heiko Hausendorf
Kyoko Sugisaki | Nicolas Wiedmer | Heiko Hausendorf
A Lexical Tool for Academic Writing in Spanish based on Expert and Novice Corpora
Marcos García Salido | Marcos García | Milka Villayandre-Llamazares | Margarita Alonso-Ramos
Marcos García Salido | Marcos García | Milka Villayandre-Llamazares | Margarita Alonso-Ramos
Framing Named Entity Linking Error Types
Adrian Braşoveanu | Giuseppe Rizzo | Philipp Kuntschik | Albert Weichselbraun | Lyndon J.B. Nixon
Adrian Braşoveanu | Giuseppe Rizzo | Philipp Kuntschik | Albert Weichselbraun | Lyndon J.B. Nixon
A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation
Kirk Roberts | Yuqi Si | Anshul Gandhi | Elmer Bernstam
Kirk Roberts | Yuqi Si | Anshul Gandhi | Elmer Bernstam
A New Corpus to Support Text Mining for the Curation of Metabolites in the ChEBI Database
Matthew Shardlow | Nhung Nguyen | Gareth Owen | Claire O’Donovan | Andrew Leach | John McNaught | Steve Turner | Sophia Ananiadou
Matthew Shardlow | Nhung Nguyen | Gareth Owen | Claire O’Donovan | Andrew Leach | John McNaught | Steve Turner | Sophia Ananiadou
Parallel Corpora for the Biomedical Domain
Aurélie Névéol | Antonio Jimeno Yepes | Mariana Neves | Karin Verspoor
Aurélie Névéol | Antonio Jimeno Yepes | Mariana Neves | Karin Verspoor
Medical Entity Corpus with PICO elements and Sentiment Analysis
Markus Zlabinger | Linda Andersson | Allan Hanbury | Michael Andersson | Vanessa Quasnik | Jon Brassey
Markus Zlabinger | Linda Andersson | Allan Hanbury | Michael Andersson | Vanessa Quasnik | Jon Brassey
A Large Automatically-Acquired All-Words List of Multiword Expressions Scored for Compositionality
Will Roberts | Markus Egg
Will Roberts | Markus Egg
A Hybrid Approach for Automatic Extraction of Bilingual Multiword Expressions from Parallel Corpora
Nasredine Semmar
Nasredine Semmar
No more beating about the bush : A Step towards Idiom Handling for Indian Language NLP
Ruchit Agrawal | Vighnesh Chenthil Kumar | Vigneshwaran Muralidharan | Dipti Sharma
Ruchit Agrawal | Vighnesh Chenthil Kumar | Vigneshwaran Muralidharan | Dipti Sharma
Sentence Level Temporality Detection using an Implicit Time-sensed Resource
Sabyasachi Kamila | Asif Ekbal | Pushpak Bhattacharyya
Sabyasachi Kamila | Asif Ekbal | Pushpak Bhattacharyya
Comprehensive Annotation of Various Types of Temporal Information on the Time Axis
Tomohiro Sakaguchi | Daisuke Kawahara | Sadao Kurohashi
Tomohiro Sakaguchi | Daisuke Kawahara | Sadao Kurohashi
Systems’ Agreements and Disagreements in Temporal Processing: An Extensive Error Analysis of the TempEval-3 Task
Tommaso Caselli | Roser Morante
Tommaso Caselli | Roser Morante
Annotating Temporally-Anchored Spatial Knowledge by Leveraging Syntactic Dependencies
Alakananda Vempala | Eduardo Blanco
Alakananda Vempala | Eduardo Blanco
SW4ALL: a CEFR Classified and Aligned Corpus for Language Learning
Rodrigo Wilkens | Leonardo Zilio | Cédrick Fairon
Rodrigo Wilkens | Leonardo Zilio | Cédrick Fairon
Towards a Diagnosis of Textual Difficulties for Children with Dyslexia
Solen Quiniou | Béatrice Daille
Solen Quiniou | Béatrice Daille
Deep Neural Networks for Coreference Resolution for Polish
Bartłomiej Nitoń | Paweł Morawiecki | Maciej Ogrodniczuk
Bartłomiej Nitoń | Paweł Morawiecki | Maciej Ogrodniczuk
SzegedKoref: A Hungarian Coreference Corpus
Veronika Vincze | Klára Hegedűs | Alex Sliz-Nagy | Richárd Farkas
Veronika Vincze | Klára Hegedűs | Alex Sliz-Nagy | Richárd Farkas
Sanaphor++: Combining Deep Neural Networks with Semantics for Coreference Resolution
Julien Plu | Roman Prokofyev | Alberto Tonon | Philippe Cudré-Mauroux | Djellel Eddine Difallah | Raphaël Troncy | Giuseppe Rizzo
Julien Plu | Roman Prokofyev | Alberto Tonon | Philippe Cudré-Mauroux | Djellel Eddine Difallah | Raphaël Troncy | Giuseppe Rizzo
ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations
Loïc Grobol | Isabelle Tellier | Éric de la Clergerie | Marco Dinarelli | Frédéric Landragin
Loïc Grobol | Isabelle Tellier | Éric de la Clergerie | Marco Dinarelli | Frédéric Landragin
ParCorFull: a Parallel Corpus Annotated with Full Coreference
Ekaterina Lapshinova-Koltunski | Christian Hardmeier | Pauline Krielke
Ekaterina Lapshinova-Koltunski | Christian Hardmeier | Pauline Krielke
An Application for Building a Polish Telephone Speech Corpus
Bartosz Ziółko | Piotr Żelasko | Ireneusz Gawlik | Tomasz Pędzimąż | Tomasz Jadczyk
Bartosz Ziółko | Piotr Żelasko | Ireneusz Gawlik | Tomasz Pędzimąż | Tomasz Jadczyk
CPJD Corpus: Crowdsourced Parallel Speech Corpus of Japanese Dialects
Shinnosuke Takamichi | Hiroshi Saruwatari
Shinnosuke Takamichi | Hiroshi Saruwatari
Korean L2 Vocabulary Prediction: Can a Large Annotated Corpus be Used to Train Better Models for Predicting Unknown Words?
Kevin Yancey | Yves Lepage
Kevin Yancey | Yves Lepage
Crowdsourcing-based Annotation of the Accounting Registers of the Italian Comedy
Adeline Granet | Benjamin Hervy | Geoffrey Roman-Jimenez | Marouane Hachicha | Emmanuel Morin | Harold Mouchère | Solen Quiniou | Guillaume Raschia | Françoise Rubellin | Christian Viard-Gaudin
Adeline Granet | Benjamin Hervy | Geoffrey Roman-Jimenez | Marouane Hachicha | Emmanuel Morin | Harold Mouchère | Solen Quiniou | Guillaume Raschia | Françoise Rubellin | Christian Viard-Gaudin
FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German
Leonidas Lefakis | Alan Akbik | Roland Vollgraf
Leonidas Lefakis | Alan Akbik | Roland Vollgraf
Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing
Alice Millour | Karën Fort
Alice Millour | Karën Fort
Crowdsourced Corpus of Sentence Simplification with Core Vocabulary
Akihiro Katsuta | Kazuhide Yamamoto
Akihiro Katsuta | Kazuhide Yamamoto
A Multilingual Wikified Data Set of Educational Material
Iris Hendrickx | Eirini Takoulidou | Thanasis Naskos | Katia Lida Kermanidis | Vilelmini Sosoni | Hugo de Vos | Maria Stasimioti | Menno van Zaanen | Panayota Georgakopoulou | Valia Kordoni | Maja Popovic | Markus Egg | Antal van den Bosch
Iris Hendrickx | Eirini Takoulidou | Thanasis Naskos | Katia Lida Kermanidis | Vilelmini Sosoni | Hugo de Vos | Maria Stasimioti | Menno van Zaanen | Panayota Georgakopoulou | Valia Kordoni | Maja Popovic | Markus Egg | Antal van den Bosch
Translation Crowdsourcing: Creating a Multilingual Corpus of Online Educational Content
Vilelmini Sosoni | Katia Lida Kermanidis | Maria Stasimioti | Thanasis Naskos | Eirini Takoulidou | Menno van Zaanen | Sheila Castilho | Panayota Georgakopoulou | Valia Kordoni | Markus Egg
Vilelmini Sosoni | Katia Lida Kermanidis | Maria Stasimioti | Thanasis Naskos | Eirini Takoulidou | Menno van Zaanen | Sheila Castilho | Panayota Georgakopoulou | Valia Kordoni | Markus Egg
Building an English Vocabulary Knowledge Dataset of Japanese English-as-a-Second-Language Learners Using Crowdsourcing
Yo Ehara
Yo Ehara
The UIR Uncertainty Corpus for Chinese: Annotating Chinese Microblog Corpus for Uncertainty Identification from Social Media
Binyang Li | Jun Xiang | Le Chen | Xu Han | Xiaoyan Yu | Ruifeng Xu | Tengjiao Wang | Kam-fai Wong
Binyang Li | Jun Xiang | Le Chen | Xu Han | Xiaoyan Yu | Ruifeng Xu | Tengjiao Wang | Kam-fai Wong
EventWiki: A Knowledge Base of Major Events
Tao Ge | Lei Cui | Baobao Chang | Zhifang Sui | Furu Wei | Ming Zhou
Tao Ge | Lei Cui | Baobao Chang | Zhifang Sui | Furu Wei | Ming Zhou
Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
Anna Koroleva | Patrick Paroubek
Anna Koroleva | Patrick Paroubek
Visualization of the occurrence trend of infectious diseases using Twitter
Ryusei Matsumoto | Minoru Yoshida | Kazuyuki Matsumoto | Hironobu Matsuda | Kenji Kita
Ryusei Matsumoto | Minoru Yoshida | Kazuyuki Matsumoto | Hironobu Matsuda | Kenji Kita
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications
Andrea Zielinski | Peter Mutschke
Andrea Zielinski | Peter Mutschke
KRAUTS: A German Temporally Annotated News Corpus
Jannik Strötgen | Anne-Lyse Minard | Lukas Lange | Manuela Speranza | Bernardo Magnini
Jannik Strötgen | Anne-Lyse Minard | Lukas Lange | Manuela Speranza | Bernardo Magnini
CogCompNLP: Your Swiss Army Knife for NLP
Daniel Khashabi | Mark Sammons | Ben Zhou | Tom Redman | Christos Christodoulopoulos | Vivek Srikumar | Nicholas Rizzolo | Lev Ratinov | Guanheng Luo | Quang Do | Chen-Tse Tsai | Subhro Roy | Stephen Mayhew | Zhili Feng | John Wieting | Xiaodong Yu | Yangqiu Song | Shashank Gupta | Shyam Upadhyay | Naveen Arivazhagan | Qiang Ning | Shaoshi Ling | Dan Roth
Daniel Khashabi | Mark Sammons | Ben Zhou | Tom Redman | Christos Christodoulopoulos | Vivek Srikumar | Nicholas Rizzolo | Lev Ratinov | Guanheng Luo | Quang Do | Chen-Tse Tsai | Subhro Roy | Stephen Mayhew | Zhili Feng | John Wieting | Xiaodong Yu | Yangqiu Song | Shashank Gupta | Shyam Upadhyay | Naveen Arivazhagan | Qiang Ning | Shaoshi Ling | Dan Roth
A Framework for the Needs of Different Types of Users in Multilingual Semantic Enrichment
Jan Nehring | Felix Sasaki
Jan Nehring | Felix Sasaki
Preserving Workflow Reproducibility: The RePlay-DH Client as a Tool for Process Documentation
Markus Gärtner | Uli Hahn | Sibylle Hermann
Markus Gärtner | Uli Hahn | Sibylle Hermann
What’s Wrong, Python? – A Visual Differ and Graph Library for NLP in Python
Balázs Indig | András Simonyi | Noémi Ligeti-Nagy
Balázs Indig | András Simonyi | Noémi Ligeti-Nagy
ScholarGraph:a Chinese Knowledge Graph of Chinese Scholars
Shuo Wang | Zehui Hao | Xiaofeng Meng | Qiuyue Wang
Shuo Wang | Zehui Hao | Xiaofeng Meng | Qiuyue Wang
Enriching Frame Representations with Distributionally Induced Senses
Stefano Faralli | Alexander Panchenko | Chris Biemann | Simone Paolo Ponzetto
Stefano Faralli | Alexander Panchenko | Chris Biemann | Simone Paolo Ponzetto
An Integrated Formal Representation for Terminological and Lexical Data included in Classification Schemes
Thierry Declerck | Kseniya Egorova | Eileen Schnur
Thierry Declerck | Kseniya Egorova | Eileen Schnur
One event, many representations. Mapping action concepts through visual features.
Alessandro Panunzi | Lorenzo Gregori | Andrea Amelio Ravelli
Alessandro Panunzi | Lorenzo Gregori | Andrea Amelio Ravelli
Sentiment-Stance-Specificity (SSS) Dataset: Identifying Support-based Entailment among Opinions.
Pavithra Rajendran | Danushka Bollegala | Simon Parsons
Pavithra Rajendran | Danushka Bollegala | Simon Parsons
Resource Creation Towards Automated Sentiment Analysis in Telugu (a low resource language) and Integrating Multiple Domain Sources to Enhance Sentiment Prediction
Rama Rohit Reddy Gangula | Radhika Mamidi
Rama Rohit Reddy Gangula | Radhika Mamidi
Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks
Mohammed Attia | Younes Samih | Ali Elkahky | Laura Kallmeyer
Mohammed Attia | Younes Samih | Ali Elkahky | Laura Kallmeyer
HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments
Akari Asai | Sara Evensen | Behzad Golshan | Alon Halevy | Vivian Li | Andrei Lopatenko | Daniela Stepanov | Yoshihiko Suhara | Wang-Chiew Tan | Yinzhan Xu
Akari Asai | Sara Evensen | Behzad Golshan | Alon Halevy | Vivian Li | Andrei Lopatenko | Daniela Stepanov | Yoshihiko Suhara | Wang-Chiew Tan | Yinzhan Xu
MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification
Jeremy Barnes | Toni Badia | Patrik Lambert
Jeremy Barnes | Toni Badia | Patrik Lambert
Collecting Code-Switched Data from Social Media
Gideon Mendels | Victor Soto | Aaron Jaech | Julia Hirschberg
Gideon Mendels | Victor Soto | Aaron Jaech | Julia Hirschberg
A Taxonomy for In-depth Evaluation of Normalization for User Generated Content
Rob van der Goot | Rik van Noord | Gertjan van Noord
Rob van der Goot | Rik van Noord | Gertjan van Noord
Arap-Tweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification
Wajdi Zaghouani | Anis Charfi
Wajdi Zaghouani | Anis Charfi
Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents
Nadezda Okinina | Lionel Nicolas | Verena Lyding
Nadezda Okinina | Lionel Nicolas | Verena Lyding
Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods
Vivi Nastase | Julian Hitschler
Vivi Nastase | Julian Hitschler
Crowdsourced Multimodal Corpora Collection Tool
Patrik Jonell | Catharine Oertel | Dimosthenis Kontogiorgos | Jonas Beskow | Joakim Gustafson
Patrik Jonell | Catharine Oertel | Dimosthenis Kontogiorgos | Jonas Beskow | Joakim Gustafson
Expert Evaluation of a Spoken Dialogue System in a Clinical Operating Room
Juliana Miehle | Nadine Gerstenlauer | Daniel Ostler | Hubertus Feußner | Wolfgang Minker | Stefan Ultes
Juliana Miehle | Nadine Gerstenlauer | Daniel Ostler | Hubertus Feußner | Wolfgang Minker | Stefan Ultes
The Metalogue Debate Trainee Corpus: Data Collection and Annotations
Volha Petukhova | Andrei Malchanau | Youssef Oualil | Dietrich Klakow | Saturnino Luz | Fasih Haider | Nick Campbell | Dimitris Koryzis | Dimitris Spiliotopoulos | Pierre Albert | Nicklas Linz | Jan Alexandersson
Volha Petukhova | Andrei Malchanau | Youssef Oualil | Dietrich Klakow | Saturnino Luz | Fasih Haider | Nick Campbell | Dimitris Koryzis | Dimitris Spiliotopoulos | Pierre Albert | Nicklas Linz | Jan Alexandersson
Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it
Andrei Malchanau | Volha Petukhova | Harry Bunt
Andrei Malchanau | Volha Petukhova | Harry Bunt
KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
Todd Shore | Theofronia Androulakaki | Gabriel Skantze
Todd Shore | Theofronia Androulakaki | Gabriel Skantze
On the Vector Representation of Utterances in Dialogue Context
Louisa Pragst | Niklas Rach | Wolfgang Minker | Stefan Ultes
Louisa Pragst | Niklas Rach | Wolfgang Minker | Stefan Ultes
ES-Port: a Spontaneous Spoken Human-Human Technical Support Corpus for Dialogue Research in Spanish
Laura García-Sardiña | Manex Serras | Arantza del Pozo
Laura García-Sardiña | Manex Serras | Arantza del Pozo
From analysis to modeling of engagement as sequences of multimodal behaviors
Soumia Dermouche | Catherine Pelachaud
Soumia Dermouche | Catherine Pelachaud
Building Literary Corpora for Computational Literary Analysis - A Prototype to Bridge the Gap between CL and DH
Andrew Frank | Christine Ivanovic
Andrew Frank | Christine Ivanovic
Towards faithfully visualizing global linguistic diversity
Garland McNew | Curdin Derungs | Steven Moran
Garland McNew | Curdin Derungs | Steven Moran
Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction
Adam Ek | Mats Wirén | Robert Östling | Kristina N. Björkenstam | Gintarė Grigonytė | Sofia Gustafson Capková
Adam Ek | Mats Wirén | Robert Östling | Kristina N. Björkenstam | Gintarė Grigonytė | Sofia Gustafson Capková
Word Embedding Evaluation Datasets and Wikipedia Title Embedding for Chinese
Chi-Yen Chen | Wei-Yun Ma
Chi-Yen Chen | Wei-Yun Ma
An Automatic Learning of an Algerian Dialect Lexicon by using Multilingual Word Embeddings
Abidi Karima | Kamel Smaïli
Abidi Karima | Kamel Smaïli
Candidate Ranking for Maintenance of an Online Dictionary
Claire Broad | Helen Langone | David Guy Brizan
Claire Broad | Helen Langone | David Guy Brizan
Tools for Building an Interlinked Synonym Lexicon Network
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Combining Concepts and Their Translations from Structured Dictionaries of Uralic Minority Languages
Mika Hämäläinen | Liisa Lotta Tarvainen | Jack Rueter
Mika Hämäläinen | Liisa Lotta Tarvainen | Jack Rueter
Transfer of Frames from English FrameNet to Construct Chinese FrameNet: A Bilingual Corpus-Based Approach
Tsung-Han Yang | Hen-Hsen Huang | An-Zi Yen | Hsin-Hsi Chen
Tsung-Han Yang | Hen-Hsen Huang | An-Zi Yen | Hsin-Hsi Chen
EFLLex: A Graded Lexical Resource for Learners of English as a Foreign Language
Luise Dürlich | Thomas François
Luise Dürlich | Thomas François
English-Basque Statistical and Neural Machine Translation
Inigo Jauregi Unanue | Lierni Garmendia Arratibel | Ehsan Zare Borzeshi | Massimo Piccardi
Inigo Jauregi Unanue | Lierni Garmendia Arratibel | Ehsan Zare Borzeshi | Massimo Piccardi
TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality
Vivien Macketanz | Renlong Ai | Aljoscha Burchardt | Hans Uszkoreit
Vivien Macketanz | Renlong Ai | Aljoscha Burchardt | Hans Uszkoreit
Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Dynamic Oracle for Neural Machine Translation in Decoding Phase
Zi-Yi Dou | Hao Zhou | Shu-Jian Huang | Xin-Yu Dai | Jia-Jun Chen
Zi-Yi Dou | Hao Zhou | Shu-Jian Huang | Xin-Yu Dai | Jia-Jun Chen
A Parallel Corpus of Arabic-Japanese News Articles
Go Inoue | Nizar Habash | Yuji Matsumoto | Hiroyuki Aoyama
Go Inoue | Nizar Habash | Yuji Matsumoto | Hiroyuki Aoyama
Examining the Tip of the Iceberg: A Data Set for Idiom Translation
Marzieh Fadaee | Arianna Bisazza | Christof Monz
Marzieh Fadaee | Arianna Bisazza | Christof Monz
Automatic Enrichment of Terminological Resources: the IATE RDF Example
Mihael Arcan | Elena Montiel-Ponsoda | John P. McCrae | Paul Buitelaar
Mihael Arcan | Elena Montiel-Ponsoda | John P. McCrae | Paul Buitelaar
A Comparative Study of Extremely Low-Resource Transliteration of the World’s Languages
Winston Wu | David Yarowsky
Winston Wu | David Yarowsky
Translating Web Search Queries into Natural Language Questions
Adarsh Kumar | Sandipan Dandapat | Sushil Chordia
Adarsh Kumar | Sandipan Dandapat | Sushil Chordia
Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering
Olga Majewska | Diana McCarthy | Ivan Vulić | Anna Korhonen
Olga Majewska | Diana McCarthy | Ivan Vulić | Anna Korhonen
Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense
Haoyue Shi | Xihao Wang | Yuqi Sun | Junfeng Hu
Haoyue Shi | Xihao Wang | Yuqi Sun | Junfeng Hu
Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation
Mika Hasegawa | Tetsunori Kobayashi | Yoshihiko Hayashi
Mika Hasegawa | Tetsunori Kobayashi | Yoshihiko Hayashi
Semantic Frame Parsing for Information Extraction : the CALOR corpus
Gabriel Marzinotto | Jeremy Auguste | Frederic Bechet | Geraldine Damnati | Alexis Nasr
Gabriel Marzinotto | Jeremy Auguste | Frederic Bechet | Geraldine Damnati | Alexis Nasr
Using a Corpus of English and Chinese Political Speeches for Metaphor Analysis
Kathleen Ahrens | Huiheng Zeng | Shun-han Rebekah Wong
Kathleen Ahrens | Huiheng Zeng | Shun-han Rebekah Wong
A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language
João Sequeira | Teresa Gonçalves | Paulo Quaresma | Amália Mendes | Iris Hendrickx
João Sequeira | Teresa Gonçalves | Paulo Quaresma | Amália Mendes | Iris Hendrickx
All-words Word Sense Disambiguation Using Concept Embeddings
Rui Suzuki | Kanako Komiya | Masayuki Asahara | Minoru Sasaki | Hiroyuki Shinnou
Rui Suzuki | Kanako Komiya | Masayuki Asahara | Minoru Sasaki | Hiroyuki Shinnou
Enhancing Modern Supervised Word Sense Disambiguation Models by Semantic Lexical Resources
Stefano Melacci | Achille Globo | Leonardo Rigutini
Stefano Melacci | Achille Globo | Leonardo Rigutini
An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages
Dmitry Ustalov | Denis Teslenko | Alexander Panchenko | Mikhail Chernoskutov | Chris Biemann | Simone Paolo Ponzetto
Dmitry Ustalov | Denis Teslenko | Alexander Panchenko | Mikhail Chernoskutov | Chris Biemann | Simone Paolo Ponzetto
Unsupervised Korean Word Sense Disambiguation using CoreNet
Kijong Han | Sangha Nam | Jiseong Kim | Younggyun Hahm | Key-Sun Choi
Kijong Han | Sangha Nam | Jiseong Kim | Younggyun Hahm | Key-Sun Choi
UFSAC: Unification of Sense Annotated Corpora and Tools
Loïc Vial | Benjamin Lecouteux | Didier Schwab
Loïc Vial | Benjamin Lecouteux | Didier Schwab
Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities
Steffen Remus | Chris Biemann
Steffen Remus | Chris Biemann
FastSense: An Efficient Word Sense Disambiguation Classifier
Tolga Uslu | Alexander Mehler | Daniel Baumartz | Wahed Hemati
Tolga Uslu | Alexander Mehler | Daniel Baumartz | Wahed Hemati
Text Annotation Graphs: Annotating Complex Natural Language Phenomena
Angus Forbes | Kristine Lee | Gus Hahn-Powell | Marco A. Valenzuela-Escárcega | Mihai Surdeanu
Angus Forbes | Kristine Lee | Gus Hahn-Powell | Marco A. Valenzuela-Escárcega | Mihai Surdeanu
Tools for The Production of Analogical Grids and a Resource of N-gram Analogical Grids in 11 Languages
Rashel Fam | Yves Lepage
Rashel Fam | Yves Lepage
The Automatic Annotation of the Semiotic Type of Hand Gestures in Obama’ s Humorous Speeches
Costanza Navarretta
Costanza Navarretta
Annotation and Quantitative Analysis of Speaker Information in Novel Conversation Sentences in Japanese
Makoto Yamazaki | Yumi Miyazaki | Wakako Kashino
Makoto Yamazaki | Yumi Miyazaki | Wakako Kashino
PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents
Hiroyuki Shindo | Yohei Munesada | Yuji Matsumoto
Hiroyuki Shindo | Yohei Munesada | Yuji Matsumoto
An Annotation Language for Semantic Search of Legal Sources
Adeline Nazarenko | François Levy | Adam Wyner
Adeline Nazarenko | François Levy | Adam Wyner
Resource Interoperability for Sustainable Benchmarking: The Case of Events
Chantal van Son | Oana Inel | Roser Morante | Lora Aroyo | Piek Vossen
Chantal van Son | Oana Inel | Roser Morante | Lora Aroyo | Piek Vossen
Parsivar: A Language Processing Toolkit for Persian
Salar Mohtaj | Behnam Roshanfekr | Atefeh Zafarian | Habibollah Asghari
Salar Mohtaj | Behnam Roshanfekr | Atefeh Zafarian | Habibollah Asghari
Multilingual Word Segmentation: Training Many Language-Specific Tokenizers Smoothly Thanks to the Universal Dependencies Corpus
Erwan Moreau | Carl Vogel
Erwan Moreau | Carl Vogel
Building a Corpus for Personality-dependent Natural Language Understanding and Generation
Ricelli Ramos | Georges Neto | Barbara Silva | Danielle Monteiro | Ivandré Paraboni | Rafael Dias
Ricelli Ramos | Georges Neto | Barbara Silva | Danielle Monteiro | Ivandré Paraboni | Rafael Dias
Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters
Marijn Schraagen | Feike Dietz | Marjo van Koppen
Marijn Schraagen | Feike Dietz | Marjo van Koppen
ASAP++: Enriching the ASAP Automated Essay Grading Dataset with Essay Attribute Scores
Sandeep Mathias | Pushpak Bhattacharyya
Sandeep Mathias | Pushpak Bhattacharyya
MirasText: An Automatically Generated Text Corpus for Persian
Behnam Sabeti | Hossein Abedi Firouzjaee | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Amir Vaheb
Behnam Sabeti | Hossein Abedi Firouzjaee | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Amir Vaheb
The Reference Corpus of the Contemporary Romanian Language (CoRoLa)
Verginica Barbu Mititelu | Dan Tufiș | Elena Irimia
Verginica Barbu Mititelu | Dan Tufiș | Elena Irimia
A Corpus of Drug Usage Guidelines Annotated with Type of Advice
Sarah Masud Preum | Md. Rizwan Parvez | Kai-Wei Chang | John Stankovic
Sarah Masud Preum | Md. Rizwan Parvez | Kai-Wei Chang | John Stankovic
A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set
Ian D. Wood | John P. McCrae | Vladimir Andryushechkin | Paul Buitelaar
Ian D. Wood | John P. McCrae | Vladimir Andryushechkin | Paul Buitelaar
Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System
Ankush Khandelwal | Sahil Swami | Syed S. Akhtar | Manish Shrivastava
Ankush Khandelwal | Sahil Swami | Syed S. Akhtar | Manish Shrivastava
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing
Koichiro Yoshino | Yoko Ishikawa | Masahiro Mizukami | Yu Suzuki | Sakriani Sakti | Satoshi Nakamura
Koichiro Yoshino | Yoko Ishikawa | Masahiro Mizukami | Yu Suzuki | Sakriani Sakti | Satoshi Nakamura
Contextual Dependencies in Time-Continuous Multidimensional Affect Recognition
Dmitrii Fedotov | Denis Ivanko | Maxim Sidorov | Wolfgang Minker
Dmitrii Fedotov | Denis Ivanko | Maxim Sidorov | Wolfgang Minker
WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art
Saif Mohammad | Svetlana Kiritchenko
Saif Mohammad | Svetlana Kiritchenko
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
Paul Rodrigues | Valerie Novak | C. Anton Rytting | Julie Yelle | Jennifer Boutz
Paul Rodrigues | Valerie Novak | C. Anton Rytting | Julie Yelle | Jennifer Boutz
Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus
Shabnam Tafreshi | Mona Diab
Shabnam Tafreshi | Mona Diab
A Swedish Cookie-Theft Corpus
Dimitrios Kokkinakis | Kristina Lundholm Fors | Kathleen Fraser | Arto Nordlund
Dimitrios Kokkinakis | Kristina Lundholm Fors | Kathleen Fraser | Arto Nordlund
Sharing Copies of Synthetic Clinical Corpora without Physical Distribution — A Case Study to Get Around IPRs and Privacy Constraints Featuring the German JSYNCC Corpus
Christina Lohr | Sven Buechel | Udo Hahn
Christina Lohr | Sven Buechel | Udo Hahn
A Legal Perspective on Training Models for Natural Language Processing
Richard Eckart de Castilho | Giulia Dore | Thomas Margoni | Penny Labropoulou | Iryna Gurevych
Richard Eckart de Castilho | Giulia Dore | Thomas Margoni | Penny Labropoulou | Iryna Gurevych
LREMap, a Song of Resources and Evaluation
Riccardo Del Gratta | Sara Goggi | Gabriella Pardelli | Nicoletta Calzolari
Riccardo Del Gratta | Sara Goggi | Gabriella Pardelli | Nicoletta Calzolari
Metadata Collection Records for Language Resources
Henk van den Heuvel | Erwin Komen | Nelleke Oostdijk
Henk van den Heuvel | Erwin Komen | Nelleke Oostdijk
Managing Public Sector Data for Multilingual Applications Development
Stelios Piperidis | Penny Labropoulou | Miltos Deligiannis | Maria Giagkou
Stelios Piperidis | Penny Labropoulou | Miltos Deligiannis | Maria Giagkou
Bridging the LAPPS Grid and CLARIN
Erhard Hinrichs | Nancy Ide | James Pustejovsky | Jan Hajič | Marie Hinrichs | Mohammad Fazleh Elahi | Keith Suderman | Marc Verhagen | Kyeongmin Rim | Pavel Straňák | Jozef Mišutka
Erhard Hinrichs | Nancy Ide | James Pustejovsky | Jan Hajič | Marie Hinrichs | Mohammad Fazleh Elahi | Keith Suderman | Marc Verhagen | Kyeongmin Rim | Pavel Straňák | Jozef Mišutka
Fluid Annotation: A Granularity-aware Annotation Tool for Chinese Word Fluidity
Shu-Kai Hsieh | Yu-Hsiang Tseng | Chih-Yao Lee | Chiung-Yu Chiang
Shu-Kai Hsieh | Yu-Hsiang Tseng | Chih-Yao Lee | Chiung-Yu Chiang
E-magyar – A Digital Language Processing System
Tamás Váradi | Eszter Simon | Bálint Sass | Iván Mittelholcz | Attila Novák | Balázs Indig | Richárd Farkas | Veronika Vincze
Tamás Váradi | Eszter Simon | Bálint Sass | Iván Mittelholcz | Attila Novák | Balázs Indig | Richárd Farkas | Veronika Vincze
ILCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data
Andreas Niekler | Arnim Bleier | Christian Kahmann | Lisa Posch | Gregor Wiedemann | Kenan Erdogan | Gerhard Heyer | Markus Strohmaier
Andreas Niekler | Arnim Bleier | Christian Kahmann | Lisa Posch | Gregor Wiedemann | Kenan Erdogan | Gerhard Heyer | Markus Strohmaier
Indra: A Word Embedding and Semantic Relatedness Server
Juliano Efson Sales | Leonardo Souza | Siamak Barzegar | Brian Davis | André Freitas | Siegfried Handschuh
Juliano Efson Sales | Leonardo Souza | Siamak Barzegar | Brian Davis | André Freitas | Siegfried Handschuh
A UIMA Database Interface for Managing NLP-related Text Annotations
Giuseppe Abrami | Alexander Mehler
Giuseppe Abrami | Alexander Mehler
European Language Resource Coordination: Collecting Language Resources for Public Sector Multilingual Information Management
Andrea Lösch | Valérie Mapelli | Stelios Piperidis | Andrejs Vasiļjevs | Lilli Smal | Thierry Declerck | Eileen Schnur | Khalid Choukri | Josef van Genabith
Andrea Lösch | Valérie Mapelli | Stelios Piperidis | Andrejs Vasiļjevs | Lilli Smal | Thierry Declerck | Eileen Schnur | Khalid Choukri | Josef van Genabith
Tilde MT Platform for Developing Client Specific MT Solutions
Mārcis Pinnis | Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Raivis Skadiņš | Valters Šics
Mārcis Pinnis | Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Raivis Skadiņš | Valters Šics
Improving homograph disambiguation with supervised machine learning
Kyle Gorman | Gleb Mazovetskiy | Vitaly Nikolaev
Kyle Gorman | Gleb Mazovetskiy | Vitaly Nikolaev
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties
Mason Chua | Daan van Esch | Noah Coccaro | Eunjoon Cho | Sujeet Bhandari | Libin Jia
Mason Chua | Daan van Esch | Noah Coccaro | Eunjoon Cho | Sujeet Bhandari | Libin Jia
DeModify: A Dataset for Analyzing Contextual Constraints on Modifier Deletion
Vivi Nastase | Devon Fritz | Anette Frank
Vivi Nastase | Devon Fritz | Anette Frank
Fine-grained Semantic Textual Similarity for Serbian
Vuk Batanović | Miloš Cvetanović | Boško Nikolić
Vuk Batanović | Miloš Cvetanović | Boško Nikolić
ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation
Venelin Kovatchev | M. Antònia Martí | Maria Salamó
Venelin Kovatchev | M. Antònia Martí | Maria Salamó
Introducing a Lexicon of Verbal Polarity Shifters for English
Marc Schulder | Michael Wiegand | Josef Ruppenhofer | Stephanie Köser
Marc Schulder | Michael Wiegand | Josef Ruppenhofer | Stephanie Köser
Quantifying Qualitative Data for Understanding Controversial Issues
Michael Wojatzki | Saif Mohammad | Torsten Zesch | Svetlana Kiritchenko
Michael Wojatzki | Saif Mohammad | Torsten Zesch | Svetlana Kiritchenko
Distribution of Emotional Reactions to News Articles in Twitter
Omar Juárez Gambino | Hiram Calvo | Consuelo-Varinia García-Mendoza
Omar Juárez Gambino | Hiram Calvo | Consuelo-Varinia García-Mendoza
Aggression-annotated Corpus of Hindi-English Code-mixed Data
Ritesh Kumar | Aishwarya N. Reganti | Akshit Bhatia | Tushar Maheshwari
Ritesh Kumar | Aishwarya N. Reganti | Akshit Bhatia | Tushar Maheshwari
Creating a Verb Synonym Lexicon Based on a Parallel Corpus
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Evaluation of Domain-specific Word Embeddings using Knowledge Resources
Farhad Nooralahzadeh | Lilja Øvrelid | Jan Tore Lønning
Farhad Nooralahzadeh | Lilja Øvrelid | Jan Tore Lønning
Automatic Wordnet Mapping: from CoreNet to Princeton WordNet
Jiseong Kim | Younggyun Hahm | Sunggoo Kwon | Key-Sun Choi
Jiseong Kim | Younggyun Hahm | Sunggoo Kwon | Key-Sun Choi
The New Propbank: Aligning Propbank with AMR through POS Unification
Tim O’Gorman | Sameer Pradhan | Martha Palmer | Julia Bonn | Katie Conger | James Gung
Tim O’Gorman | Sameer Pradhan | Martha Palmer | Julia Bonn | Katie Conger | James Gung
The Boarnsterhim Corpus: A Bilingual Frisian-Dutch Panel and Trend Study
Marjoleine Sloos | Eduard Drenth | Wilbert Heeringa
Marjoleine Sloos | Eduard Drenth | Wilbert Heeringa
The French-Algerian Code-Switching Triggered audio corpus (FACST)
Amazouz Djegdjiga | Martine Adda-Decker | Lori Lamel
Amazouz Djegdjiga | Martine Adda-Decker | Lori Lamel
Strategies and Challenges for Crowdsourcing Regional Dialect Perception Data for Swiss German and Swiss French
Jean-Philippe Goldman | Simon Clematide | Mathieu Avanzi | Raphael Tandler
Jean-Philippe Goldman | Simon Clematide | Mathieu Avanzi | Raphael Tandler
Phonetically Balanced Code-Mixed Speech Corpus for Hindi-English Automatic Speech Recognition
Ayushi Pandey | Brij Mohan Lal Srivastava | Rohit Kumar | Bhanu Teja Nellore | Kasi Sai Teja | Suryakanth V. Gangashetty
Ayushi Pandey | Brij Mohan Lal Srivastava | Rohit Kumar | Bhanu Teja Nellore | Kasi Sai Teja | Suryakanth V. Gangashetty
Chinese-Portuguese Machine Translation: A Study on Building Parallel Corpora from Comparable Texts
Siyou Liu | Longyue Wang | Chao-Hong Liu
Siyou Liu | Longyue Wang | Chao-Hong Liu
Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences
Morgan Ulinski | Bob Coyne | Julia Hirschberg
Morgan Ulinski | Bob Coyne | Julia Hirschberg
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections
Pierre-Alexandre Broux | David Doukhan | Simon Petitrenaud | Sylvain Meignier | Jean Carrive
Pierre-Alexandre Broux | David Doukhan | Simon Petitrenaud | Sylvain Meignier | Jean Carrive
Performance Impact Caused by Hidden Bias of Training Data for Recognizing Textual Entailment
Masatoshi Tsuchiya
Masatoshi Tsuchiya
A Corpus of Metaphor Novelty Scores for Syntactically-Related Word Pairs
Natalie Parde | Rodney Nielsen
Natalie Parde | Rodney Nielsen
Improving Hypernymy Extraction with Distributional Semantic Classes
Alexander Panchenko | Dmitry Ustalov | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Alexander Panchenko | Dmitry Ustalov | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Laying the Groundwork for Knowledge Base Population: Nine Years of Linguistic Resources for TAC KBP
Jeremy Getman | Joe Ellis | Stephanie Strassel | Zhiyi Song | Jennifer Tracey
Jeremy Getman | Joe Ellis | Stephanie Strassel | Zhiyi Song | Jennifer Tracey
A Dataset for Inter-Sentence Relation Extraction using Distant Supervision
Angrosh Mandya | Danushka Bollegala | Frans Coenen | Katie Atkinson
Angrosh Mandya | Danushka Bollegala | Frans Coenen | Katie Atkinson
Diacritics Restoration Using Neural Networks
Jakub Náplava | Milan Straka | Pavel Straňák | Jan Hajič
Jakub Náplava | Milan Straka | Pavel Straňák | Jan Hajič
Ensemble Romanian Dependency Parsing with Neural Networks
Radu Ion | Elena Irimia | Verginica Barbu Mititelu
Radu Ion | Elena Irimia | Verginica Barbu Mititelu
Collection of Multimodal Dialog Data and Analysis of the Result of Annotation of Users’ Interest Level
Masahiro Araki | Sayaka Tomimasu | Mikio Nakano | Kazunori Komatani | Shogo Okada | Shinya Fujie | Hiroaki Sugiyama
Masahiro Araki | Sayaka Tomimasu | Mikio Nakano | Kazunori Komatani | Shogo Okada | Shinya Fujie | Hiroaki Sugiyama
Recognizing Behavioral Factors while Driving: A Real-World Multimodal Corpus to Monitor the Driver’s Affective State
Alicia Lotz | Klas Ihme | Audrey Charnoz | Pantelis Maroudis | Ivan Dmitriev | Andreas Wendemuth
Alicia Lotz | Klas Ihme | Audrey Charnoz | Pantelis Maroudis | Ivan Dmitriev | Andreas Wendemuth
EmotionLines: An Emotion Corpus of Multi-Party Conversations
Chao-Chun Hsu | Sheng-Yeh Chen | Chuan-Chun Kuo | Ting-Hao Huang | Lun-Wei Ku
Chao-Chun Hsu | Sheng-Yeh Chen | Chuan-Chun Kuo | Ting-Hao Huang | Lun-Wei Ku
Academic-Industrial Perspective on the Development and Deployment of a Moderation System for a Newspaper Website
Dietmar Schabus | Marcin Skowron
Dietmar Schabus | Marcin Skowron
Community-Driven Crowdsourcing: Data Collection with Local Developers
Christina Funk | Michael Tseng | Ravindran Rajakumar | Linne Ha
Christina Funk | Michael Tseng | Ravindran Rajakumar | Linne Ha
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech
Jaka Aris Eko Wibawa | Supheakmungkol Sarin | Chenfang Li | Knot Pipatsrisawat | Keshan Sodimana | Oddur Kjartansson | Alexander Gutkin | Martin Jansche | Linne Ha
Jaka Aris Eko Wibawa | Supheakmungkol Sarin | Chenfang Li | Knot Pipatsrisawat | Keshan Sodimana | Oddur Kjartansson | Alexander Gutkin | Martin Jansche | Linne Ha
An Integrated Representation of Linguistic and Social Functions of Code-Switching
Silvana Hartmann | Monojit Choudhury | Kalika Bali
Silvana Hartmann | Monojit Choudhury | Kalika Bali
A Corpus of eRulemaking User Comments for Measuring Evaluability of Arguments
Joonsuk Park | Claire Cardie
Joonsuk Park | Claire Cardie
A Multi-layer Annotated Corpus of Argumentative Text: From Argument Schemes to Discourse Relations
Elena Musi | Tariq Alhindi | Manfred Stede | Leonard Kriese | Smaranda Muresan | Andrea Rocci
Elena Musi | Tariq Alhindi | Manfred Stede | Leonard Kriese | Smaranda Muresan | Andrea Rocci
Discourse Coherence Through the Lens of an Annotated Text Corpus: A Case Study
Eva Hajičová | Jiří Mírovský
Eva Hajičová | Jiří Mírovský
Automatic Prediction of Discourse Connectives
Eric Malmi | Daniele Pighin | Sebastian Krause | Mikhail Kozhevnikov
Eric Malmi | Daniele Pighin | Sebastian Krause | Mikhail Kozhevnikov
Handling Rare Word Problem using Synthetic Training Data for Sinhala and Tamil Neural Machine Translation
Pasindu Tennage | Prabath Sandaruwan | Malith Thilakarathne | Achini Herath | Surangika Ranathunga
Pasindu Tennage | Prabath Sandaruwan | Malith Thilakarathne | Achini Herath | Surangika Ranathunga
BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages
Egidio Marsico | Sebastien Flavier | Annemarie Verkerk | Steven Moran
Egidio Marsico | Sebastien Flavier | Annemarie Verkerk | Steven Moran
Creating a Translation Matrix of the Bible’s Names Across 591 Languages
Winston Wu | Nidhi Vyas | David Yarowsky
Winston Wu | Nidhi Vyas | David Yarowsky
Building a Word Segmenter for Sanskrit Overnight
Vikas Reddy | Amrith Krishna | Vishnu Sharma | Prateek Gupta | Vineeth M R | Pawan Goyal
Vikas Reddy | Amrith Krishna | Vishnu Sharma | Prateek Gupta | Vineeth M R | Pawan Goyal
Simple Semantic Annotation and Situation Frames: Two Approaches to Basic Text Understanding in LORELEI
Kira Griffitt | Jennifer Tracey | Ann Bies | Stephanie Strassel
Kira Griffitt | Jennifer Tracey | Ann Bies | Stephanie Strassel
Abstract Meaning Representation of Constructions: The More We Include, the Better the Representation
Claire Bonial | Bianca Badarau | Kira Griffitt | Ulf Hermjakob | Kevin Knight | Tim O’Gorman | Martha Palmer | Nathan Schneider
Claire Bonial | Bianca Badarau | Kira Griffitt | Ulf Hermjakob | Kevin Knight | Tim O’Gorman | Martha Palmer | Nathan Schneider
Evaluating Scoped Meaning Representations
Rik van Noord | Lasha Abzianidze | Hessel Haagsma | Johan Bos
Rik van Noord | Lasha Abzianidze | Hessel Haagsma | Johan Bos
Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation
Tommaso Pasini | Francesco Elia | Roberto Navigli
Tommaso Pasini | Francesco Elia | Roberto Navigli
A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches
Steven Neale
Steven Neale
Linguistically-driven Framework for Computationally Efficient and Scalable Sign Recognition
Dimitris Metaxas | Mark Dilsizian | Carol Neidle
Dimitris Metaxas | Mark Dilsizian | Carol Neidle
CONDUCT: An Expressive Conducting Gesture Dataset for Sound Control
Lei Chen | Sylvie Gibet | Camille Marteau
Lei Chen | Sylvie Gibet | Camille Marteau
MPST: A Corpus of Movie Plot Synopses with Tags
Sudipta Kar | Suraj Maharjan | A. Pastor López-Monroy | Thamar Solorio
Sudipta Kar | Suraj Maharjan | A. Pastor López-Monroy | Thamar Solorio
OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora
Pierre Lison | Jörg Tiedemann | Milen Kouylekov
Pierre Lison | Jörg Tiedemann | Milen Kouylekov
Building an Ellipsis-aware Chinese Dependency Treebank for Web Text
Xuancheng Ren | Xu Sun | Ji Wen | Bingzhen Wei | Weidong Zhan | Zhiyuan Zhang
Xuancheng Ren | Xu Sun | Ji Wen | Bingzhen Wei | Weidong Zhan | Zhiyuan Zhang
EuroGames16: Evaluating Change Detection in Online Conversation
Cyril Goutte | Yunli Wang | Fangming Liao | Zachary Zanussi | Samuel Larkin | Yuri Grinberg
Cyril Goutte | Yunli Wang | Fangming Liao | Zachary Zanussi | Samuel Larkin | Yuri Grinberg
A Deep Neural Network based Approach for Entity Extraction in Code-Mixed Indian Social Media Text
Deepak Gupta | Asif Ekbal | Pushpak Bhattacharyya
Deepak Gupta | Asif Ekbal | Pushpak Bhattacharyya
PoSTWITA-UD: an Italian Twitter Treebank in Universal Dependencies
Manuela Sanguinetti | Cristina Bosco | Alberto Lavelli | Alessandro Mazzei | Oronzo Antonelli | Fabio Tamburini
Manuela Sanguinetti | Cristina Bosco | Alberto Lavelli | Alessandro Mazzei | Oronzo Antonelli | Fabio Tamburini
Annotating If the Authors of a Tweet are Located at the Locations They Tweet About
Vivek Doudagiri | Alakananda Vempala | Eduardo Blanco
Vivek Doudagiri | Alakananda Vempala | Eduardo Blanco
MOCCA: Measure of Confidence for Corpus Analysis - Automatic Reliability Check of Transcript and Automatic Segmentation
Thomas Kisler | Florian Schiel
Thomas Kisler | Florian Schiel
Towards an ISO Standard for the Annotation of Quantification
Harry Bunt | James Pustejovsky | Kiyong Lee
Harry Bunt | James Pustejovsky | Kiyong Lee
Lightweight Grammatical Annotation in the TEI: New Perspectives
Piotr Bański | Susanne Haaf | Martin Mueller
Piotr Bański | Susanne Haaf | Martin Mueller
A Gold Standard for Multilingual Automatic Term Extraction from Comparable Corpora: Term Structure and Translation Equivalents
Ayla Rigouts Terryn | Véronique Hoste | Els Lefever
Ayla Rigouts Terryn | Véronique Hoste | Els Lefever
Handling Big Data and Sensitive Data Using EUDAT’s Generic Execution Framework and the WebLicht Workflow Engine.
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov
Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl
Alexander Panchenko | Eugen Ruppert | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Alexander Panchenko | Eugen Ruppert | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Universal Dependencies Version 2 for Japanese
Masayuki Asahara | Hiroshi Kanayama | Takaaki Tanaka | Yusuke Miyao | Sumire Uematsu | Shinsuke Mori | Yuji Matsumoto | Mai Omura | Yugo Murawaki
Masayuki Asahara | Hiroshi Kanayama | Takaaki Tanaka | Yusuke Miyao | Sumire Uematsu | Shinsuke Mori | Yuji Matsumoto | Mai Omura | Yugo Murawaki
A New Version of the Składnica Treebank of Polish Harmonised with the Walenty Valency Dictionary
Marcin Woliński | Elżbieta Hajnicz | Tomasz Bartosiak
Marcin Woliński | Elżbieta Hajnicz | Tomasz Bartosiak
Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions
Kira Droganova | Daniel Zeman | Jenna Kanerva | Filip Ginter
Kira Droganova | Daniel Zeman | Jenna Kanerva | Filip Ginter
Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish)
Mateusz Lango | Magda Ševčíková | Zdeněk Žabokrtský
Mateusz Lango | Magda Ševčíková | Zdeněk Žabokrtský
UniMorph 2.0: Universal Morphology
Christo Kirov | Ryan Cotterell | John Sylak-Glassman | Géraldine Walther | Ekaterina Vylomova | Patrick Xia | Manaal Faruqui | Sabrina J. Mielke | Arya McCarthy | Sandra Kübler | David Yarowsky | Jason Eisner | Mans Hulden
Christo Kirov | Ryan Cotterell | John Sylak-Glassman | Géraldine Walther | Ekaterina Vylomova | Patrick Xia | Manaal Faruqui | Sabrina J. Mielke | Arya McCarthy | Sandra Kübler | David Yarowsky | Jason Eisner | Mans Hulden
A Computational Architecture for the Morphology of Upper Tanana
Olga Lovick | Christopher Cox | Miikka Silfverberg | Antti Arppe | Mans Hulden
Olga Lovick | Christopher Cox | Miikka Silfverberg | Antti Arppe | Mans Hulden
Expanding Abbreviations in a Strongly Inflected Language: Are Morphosyntactic Tags Sufficient?
Piotr Żelasko
Piotr Żelasko
A High-Quality Gold Standard for Citation-based Tasks
Michael Färber | Alexander Thiemann | Adam Jatowt
Michael Färber | Alexander Thiemann | Adam Jatowt
Measuring Innovation in Speech and Language Processing Publications.
Joseph Mariani | Gil Francopoulo | Patrick Paroubek
Joseph Mariani | Gil Francopoulo | Patrick Paroubek
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles
Daniel Ferrés | Horacio Saggion | Francesco Ronzano | Àlex Bravo
Daniel Ferrés | Horacio Saggion | Francesco Ronzano | Àlex Bravo
Automatic Identification of Research Fields in Scientific Papers
Eric Kergosien | Amin Farvardin | Maguelonne Teisseire | Marie-Noëlle Bessagnet | Joachim Schöpfel | Stéphane Chaudiron | Bernard Jacquemin | Annig Lacayrelle | Mathieu Roche | Christian Sallaberry | Jean Philippe Tonneau
Eric Kergosien | Amin Farvardin | Maguelonne Teisseire | Marie-Noëlle Bessagnet | Joachim Schöpfel | Stéphane Chaudiron | Bernard Jacquemin | Annig Lacayrelle | Mathieu Roche | Christian Sallaberry | Jean Philippe Tonneau
Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
Deniz Zeyrek | Amália Mendes | Murathan Kurfalı
Deniz Zeyrek | Amália Mendes | Murathan Kurfalı
Enhancing the AI2 Diagrams Dataset Using Rhetorical Structure Theory
Tuomo Hiippala | Serafina Orekhova
Tuomo Hiippala | Serafina Orekhova
QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
Kordula De Kuthy | Nils Reiter | Arndt Riester
Kordula De Kuthy | Nils Reiter | Arndt Riester
The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions
José Lopes | Nils Hemmingsson | Oliver Åstrand
José Lopes | Nils Hemmingsson | Oliver Åstrand
A Context-based Approach for Dialogue Act Recognition using Simple Recurrent Neural Networks
Chandrakant Bothe | Cornelius Weber | Sven Magg | Stefan Wermter
Chandrakant Bothe | Cornelius Weber | Sven Magg | Stefan Wermter
TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations
Philipp Helfrich | Elias Rieb | Giuseppe Abrami | Andy Lücking | Alexander Mehler
Philipp Helfrich | Elias Rieb | Giuseppe Abrami | Andy Lücking | Alexander Mehler
Chats and Chunks: Annotation and Analysis of Multiparty Long Casual Conversations
Emer Gilmartin | Carl Vogel | Nick Campbell
Emer Gilmartin | Carl Vogel | Nick Campbell
Extending the gold standard for a lexical substitution task: is it worth it?
Ludovic Tanguy | Cécile Fabre | Laura Rivière
Ludovic Tanguy | Cécile Fabre | Laura Rivière
Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases
Maria Moritz | David Steding
Maria Moritz | David Steding
Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages
Zsanett Ferenczi | Iván Mittelholcz | Eszter Simon | Tamás Váradi
Zsanett Ferenczi | Iván Mittelholcz | Eszter Simon | Tamás Váradi
Dysarthric speech evaluation: automatic and perceptual approaches
Imed Laaridh | Christine Meunier | Corinne Fredouille
Imed Laaridh | Christine Meunier | Corinne Fredouille
Towards an Automatic Assessment of Crowdsourced Data for NLU
Patricia Braunger | Wolfgang Maier | Jan Wessling | Maria Schmidt
Patricia Braunger | Wolfgang Maier | Jan Wessling | Maria Schmidt
Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning
Jinyoung Yeo | Gyeongbok Lee | Gengyu Wang | Seungtaek Choi | Hyunsouk Cho | Reinald Kim Amplayo | Seung-won Hwang
Jinyoung Yeo | Gyeongbok Lee | Gengyu Wang | Seungtaek Choi | Hyunsouk Cho | Reinald Kim Amplayo | Seung-won Hwang
Is it worth it? Budget-related evaluation metrics for model selection
Filip Klubička | Giancarlo D. Salton | John D. Kelleher
Filip Klubička | Giancarlo D. Salton | John D. Kelleher
Matics Software Suite: New Tools for Evaluation and Data Exploration
Olivier Galibert | Guillaume Bernard | Agnes Delaborde | Sabrina Lecadre | Juliette Kahn
Olivier Galibert | Guillaume Bernard | Agnes Delaborde | Sabrina Lecadre | Juliette Kahn
MIsA: Multilingual “IsA” Extraction from Corpora
Stefano Faralli | Els Lefever | Simone Paolo Ponzetto
Stefano Faralli | Els Lefever | Simone Paolo Ponzetto
A supervised approach to taxonomy extraction using word embeddings
Rajdeep Sarkar | John P. McCrae | Paul Buitelaar
Rajdeep Sarkar | John P. McCrae | Paul Buitelaar
Korean TimeBank Including Relative Temporal Information
Chae-Gyun Lim | Young-Seob Jeong | Ho-Jin Choi
Chae-Gyun Lim | Young-Seob Jeong | Ho-Jin Choi
An Initial Test Collection for Ranked Retrieval of SMS Conversations
Rashmi Sankepally | Douglas W. Oard
Rashmi Sankepally | Douglas W. Oard
FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles
Nathalie Camelin | Géraldine Damnati | Abdessalam Bouchekif | Anais Landeau | Delphine Charlet | Yannick Estève
Nathalie Camelin | Géraldine Damnati | Abdessalam Bouchekif | Anais Landeau | Delphine Charlet | Yannick Estève
Towards Processing of the Oral History Interviews and Related Printed Documents
Zbyněk Zajíc | Lucie Skorkovská | Petr Neduchal | Pavel Ircing | Josef V. Psutka | Marek Hrúz | Aleš Pražák | Daniel Soutner | Jan Švec | Lukáš Bureš | Luděk Müller
Zbyněk Zajíc | Lucie Skorkovská | Petr Neduchal | Pavel Ircing | Josef V. Psutka | Marek Hrúz | Aleš Pražák | Daniel Soutner | Jan Švec | Lukáš Bureš | Luděk Müller
The Effects of Unimodal Representation Choices on Multimodal Learning
Fernando Tadao Ito | Helena de Medeiros Caseli | Jander Moreira
Fernando Tadao Ito | Helena de Medeiros Caseli | Jander Moreira
The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
Ahmed Abdelali | Irina Temnikova | Samy Hedaya | Stephan Vogel
Ahmed Abdelali | Irina Temnikova | Samy Hedaya | Stephan Vogel
Action Verb Corpus
Stephanie Gross | Matthias Hirschmanner | Brigitte Krenn | Friedrich Neubarth | Michael Zillich
Stephanie Gross | Matthias Hirschmanner | Brigitte Krenn | Friedrich Neubarth | Michael Zillich
EMO&LY (EMOtion and AnomaLY) : A new corpus for anomaly detection in an audiovisual stream with emotional context.
Cédric Fayet | Arnaud Delhay | Damien Lolive | Pierre-François Marteau
Cédric Fayet | Arnaud Delhay | Damien Lolive | Pierre-François Marteau
Development of an Annotated Multimodal Dataset for the Investigation of Classification and Summarisation of Presentations using High-Level Paralinguistic Features
Keith Curtis | Nick Campbell | Gareth Jones
Keith Curtis | Nick Campbell | Gareth Jones
GeCoTagger: Annotation of German Verb Complements with Conditional Random Fields
Roman Schneider | Monica Fürbacher
Roman Schneider | Monica Fürbacher
AET: Web-based Adjective Exploration Tool for German
Tatiana Bladier | Esther Seyffarth | Oliver Hellwig | Wiebke Petersen
Tatiana Bladier | Esther Seyffarth | Oliver Hellwig | Wiebke Petersen
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages
Talha Javed | Nizar Habash | Dima Taji
Talha Javed | Nizar Habash | Dima Taji
Building Universal Dependency Treebanks in Korean
Jayeol Chun | Na-Rae Han | Jena D. Hwang | Jinho D. Choi
Jayeol Chun | Na-Rae Han | Jena D. Hwang | Jinho D. Choi
Multilingual Dependency Parsing for Low-Resource Languages: Case Studies on North Saami and Komi-Zyrian
KyungTae Lim | Niko Partanen | Thierry Poibeau
KyungTae Lim | Niko Partanen | Thierry Poibeau
FonBund: A Library for Combining Cross-lingual Phonological Segment Data
Alexander Gutkin | Martin Jansche | Tatiana Merkulova
Alexander Gutkin | Martin Jansche | Tatiana Merkulova
Voice Builder: A Tool for Building Text-To-Speech Voices
Pasindu De Silva | Theeraphol Wattanavekin | Tang Hao | Knot Pipatsrisawat
Pasindu De Silva | Theeraphol Wattanavekin | Tang Hao | Knot Pipatsrisawat
Sudachi: a Japanese Tokenizer for Business
Kazuma Takaoka | Sorami Hisamoto | Noriko Kawahara | Miho Sakamoto | Yoshitaka Uchida | Yuji Matsumoto
Kazuma Takaoka | Sorami Hisamoto | Noriko Kawahara | Miho Sakamoto | Yoshitaka Uchida | Yuji Matsumoto
Chemical Compounds Knowledge Visualization with Natural Language Processing and Linked Data
Kazunari Tanaka | Tomoya Iwakura | Yusuke Koyanagi | Noriko Ikeda | Hiroyuki Shindo | Yuji Matsumoto
Kazunari Tanaka | Tomoya Iwakura | Yusuke Koyanagi | Noriko Ikeda | Hiroyuki Shindo | Yuji Matsumoto
Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus
Shuyuan Cao | Harritxu Gete
Shuyuan Cao | Harritxu Gete
A 2nd Longitudinal Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns
Kay Berkling
Kay Berkling
Development of a Mobile Observation Support System for Students: FishWatchr Mini
Masaya Yamaguchi | Masanori Kitamura | Naomi Yanagida
Masaya Yamaguchi | Masanori Kitamura | Naomi Yanagida
The AnnCor CHILDES Treebank
Jan Odijk | Alexis Dimitriadis | Martijn van der Klis | Marjo van Koppen | Meie Otten | Remco van der Veen
Jan Odijk | Alexis Dimitriadis | Martijn van der Klis | Marjo van Koppen | Meie Otten | Remco van der Veen
BabyCloud, a Technological Platform for Parents and Researchers
Xuân-Nga Cao | Cyrille Dakhlia | Patricia Del Carmen | Mohamed-Amine Jaouani | Malik Ould-Arbi | Emmanuel Dupoux
Xuân-Nga Cao | Cyrille Dakhlia | Patricia Del Carmen | Mohamed-Amine Jaouani | Malik Ould-Arbi | Emmanuel Dupoux
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Building a TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis
Lung-Hao Lee | Yuen-Hsien Tseng | Li-Ping Chang
Lung-Hao Lee | Yuen-Hsien Tseng | Li-Ping Chang
MIAPARLE: Online training for the discrimination of stress contrasts
Jean-Philippe Goldman | Sandra Schwab
Jean-Philippe Goldman | Sandra Schwab
A Leveled Reading Corpus of Modern Standard Arabic
Muhamed Al Khalil | Hind Saddiki | Nizar Habash | Latifa Alfalasi
Muhamed Al Khalil | Hind Saddiki | Nizar Habash | Latifa Alfalasi
Developing New Linguistic Resources and Tools for the Galician Language
Rodrigo Agerri | Xavier Gómez Guinovart | German Rigau | Miguel Anxo Solla Portela
Rodrigo Agerri | Xavier Gómez Guinovart | German Rigau | Miguel Anxo Solla Portela
Modeling Northern Haida Verb Morphology
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation
Caitlin Richter | Matthew Wickes | Deniz Beser | Mitch Marcus
Caitlin Richter | Matthew Wickes | Deniz Beser | Mitch Marcus
Introducing the CLARIN Knowledge Centre for Linguistic Diversity and Language Documentation
Hanna Hedeland | Timm Lehmberg | Felix Rau | Sophie Salffner | Mandana Seyfeddinipur | Andreas Witt
Hanna Hedeland | Timm Lehmberg | Felix Rau | Sophie Salffner | Mandana Seyfeddinipur | Andreas Witt
SB-CH: A Swiss German Corpus with Sentiment Annotations
Ralf Grubenmann | Don Tuggener | Pius von Däniken | Jan Deriu | Mark Cieliebak
Ralf Grubenmann | Don Tuggener | Pius von Däniken | Jan Deriu | Mark Cieliebak
Signbank: Software to Support Web Based Dictionaries of Sign Language
Steve Cassidy | Onno Crasborn | Henri Nieminen | Wessel Stoop | Micha Hulsbosch | Susan Even | Erwin Komen | Trevor Johnston
Steve Cassidy | Onno Crasborn | Henri Nieminen | Wessel Stoop | Micha Hulsbosch | Susan Even | Erwin Komen | Trevor Johnston
J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage
Kaoru Ito | Hiroyuki Nagai | Taro Okahisa | Shoko Wakamiya | Tomohide Iwao | Eiji Aramaki
Kaoru Ito | Hiroyuki Nagai | Taro Okahisa | Shoko Wakamiya | Tomohide Iwao | Eiji Aramaki
Building a List of Synonymous Words and Phrases of Japanese Compound Verbs
Kyoko Kanzaki | Hitoshi Isahara
Kyoko Kanzaki | Hitoshi Isahara
A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier
Bolette Pedersen | Sanni Nimb | Anders Søgaard | Mareike Hartmann | Sussi Olsen
Bolette Pedersen | Sanni Nimb | Anders Søgaard | Mareike Hartmann | Sussi Olsen
SLIDE - a Sentiment Lexicon of Common Idioms
Charles Jochim | Francesca Bonin | Roy Bar-Haim | Noam Slonim
Charles Jochim | Francesca Bonin | Roy Bar-Haim | Noam Slonim
Teanga: A Linked Data based platform for Natural Language Processing
Housam Ziad | John P. McCrae | Paul Buitelaar
Housam Ziad | John P. McCrae | Paul Buitelaar
Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena
Georg Rehm | Julian Moreno-Schneider | Peter Bourgonje
Georg Rehm | Julian Moreno-Schneider | Peter Bourgonje
The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction
René Witte | Bahar Sateli
René Witte | Bahar Sateli
LiDo RDF: From a Relational Database to a Linked Data Graph of Linguistic Terms and Bibliographic Data
Bettina Klimek | Robert Schädlich | Dustin Kröger | Edwin Knese | Benedikt Elßmann
Bettina Klimek | Robert Schädlich | Dustin Kröger | Edwin Knese | Benedikt Elßmann
Towards a Linked Open Data Edition of Sumerian Corpora
Christian Chiarcos | Émilie Pagé-Perron | Ilya Khait | Niko Schenk | Lucas Reckling
Christian Chiarcos | Émilie Pagé-Perron | Ilya Khait | Niko Schenk | Lucas Reckling
PMKI: an European Commission action for the interoperability, maintainability and sustainability of Language Resources
Peter Schmitz | Enrico Francesconi | Najeh Hajlaoui | Brahim Batouche
Peter Schmitz | Enrico Francesconi | Najeh Hajlaoui | Brahim Batouche
Collecting Language Resources from Public Administrations in the Nordic and Baltic Countries
Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Aivars Bērziņš
Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Aivars Bērziņš
LIdioms: A Multilingual Linked Idioms Data Set
Diego Moussallem | Mohamed Ahmed Sherif | Diego Esteves | Marcos Zampieri | Axel-Cyrille Ngonga Ngomo
Diego Moussallem | Mohamed Ahmed Sherif | Diego Esteves | Marcos Zampieri | Axel-Cyrille Ngonga Ngomo
Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus
Suguru Matsuyoshi | Hirotaka Kameko | Yugo Murawaki | Shinsuke Mori
Suguru Matsuyoshi | Hirotaka Kameko | Yugo Murawaki | Shinsuke Mori
Annotating Chinese Light Verb Constructions according to PARSEME guidelines
Menghan Jiang | Natalia Klyueva | Hongzhi Xu | Chu-Ren Huang
Menghan Jiang | Natalia Klyueva | Hongzhi Xu | Chu-Ren Huang
Using English Baits to Catch Serbian Multi-Word Terminology
Cvetana Krstev | Branislava Šandrih | Ranka Stanković | Miljana Mladenović
Cvetana Krstev | Branislava Šandrih | Ranka Stanković | Miljana Mladenović
Construction of Large-scale English Verbal Multiword Expression Annotated Corpus
Akihiko Kato | Hiroyuki Shindo | Yuji Matsumoto
Akihiko Kato | Hiroyuki Shindo | Yuji Matsumoto
Konbitzul: an MWE-specific database for Spanish-Basque
Uxoa Iñurrieta | Itziar Aduriz | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola
Uxoa Iñurrieta | Itziar Aduriz | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola
A Multilingual Test Collection for the Semantic Search of Entity Categories
Juliano Efson Sales | Siamak Barzegar | Wellington Franco | Bernhard Bermeitinger | Tiago Cunha | Brian Davis | André Freitas | Siegfried Handschuh
Juliano Efson Sales | Siamak Barzegar | Wellington Franco | Bernhard Bermeitinger | Tiago Cunha | Brian Davis | André Freitas | Siegfried Handschuh
Towards the Inference of Semantic Relations in Complex Nominals: a Pilot Study
Melania Cabezas-García | Pilar León-Araúz
Melania Cabezas-García | Pilar León-Araúz
Generation of a Spanish Artificial Collocation Error Corpus
Sara Rodríguez-Fernández | Roberto Carlini | Leo Wanner
Sara Rodríguez-Fernández | Roberto Carlini | Leo Wanner
Improving a Neural-based Tagger for Multiword Expressions Identification
Dušan Variš | Natalia Klyueva
Dušan Variš | Natalia Klyueva
DeepTC – An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning Experiments
Tobias Horsmann | Torsten Zesch
Tobias Horsmann | Torsten Zesch
Improving Hate Speech Detection with Deep Learning Ensembles
Steven Zimmerman | Udo Kruschwitz | Chris Fox
Steven Zimmerman | Udo Kruschwitz | Chris Fox
Semantic Relatedness of Wikipedia Concepts – Benchmark Data and a Working Solution
Liat Ein Dor | Alon Halfon | Yoav Kantor | Ran Levy | Yosi Mass | Ruty Rinott | Eyal Shnarch | Noam Slonim
Liat Ein Dor | Alon Halfon | Yoav Kantor | Ran Levy | Yosi Mass | Ruty Rinott | Eyal Shnarch | Noam Slonim
Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution
Dainis Boumber | Yifan Zhang | Arjun Mukherjee
Dainis Boumber | Yifan Zhang | Arjun Mukherjee
A Fast and Accurate Vietnamese Word Segmenter
Dat Quoc Nguyen | Dai Quoc Nguyen | Thanh Vu | Mark Dras | Mark Johnson
Dat Quoc Nguyen | Dai Quoc Nguyen | Thanh Vu | Mark Dras | Mark Johnson
Finite-state morphological analysis for Gagauz
Sevilay Bayatli | Güllü Karanfil | Memduh Gökırmak | Francis M. Tyers
Sevilay Bayatli | Güllü Karanfil | Memduh Gökırmak | Francis M. Tyers
Morphology Injection for English-Malayalam Statistical Machine Translation
Sreelekha S | Pushpak Bhattacharyya
Sreelekha S | Pushpak Bhattacharyya
The Morpho-syntactic Annotation of Animacy for a Dependency Parser
Mohammed Attia | Vitaly Nikolaev | Ali Elkahky
Mohammed Attia | Vitaly Nikolaev | Ali Elkahky
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction
Ossama Obeid | Salam Khalifa | Nizar Habash | Houda Bouamor | Wajdi Zaghouani | Kemal Oflazer
Ossama Obeid | Salam Khalifa | Nizar Habash | Houda Bouamor | Wajdi Zaghouani | Kemal Oflazer
Universal Morphologies for the Caucasus region
Christian Chiarcos | Kathrin Donandt | Maxim Ionov | Monika Rind-Pawlowski | Hasmik Sargsian | Jesse Wichers Schreur | Frank Abromeit | Christian Fäth
Christian Chiarcos | Kathrin Donandt | Maxim Ionov | Monika Rind-Pawlowski | Hasmik Sargsian | Jesse Wichers Schreur | Frank Abromeit | Christian Fäth
EMTC: Multilabel Corpus in Movie Domain for Emotion Analysis in Conversational Text
Duc-Anh Phan | Yuji Matsumoto
Duc-Anh Phan | Yuji Matsumoto
Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis
Stefania Pecore | Jeanne Villaneau
Stefania Pecore | Jeanne Villaneau
Lingmotif-lex: a Wide-coverage, State-of-the-art Lexicon for Sentiment Analysis
Antonio Moreno-Ortiz | Chantal Pérez-Hernández
Antonio Moreno-Ortiz | Chantal Pérez-Hernández
FooTweets: A Bilingual Parallel Corpus of World Cup Tweets
Henny Sluyter-Gäthje | Pintu Lohar | Haithem Afli | Andy Way
Henny Sluyter-Gäthje | Pintu Lohar | Haithem Afli | Andy Way
The SSIX Corpora: Three Gold Standard Corpora for Sentiment Analysis in English, Spanish and German Financial Microblogs
Thomas Gaillat | Manel Zarrouk | André Freitas | Brian Davis
Thomas Gaillat | Manel Zarrouk | André Freitas | Brian Davis
Sarcasm Target Identification: Dataset and An Introductory Approach
Aditya Joshi | Pranav Goel | Pushpak Bhattacharyya | Mark Carman
Aditya Joshi | Pranav Goel | Pushpak Bhattacharyya | Mark Carman
Annotating Opinions and Opinion Targets in Student Course Feedback
Janaka Chathuranga | Shanika Ediriweera | Ravindu Hasantha | Pranidhith Munasinghe | Surangika Ranathunga
Janaka Chathuranga | Shanika Ediriweera | Ravindu Hasantha | Pranidhith Munasinghe | Surangika Ranathunga
Generating a Gold Standard for a Swedish Sentiment Lexicon
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
WordKit: a Python Package for Orthographic and Phonological Featurization
Stéphan Tulkens | Dominiek Sandra | Walter Daelemans
Stéphan Tulkens | Dominiek Sandra | Walter Daelemans
Pronunciation Variants and ASR of Colloquial Speech: A Case Study on Czech
David Lukeš | Marie Kopřivová | Zuzana Komrsková | Petra Poukarová
David Lukeš | Marie Kopřivová | Zuzana Komrsková | Petra Poukarová
A Multilingual Approach to Question Classification
Aikaterini-Lida Kalouli | Katharina Kaiser | Annette Hautli-Janisz | Georg A. Kaiser | Miriam Butt
Aikaterini-Lida Kalouli | Katharina Kaiser | Annette Hautli-Janisz | Georg A. Kaiser | Miriam Butt
Dataset for the First Evaluation on Chinese Machine Reading Comprehension
Yiming Cui | Ting Liu | Zhipeng Chen | Wentao Ma | Shijin Wang | Guoping Hu
Yiming Cui | Ting Liu | Zhipeng Chen | Wentao Ma | Shijin Wang | Guoping Hu
A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks
Amir Hazem | Basma El Amal Boussaha | Nicolas Hernandez
Amir Hazem | Basma El Amal Boussaha | Nicolas Hernandez
WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions supporting Multi-hop Inference
Peter Jansen | Elizabeth Wainwright | Steven Marmorstein | Clayton Morrison
Peter Jansen | Elizabeth Wainwright | Steven Marmorstein | Clayton Morrison
Analysis of Implicit Conditions in Database Search Dialogues
Shun-ya Fukunaga | Hitoshi Nishikawa | Takenobu Tokunaga | Hikaru Yokono | Tetsuro Takahashi
Shun-ya Fukunaga | Hitoshi Nishikawa | Takenobu Tokunaga | Hikaru Yokono | Tetsuro Takahashi
An Information-Providing Closed-Domain Human-Agent Interaction Corpus
Jelte van Waterschoot | Guillaume Dubuisson Duplessis | Lorenzo Gatti | Merijn Bruijnes | Dirk Heylen
Jelte van Waterschoot | Guillaume Dubuisson Duplessis | Lorenzo Gatti | Merijn Bruijnes | Dirk Heylen
Augmenting Image Question Answering Dataset by Exploiting Image Captions
Masashi Yokota | Hideki Nakayama
Masashi Yokota | Hideki Nakayama
Semi-supervised Training Data Generation for Multilingual Question Answering
Kyungjae Lee | Kyoungho Yoon | Sunghyun Park | Seung-won Hwang
Kyungjae Lee | Kyoungho Yoon | Sunghyun Park | Seung-won Hwang
PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering
Andrei Dulceanu | Thang Le Dinh | Walter Chang | Trung Bui | Doo Soon Kim | Manh Chien Vu | Seokhwan Kim
Andrei Dulceanu | Thang Le Dinh | Walter Chang | Trung Bui | Doo Soon Kim | Manh Chien Vu | Seokhwan Kim
BioRead: A New Dataset for Biomedical Reading Comprehension
Dimitris Pappas | Ion Androutsopoulos | Haris Papageorgiou
Dimitris Pappas | Ion Androutsopoulos | Haris Papageorgiou
MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi
Deepak Gupta | Surabhi Kumari | Asif Ekbal | Pushpak Bhattacharyya
Deepak Gupta | Surabhi Kumari | Asif Ekbal | Pushpak Bhattacharyya
Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System
Shweta Yadav | Asif Ekbal | Sriparna Saha | Pushpak Bhattacharyya
Shweta Yadav | Asif Ekbal | Sriparna Saha | Pushpak Bhattacharyya
An Italian Twitter Corpus of Hate Speech against Immigrants
Manuela Sanguinetti | Fabio Poletto | Cristina Bosco | Viviana Patti | Marco Stranisci
Manuela Sanguinetti | Fabio Poletto | Cristina Bosco | Viviana Patti | Marco Stranisci
A Large Multilingual and Multi-domain Dataset for Recommender Systems
Giorgia Di Tommaso | Stefano Faralli | Paola Velardi
Giorgia Di Tommaso | Stefano Faralli | Paola Velardi
RtGender: A Corpus for Studying Differential Responses to Gender
Rob Voigt | David Jurgens | Vinodkumar Prabhakaran | Dan Jurafsky | Yulia Tsvetkov
Rob Voigt | David Jurgens | Vinodkumar Prabhakaran | Dan Jurafsky | Yulia Tsvetkov
A Neural Network Model for Part-Of-Speech Tagging of Social Media Texts
Sara Meftah | Nasredine Semmar
Sara Meftah | Nasredine Semmar
Utilizing Large Twitter Corpora to Create Sentiment Lexica
Valerij Fredriksen | Brage Jahren | Björn Gambäck
Valerij Fredriksen | Brage Jahren | Björn Gambäck
The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions
Laura Fernández Gallardo | Benjamin Weiss
Laura Fernández Gallardo | Benjamin Weiss
Design and Development of Speech Corpora for Air Traffic Control Training
Luboš Šmídl | Jan Švec | Daniel Tihelka | Jindřich Matoušek | Jan Romportl | Pavel Ircing
Luboš Šmídl | Jan Švec | Daniel Tihelka | Jindřich Matoušek | Jan Romportl | Pavel Ircing
A First South African Corpus of Multilingual Code-switched Soap Opera Speech
Ewald van der Westhuizen | Thomas Niesler
Ewald van der Westhuizen | Thomas Niesler
A Web Service for Pre-segmenting Very Long Transcribed Speech Recordings
Nina Poerner | Florian Schiel
Nina Poerner | Florian Schiel
A Real-life, French-accented Corpus of Air Traffic Control Communications
Estelle Delpech | Marion Laignelet | Christophe Pimm | Céline Raynal | Michal Trzos | Alexandre Arnold | Dominique Pronto
Estelle Delpech | Marion Laignelet | Christophe Pimm | Céline Raynal | Michal Trzos | Alexandre Arnold | Dominique Pronto
Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data
Askars Salimbajevs
Askars Salimbajevs
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach
Sunayana Sitaram | Varun Manjunath | Varun Bharadwaj | Monojit Choudhury | Kalika Bali | Michael Tjalve
Sunayana Sitaram | Varun Manjunath | Varun Bharadwaj | Monojit Choudhury | Kalika Bali | Michael Tjalve
Extending Search System based on Interactive Visualization for Speech Corpora
Tomoko Ohsuga | Yuichi Ishimoto | Tomoko Kajiyama | Shunsuke Kozawa | Kiyotaka Uchimoto | Shuichi Itahashi
Tomoko Ohsuga | Yuichi Ishimoto | Tomoko Kajiyama | Shunsuke Kozawa | Kiyotaka Uchimoto | Shuichi Itahashi
German Radio Interviews: The GRAIN Release of the SFB732 Silver Standard Collection
Katrin Schweitzer | Kerstin Eckart | Markus Gärtner | Agnieszka Falenska | Arndt Riester | Ina Rösiger | Antje Schweitzer | Sabrina Stehwien | Jonas Kuhn
Katrin Schweitzer | Kerstin Eckart | Markus Gärtner | Agnieszka Falenska | Arndt Riester | Ina Rösiger | Antje Schweitzer | Sabrina Stehwien | Jonas Kuhn
Preparing Data from Psychotherapy for Natural Language Processing
Margot Mieskes | Andreas Stiegelmayr
Margot Mieskes | Andreas Stiegelmayr
MirasVoice: A bilingual (English-Persian) speech corpus
Amir Vaheb | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Saeid Safavi | Behnam Sabeti
Amir Vaheb | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Saeid Safavi | Behnam Sabeti
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags
Koichiro Yoshino | Hiroki Tanaka | Kyoshiro Sugiyama | Makoto Kondo | Satoshi Nakamura
Koichiro Yoshino | Hiroki Tanaka | Kyoshiro Sugiyama | Makoto Kondo | Satoshi Nakamura
The Niki and Julie Corpus: Collaborative Multimodal Dialogues between Humans, Robots, and Virtual Agents
Ron Artstein | Jill Boberg | Alesia Gainer | Jonathan Gratch | Emmanuel Johnson | Anton Leuski | Gale Lucas | David Traum
Ron Artstein | Jill Boberg | Alesia Gainer | Jonathan Gratch | Emmanuel Johnson | Anton Leuski | Gale Lucas | David Traum
Constructing a Chinese Medical Conversation Corpus Annotated with Conversational Structures and Actions
Nan Wang | Yan Song | Fei Xia
Nan Wang | Yan Song | Fei Xia
Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus
Maria Koutsombogera | Carl Vogel
Maria Koutsombogera | Carl Vogel
A Semi-autonomous System for Creating a Human-Machine Interaction Corpus in Virtual Reality: Application to the ACORFORMed System for Training Doctors to Break Bad News
Magalie Ochs | Philippe Blache | Grégoire de Montcheuil | Jean-Marie Pergandi | Jorane Saubesty | Daniel Francon | Daniel Mestre
Magalie Ochs | Philippe Blache | Grégoire de Montcheuil | Jean-Marie Pergandi | Jorane Saubesty | Daniel Francon | Daniel Mestre
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas
Sashi Novitasari | Quoc Truong Do | Sakriani Sakti | Dessi Lestari | Satoshi Nakamura
Sashi Novitasari | Quoc Truong Do | Sakriani Sakti | Dessi Lestari | Satoshi Nakamura
QUEST: A Natural Language Interface to Relational Databases
Vadim Sheinin | Elahe Khorashani | Hangu Yeo | Kun Xu | Ngoc Phuoc An Vo | Octavian Popescu
Vadim Sheinin | Elahe Khorashani | Hangu Yeo | Kun Xu | Ngoc Phuoc An Vo | Octavian Popescu
Grapheme-level Awareness in Word Embeddings for Morphologically Rich Languages
Suzi Park | Hyopil Shin
Suzi Park | Hyopil Shin
Building a Constraint Grammar Parser for Plains Cree Verbs and Arguments
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen
BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages
Benjamin Heinzerling | Michael Strube
Benjamin Heinzerling | Michael Strube
Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation
Danillo Rocha | Ivandré Paraboni
Danillo Rocha | Ivandré Paraboni
Definite Description Lexical Choice: taking Speaker’s Personality into account
Alex Lan | Ivandré Paraboni
Alex Lan | Ivandré Paraboni
Incorporating Semantic Attention in Video Description Generation
Natsuda Laokulrat | Naoaki Okazaki | Hideki Nakayama
Natsuda Laokulrat | Naoaki Okazaki | Hideki Nakayama
GenDR: A Generic Deep Realizer with Complex Lexicalization
François Lareau | Florie Lambrey | Ieva Dubinskaite | Daniel Galarreta-Piquette | Maryam Nejat
François Lareau | Florie Lambrey | Ieva Dubinskaite | Daniel Galarreta-Piquette | Maryam Nejat
A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification
Sanja Štajner | Sergiu Nisioi
Sanja Štajner | Sergiu Nisioi
Don’t Annotate, but Validate: a Data-to-Text Method for Capturing Event Data
Piek Vossen | Filip Ilievski | Marten Postma | Roxane Segers
Piek Vossen | Filip Ilievski | Marten Postma | Roxane Segers
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data
Diego Moussallem | Thiago Ferreira | Marcos Zampieri | Maria Claudia Cavalcanti | Geraldo Xexéo | Mariana Neves | Axel-Cyrille Ngonga Ngomo
Diego Moussallem | Thiago Ferreira | Marcos Zampieri | Maria Claudia Cavalcanti | Geraldo Xexéo | Mariana Neves | Axel-Cyrille Ngonga Ngomo
Neural Models of Selectional Preferences for Implicit Semantic Role Labeling
Minh Le | Antske Fokkens
Minh Le | Antske Fokkens
A database of German definitory contexts from selected web sources
Adrien Barbaresi | Lothar Lemnitzer | Alexander Geyken
Adrien Barbaresi | Lothar Lemnitzer | Alexander Geyken
Annotating Abstract Meaning Representations for Spanish
Noelia Migueles-Abraira | Rodrigo Agerri | Arantza Diaz de Ilarraza
Noelia Migueles-Abraira | Rodrigo Agerri | Arantza Diaz de Ilarraza
Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification
Marie-Claude L’Homme | Benoît Robichaud | Nathalie Prévil
Marie-Claude L’Homme | Benoît Robichaud | Nathalie Prévil
Rollenwechsel-English: a large-scale semantic role corpus
Asad Sayeed | Pavel Shkadzko | Vera Demberg
Asad Sayeed | Pavel Shkadzko | Vera Demberg
Towards a Standardized Dataset for Noun Compound Interpretation
Girishkumar Ponkiya | Kevin Patel | Pushpak Bhattacharyya | Girish K Palshikar
Girishkumar Ponkiya | Kevin Patel | Pushpak Bhattacharyya | Girish K Palshikar
NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System
Xi Victoria Lin | Chenglong Wang | Luke Zettlemoyer | Michael D. Ernst
Xi Victoria Lin | Chenglong Wang | Luke Zettlemoyer | Michael D. Ernst
World Knowledge for Abstract Meaning Representation Parsing
Charles Welch | Jonathan K. Kummerfeld | Song Feng | Rada Mihalcea
Charles Welch | Jonathan K. Kummerfeld | Song Feng | Rada Mihalcea
Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research
Michael Gref | Joachim Köhler | Almut Leh
Michael Gref | Joachim Köhler | Almut Leh
Sound Signal Processing with Seq2Tree Network
Weicheng Ma | Kai Cao | Zhaoheng Ni | Peter Chin | Xiang Li
Weicheng Ma | Kai Cao | Zhaoheng Ni | Peter Chin | Xiang Li
Open ASR for Icelandic: Resources and a Baseline System
Anna Björk Nikulásdóttir | Inga Rún Helgadóttir | Matthías Pétursson | Jón Guðnason
Anna Björk Nikulásdóttir | Inga Rún Helgadóttir | Matthías Pétursson | Jón Guðnason
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
Zhao Meng | Lili Mou | Zhi Jin
Zhao Meng | Lili Mou | Zhi Jin
Discriminating between Similar Languages on Imbalanced Conversational Texts
Junqing He | Xian Huang | Xuemin Zhao | Yan Zhang | Yonghong Yan
Junqing He | Xian Huang | Xuemin Zhao | Yan Zhang | Yonghong Yan
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition
Michael Stadtschnitzer | Christoph Schmidt
Michael Stadtschnitzer | Christoph Schmidt
Simulating ASR errors for training SLU systems
Edwin Simonnet | Sahar Ghannay | Nathalie Camelin | Yannick Estève
Edwin Simonnet | Sahar Ghannay | Nathalie Camelin | Yannick Estève
Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models
Natalia Tomashenko | Yannick Estève
Natalia Tomashenko | Yannick Estève
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform
Ingmar Steiner | Sébastien Le Maguer
Ingmar Steiner | Sébastien Le Maguer
Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
Akira Hayakawa | Carl Vogel | Saturnino Luz | Nick Campbell
Akira Hayakawa | Carl Vogel | Saturnino Luz | Nick Campbell
Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data
Christopher Tauchmann | Thomas Arnold | Andreas Hanselowski | Christian M. Meyer | Margot Mieskes
Christopher Tauchmann | Thomas Arnold | Andreas Hanselowski | Christian M. Meyer | Margot Mieskes
A New Annotated Portuguese/Spanish Corpus for the Multi-Sentence Compression Task
Elvys Linhares Pontes | Juan-Manuel Torres-Moreno | Stéphane Huet | Andréa Carneiro Linhares
Elvys Linhares Pontes | Juan-Manuel Torres-Moreno | Stéphane Huet | Andréa Carneiro Linhares
TSix: A Human-involved-creation Dataset for Tweet Summarization
Minh-Tien Nguyen | Dac Viet Lai | Huy-Tien Nguyen | Le-Minh Nguyen
Minh-Tien Nguyen | Dac Viet Lai | Huy-Tien Nguyen | Le-Minh Nguyen
A Workbench for Rapid Generation of Cross-Lingual Summaries
Nisarg Jhaveri | Manish Gupta | Vasudeva Varma
Nisarg Jhaveri | Manish Gupta | Vasudeva Varma
Annotation and Analysis of Extractive Summaries for the Kyutech Corpus
Takashi Yamamura | Kazutaka Shimada
Takashi Yamamura | Kazutaka Shimada
Auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus
Markus Zopf
Markus Zopf
PyrEval: An Automated Method for Summary Content Analysis
Yanjun Gao | Andrew Warner | Rebecca Passonneau
Yanjun Gao | Andrew Warner | Rebecca Passonneau
Mapping Texts to Scripts: An Entailment Study
Simon Ostermann | Hannah Seitz | Stefan Thater | Manfred Pinkal
Simon Ostermann | Hannah Seitz | Stefan Thater | Manfred Pinkal
Semantic Equivalence Detection: Are Interrogatives Harder than Declaratives?
João Rodrigues | Chakaveh Saedi | António Branco | João Silva
João Rodrigues | Chakaveh Saedi | António Branco | João Silva
CLARIN: Towards FAIR and Responsible Data Science Using Language Resources
Franciska de Jong | Bente Maegaard | Koenraad De Smedt | Darja Fišer | Dieter Van Uytvanck
Franciska de Jong | Bente Maegaard | Koenraad De Smedt | Darja Fišer | Dieter Van Uytvanck
From ‘Solved Problems’ to New Challenges: A Report on LDC Activities
Christopher Cieri | Mark Liberman | Stephanie Strassel | Denise DiPersio | Jonathan Wright | Andrea Mazzucchi
Christopher Cieri | Mark Liberman | Stephanie Strassel | Denise DiPersio | Jonathan Wright | Andrea Mazzucchi
New directions in ELRA activities
Valérie Mapelli | Victoria Arranz | Hélène Mazo | Pawel Kamocki | Vladimir Popescu
Valérie Mapelli | Victoria Arranz | Hélène Mazo | Pawel Kamocki | Vladimir Popescu
A Framework for Multi-Language Service Design with the Language Grid
Donghui Lin | Yohei Murakami | Toru Ishida
Donghui Lin | Yohei Murakami | Toru Ishida
Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs
Georg Rehm | Stefanie Hegele
Georg Rehm | Stefanie Hegele
Annotating High-Level Structures of Short Stories and Personal Anecdotes
Boyang Li | Beth Cardier | Tong Wang | Florian Metze
Boyang Li | Beth Cardier | Tong Wang | Florian Metze
Discovering the Language of Wine Reviews: A Text Mining Account
Els Lefever | Iris Hendrickx | Ilja Croijmans | Antal van den Bosch | Asifa Majid
Els Lefever | Iris Hendrickx | Ilja Croijmans | Antal van den Bosch | Asifa Majid
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods
Thomas Proisl | Stefan Evert | Fotis Jannidis | Christof Schöch | Leonard Konle | Steffen Pielström
Thomas Proisl | Stefan Evert | Fotis Jannidis | Christof Schöch | Leonard Konle | Steffen Pielström
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions
Albert Gatt | Marc Tanti | Adrian Muscat | Patrizia Paggio | Reuben A Farrugia | Claudia Borg | Kenneth P Camilleri | Michael Rosner | Lonneke van der Plas
Albert Gatt | Marc Tanti | Adrian Muscat | Patrizia Paggio | Reuben A Farrugia | Claudia Borg | Kenneth P Camilleri | Michael Rosner | Lonneke van der Plas
Adapting Serious Game for Fallacious Argumentation to German: Pitfalls, Insights, and Best Practices
Ivan Habernal | Patrick Pauli | Iryna Gurevych
Ivan Habernal | Patrick Pauli | Iryna Gurevych
Crowdsourcing Regional Variation Data and Automatic Geolocalisation of Speakers of European French
Jean-Philippe Goldman | Yves Scherrer | Julie Glikman | Mathieu Avanzi | Christophe Benzitoun | Philippe Boula de Mareüil
Jean-Philippe Goldman | Yves Scherrer | Julie Glikman | Mathieu Avanzi | Christophe Benzitoun | Philippe Boula de Mareüil
Improving Machine Translation of Educational Content via Crowdsourcing
Maximiliana Behnke | Antonio Valerio Miceli Barone | Rico Sennrich | Vilelmini Sosoni | Thanasis Naskos | Eirini Takoulidou | Maria Stasimioti | Menno van Zaanen | Sheila Castilho | Federico Gaspari | Panayota Georgakopoulou | Valia Kordoni | Markus Egg | Katia Lida Kermanidis
Maximiliana Behnke | Antonio Valerio Miceli Barone | Rico Sennrich | Vilelmini Sosoni | Thanasis Naskos | Eirini Takoulidou | Maria Stasimioti | Menno van Zaanen | Sheila Castilho | Federico Gaspari | Panayota Georgakopoulou | Valia Kordoni | Markus Egg | Katia Lida Kermanidis
Grounding Gradable Adjectives through Crowdsourcing
Rebecca Sharp | Mithun Paul | Ajay Nagesh | Dane Bell | Mihai Surdeanu
Rebecca Sharp | Mithun Paul | Ajay Nagesh | Dane Bell | Mihai Surdeanu
Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
Oliver Adams | Trevor Cohn | Graham Neubig | Hilaria Cruz | Steven Bird | Alexis Michaud
Oliver Adams | Trevor Cohn | Graham Neubig | Hilaria Cruz | Steven Bird | Alexis Michaud
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
Pierre Godard | Gilles Adda | Martine Adda-Decker | Juan Benjumea | Laurent Besacier | Jamison Cooper-Leavitt | Guy-Noel Kouarata | Lori Lamel | Hélène Maynard | Markus Mueller | Annie Rialland | Sebastian Stueker | François Yvon | Marcely Zanon-Boito
Pierre Godard | Gilles Adda | Martine Adda-Decker | Juan Benjumea | Laurent Besacier | Jamison Cooper-Leavitt | Guy-Noel Kouarata | Lori Lamel | Hélène Maynard | Markus Mueller | Annie Rialland | Sebastian Stueker | François Yvon | Marcely Zanon-Boito
Chahta Anumpa: A multimodal corpus of the Choctaw Language
Jacqueline Brixey | Eli Pincus | Ron Artstein
Jacqueline Brixey | Eli Pincus | Ron Artstein
BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools
Fatima Hamlaoui | Emmanuel-Moselly Makasso | Markus Müller | Jonas Engelmann | Gilles Adda | Alex Waibel | Sebastian Stüker
Fatima Hamlaoui | Emmanuel-Moselly Makasso | Markus Müller | Jonas Engelmann | Gilles Adda | Alex Waibel | Sebastian Stüker
The MADAR Arabic Dialect Corpus and Lexicon
Houda Bouamor | Nizar Habash | Mohammad Salameh | Wajdi Zaghouani | Owen Rambow | Dana Abdulrahim | Ossama Obeid | Salam Khalifa | Fadhl Eryani | Alexander Erdmann | Kemal Oflazer
Houda Bouamor | Nizar Habash | Mohammad Salameh | Wajdi Zaghouani | Owen Rambow | Dana Abdulrahim | Ossama Obeid | Salam Khalifa | Fadhl Eryani | Alexander Erdmann | Kemal Oflazer
Designing a Collaborative Process to Create Bilingual Dictionaries of Indonesian Ethnic Languages
Arbi Haza Nasution | Yohei Murakami | Toru Ishida
Arbi Haza Nasution | Yohei Murakami | Toru Ishida
Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction
Chin-Ho Lin | Hen-Hsen Huang | Hsin-Hsi Chen
Chin-Ho Lin | Hen-Hsen Huang | Hsin-Hsi Chen
Building a Knowledge Graph from Natural Language Definitions for Interpretable Text Entailment Recognition
Vivian Silva | André Freitas | Siegfried Handschuh
Vivian Silva | André Freitas | Siegfried Handschuh
Combining rule-based and embedding-based approaches to normalize textual entities with an ontology
Arnaud Ferré | Louise Deléger | Pierre Zweigenbaum | Claire Nédellec
Arnaud Ferré | Louise Deléger | Pierre Zweigenbaum | Claire Nédellec
T-REx: A Large Scale Alignment of Natural Language with Knowledge Base Triples
Hady Elsahar | Pavlos Vougiouklis | Arslen Remaci | Christophe Gravier | Jonathon Hare | Frederique Laforest | Elena Simperl
Hady Elsahar | Pavlos Vougiouklis | Arslen Remaci | Christophe Gravier | Jonathon Hare | Frederique Laforest | Elena Simperl
A Large Parallel Corpus of Full-Text Scientific Articles
Felipe Soares | Viviane Moreira | Karin Becker
Felipe Soares | Viviane Moreira | Karin Becker
The IIT Bombay English-Hindi Parallel Corpus
Anoop Kunchukuttan | Pratik Mehta | Pushpak Bhattacharyya
Anoop Kunchukuttan | Pratik Mehta | Pushpak Bhattacharyya
Extracting an English-Persian Parallel Corpus from Comparable Corpora
Akbar Karimi | Ebrahim Ansari | Bahram Sadeghi Bigham
Akbar Karimi | Ebrahim Ansari | Bahram Sadeghi Bigham
Learning Word Vectors for 157 Languages
Edouard Grave | Piotr Bojanowski | Prakhar Gupta | Armand Joulin | Tomas Mikolov
Edouard Grave | Piotr Bojanowski | Prakhar Gupta | Armand Joulin | Tomas Mikolov
SumeCzech: Large Czech News-Based Summarization Dataset
Milan Straka | Nikita Mediankin | Tom Kocmi | Zdeněk Žabokrtský | Vojtěch Hudeček | Jan Hajič
Milan Straka | Nikita Mediankin | Tom Kocmi | Zdeněk Žabokrtský | Vojtěch Hudeček | Jan Hajič
Text Simplification from Professionally Produced Corpora
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Intertextual Correspondence for Integrating Corpora
Jacky Visser | Rory Duthie | John Lawrence | Chris Reed
Jacky Visser | Rory Duthie | John Lawrence | Chris Reed
Building Named Entity Recognition Taggers via Parallel Corpora
Rodrigo Agerri | Yiling Chung | Itziar Aldabe | Nora Aranberri | Gorka Labaka | German Rigau
Rodrigo Agerri | Yiling Chung | Itziar Aldabe | Nora Aranberri | Gorka Labaka | German Rigau
Cross-Document, Cross-Language Event Coreference Annotation Using Event Hoppers
Zhiyi Song | Ann Bies | Justin Mott | Xuansong Li | Stephanie Strassel | Christopher Caruso
Zhiyi Song | Ann Bies | Justin Mott | Xuansong Li | Stephanie Strassel | Christopher Caruso
TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection
Tirthankar Ghosal | Amitra Salam | Swati Tiwari | Asif Ekbal | Pushpak Bhattacharyya
Tirthankar Ghosal | Amitra Salam | Swati Tiwari | Asif Ekbal | Pushpak Bhattacharyya
Analyzing Citation-Distance Networks for Evaluating Publication Impact
Drahomira Herrmannova | Petr Knoth | Robert Patton
Drahomira Herrmannova | Petr Knoth | Robert Patton
Incorporating Global Contexts into Sentence Embedding for Relational Extraction at the Paragraph Level with Distant Supervision
Eun-kyung Kim | Key-Sun Choi
Eun-kyung Kim | Key-Sun Choi
MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge
Simon Ostermann | Ashutosh Modi | Michael Roth | Stefan Thater | Manfred Pinkal
Simon Ostermann | Ashutosh Modi | Michael Roth | Stefan Thater | Manfred Pinkal
A Neural Network Based Model for Loanword Identification in Uyghur
Chenggang Mi | Yating Yang | Lei Wang | Xi Zhou | Tonghai Jiang
Chenggang Mi | Yating Yang | Lei Wang | Xi Zhou | Tonghai Jiang
Revisiting Distant Supervision for Relation Extraction
Tingsong Jiang | Jing Liu | Chin-Yew Lin | Zhifang Sui
Tingsong Jiang | Jing Liu | Chin-Yew Lin | Zhifang Sui
Incorporating Contextual Information for Language-Independent, Dynamic Disambiguation Tasks
Tobias Staron | Özge Alaçam | Wolfgang Menzel
Tobias Staron | Özge Alaçam | Wolfgang Menzel
Overcoming the Long Tail Problem: A Case Study on CO2-Footprint Estimation of Recipes using Information Retrieval
Melanie Geiger | Martin Braschler
Melanie Geiger | Martin Braschler
A vision-grounded dataset for predicting typical locations for verbs
Nelson Mukuze | Anna Rohrbach | Vera Demberg | Bernt Schiele
Nelson Mukuze | Anna Rohrbach | Vera Demberg | Bernt Schiele
Creating dialect sub-corpora by clustering: a case in Japanese for an adaptive method
Yo Sato | Kevin Heffernan
Yo Sato | Kevin Heffernan
A Fast and Flexible Webinterface for Dialect Research in the Low Countries
Roeland van Hout | Nicoline van der Sijs | Erwin Komen | Henk van den Heuvel
Roeland van Hout | Nicoline van der Sijs | Erwin Komen | Henk van den Heuvel
Arabic Dialect Identification in the Context of Bivalency and Code-Switching
Mahmoud El-Haj | Paul Rayson | Mariam Aboelezz
Mahmoud El-Haj | Paul Rayson | Mariam Aboelezz
Unified Guidelines and Resources for Arabic Dialect Orthography
Nizar Habash | Fadhl Eryani | Salam Khalifa | Owen Rambow | Dana Abdulrahim | Alexander Erdmann | Reem Faraj | Wajdi Zaghouani | Houda Bouamor | Nasser Zalmout | Sara Hassan | Faisal Al-Shargi | Sakhar Alkhereyf | Basma Abdulkareem | Ramy Eskander | Mohammad Salameh | Hind Saddiki
Nizar Habash | Fadhl Eryani | Salam Khalifa | Owen Rambow | Dana Abdulrahim | Alexander Erdmann | Reem Faraj | Wajdi Zaghouani | Houda Bouamor | Nasser Zalmout | Sara Hassan | Faisal Al-Shargi | Sakhar Alkhereyf | Basma Abdulkareem | Ramy Eskander | Mohammad Salameh | Hind Saddiki
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach
Houda Saâdane | Hosni Seffih | Christian Fluhr | Khalid Choukri | Nasredine Semmar
Houda Saâdane | Hosni Seffih | Christian Fluhr | Khalid Choukri | Nasredine Semmar
Shami: A Corpus of Levantine Arabic Dialects
Kathrein Abu Kwaik | Motaz Saad | Stergios Chatzikyriakidis | Simon Dobnik
Kathrein Abu Kwaik | Motaz Saad | Stergios Chatzikyriakidis | Simon Dobnik
You Tweet What You Speak: A City-Level Dataset of Arabic Dialects
Muhammad Abdul-Mageed | Hassan Alhuzali | Mohamed Elaraby
Muhammad Abdul-Mageed | Hassan Alhuzali | Mohamed Elaraby
DART: A Large Dataset of Dialectal Arabic Tweets
Israa Alsarsour | Esraa Mohamed | Reem Suwaileh | Tamer Elsayed
Israa Alsarsour | Esraa Mohamed | Reem Suwaileh | Tamer Elsayed
Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features
Gregor Wiedemann | Gerhard Heyer
Gregor Wiedemann | Gerhard Heyer
Automating Document Discovery in the Systematic Review Process: How to Use Chaff to Extract Wheat
Christopher Norman | Mariska Leeflang | Pierre Zweigenbaum | Aurélie Névéol
Christopher Norman | Mariska Leeflang | Pierre Zweigenbaum | Aurélie Névéol
Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications.
Oussama Ahmia | Nicolas Béchet | Pierre-François Marteau
Oussama Ahmia | Nicolas Béchet | Pierre-François Marteau
Using Adversarial Examples in Natural Language Processing
Petr Bělohlávek | Ondřej Plátek | Zdeněk Žabokrtský | Milan Straka
Petr Bělohlávek | Ondřej Plátek | Zdeněk Žabokrtský | Milan Straka
Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus
Anne-Kathrin Schumann | Héctor Martínez Alonso
Anne-Kathrin Schumann | Héctor Martínez Alonso
Annotated Corpus of Scientific Conference’s Homepages for Information Extraction
Piotr Andruszkiewicz | Rafał Hazan
Piotr Andruszkiewicz | Rafał Hazan
WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis
Rüdiger Gleim | Alexander Mehler | Sung Y. Song
Rüdiger Gleim | Alexander Mehler | Sung Y. Song
Studying Muslim Stereotyping through Microportrait Extraction
Antske Fokkens | Nel Ruigrok | Camiel Beukeboom | Gagestein Sarah | Wouter van Atteveldt
Antske Fokkens | Nel Ruigrok | Camiel Beukeboom | Gagestein Sarah | Wouter van Atteveldt
Analyzing the Quality of Counseling Conversations: the Tell-Tale Signs of High-quality Counseling
Verónica Pérez-Rosas | Xuetong Sun | Christy Li | Yuchen Wang | Kenneth Resnicow | Rada Mihalcea
Verónica Pérez-Rosas | Xuetong Sun | Christy Li | Yuchen Wang | Kenneth Resnicow | Rada Mihalcea
Interpersonal Relationship Labels for the CALLHOME Corpus
Denys Katerenchuk | David Guy Brizan | Andrew Rosenberg
Denys Katerenchuk | David Guy Brizan | Andrew Rosenberg
Text Mining for History: first steps on building a large dataset
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker
Building Evaluation Datasets for Cultural Microblog Retrieval
Lorraine Goeuriot | Josiane Mothe | Philippe Mulhem | Eric SanJuan
Lorraine Goeuriot | Josiane Mothe | Philippe Mulhem | Eric SanJuan
Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages
Matīss Rikters | Mārcis Pinnis | Rihards Krišlauks
Matīss Rikters | Mārcis Pinnis | Rihards Krišlauks
Cross-lingual Terminology Extraction for Translation Quality Estimation
Yu Yuan | Yuze Gao | Yue Zhang | Serge Sharoff
Yu Yuan | Yuze Gao | Yue Zhang | Serge Sharoff
Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German
Pierre-Edouard Honnet | Andrei Popescu-Belis | Claudiu Musat | Michael Baeriswyl
Pierre-Edouard Honnet | Andrei Popescu-Belis | Claudiu Musat | Michael Baeriswyl
Improving domain-specific SMT for low-resourced languages using data from different domains
Fathima Farhath | Pranavan Theivendiram | Surangika Ranathunga | Sanath Jayasena | Gihan Dias
Fathima Farhath | Pranavan Theivendiram | Surangika Ranathunga | Sanath Jayasena | Gihan Dias
Discovering Parallel Language Resources for Training MT Engines
Vassilis Papavassiliou | Prokopis Prokopidis | Stelios Piperidis
Vassilis Papavassiliou | Prokopis Prokopidis | Stelios Piperidis
A fine-grained error analysis of NMT, SMT and RBMT output for English-to-Dutch
Laura Van Brussel | Arda Tezcan | Lieve Macken
Laura Van Brussel | Arda Tezcan | Lieve Macken
Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus
Injy Hamed | Mohamed Elmahdy | Slim Abdennadher
Injy Hamed | Mohamed Elmahdy | Slim Abdennadher
Evaluation of Machine Translation Performance Across Multiple Genres and Languages
Marlies van der Wees | Arianna Bisazza | Christof Monz
Marlies van der Wees | Arianna Bisazza | Christof Monz
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
Pierre Zweigenbaum | Serge Sharoff | Reinhard Rapp
Pierre Zweigenbaum | Serge Sharoff | Reinhard Rapp
A Morphologically Annotated Corpus of Emirati Arabic
Salam Khalifa | Nizar Habash | Fadhl Eryani | Ossama Obeid | Dana Abdulrahim | Meera Al Kaabi
Salam Khalifa | Nizar Habash | Fadhl Eryani | Ossama Obeid | Dana Abdulrahim | Meera Al Kaabi
CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing
Amir More | Özlem Çetinoğlu | Çağrı Çöltekin | Nizar Habash | Benoît Sagot | Djamé Seddah | Dima Taji | Reut Tsarfaty
Amir More | Özlem Çetinoğlu | Çağrı Çöltekin | Nizar Habash | Benoît Sagot | Djamé Seddah | Dima Taji | Reut Tsarfaty
Manually Annotated Corpus of Polish Texts Published between 1830 and 1918
Witold Kieraś | Marcin Woliński
Witold Kieraś | Marcin Woliński
Evaluating Inflectional Complexity Crosslinguistically: a Processing Perspective
Claudia Marzi | Marcello Ferro | Ouafae Nahli | Patrizia Belik | Stavros Bompolas | Vito Pirrelli
Claudia Marzi | Marcello Ferro | Ouafae Nahli | Patrizia Belik | Stavros Bompolas | Vito Pirrelli
Parser combinators for Tigrinya and Oromo morphology
Patrick Littell | Tom McCoy | Na-Rae Han | Shruti Rijhwani | Zaid Sheikh | David Mortensen | Teruko Mitamura | Lori Levin
Patrick Littell | Tom McCoy | Na-Rae Han | Shruti Rijhwani | Zaid Sheikh | David Mortensen | Teruko Mitamura | Lori Levin
Building a Morphological Treebank for German from a Linguistic Database
Petra Steiner | Josef Ruppenhofer
Petra Steiner | Josef Ruppenhofer
CATS: A Tool for Customized Alignment of Text Simplification Corpora
Sanja Štajner | Marc Franco-Salvador | Paolo Rosso | Simone Paolo Ponzetto
Sanja Štajner | Marc Franco-Salvador | Paolo Rosso | Simone Paolo Ponzetto
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus
Thanh-Le Ha | Jan Niehues | Matthias Sperber | Ngoc Quan Pham | Alexander Waibel
Thanh-Le Ha | Jan Niehues | Matthias Sperber | Ngoc Quan Pham | Alexander Waibel
Multi-lingual Argumentative Corpora in English, Turkish, Greek, Albanian, Croatian, Serbian, Macedonian, Bulgarian, Romanian and Arabic
Alfred Sliwa | Yuan Ma | Ruishen Liu | Niravkumar Borad | Seyedeh Ziyaei | Mina Ghobadi | Firas Sabbah | Ahmet Aker
Alfred Sliwa | Yuan Ma | Ruishen Liu | Niravkumar Borad | Seyedeh Ziyaei | Mina Ghobadi | Firas Sabbah | Ahmet Aker
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages
Siamak Barzegar | Brian Davis | Manel Zarrouk | Siegfried Handschuh | Andre Freitas
Siamak Barzegar | Brian Davis | Manel Zarrouk | Siegfried Handschuh | Andre Freitas
Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard
Delphine Bernhard | Anne-Laure Ligozat | Fanny Martin | Myriam Bras | Pierre Magistry | Marianne Vergez-Couret | Lucie Steiblé | Pascale Erhart | Nabil Hathout | Dominique Huck | Christophe Rey | Philippe Reynés | Sophie Rosset | Jean Sibille | Thomas Lavergne
Delphine Bernhard | Anne-Laure Ligozat | Fanny Martin | Myriam Bras | Pierre Magistry | Marianne Vergez-Couret | Lucie Steiblé | Pascale Erhart | Nabil Hathout | Dominique Huck | Christophe Rey | Philippe Reynés | Sophie Rosset | Jean Sibille | Thomas Lavergne
Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM
Randah Alharbi | Walid Magdy | Kareem Darwish | Ahmed AbdelAli | Hamdy Mubarak
Randah Alharbi | Walid Magdy | Kareem Darwish | Ahmed AbdelAli | Hamdy Mubarak
HiNTS: A Tagset for Middle Low German
Fabian Barteld | Sarah Ihden | Katharina Dreessen | Ingrid Schröder
Fabian Barteld | Sarah Ihden | Katharina Dreessen | Ingrid Schröder
Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh
Steven Neale | Kevin Donnelly | Gareth Watkins | Dawn Knight
Steven Neale | Kevin Donnelly | Gareth Watkins | Dawn Knight
Graph Based Semi-Supervised Learning Approach for Tamil POS tagging
Mokanarangan Thayaparan | Surangika Ranathunga | Uthayasanker Thayasivam
Mokanarangan Thayaparan | Surangika Ranathunga | Uthayasanker Thayasivam
What Causes the Differences in Communication Styles? A Multicultural Study on Directness and Elaborateness
Juliana Miehle | Wolfgang Minker | Stefan Ultes
Juliana Miehle | Wolfgang Minker | Stefan Ultes
FARMI: A FrAmework for Recording Multi-Modal Interactions
Patrik Jonell | Mattias Bystedt | Per Fallgren | Dimosthenis Kontogiorgos | José Lopes | Zofia Malisz | Samuel Mascarenhas | Catharine Oertel | Eran Raveh | Todd Shore
Patrik Jonell | Mattias Bystedt | Per Fallgren | Dimosthenis Kontogiorgos | José Lopes | Zofia Malisz | Samuel Mascarenhas | Catharine Oertel | Eran Raveh | Todd Shore
Creating Large-Scale Argumentation Structures for Dialogue Systems
Kazuki Sakai | Akari Inago | Ryuichiro Higashinaka | Yuichiro Yoshikawa | Hiroshi Ishiguro | Junji Tomita
Kazuki Sakai | Akari Inago | Ryuichiro Higashinaka | Yuichiro Yoshikawa | Hiroshi Ishiguro | Junji Tomita
Exploring Conversational Language Generation for Rich Content about Hotels
Marilyn Walker | Albry Smither | Shereen Oraby | Vrindavan Harrison | Hadar Shemtov
Marilyn Walker | Albry Smither | Shereen Oraby | Vrindavan Harrison | Hadar Shemtov
A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
Thi-Lan Ngo | Pham Khac Linh | Hideaki Takeda
Thi-Lan Ngo | Pham Khac Linh | Hideaki Takeda
Annotating Attribution Relations in Arabic
Amal Alsaif | Tasniem Alyahya | Madawi Alotaibi | Huda Almuzaini | Abeer Algahtani
Amal Alsaif | Tasniem Alyahya | Madawi Alotaibi | Huda Almuzaini | Abeer Algahtani
The ADELE Corpus of Dyadic Social Text Conversations:Dialog Act Annotation with ISO 24617-2
Emer Gilmartin | Christian Saam | Brendan Spillane | Maria O’Reilly | Ketong Su | Arturo Calvo | Loredana Cerrato | Killian Levacher | Nick Campbell | Vincent Wade
Emer Gilmartin | Christian Saam | Brendan Spillane | Maria O’Reilly | Ketong Su | Arturo Calvo | Loredana Cerrato | Killian Levacher | Nick Campbell | Vincent Wade
An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank
Deniz Zeyrek | Murathan Kurfalı
Deniz Zeyrek | Murathan Kurfalı
Compilation of Corpora for the Study of the Information Structure–Prosody Interface
Alicia Burga | Mónica Domínguez | Mireia Farrús | Leo Wanner
Alicia Burga | Mónica Domínguez | Mireia Farrús | Leo Wanner
Preliminary Analysis of Embodied Interactions between Science Communicators and Visitors Based on a Multimodal Corpus of Japanese Conversations in a Science Museum
Rui Sakaida | Ryosaku Makino | Mayumi Bono
Rui Sakaida | Ryosaku Makino | Mayumi Bono
Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations
Yudai Kishimoto | Shinnosuke Sawada | Yugo Murawaki | Daisuke Kawahara | Sadao Kurohashi
Yudai Kishimoto | Shinnosuke Sawada | Yugo Murawaki | Daisuke Kawahara | Sadao Kurohashi
Automatic Labeling of Problem-Solving Dialogues for Computational Microgenetic Learning Analytics
Yuanliang Meng | Anna Rumshisky | Florence Sullivan
Yuanliang Meng | Anna Rumshisky | Florence Sullivan
Increasing Argument Annotation Reproducibility by Using Inter-annotator Agreement to Improve Guidelines
Milagro Teruel | Cristian Cardellino | Fernando Cardellino | Laura Alonso Alemany | Serena Villata
Milagro Teruel | Cristian Cardellino | Fernando Cardellino | Laura Alonso Alemany | Serena Villata
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Revita: a Language-learning Platform at the Intersection of ITS and CALL
Anisia Katinskaia | Javad Nouri | Roman Yangarber
Anisia Katinskaia | Javad Nouri | Roman Yangarber
The Distribution and Prosodic Realization of Verb Forms in German Infant-Directed Speech
Bettina Braun | Katharina Zahner
Bettina Braun | Katharina Zahner
Cross-linguistically Small World Networks are Ubiquitous in Child-directed Speech
Steven Moran | Danica Pajović | Sabine Stoll
Steven Moran | Danica Pajović | Sabine Stoll
L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures
Keying Li | John Lee
Keying Li | John Lee
The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners
Roberts Darģis | Ilze Auziņa | Kristīne Levāne-Petrova
Roberts Darģis | Ilze Auziņa | Kristīne Levāne-Petrova
An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures
Leonardo Zilio | Rodrigo Wilkens | Cédrick Fairon
Leonardo Zilio | Rodrigo Wilkens | Cédrick Fairon
Portable Spelling Corrector for a Less-Resourced Language: Amharic
Andargachew Mekonnen Gezmu | Andreas Nürnberger | Binyam Ephrem Seyoum
Andargachew Mekonnen Gezmu | Andreas Nürnberger | Binyam Ephrem Seyoum
A Speaking Atlas of the Regional Languages of France
Philippe Boula de Mareüil | Albert Rilliard | Frédéric Vernier
Philippe Boula de Mareüil | Albert Rilliard | Frédéric Vernier
Pronunciation Dictionaries for the Alsatian Dialects to Analyze Spelling and Phonetic Variation
Lucie Steiblé | Delphine Bernhard
Lucie Steiblé | Delphine Bernhard
ChAnot: An Intelligent Annotation Tool for Indigenous and Highly Agglutinative Languages in Peru
Rodolfo Mercado-Gonzales | José Pereira-Noriega | Marco Sobrevilla | Arturo Oncevay
Rodolfo Mercado-Gonzales | José Pereira-Noriega | Marco Sobrevilla | Arturo Oncevay
The DLDP Survey on Digital Use and Usability of EU Regional and Minority Languages
Claudia Soria | Valeria Quochi | Irene Russo
Claudia Soria | Valeria Quochi | Irene Russo
ASR for Documenting Acutely Under-Resourced Indigenous Languages
Robbie Jimerson | Emily Prud’hommeaux
Robbie Jimerson | Emily Prud’hommeaux
Building a Sentiment Corpus of Tweets in Brazilian Portuguese
Henrico Brum | Maria das Graças Volpe Nunes
Henrico Brum | Maria das Graças Volpe Nunes
‘Aye’ or ‘No’? Speech-level Sentiment Analysis of Hansard UK Parliamentary Debate Transcripts
Gavin Abercrombie | Riza Batista-Navarro
Gavin Abercrombie | Riza Batista-Navarro
NoReC: The Norwegian Review Corpus
Erik Velldal | Lilja Øvrelid | Eivind Alexander Bergem | Cathrine Stadsnes | Samia Touileb | Fredrik Jørgensen
Erik Velldal | Lilja Øvrelid | Eivind Alexander Bergem | Cathrine Stadsnes | Samia Touileb | Fredrik Jørgensen
SenSALDO: Creating a Sentiment Lexicon for Swedish
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Corpus Building and Evaluation of Aspect-based Opinion Summaries from Tweets in Spanish
Daniel Peñaloza | Rodrigo López | Juanjosé Tenorio | Héctor Gómez | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Daniel Peñaloza | Rodrigo López | Juanjosé Tenorio | Héctor Gómez | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Application and Analysis of a Multi-layered Scheme for Irony on the Italian Twitter Corpus TWITTIRÒ
Alessandra Teresa Cignarella | Cristina Bosco | Viviana Patti | Mirko Lai
Alessandra Teresa Cignarella | Cristina Bosco | Viviana Patti | Mirko Lai
SMILE Swiss German Sign Language Dataset
Sarah Ebling | Necati Cihan Camgöz | Penny Boyes Braem | Katja Tissi | Sandra Sidler-Miserez | Stephanie Stoll | Simon Hadfield | Tobias Haug | Richard Bowden | Sandrine Tornay | Marzieh Razavi | Mathew Magimai-Doss
Sarah Ebling | Necati Cihan Camgöz | Penny Boyes Braem | Katja Tissi | Sandra Sidler-Miserez | Stephanie Stoll | Simon Hadfield | Tobias Haug | Richard Bowden | Sandrine Tornay | Marzieh Razavi | Mathew Magimai-Doss
IPSL: A Database of Iconicity Patterns in Sign Languages. Creation and Use
Vadim Kimmelman | Anna Klezovich | George Moroz
Vadim Kimmelman | Anna Klezovich | George Moroz
Sign Languages and the Online World Online Dictionaries & Lexicostatistics
Shi Yu | Carlo Geraci | Natasha Abner
Shi Yu | Carlo Geraci | Natasha Abner
Elicitation protocol and material for a corpus of long prepared monologues in Sign Language
Michael Filhol | Mohamed Nassime Hadjadj
Michael Filhol | Mohamed Nassime Hadjadj
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions
Heike Brock | Kazuhiro Nakadai
Heike Brock | Kazuhiro Nakadai
Modeling French Sign Language: a proposal for a semantically compositional system
Mohamed Nassime Hadjadj | Michael Filhol | Annelies Braffort
Mohamed Nassime Hadjadj | Michael Filhol | Annelies Braffort
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
Hanae Koiso | Yasuharu Den | Yuriko Iseki | Wakako Kashino | Yoshiko Kawabata | Ken’ya Nishikawa | Yayoi Tanaka | Yasuyuki Usuda
Hanae Koiso | Yasuharu Den | Yuriko Iseki | Wakako Kashino | Yoshiko Kawabata | Ken’ya Nishikawa | Yayoi Tanaka | Yasuyuki Usuda
Carcinologic Speech Severity Index Project: A Database of Speech Disorder Productions to Assess Quality of Life Related to Speech After Cancer
Corine Astésano | Mathieu Balaguer | Jérôme Farinas | Corinne Fredouille | Pascal Gaillard | Alain Ghio | Imed Laaridh | Muriel Lalain | Benoît Lepage | Julie Mauclair | Olivier Nocaudie | Julien Pinquier | Oriol Pont | Gilles Pouchoulin | Michèle Puech | Danièle Robert | Etienne Sicard | Virginie Woisard
Corine Astésano | Mathieu Balaguer | Jérôme Farinas | Corinne Fredouille | Pascal Gaillard | Alain Ghio | Imed Laaridh | Muriel Lalain | Benoît Lepage | Julie Mauclair | Olivier Nocaudie | Julien Pinquier | Oriol Pont | Gilles Pouchoulin | Michèle Puech | Danièle Robert | Etienne Sicard | Virginie Woisard
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville)
Annie Rialland | Martine Adda-Decker | Guy-Noël Kouarata | Gilles Adda | Laurent Besacier | Lori Lamel | Elodie Gauthier | Pierre Godard | Jamison Cooper-Leavitt
Annie Rialland | Martine Adda-Decker | Guy-Noël Kouarata | Gilles Adda | Laurent Besacier | Lori Lamel | Elodie Gauthier | Pierre Godard | Jamison Cooper-Leavitt
A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Arif Khan | Ingmar Steiner | Yusuke Sugano | Andreas Bulling | Ross Macdonald
Arif Khan | Ingmar Steiner | Yusuke Sugano | Andreas Bulling | Ross Macdonald
Statistical Analysis of Missing Translation in Simultaneous Interpretation Using A Large-scale Bilingual Speech Corpus
Zhongxi Cai | Koichiro Ryu | Shigeki Matsubara
Zhongxi Cai | Koichiro Ryu | Shigeki Matsubara
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.
Aghilas Sini | Damien Lolive | Gaëlle Vidal | Marie Tahon | Élisabeth Delais-Roussarie
Aghilas Sini | Damien Lolive | Gaëlle Vidal | Marie Tahon | Élisabeth Delais-Roussarie
The MonPaGe_HA Database for the Documentation of Spoken French Throughout Adulthood
Cécile Fougeron | Véronique Delvaux | Lucie Ménard | Marina Laganaro
Cécile Fougeron | Véronique Delvaux | Lucie Ménard | Marina Laganaro
Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data
Per Fallgren | Zofia Malisz | Jens Edlund
Per Fallgren | Zofia Malisz | Jens Edlund
CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task
Robert Herms | Maria Wirzberger | Maximilian Eibl | Günter Daniel Rey
Robert Herms | Maria Wirzberger | Maximilian Eibl | Günter Daniel Rey
Edit me: A Corpus and a Framework for Understanding Natural Language Image Editing
Ramesh Manuvinakurike | Jacqueline Brixey | Trung Bui | Walter Chang | Doo Soon Kim | Ron Artstein | Kallirroi Georgila
Ramesh Manuvinakurike | Jacqueline Brixey | Trung Bui | Walter Chang | Doo Soon Kim | Ron Artstein | Kallirroi Georgila
Enriching a Lexicon of Discourse Connectives with Corpus-based Data
Anna Feltracco | Elisabetta Jezek | Bernardo Magnini
Anna Feltracco | Elisabetta Jezek | Bernardo Magnini
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Carolina Scarton | Gustavo Paetzold | Lucia Specia
The brWaC Corpus: A New Open Resource for Brazilian Portuguese
Jorge A. Wagner Filho | Rodrigo Wilkens | Marco Idiart | Aline Villavicencio
Jorge A. Wagner Filho | Rodrigo Wilkens | Marco Idiart | Aline Villavicencio
The German Reference Corpus DeReKo: New Developments – New Opportunities
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt
Risamálheild: A Very Large Icelandic Text Corpus
Steinþór Steingrímsson | Sigrún Helgadóttir | Eiríkur Rögnvaldsson | Starkaður Barkarson | Jón Guðnason
Steinþór Steingrímsson | Sigrún Helgadóttir | Eiríkur Rögnvaldsson | Starkaður Barkarson | Jón Guðnason
TriMED: A Multilingual Terminological Database
Federica Vezzani | Giorgio Maria Di Nunzio | Geneviève Henrot
Federica Vezzani | Giorgio Maria Di Nunzio | Geneviève Henrot
Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment
Sonja Bosch | Thomas Eckart | Bettina Klimek | Dirk Goldhahn | Uwe Quasthoff
Sonja Bosch | Thomas Eckart | Bettina Klimek | Dirk Goldhahn | Uwe Quasthoff
A Lexicon of Discourse Markers for Portuguese – LDM-PT
Amália Mendes | Iria del Rio | Manfred Stede | Felix Dombek
Amália Mendes | Iria del Rio | Manfred Stede | Felix Dombek
One Language to rule them all: modelling Morphological Patterns in a Large Scale Italian Lexicon with SWRL
Fahad Khan | Andrea Bellandi | Francesca Frontini | Monica Monachini
Fahad Khan | Andrea Bellandi | Francesca Frontini | Monica Monachini
WordNet-Shp: Towards the Building of a Lexical Database for a Peruvian Minority Language
Diego Maguiño-Valencia | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Diego Maguiño-Valencia | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Retrieving Information from the French Lexical Network in RDF/OWL Format
Alexsandro Fonseca | Fatiha Sadat | François Lareau
Alexsandro Fonseca | Fatiha Sadat | François Lareau
Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus
Abbas Ghaddar | Philippe Langlais
Abbas Ghaddar | Philippe Langlais
Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges
Halidanmu Abudukelimu | Abudoukelimu Abulizi | Boliang Zhang | Xiaoman Pan | Di Lu | Heng Ji | Yang Liu
Halidanmu Abudukelimu | Abudoukelimu Abulizi | Boliang Zhang | Xiaoman Pan | Di Lu | Heng Ji | Yang Liu
BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset
Hanieh Poostchi | Ehsan Zare Borzeshi | Massimo Piccardi
Hanieh Poostchi | Ehsan Zare Borzeshi | Massimo Piccardi
Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
Juyeon Kang | Jungyeul Park
Juyeon Kang | Jungyeul Park
A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events
Martin Schiersch | Veselina Mironova | Maximilian Schmitt | Philippe Thomas | Aleksandra Gabryszak | Leonhard Hennig
Martin Schiersch | Veselina Mironova | Maximilian Schmitt | Philippe Thomas | Aleksandra Gabryszak | Leonhard Hennig
A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products
Saskia Schön | Veselina Mironova | Aleksandra Gabryszak | Leonhard Hennig
Saskia Schön | Veselina Mironova | Aleksandra Gabryszak | Leonhard Hennig
Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
Juliana Pirovani | Elias Oliveira
Juliana Pirovani | Elias Oliveira
M-CNER: A Corpus for Chinese Named Entity Recognition in Multi-Domains
Qi Lu | YaoSheng Yang | Zhenghua Li | Wenliang Chen | Min Zhang
Qi Lu | YaoSheng Yang | Zhenghua Li | Wenliang Chen | Min Zhang
SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems
Kevin Bowden | Jiaqi Wu | Shereen Oraby | Amita Misra | Marilyn Walker
Kevin Bowden | Jiaqi Wu | Shereen Oraby | Amita Misra | Marilyn Walker
Transfer Learning for Named-Entity Recognition with Neural Networks
Ji Young Lee | Franck Dernoncourt | Peter Szolovits
Ji Young Lee | Franck Dernoncourt | Peter Szolovits
ForFun 1.0: Prague Database of Forms and Functions – An Invaluable Resource for Linguistic Research
Marie Mikulová | Eduard Bejček
Marie Mikulová | Eduard Bejček
The LIA Treebank of Spoken Norwegian Dialects
Lilja Øvrelid | Andre Kåsen | Kristin Hagen | Anders Nøklestad | Per Erik Solberg | Janne Bondi Johannessen
Lilja Øvrelid | Andre Kåsen | Kristin Hagen | Anders Nøklestad | Per Erik Solberg | Janne Bondi Johannessen
Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project
Guillaume Wisniewski
Guillaume Wisniewski
SandhiKosh: A Benchmark Corpus for Evaluating Sanskrit Sandhi Tools
Shubham Bhardwaj | Neelamadhav Gantayat | Nikhil Chaturvedi | Rahul Garg | Sumeet Agarwal
Shubham Bhardwaj | Neelamadhav Gantayat | Nikhil Chaturvedi | Rahul Garg | Sumeet Agarwal
Creation of a Balanced State-of-the-Art Multilayer Corpus for NLU
Normunds Gruzitis | Lauma Pretkalnina | Baiba Saulite | Laura Rituma | Gunta Nespore-Berzkalne | Arturs Znotins | Peteris Paikens
Normunds Gruzitis | Lauma Pretkalnina | Baiba Saulite | Laura Rituma | Gunta Nespore-Berzkalne | Arturs Znotins | Peteris Paikens
Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution
Sebastien Delecraz | Alexis Nasr | Frederic Bechet | Benoit Favre
Sebastien Delecraz | Alexis Nasr | Frederic Bechet | Benoit Favre
Analyzing Middle High German Syntax with RDF and SPARQL
Christian Chiarcos | Benjamin Kosmehl | Christian Fäth | Maria Sukhareva
Christian Chiarcos | Benjamin Kosmehl | Christian Fäth | Maria Sukhareva
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
Djamé Seddah | Eric de la Clergerie | Benoît Sagot | Héctor Martínez Alonso | Marie Candito
Djamé Seddah | Eric de la Clergerie | Benoît Sagot | Héctor Martínez Alonso | Marie Candito
Universal Dependencies and Quantitative Typological Trends. A Case Study on Word Order
Chiara Alzetta | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Chiara Alzetta | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Interoperability of Language-related Information: Mapping the BLL Thesaurus to Lexvo and Glottolog
Vanya Dimitrova | Christian Fäth | Christian Chiarcos | Heike Renner-Westermann | Frank Abromeit
Vanya Dimitrova | Christian Fäth | Christian Chiarcos | Heike Renner-Westermann | Frank Abromeit
Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest
António Branco | Ruben Branco | Chakaveh Saedi | João Silva
António Branco | Ruben Branco | Chakaveh Saedi | João Silva
Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model
Wei-Yun Ma | Yueh-Yin Shih
Wei-Yun Ma | Yueh-Yin Shih
The Circumstantial Event Ontology (CEO) and ECB+/CEO: an Ontology and Corpus for Implicit Causal Relations between Events
Roxane Segers | Tommaso Caselli | Piek Vossen
Roxane Segers | Tommaso Caselli | Piek Vossen