International Conference on Language Resources and Evaluation (2018)
up
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Nicoletta Calzolari | Khalid Choukri | Christopher Cieri | Thierry Declerck | Sara Goggi | Koiti Hasida | Hitoshi Isahara | Bente Maegaard | Joseph Mariani | Hélène Mazo | Asuncion Moreno | Jan Odijk | Stelios Piperidis | Takenobu Tokunaga
Nicoletta Calzolari | Khalid Choukri | Christopher Cieri | Thierry Declerck | Sara Goggi | Koiti Hasida | Hitoshi Isahara | Bente Maegaard | Joseph Mariani | Hélène Mazo | Asuncion Moreno | Jan Odijk | Stelios Piperidis | Takenobu Tokunaga
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
Ali Can Kocabiyikoglu | Laurent Besacier | Olivier Kraif
Ali Can Kocabiyikoglu | Laurent Besacier | Olivier Kraif
Evaluating Domain Adaptation for Machine Translation Across Scenarios
Thierry Etchegoyhen | Anna Fernández Torné | Andoni Azpeitia | Eva Martínez Garcia | Anna Matamala
Thierry Etchegoyhen | Anna Fernández Torné | Andoni Azpeitia | Eva Martínez Garcia | Anna Matamala
Upping the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation
Christian Hadiwinoto | Hwee Tou Ng
Christian Hadiwinoto | Hwee Tou Ng
ESCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi
Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method
Yutong Shao | Rico Sennrich | Bonnie Webber | Federico Fancellu
Yutong Shao | Rico Sennrich | Bonnie Webber | Federico Fancellu
Cross-Lingual Generation and Evaluation of a Wide-Coverage Lexical Semantic Resource
Attila Novák | Borbála Novák
Attila Novák | Borbála Novák
Advances in Pre-Training Distributed Word Representations
Tomas Mikolov | Edouard Grave | Piotr Bojanowski | Christian Puhrsch | Armand Joulin
Tomas Mikolov | Edouard Grave | Piotr Bojanowski | Christian Puhrsch | Armand Joulin
Integrating Generative Lexicon Event Structures into VerbNet
Susan Windisch Brown | James Pustejovsky | Annie Zaenen | Martha Palmer
Susan Windisch Brown | James Pustejovsky | Annie Zaenen | Martha Palmer
Multi-layer Annotation of the Rigveda
Oliver Hellwig | Heinrich Hettrich | Ashutosh Modi | Manfred Pinkal
Oliver Hellwig | Heinrich Hettrich | Ashutosh Modi | Manfred Pinkal
The Natural Stories Corpus
Richard Futrell | Edward Gibson | Harry J. Tily | Idan Blank | Anastasia Vishnevetsky | Steven Piantadosi | Evelina Fedorenko
Richard Futrell | Edward Gibson | Harry J. Tily | Idan Blank | Anastasia Vishnevetsky | Steven Piantadosi | Evelina Fedorenko
Semi-automatic Korean FrameNet Annotation over KAIST Treebank
Younggyun Hahm | Jiseong Kim | Sunggoo Kwon | Key-Sun Choi
Younggyun Hahm | Jiseong Kim | Sunggoo Kwon | Key-Sun Choi
Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text
Géraldine Damnati | Jeremy Auguste | Alexis Nasr | Delphine Charlet | Johannes Heinecke | Frédéric Béchet
Géraldine Damnati | Jeremy Auguste | Alexis Nasr | Delphine Charlet | Johannes Heinecke | Frédéric Béchet
Multi-Dialect Arabic POS Tagging: A CRF Approach
Kareem Darwish | Hamdy Mubarak | Ahmed Abdelali | Mohamed Eldesouki | Younes Samih | Randah Alharbi | Mohammed Attia | Walid Magdy | Laura Kallmeyer
Kareem Darwish | Hamdy Mubarak | Ahmed Abdelali | Mohamed Eldesouki | Younes Samih | Randah Alharbi | Mohammed Attia | Walid Magdy | Laura Kallmeyer
A Corpus for Modeling Word Importance in Spoken Dialogue Transcripts
Sushant Kafle | Matt Huenerfauth
Sushant Kafle | Matt Huenerfauth
Dialogue Structure Annotation for Multi-Floor Interaction
David Traum | Cassidy Henry | Stephanie Lukin | Ron Artstein | Felix Gervits | Kimberly Pollard | Claire Bonial | Su Lei | Clare Voss | Matthew Marge | Cory Hayes | Susan Hill
David Traum | Cassidy Henry | Stephanie Lukin | Ron Artstein | Felix Gervits | Kimberly Pollard | Claire Bonial | Su Lei | Clare Voss | Matthew Marge | Cory Hayes | Susan Hill
Effects of Gender Stereotypes on Trust and Likability in Spoken Human-Robot Interaction
Matthias Kraus | Johannes Kraus | Martin Baumann | Wolfgang Minker
Matthias Kraus | Johannes Kraus | Martin Baumann | Wolfgang Minker
A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction
Dimosthenis Kontogiorgos | Vanya Avramova | Simon Alexanderson | Patrik Jonell | Catharine Oertel | Jonas Beskow | Gabriel Skantze | Joakim Gustafson
Dimosthenis Kontogiorgos | Vanya Avramova | Simon Alexanderson | Patrik Jonell | Catharine Oertel | Jonas Beskow | Gabriel Skantze | Joakim Gustafson
Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level
AbdelRahim Elmadany | Sherif Abdou | Mervat Gheith
AbdelRahim Elmadany | Sherif Abdou | Mervat Gheith
Data Management Plan (DMP) for Language Data under the New General Da-ta Protection Regulation (GDPR)
Pawel Kamocki | Valérie Mapelli | Khalid Choukri
Pawel Kamocki | Valérie Mapelli | Khalid Choukri
We Are Depleting Our Research Subject as We Are Investigating It: In Language Technology, more Replication and Diversity Are Needed
António Branco
António Branco
Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library.
Thorsten Trippel | Claus Zinn
Thorsten Trippel | Claus Zinn
Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data
Christopher Cieri | James Fiumara | Mark Liberman | Chris Callison-Burch | Jonathan Wright
Christopher Cieri | James Fiumara | Mark Liberman | Chris Callison-Burch | Jonathan Wright
Three Dimensions of Reproducibility in Natural Language Processing
K. Bretonnel Cohen | Jingbo Xia | Pierre Zweigenbaum | Tiffany Callahan | Orin Hargraves | Foster Goss | Nancy Ide | Aurélie Névéol | Cyril Grouin | Lawrence E. Hunter
K. Bretonnel Cohen | Jingbo Xia | Pierre Zweigenbaum | Tiffany Callahan | Orin Hargraves | Foster Goss | Nancy Ide | Aurélie Névéol | Cyril Grouin | Lawrence E. Hunter
Representation Mapping: A Novel Approach to Generate High-Quality Multi-Lingual Emotion Lexicons
Sven Buechel | Udo Hahn
Sven Buechel | Udo Hahn
Unfolding the External Behavior and Inner Affective State of Teammates through Ensemble Learning: Experimental Evidence from a Dyadic Team Corpus
Aggeliki Vlachostergiou | Mark Dennison | Catherine Neubauer | Stefan Scherer | Peter Khooshabeh | Andre Harrison
Aggeliki Vlachostergiou | Mark Dennison | Catherine Neubauer | Stefan Scherer | Peter Khooshabeh | Andre Harrison
Understanding Emotions: A Dataset of Tweets to Study Interactions between Affect Categories
Saif Mohammad | Svetlana Kiritchenko
Saif Mohammad | Svetlana Kiritchenko
When ACE met KBP: End-to-End Evaluation of Knowledge Base Population with Component-level Annotation
Bonan Min | Marjorie Freedman | Roger Bock | Ralph Weischedel
Bonan Min | Marjorie Freedman | Roger Bock | Ralph Weischedel
Simple Large-scale Relation Extraction from Unstructured Text
Christos Christodoulopoulos | Arpit Mittal
Christos Christodoulopoulos | Arpit Mittal
Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task
Dagmar Gromann | Thierry Declerck
Dagmar Gromann | Thierry Declerck
A Large Resource of Patterns for Verbal Paraphrases
Octavian Popescu | Ngoc Phuoc An Vo | Vadim Sheinin
Octavian Popescu | Ngoc Phuoc An Vo | Vadim Sheinin
A Recorded Debating Dataset
Shachar Mirkin | Michal Jacovi | Tamar Lavee | Hong-Kwang Kuo | Samuel Thomas | Leslie Sager | Lili Kotlerman | Elad Venezian | Noam Slonim
Shachar Mirkin | Michal Jacovi | Tamar Lavee | Hong-Kwang Kuo | Samuel Thomas | Leslie Sager | Lili Kotlerman | Elad Venezian | Noam Slonim
Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging
Kyoko Sugisaki | Nicolas Wiedmer | Heiko Hausendorf
Kyoko Sugisaki | Nicolas Wiedmer | Heiko Hausendorf
A Lexical Tool for Academic Writing in Spanish based on Expert and Novice Corpora
Marcos García Salido | Marcos García | Milka Villayandre-Llamazares | Margarita Alonso-Ramos
Marcos García Salido | Marcos García | Milka Villayandre-Llamazares | Margarita Alonso-Ramos
Framing Named Entity Linking Error Types
Adrian Braşoveanu | Giuseppe Rizzo | Philipp Kuntschik | Albert Weichselbraun | Lyndon J.B. Nixon
Adrian Braşoveanu | Giuseppe Rizzo | Philipp Kuntschik | Albert Weichselbraun | Lyndon J.B. Nixon
A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation
Kirk Roberts | Yuqi Si | Anshul Gandhi | Elmer Bernstam
Kirk Roberts | Yuqi Si | Anshul Gandhi | Elmer Bernstam
A New Corpus to Support Text Mining for the Curation of Metabolites in the ChEBI Database
Matthew Shardlow | Nhung Nguyen | Gareth Owen | Claire O’Donovan | Andrew Leach | John McNaught | Steve Turner | Sophia Ananiadou
Matthew Shardlow | Nhung Nguyen | Gareth Owen | Claire O’Donovan | Andrew Leach | John McNaught | Steve Turner | Sophia Ananiadou
Parallel Corpora for the Biomedical Domain
Aurélie Névéol | Antonio Jimeno Yepes | Mariana Neves | Karin Verspoor
Aurélie Névéol | Antonio Jimeno Yepes | Mariana Neves | Karin Verspoor
Medical Entity Corpus with PICO elements and Sentiment Analysis
Markus Zlabinger | Linda Andersson | Allan Hanbury | Michael Andersson | Vanessa Quasnik | Jon Brassey
Markus Zlabinger | Linda Andersson | Allan Hanbury | Michael Andersson | Vanessa Quasnik | Jon Brassey
A Large Automatically-Acquired All-Words List of Multiword Expressions Scored for Compositionality
Will Roberts | Markus Egg
Will Roberts | Markus Egg
A Hybrid Approach for Automatic Extraction of Bilingual Multiword Expressions from Parallel Corpora
Nasredine Semmar
Nasredine Semmar
No more beating about the bush : A Step towards Idiom Handling for Indian Language NLP
Ruchit Agrawal | Vighnesh Chenthil Kumar | Vigneshwaran Muralidharan | Dipti Sharma
Ruchit Agrawal | Vighnesh Chenthil Kumar | Vigneshwaran Muralidharan | Dipti Sharma
Sentence Level Temporality Detection using an Implicit Time-sensed Resource
Sabyasachi Kamila | Asif Ekbal | Pushpak Bhattacharyya
Sabyasachi Kamila | Asif Ekbal | Pushpak Bhattacharyya
Comprehensive Annotation of Various Types of Temporal Information on the Time Axis
Tomohiro Sakaguchi | Daisuke Kawahara | Sadao Kurohashi
Tomohiro Sakaguchi | Daisuke Kawahara | Sadao Kurohashi
Systems’ Agreements and Disagreements in Temporal Processing: An Extensive Error Analysis of the TempEval-3 Task
Tommaso Caselli | Roser Morante
Tommaso Caselli | Roser Morante
Annotating Temporally-Anchored Spatial Knowledge by Leveraging Syntactic Dependencies
Alakananda Vempala | Eduardo Blanco
Alakananda Vempala | Eduardo Blanco
SW4ALL: a CEFR Classified and Aligned Corpus for Language Learning
Rodrigo Wilkens | Leonardo Zilio | Cédrick Fairon
Rodrigo Wilkens | Leonardo Zilio | Cédrick Fairon
Towards a Diagnosis of Textual Difficulties for Children with Dyslexia
Solen Quiniou | Béatrice Daille
Solen Quiniou | Béatrice Daille
Deep Neural Networks for Coreference Resolution for Polish
Bartłomiej Nitoń | Paweł Morawiecki | Maciej Ogrodniczuk
Bartłomiej Nitoń | Paweł Morawiecki | Maciej Ogrodniczuk
SzegedKoref: A Hungarian Coreference Corpus
Veronika Vincze | Klára Hegedűs | Alex Sliz-Nagy | Richárd Farkas
Veronika Vincze | Klára Hegedűs | Alex Sliz-Nagy | Richárd Farkas
Sanaphor++: Combining Deep Neural Networks with Semantics for Coreference Resolution
Julien Plu | Roman Prokofyev | Alberto Tonon | Philippe Cudré-Mauroux | Djellel Eddine Difallah | Raphaël Troncy | Giuseppe Rizzo
Julien Plu | Roman Prokofyev | Alberto Tonon | Philippe Cudré-Mauroux | Djellel Eddine Difallah | Raphaël Troncy | Giuseppe Rizzo
ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations
Loïc Grobol | Isabelle Tellier | Éric de la Clergerie | Marco Dinarelli | Frédéric Landragin
Loïc Grobol | Isabelle Tellier | Éric de la Clergerie | Marco Dinarelli | Frédéric Landragin
ParCorFull: a Parallel Corpus Annotated with Full Coreference
Ekaterina Lapshinova-Koltunski | Christian Hardmeier | Pauline Krielke
Ekaterina Lapshinova-Koltunski | Christian Hardmeier | Pauline Krielke
An Application for Building a Polish Telephone Speech Corpus
Bartosz Ziółko | Piotr Żelasko | Ireneusz Gawlik | Tomasz Pędzimąż | Tomasz Jadczyk
Bartosz Ziółko | Piotr Żelasko | Ireneusz Gawlik | Tomasz Pędzimąż | Tomasz Jadczyk
CPJD Corpus: Crowdsourced Parallel Speech Corpus of Japanese Dialects
Shinnosuke Takamichi | Hiroshi Saruwatari
Shinnosuke Takamichi | Hiroshi Saruwatari
Korean L2 Vocabulary Prediction: Can a Large Annotated Corpus be Used to Train Better Models for Predicting Unknown Words?
Kevin Yancey | Yves Lepage
Kevin Yancey | Yves Lepage
Crowdsourcing-based Annotation of the Accounting Registers of the Italian Comedy
Adeline Granet | Benjamin Hervy | Geoffrey Roman-Jimenez | Marouane Hachicha | Emmanuel Morin | Harold Mouchère | Solen Quiniou | Guillaume Raschia | Françoise Rubellin | Christian Viard-Gaudin
Adeline Granet | Benjamin Hervy | Geoffrey Roman-Jimenez | Marouane Hachicha | Emmanuel Morin | Harold Mouchère | Solen Quiniou | Guillaume Raschia | Françoise Rubellin | Christian Viard-Gaudin
FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German
Leonidas Lefakis | Alan Akbik | Roland Vollgraf
Leonidas Lefakis | Alan Akbik | Roland Vollgraf
Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing
Alice Millour | Karën Fort
Alice Millour | Karën Fort
Crowdsourced Corpus of Sentence Simplification with Core Vocabulary
Akihiro Katsuta | Kazuhide Yamamoto
Akihiro Katsuta | Kazuhide Yamamoto
A Multilingual Wikified Data Set of Educational Material
Iris Hendrickx | Eirini Takoulidou | Thanasis Naskos | Katia Lida Kermanidis | Vilelmini Sosoni | Hugo de Vos | Maria Stasimioti | Menno van Zaanen | Panayota Georgakopoulou | Valia Kordoni | Maja Popovic | Markus Egg | Antal van den Bosch
Iris Hendrickx | Eirini Takoulidou | Thanasis Naskos | Katia Lida Kermanidis | Vilelmini Sosoni | Hugo de Vos | Maria Stasimioti | Menno van Zaanen | Panayota Georgakopoulou | Valia Kordoni | Maja Popovic | Markus Egg | Antal van den Bosch
Translation Crowdsourcing: Creating a Multilingual Corpus of Online Educational Content
Vilelmini Sosoni | Katia Lida Kermanidis | Maria Stasimioti | Thanasis Naskos | Eirini Takoulidou | Menno van Zaanen | Sheila Castilho | Panayota Georgakopoulou | Valia Kordoni | Markus Egg
Vilelmini Sosoni | Katia Lida Kermanidis | Maria Stasimioti | Thanasis Naskos | Eirini Takoulidou | Menno van Zaanen | Sheila Castilho | Panayota Georgakopoulou | Valia Kordoni | Markus Egg
Building an English Vocabulary Knowledge Dataset of Japanese English-as-a-Second-Language Learners Using Crowdsourcing
Yo Ehara
Yo Ehara
The UIR Uncertainty Corpus for Chinese: Annotating Chinese Microblog Corpus for Uncertainty Identification from Social Media
Binyang Li | Jun Xiang | Le Chen | Xu Han | Xiaoyan Yu | Ruifeng Xu | Tengjiao Wang | Kam-fai Wong
Binyang Li | Jun Xiang | Le Chen | Xu Han | Xiaoyan Yu | Ruifeng Xu | Tengjiao Wang | Kam-fai Wong
EventWiki: A Knowledge Base of Major Events
Tao Ge | Lei Cui | Baobao Chang | Zhifang Sui | Furu Wei | Ming Zhou
Tao Ge | Lei Cui | Baobao Chang | Zhifang Sui | Furu Wei | Ming Zhou
Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
Anna Koroleva | Patrick Paroubek
Anna Koroleva | Patrick Paroubek
Visualization of the occurrence trend of infectious diseases using Twitter
Ryusei Matsumoto | Minoru Yoshida | Kazuyuki Matsumoto | Hironobu Matsuda | Kenji Kita
Ryusei Matsumoto | Minoru Yoshida | Kazuyuki Matsumoto | Hironobu Matsuda | Kenji Kita
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications
Andrea Zielinski | Peter Mutschke
Andrea Zielinski | Peter Mutschke
KRAUTS: A German Temporally Annotated News Corpus
Jannik Strötgen | Anne-Lyse Minard | Lukas Lange | Manuela Speranza | Bernardo Magnini
Jannik Strötgen | Anne-Lyse Minard | Lukas Lange | Manuela Speranza | Bernardo Magnini
CogCompNLP: Your Swiss Army Knife for NLP
Daniel Khashabi | Mark Sammons | Ben Zhou | Tom Redman | Christos Christodoulopoulos | Vivek Srikumar | Nicholas Rizzolo | Lev Ratinov | Guanheng Luo | Quang Do | Chen-Tse Tsai | Subhro Roy | Stephen Mayhew | Zhili Feng | John Wieting | Xiaodong Yu | Yangqiu Song | Shashank Gupta | Shyam Upadhyay | Naveen Arivazhagan | Qiang Ning | Shaoshi Ling | Dan Roth
Daniel Khashabi | Mark Sammons | Ben Zhou | Tom Redman | Christos Christodoulopoulos | Vivek Srikumar | Nicholas Rizzolo | Lev Ratinov | Guanheng Luo | Quang Do | Chen-Tse Tsai | Subhro Roy | Stephen Mayhew | Zhili Feng | John Wieting | Xiaodong Yu | Yangqiu Song | Shashank Gupta | Shyam Upadhyay | Naveen Arivazhagan | Qiang Ning | Shaoshi Ling | Dan Roth
A Framework for the Needs of Different Types of Users in Multilingual Semantic Enrichment
Jan Nehring | Felix Sasaki
Jan Nehring | Felix Sasaki
Preserving Workflow Reproducibility: The RePlay-DH Client as a Tool for Process Documentation
Markus Gärtner | Uli Hahn | Sibylle Hermann
Markus Gärtner | Uli Hahn | Sibylle Hermann
What’s Wrong, Python? – A Visual Differ and Graph Library for NLP in Python
Balázs Indig | András Simonyi | Noémi Ligeti-Nagy
Balázs Indig | András Simonyi | Noémi Ligeti-Nagy
ScholarGraph:a Chinese Knowledge Graph of Chinese Scholars
Shuo Wang | Zehui Hao | Xiaofeng Meng | Qiuyue Wang
Shuo Wang | Zehui Hao | Xiaofeng Meng | Qiuyue Wang
Enriching Frame Representations with Distributionally Induced Senses
Stefano Faralli | Alexander Panchenko | Chris Biemann | Simone Paolo Ponzetto
Stefano Faralli | Alexander Panchenko | Chris Biemann | Simone Paolo Ponzetto
An Integrated Formal Representation for Terminological and Lexical Data included in Classification Schemes
Thierry Declerck | Kseniya Egorova | Eileen Schnur
Thierry Declerck | Kseniya Egorova | Eileen Schnur
One event, many representations. Mapping action concepts through visual features.
Alessandro Panunzi | Lorenzo Gregori | Andrea Amelio Ravelli
Alessandro Panunzi | Lorenzo Gregori | Andrea Amelio Ravelli
Sentiment-Stance-Specificity (SSS) Dataset: Identifying Support-based Entailment among Opinions.
Pavithra Rajendran | Danushka Bollegala | Simon Parsons
Pavithra Rajendran | Danushka Bollegala | Simon Parsons
Resource Creation Towards Automated Sentiment Analysis in Telugu (a low resource language) and Integrating Multiple Domain Sources to Enhance Sentiment Prediction
Rama Rohit Reddy Gangula | Radhika Mamidi
Rama Rohit Reddy Gangula | Radhika Mamidi
Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks
Mohammed Attia | Younes Samih | Ali Elkahky | Laura Kallmeyer
Mohammed Attia | Younes Samih | Ali Elkahky | Laura Kallmeyer
HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments
Akari Asai | Sara Evensen | Behzad Golshan | Alon Halevy | Vivian Li | Andrei Lopatenko | Daniela Stepanov | Yoshihiko Suhara | Wang-Chiew Tan | Yinzhan Xu
Akari Asai | Sara Evensen | Behzad Golshan | Alon Halevy | Vivian Li | Andrei Lopatenko | Daniela Stepanov | Yoshihiko Suhara | Wang-Chiew Tan | Yinzhan Xu
MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification
Jeremy Barnes | Toni Badia | Patrik Lambert
Jeremy Barnes | Toni Badia | Patrik Lambert
Collecting Code-Switched Data from Social Media
Gideon Mendels | Victor Soto | Aaron Jaech | Julia Hirschberg
Gideon Mendels | Victor Soto | Aaron Jaech | Julia Hirschberg
A Taxonomy for In-depth Evaluation of Normalization for User Generated Content
Rob van der Goot | Rik van Noord | Gertjan van Noord
Rob van der Goot | Rik van Noord | Gertjan van Noord
Arap-Tweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification
Wajdi Zaghouani | Anis Charfi
Wajdi Zaghouani | Anis Charfi
Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents
Nadezda Okinina | Lionel Nicolas | Verena Lyding
Nadezda Okinina | Lionel Nicolas | Verena Lyding
Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods
Vivi Nastase | Julian Hitschler
Vivi Nastase | Julian Hitschler
Crowdsourced Multimodal Corpora Collection Tool
Patrik Jonell | Catharine Oertel | Dimosthenis Kontogiorgos | Jonas Beskow | Joakim Gustafson
Patrik Jonell | Catharine Oertel | Dimosthenis Kontogiorgos | Jonas Beskow | Joakim Gustafson
Expert Evaluation of a Spoken Dialogue System in a Clinical Operating Room
Juliana Miehle | Nadine Gerstenlauer | Daniel Ostler | Hubertus Feußner | Wolfgang Minker | Stefan Ultes
Juliana Miehle | Nadine Gerstenlauer | Daniel Ostler | Hubertus Feußner | Wolfgang Minker | Stefan Ultes
The Metalogue Debate Trainee Corpus: Data Collection and Annotations
Volha Petukhova | Andrei Malchanau | Youssef Oualil | Dietrich Klakow | Saturnino Luz | Fasih Haider | Nick Campbell | Dimitris Koryzis | Dimitris Spiliotopoulos | Pierre Albert | Nicklas Linz | Jan Alexandersson
Volha Petukhova | Andrei Malchanau | Youssef Oualil | Dietrich Klakow | Saturnino Luz | Fasih Haider | Nick Campbell | Dimitris Koryzis | Dimitris Spiliotopoulos | Pierre Albert | Nicklas Linz | Jan Alexandersson
Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it
Andrei Malchanau | Volha Petukhova | Harry Bunt
Andrei Malchanau | Volha Petukhova | Harry Bunt
KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
Todd Shore | Theofronia Androulakaki | Gabriel Skantze
Todd Shore | Theofronia Androulakaki | Gabriel Skantze
On the Vector Representation of Utterances in Dialogue Context
Louisa Pragst | Niklas Rach | Wolfgang Minker | Stefan Ultes
Louisa Pragst | Niklas Rach | Wolfgang Minker | Stefan Ultes
ES-Port: a Spontaneous Spoken Human-Human Technical Support Corpus for Dialogue Research in Spanish
Laura García-Sardiña | Manex Serras | Arantza del Pozo
Laura García-Sardiña | Manex Serras | Arantza del Pozo
From analysis to modeling of engagement as sequences of multimodal behaviors
Soumia Dermouche | Catherine Pelachaud
Soumia Dermouche | Catherine Pelachaud
Building Literary Corpora for Computational Literary Analysis - A Prototype to Bridge the Gap between CL and DH
Andrew Frank | Christine Ivanovic
Andrew Frank | Christine Ivanovic
Towards faithfully visualizing global linguistic diversity
Garland McNew | Curdin Derungs | Steven Moran
Garland McNew | Curdin Derungs | Steven Moran
Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction
Adam Ek | Mats Wirén | Robert Östling | Kristina N. Björkenstam | Gintarė Grigonytė | Sofia Gustafson Capková
Adam Ek | Mats Wirén | Robert Östling | Kristina N. Björkenstam | Gintarė Grigonytė | Sofia Gustafson Capková
Word Embedding Evaluation Datasets and Wikipedia Title Embedding for Chinese
Chi-Yen Chen | Wei-Yun Ma
Chi-Yen Chen | Wei-Yun Ma
An Automatic Learning of an Algerian Dialect Lexicon by using Multilingual Word Embeddings
Abidi Karima | Kamel Smaïli
Abidi Karima | Kamel Smaïli
Candidate Ranking for Maintenance of an Online Dictionary
Claire Broad | Helen Langone | David Guy Brizan
Claire Broad | Helen Langone | David Guy Brizan
Tools for Building an Interlinked Synonym Lexicon Network
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Combining Concepts and Their Translations from Structured Dictionaries of Uralic Minority Languages
Mika Hämäläinen | Liisa Lotta Tarvainen | Jack Rueter
Mika Hämäläinen | Liisa Lotta Tarvainen | Jack Rueter
Transfer of Frames from English FrameNet to Construct Chinese FrameNet: A Bilingual Corpus-Based Approach
Tsung-Han Yang | Hen-Hsen Huang | An-Zi Yen | Hsin-Hsi Chen
Tsung-Han Yang | Hen-Hsen Huang | An-Zi Yen | Hsin-Hsi Chen
EFLLex: A Graded Lexical Resource for Learners of English as a Foreign Language
Luise Dürlich | Thomas François
Luise Dürlich | Thomas François
English-Basque Statistical and Neural Machine Translation
Inigo Jauregi Unanue | Lierni Garmendia Arratibel | Ehsan Zare Borzeshi | Massimo Piccardi
Inigo Jauregi Unanue | Lierni Garmendia Arratibel | Ehsan Zare Borzeshi | Massimo Piccardi
TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality
Vivien Macketanz | Renlong Ai | Aljoscha Burchardt | Hans Uszkoreit
Vivien Macketanz | Renlong Ai | Aljoscha Burchardt | Hans Uszkoreit
Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Dynamic Oracle for Neural Machine Translation in Decoding Phase
Zi-Yi Dou | Hao Zhou | Shu-Jian Huang | Xin-Yu Dai | Jia-Jun Chen
Zi-Yi Dou | Hao Zhou | Shu-Jian Huang | Xin-Yu Dai | Jia-Jun Chen
A Parallel Corpus of Arabic-Japanese News Articles
Go Inoue | Nizar Habash | Yuji Matsumoto | Hiroyuki Aoyama
Go Inoue | Nizar Habash | Yuji Matsumoto | Hiroyuki Aoyama
Examining the Tip of the Iceberg: A Data Set for Idiom Translation
Marzieh Fadaee | Arianna Bisazza | Christof Monz
Marzieh Fadaee | Arianna Bisazza | Christof Monz
Automatic Enrichment of Terminological Resources: the IATE RDF Example
Mihael Arcan | Elena Montiel-Ponsoda | John P. McCrae | Paul Buitelaar
Mihael Arcan | Elena Montiel-Ponsoda | John P. McCrae | Paul Buitelaar
A Comparative Study of Extremely Low-Resource Transliteration of the World’s Languages
Winston Wu | David Yarowsky
Winston Wu | David Yarowsky
Translating Web Search Queries into Natural Language Questions
Adarsh Kumar | Sandipan Dandapat | Sushil Chordia
Adarsh Kumar | Sandipan Dandapat | Sushil Chordia
Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering
Olga Majewska | Diana McCarthy | Ivan Vulić | Anna Korhonen
Olga Majewska | Diana McCarthy | Ivan Vulić | Anna Korhonen
Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense
Haoyue Shi | Xihao Wang | Yuqi Sun | Junfeng Hu
Haoyue Shi | Xihao Wang | Yuqi Sun | Junfeng Hu
Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation
Mika Hasegawa | Tetsunori Kobayashi | Yoshihiko Hayashi
Mika Hasegawa | Tetsunori Kobayashi | Yoshihiko Hayashi
Semantic Frame Parsing for Information Extraction : the CALOR corpus
Gabriel Marzinotto | Jeremy Auguste | Frederic Bechet | Geraldine Damnati | Alexis Nasr
Gabriel Marzinotto | Jeremy Auguste | Frederic Bechet | Geraldine Damnati | Alexis Nasr
Using a Corpus of English and Chinese Political Speeches for Metaphor Analysis
Kathleen Ahrens | Huiheng Zeng | Shun-han Rebekah Wong
Kathleen Ahrens | Huiheng Zeng | Shun-han Rebekah Wong
A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language
João Sequeira | Teresa Gonçalves | Paulo Quaresma | Amália Mendes | Iris Hendrickx
João Sequeira | Teresa Gonçalves | Paulo Quaresma | Amália Mendes | Iris Hendrickx
All-words Word Sense Disambiguation Using Concept Embeddings
Rui Suzuki | Kanako Komiya | Masayuki Asahara | Minoru Sasaki | Hiroyuki Shinnou
Rui Suzuki | Kanako Komiya | Masayuki Asahara | Minoru Sasaki | Hiroyuki Shinnou
Enhancing Modern Supervised Word Sense Disambiguation Models by Semantic Lexical Resources
Stefano Melacci | Achille Globo | Leonardo Rigutini
Stefano Melacci | Achille Globo | Leonardo Rigutini
An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages
Dmitry Ustalov | Denis Teslenko | Alexander Panchenko | Mikhail Chernoskutov | Chris Biemann | Simone Paolo Ponzetto
Dmitry Ustalov | Denis Teslenko | Alexander Panchenko | Mikhail Chernoskutov | Chris Biemann | Simone Paolo Ponzetto
Unsupervised Korean Word Sense Disambiguation using CoreNet
Kijong Han | Sangha Nam | Jiseong Kim | Younggyun Hahm | Key-Sun Choi
Kijong Han | Sangha Nam | Jiseong Kim | Younggyun Hahm | Key-Sun Choi
UFSAC: Unification of Sense Annotated Corpora and Tools
Loïc Vial | Benjamin Lecouteux | Didier Schwab
Loïc Vial | Benjamin Lecouteux | Didier Schwab
Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities
Steffen Remus | Chris Biemann
Steffen Remus | Chris Biemann
FastSense: An Efficient Word Sense Disambiguation Classifier
Tolga Uslu | Alexander Mehler | Daniel Baumartz | Wahed Hemati
Tolga Uslu | Alexander Mehler | Daniel Baumartz | Wahed Hemati
Text Annotation Graphs: Annotating Complex Natural Language Phenomena
Angus Forbes | Kristine Lee | Gus Hahn-Powell | Marco A. Valenzuela-Escárcega | Mihai Surdeanu
Angus Forbes | Kristine Lee | Gus Hahn-Powell | Marco A. Valenzuela-Escárcega | Mihai Surdeanu
Tools for The Production of Analogical Grids and a Resource of N-gram Analogical Grids in 11 Languages
Rashel Fam | Yves Lepage
Rashel Fam | Yves Lepage
The Automatic Annotation of the Semiotic Type of Hand Gestures in Obama’ s Humorous Speeches
Costanza Navarretta
Costanza Navarretta
Annotation and Quantitative Analysis of Speaker Information in Novel Conversation Sentences in Japanese
Makoto Yamazaki | Yumi Miyazaki | Wakako Kashino
Makoto Yamazaki | Yumi Miyazaki | Wakako Kashino
PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents
Hiroyuki Shindo | Yohei Munesada | Yuji Matsumoto
Hiroyuki Shindo | Yohei Munesada | Yuji Matsumoto
An Annotation Language for Semantic Search of Legal Sources
Adeline Nazarenko | François Levy | Adam Wyner
Adeline Nazarenko | François Levy | Adam Wyner
Resource Interoperability for Sustainable Benchmarking: The Case of Events
Chantal van Son | Oana Inel | Roser Morante | Lora Aroyo | Piek Vossen
Chantal van Son | Oana Inel | Roser Morante | Lora Aroyo | Piek Vossen
Parsivar: A Language Processing Toolkit for Persian
Salar Mohtaj | Behnam Roshanfekr | Atefeh Zafarian | Habibollah Asghari
Salar Mohtaj | Behnam Roshanfekr | Atefeh Zafarian | Habibollah Asghari
Multilingual Word Segmentation: Training Many Language-Specific Tokenizers Smoothly Thanks to the Universal Dependencies Corpus
Erwan Moreau | Carl Vogel
Erwan Moreau | Carl Vogel
Building a Corpus for Personality-dependent Natural Language Understanding and Generation
Ricelli Ramos | Georges Neto | Barbara Silva | Danielle Monteiro | Ivandré Paraboni | Rafael Dias
Ricelli Ramos | Georges Neto | Barbara Silva | Danielle Monteiro | Ivandré Paraboni | Rafael Dias
Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters
Marijn Schraagen | Feike Dietz | Marjo van Koppen
Marijn Schraagen | Feike Dietz | Marjo van Koppen
ASAP++: Enriching the ASAP Automated Essay Grading Dataset with Essay Attribute Scores
Sandeep Mathias | Pushpak Bhattacharyya
Sandeep Mathias | Pushpak Bhattacharyya
MirasText: An Automatically Generated Text Corpus for Persian
Behnam Sabeti | Hossein Abedi Firouzjaee | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Amir Vaheb
Behnam Sabeti | Hossein Abedi Firouzjaee | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Amir Vaheb
The Reference Corpus of the Contemporary Romanian Language (CoRoLa)
Verginica Barbu Mititelu | Dan Tufiș | Elena Irimia
Verginica Barbu Mititelu | Dan Tufiș | Elena Irimia
A Corpus of Drug Usage Guidelines Annotated with Type of Advice
Sarah Masud Preum | Md. Rizwan Parvez | Kai-Wei Chang | John Stankovic
Sarah Masud Preum | Md. Rizwan Parvez | Kai-Wei Chang | John Stankovic
A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set
Ian D. Wood | John P. McCrae | Vladimir Andryushechkin | Paul Buitelaar
Ian D. Wood | John P. McCrae | Vladimir Andryushechkin | Paul Buitelaar
Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System
Ankush Khandelwal | Sahil Swami | Syed S. Akhtar | Manish Shrivastava
Ankush Khandelwal | Sahil Swami | Syed S. Akhtar | Manish Shrivastava
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing
Koichiro Yoshino | Yoko Ishikawa | Masahiro Mizukami | Yu Suzuki | Sakriani Sakti | Satoshi Nakamura
Koichiro Yoshino | Yoko Ishikawa | Masahiro Mizukami | Yu Suzuki | Sakriani Sakti | Satoshi Nakamura
Contextual Dependencies in Time-Continuous Multidimensional Affect Recognition
Dmitrii Fedotov | Denis Ivanko | Maxim Sidorov | Wolfgang Minker
Dmitrii Fedotov | Denis Ivanko | Maxim Sidorov | Wolfgang Minker
WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art
Saif Mohammad | Svetlana Kiritchenko
Saif Mohammad | Svetlana Kiritchenko
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
Paul Rodrigues | Valerie Novak | C. Anton Rytting | Julie Yelle | Jennifer Boutz
Paul Rodrigues | Valerie Novak | C. Anton Rytting | Julie Yelle | Jennifer Boutz
Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus
Shabnam Tafreshi | Mona Diab
Shabnam Tafreshi | Mona Diab
A Swedish Cookie-Theft Corpus
Dimitrios Kokkinakis | Kristina Lundholm Fors | Kathleen Fraser | Arto Nordlund
Dimitrios Kokkinakis | Kristina Lundholm Fors | Kathleen Fraser | Arto Nordlund
Sharing Copies of Synthetic Clinical Corpora without Physical Distribution — A Case Study to Get Around IPRs and Privacy Constraints Featuring the German JSYNCC Corpus
Christina Lohr | Sven Buechel | Udo Hahn
Christina Lohr | Sven Buechel | Udo Hahn
A Legal Perspective on Training Models for Natural Language Processing
Richard Eckart de Castilho | Giulia Dore | Thomas Margoni | Penny Labropoulou | Iryna Gurevych
Richard Eckart de Castilho | Giulia Dore | Thomas Margoni | Penny Labropoulou | Iryna Gurevych
LREMap, a Song of Resources and Evaluation
Riccardo Del Gratta | Sara Goggi | Gabriella Pardelli | Nicoletta Calzolari
Riccardo Del Gratta | Sara Goggi | Gabriella Pardelli | Nicoletta Calzolari
Metadata Collection Records for Language Resources
Henk van den Heuvel | Erwin Komen | Nelleke Oostdijk
Henk van den Heuvel | Erwin Komen | Nelleke Oostdijk
Managing Public Sector Data for Multilingual Applications Development
Stelios Piperidis | Penny Labropoulou | Miltos Deligiannis | Maria Giagkou
Stelios Piperidis | Penny Labropoulou | Miltos Deligiannis | Maria Giagkou
Bridging the LAPPS Grid and CLARIN
Erhard Hinrichs | Nancy Ide | James Pustejovsky | Jan Hajič | Marie Hinrichs | Mohammad Fazleh Elahi | Keith Suderman | Marc Verhagen | Kyeongmin Rim | Pavel Straňák | Jozef Mišutka
Erhard Hinrichs | Nancy Ide | James Pustejovsky | Jan Hajič | Marie Hinrichs | Mohammad Fazleh Elahi | Keith Suderman | Marc Verhagen | Kyeongmin Rim | Pavel Straňák | Jozef Mišutka
Fluid Annotation: A Granularity-aware Annotation Tool for Chinese Word Fluidity
Shu-Kai Hsieh | Yu-Hsiang Tseng | Chih-Yao Lee | Chiung-Yu Chiang
Shu-Kai Hsieh | Yu-Hsiang Tseng | Chih-Yao Lee | Chiung-Yu Chiang
E-magyar – A Digital Language Processing System
Tamás Váradi | Eszter Simon | Bálint Sass | Iván Mittelholcz | Attila Novák | Balázs Indig | Richárd Farkas | Veronika Vincze
Tamás Váradi | Eszter Simon | Bálint Sass | Iván Mittelholcz | Attila Novák | Balázs Indig | Richárd Farkas | Veronika Vincze
ILCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data
Andreas Niekler | Arnim Bleier | Christian Kahmann | Lisa Posch | Gregor Wiedemann | Kenan Erdogan | Gerhard Heyer | Markus Strohmaier
Andreas Niekler | Arnim Bleier | Christian Kahmann | Lisa Posch | Gregor Wiedemann | Kenan Erdogan | Gerhard Heyer | Markus Strohmaier
Indra: A Word Embedding and Semantic Relatedness Server
Juliano Efson Sales | Leonardo Souza | Siamak Barzegar | Brian Davis | André Freitas | Siegfried Handschuh
Juliano Efson Sales | Leonardo Souza | Siamak Barzegar | Brian Davis | André Freitas | Siegfried Handschuh
A UIMA Database Interface for Managing NLP-related Text Annotations
Giuseppe Abrami | Alexander Mehler
Giuseppe Abrami | Alexander Mehler
European Language Resource Coordination: Collecting Language Resources for Public Sector Multilingual Information Management
Andrea Lösch | Valérie Mapelli | Stelios Piperidis | Andrejs Vasiļjevs | Lilli Smal | Thierry Declerck | Eileen Schnur | Khalid Choukri | Josef van Genabith
Andrea Lösch | Valérie Mapelli | Stelios Piperidis | Andrejs Vasiļjevs | Lilli Smal | Thierry Declerck | Eileen Schnur | Khalid Choukri | Josef van Genabith
Tilde MT Platform for Developing Client Specific MT Solutions
Mārcis Pinnis | Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Raivis Skadiņš | Valters Šics
Mārcis Pinnis | Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Raivis Skadiņš | Valters Šics
Improving homograph disambiguation with supervised machine learning
Kyle Gorman | Gleb Mazovetskiy | Vitaly Nikolaev
Kyle Gorman | Gleb Mazovetskiy | Vitaly Nikolaev
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties
Mason Chua | Daan van Esch | Noah Coccaro | Eunjoon Cho | Sujeet Bhandari | Libin Jia
Mason Chua | Daan van Esch | Noah Coccaro | Eunjoon Cho | Sujeet Bhandari | Libin Jia
DeModify: A Dataset for Analyzing Contextual Constraints on Modifier Deletion
Vivi Nastase | Devon Fritz | Anette Frank
Vivi Nastase | Devon Fritz | Anette Frank
Fine-grained Semantic Textual Similarity for Serbian
Vuk Batanović | Miloš Cvetanović | Boško Nikolić
Vuk Batanović | Miloš Cvetanović | Boško Nikolić
ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation
Venelin Kovatchev | M. Antònia Martí | Maria Salamó
Venelin Kovatchev | M. Antònia Martí | Maria Salamó
Introducing a Lexicon of Verbal Polarity Shifters for English
Marc Schulder | Michael Wiegand | Josef Ruppenhofer | Stephanie Köser
Marc Schulder | Michael Wiegand | Josef Ruppenhofer | Stephanie Köser
Quantifying Qualitative Data for Understanding Controversial Issues
Michael Wojatzki | Saif Mohammad | Torsten Zesch | Svetlana Kiritchenko
Michael Wojatzki | Saif Mohammad | Torsten Zesch | Svetlana Kiritchenko
Distribution of Emotional Reactions to News Articles in Twitter
Omar Juárez Gambino | Hiram Calvo | Consuelo-Varinia García-Mendoza
Omar Juárez Gambino | Hiram Calvo | Consuelo-Varinia García-Mendoza
Aggression-annotated Corpus of Hindi-English Code-mixed Data
Ritesh Kumar | Aishwarya N. Reganti | Akshit Bhatia | Tushar Maheshwari
Ritesh Kumar | Aishwarya N. Reganti | Akshit Bhatia | Tushar Maheshwari
Creating a Verb Synonym Lexicon Based on a Parallel Corpus
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič
Evaluation of Domain-specific Word Embeddings using Knowledge Resources
Farhad Nooralahzadeh | Lilja Øvrelid | Jan Tore Lønning
Farhad Nooralahzadeh | Lilja Øvrelid | Jan Tore Lønning
Automatic Wordnet Mapping: from CoreNet to Princeton WordNet
Jiseong Kim | Younggyun Hahm | Sunggoo Kwon | Key-Sun Choi
Jiseong Kim | Younggyun Hahm | Sunggoo Kwon | Key-Sun Choi
The New Propbank: Aligning Propbank with AMR through POS Unification
Tim O’Gorman | Sameer Pradhan | Martha Palmer | Julia Bonn | Katie Conger | James Gung
Tim O’Gorman | Sameer Pradhan | Martha Palmer | Julia Bonn | Katie Conger | James Gung
The Boarnsterhim Corpus: A Bilingual Frisian-Dutch Panel and Trend Study
Marjoleine Sloos | Eduard Drenth | Wilbert Heeringa
Marjoleine Sloos | Eduard Drenth | Wilbert Heeringa
The French-Algerian Code-Switching Triggered audio corpus (FACST)
Amazouz Djegdjiga | Martine Adda-Decker | Lori Lamel
Amazouz Djegdjiga | Martine Adda-Decker | Lori Lamel
Strategies and Challenges for Crowdsourcing Regional Dialect Perception Data for Swiss German and Swiss French
Jean-Philippe Goldman | Simon Clematide | Mathieu Avanzi | Raphael Tandler
Jean-Philippe Goldman | Simon Clematide | Mathieu Avanzi | Raphael Tandler
Phonetically Balanced Code-Mixed Speech Corpus for Hindi-English Automatic Speech Recognition
Ayushi Pandey | Brij Mohan Lal Srivastava | Rohit Kumar | Bhanu Teja Nellore | Kasi Sai Teja | Suryakanth V. Gangashetty
Ayushi Pandey | Brij Mohan Lal Srivastava | Rohit Kumar | Bhanu Teja Nellore | Kasi Sai Teja | Suryakanth V. Gangashetty
Chinese-Portuguese Machine Translation: A Study on Building Parallel Corpora from Comparable Texts
Siyou Liu | Longyue Wang | Chao-Hong Liu
Siyou Liu | Longyue Wang | Chao-Hong Liu
Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences
Morgan Ulinski | Bob Coyne | Julia Hirschberg
Morgan Ulinski | Bob Coyne | Julia Hirschberg
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections
Pierre-Alexandre Broux | David Doukhan | Simon Petitrenaud | Sylvain Meignier | Jean Carrive
Pierre-Alexandre Broux | David Doukhan | Simon Petitrenaud | Sylvain Meignier | Jean Carrive
Performance Impact Caused by Hidden Bias of Training Data for Recognizing Textual Entailment
Masatoshi Tsuchiya
Masatoshi Tsuchiya
A Corpus of Metaphor Novelty Scores for Syntactically-Related Word Pairs
Natalie Parde | Rodney Nielsen
Natalie Parde | Rodney Nielsen
Improving Hypernymy Extraction with Distributional Semantic Classes
Alexander Panchenko | Dmitry Ustalov | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Alexander Panchenko | Dmitry Ustalov | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Laying the Groundwork for Knowledge Base Population: Nine Years of Linguistic Resources for TAC KBP
Jeremy Getman | Joe Ellis | Stephanie Strassel | Zhiyi Song | Jennifer Tracey
Jeremy Getman | Joe Ellis | Stephanie Strassel | Zhiyi Song | Jennifer Tracey
A Dataset for Inter-Sentence Relation Extraction using Distant Supervision
Angrosh Mandya | Danushka Bollegala | Frans Coenen | Katie Atkinson
Angrosh Mandya | Danushka Bollegala | Frans Coenen | Katie Atkinson
Diacritics Restoration Using Neural Networks
Jakub Náplava | Milan Straka | Pavel Straňák | Jan Hajič
Jakub Náplava | Milan Straka | Pavel Straňák | Jan Hajič
Ensemble Romanian Dependency Parsing with Neural Networks
Radu Ion | Elena Irimia | Verginica Barbu Mititelu
Radu Ion | Elena Irimia | Verginica Barbu Mititelu
Collection of Multimodal Dialog Data and Analysis of the Result of Annotation of Users’ Interest Level
Masahiro Araki | Sayaka Tomimasu | Mikio Nakano | Kazunori Komatani | Shogo Okada | Shinya Fujie | Hiroaki Sugiyama
Masahiro Araki | Sayaka Tomimasu | Mikio Nakano | Kazunori Komatani | Shogo Okada | Shinya Fujie | Hiroaki Sugiyama
Recognizing Behavioral Factors while Driving: A Real-World Multimodal Corpus to Monitor the Driver’s Affective State
Alicia Lotz | Klas Ihme | Audrey Charnoz | Pantelis Maroudis | Ivan Dmitriev | Andreas Wendemuth
Alicia Lotz | Klas Ihme | Audrey Charnoz | Pantelis Maroudis | Ivan Dmitriev | Andreas Wendemuth
EmotionLines: An Emotion Corpus of Multi-Party Conversations
Chao-Chun Hsu | Sheng-Yeh Chen | Chuan-Chun Kuo | Ting-Hao Huang | Lun-Wei Ku
Chao-Chun Hsu | Sheng-Yeh Chen | Chuan-Chun Kuo | Ting-Hao Huang | Lun-Wei Ku
Academic-Industrial Perspective on the Development and Deployment of a Moderation System for a Newspaper Website
Dietmar Schabus | Marcin Skowron
Dietmar Schabus | Marcin Skowron
Community-Driven Crowdsourcing: Data Collection with Local Developers
Christina Funk | Michael Tseng | Ravindran Rajakumar | Linne Ha
Christina Funk | Michael Tseng | Ravindran Rajakumar | Linne Ha
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech
Jaka Aris Eko Wibawa | Supheakmungkol Sarin | Chenfang Li | Knot Pipatsrisawat | Keshan Sodimana | Oddur Kjartansson | Alexander Gutkin | Martin Jansche | Linne Ha
Jaka Aris Eko Wibawa | Supheakmungkol Sarin | Chenfang Li | Knot Pipatsrisawat | Keshan Sodimana | Oddur Kjartansson | Alexander Gutkin | Martin Jansche | Linne Ha
An Integrated Representation of Linguistic and Social Functions of Code-Switching
Silvana Hartmann | Monojit Choudhury | Kalika Bali
Silvana Hartmann | Monojit Choudhury | Kalika Bali
A Corpus of eRulemaking User Comments for Measuring Evaluability of Arguments
Joonsuk Park | Claire Cardie
Joonsuk Park | Claire Cardie
A Multi-layer Annotated Corpus of Argumentative Text: From Argument Schemes to Discourse Relations
Elena Musi | Tariq Alhindi | Manfred Stede | Leonard Kriese | Smaranda Muresan | Andrea Rocci
Elena Musi | Tariq Alhindi | Manfred Stede | Leonard Kriese | Smaranda Muresan | Andrea Rocci
Discourse Coherence Through the Lens of an Annotated Text Corpus: A Case Study
Eva Hajičová | Jiří Mírovský
Eva Hajičová | Jiří Mírovský
Automatic Prediction of Discourse Connectives
Eric Malmi | Daniele Pighin | Sebastian Krause | Mikhail Kozhevnikov
Eric Malmi | Daniele Pighin | Sebastian Krause | Mikhail Kozhevnikov
Handling Rare Word Problem using Synthetic Training Data for Sinhala and Tamil Neural Machine Translation
Pasindu Tennage | Prabath Sandaruwan | Malith Thilakarathne | Achini Herath | Surangika Ranathunga
Pasindu Tennage | Prabath Sandaruwan | Malith Thilakarathne | Achini Herath | Surangika Ranathunga
BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages
Egidio Marsico | Sebastien Flavier | Annemarie Verkerk | Steven Moran
Egidio Marsico | Sebastien Flavier | Annemarie Verkerk | Steven Moran
Creating a Translation Matrix of the Bible’s Names Across 591 Languages
Winston Wu | Nidhi Vyas | David Yarowsky
Winston Wu | Nidhi Vyas | David Yarowsky
Building a Word Segmenter for Sanskrit Overnight
Vikas Reddy | Amrith Krishna | Vishnu Sharma | Prateek Gupta | Vineeth M R | Pawan Goyal
Vikas Reddy | Amrith Krishna | Vishnu Sharma | Prateek Gupta | Vineeth M R | Pawan Goyal
Simple Semantic Annotation and Situation Frames: Two Approaches to Basic Text Understanding in LORELEI
Kira Griffitt | Jennifer Tracey | Ann Bies | Stephanie Strassel
Kira Griffitt | Jennifer Tracey | Ann Bies | Stephanie Strassel
Abstract Meaning Representation of Constructions: The More We Include, the Better the Representation
Claire Bonial | Bianca Badarau | Kira Griffitt | Ulf Hermjakob | Kevin Knight | Tim O’Gorman | Martha Palmer | Nathan Schneider
Claire Bonial | Bianca Badarau | Kira Griffitt | Ulf Hermjakob | Kevin Knight | Tim O’Gorman | Martha Palmer | Nathan Schneider
Evaluating Scoped Meaning Representations
Rik van Noord | Lasha Abzianidze | Hessel Haagsma | Johan Bos
Rik van Noord | Lasha Abzianidze | Hessel Haagsma | Johan Bos
Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation
Tommaso Pasini | Francesco Elia | Roberto Navigli
Tommaso Pasini | Francesco Elia | Roberto Navigli
A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches
Steven Neale
Steven Neale
Linguistically-driven Framework for Computationally Efficient and Scalable Sign Recognition
Dimitris Metaxas | Mark Dilsizian | Carol Neidle
Dimitris Metaxas | Mark Dilsizian | Carol Neidle
CONDUCT: An Expressive Conducting Gesture Dataset for Sound Control
Lei Chen | Sylvie Gibet | Camille Marteau
Lei Chen | Sylvie Gibet | Camille Marteau
MPST: A Corpus of Movie Plot Synopses with Tags
Sudipta Kar | Suraj Maharjan | A. Pastor López-Monroy | Thamar Solorio
Sudipta Kar | Suraj Maharjan | A. Pastor López-Monroy | Thamar Solorio
OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora
Pierre Lison | Jörg Tiedemann | Milen Kouylekov
Pierre Lison | Jörg Tiedemann | Milen Kouylekov
Building an Ellipsis-aware Chinese Dependency Treebank for Web Text
Xuancheng Ren | Xu Sun | Ji Wen | Bingzhen Wei | Weidong Zhan | Zhiyuan Zhang
Xuancheng Ren | Xu Sun | Ji Wen | Bingzhen Wei | Weidong Zhan | Zhiyuan Zhang
EuroGames16: Evaluating Change Detection in Online Conversation
Cyril Goutte | Yunli Wang | Fangming Liao | Zachary Zanussi | Samuel Larkin | Yuri Grinberg
Cyril Goutte | Yunli Wang | Fangming Liao | Zachary Zanussi | Samuel Larkin | Yuri Grinberg
A Deep Neural Network based Approach for Entity Extraction in Code-Mixed Indian Social Media Text
Deepak Gupta | Asif Ekbal | Pushpak Bhattacharyya
Deepak Gupta | Asif Ekbal | Pushpak Bhattacharyya
PoSTWITA-UD: an Italian Twitter Treebank in Universal Dependencies
Manuela Sanguinetti | Cristina Bosco | Alberto Lavelli | Alessandro Mazzei | Oronzo Antonelli | Fabio Tamburini
Manuela Sanguinetti | Cristina Bosco | Alberto Lavelli | Alessandro Mazzei | Oronzo Antonelli | Fabio Tamburini
Annotating If the Authors of a Tweet are Located at the Locations They Tweet About
Vivek Doudagiri | Alakananda Vempala | Eduardo Blanco
Vivek Doudagiri | Alakananda Vempala | Eduardo Blanco
MOCCA: Measure of Confidence for Corpus Analysis - Automatic Reliability Check of Transcript and Automatic Segmentation
Thomas Kisler | Florian Schiel
Thomas Kisler | Florian Schiel
Towards an ISO Standard for the Annotation of Quantification
Harry Bunt | James Pustejovsky | Kiyong Lee
Harry Bunt | James Pustejovsky | Kiyong Lee
Lightweight Grammatical Annotation in the TEI: New Perspectives
Piotr Bański | Susanne Haaf | Martin Mueller
Piotr Bański | Susanne Haaf | Martin Mueller
A Gold Standard for Multilingual Automatic Term Extraction from Comparable Corpora: Term Structure and Translation Equivalents
Ayla Rigouts Terryn | Véronique Hoste | Els Lefever
Ayla Rigouts Terryn | Véronique Hoste | Els Lefever
Handling Big Data and Sensitive Data Using EUDAT’s Generic Execution Framework and the WebLicht Workflow Engine.
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov
Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl
Alexander Panchenko | Eugen Ruppert | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Alexander Panchenko | Eugen Ruppert | Stefano Faralli | Simone P. Ponzetto | Chris Biemann
Universal Dependencies Version 2 for Japanese
Masayuki Asahara | Hiroshi Kanayama | Takaaki Tanaka | Yusuke Miyao | Sumire Uematsu | Shinsuke Mori | Yuji Matsumoto | Mai Omura | Yugo Murawaki
Masayuki Asahara | Hiroshi Kanayama | Takaaki Tanaka | Yusuke Miyao | Sumire Uematsu | Shinsuke Mori | Yuji Matsumoto | Mai Omura | Yugo Murawaki
A New Version of the Składnica Treebank of Polish Harmonised with the Walenty Valency Dictionary
Marcin Woliński | Elżbieta Hajnicz | Tomasz Bartosiak
Marcin Woliński | Elżbieta Hajnicz | Tomasz Bartosiak
Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions
Kira Droganova | Daniel Zeman | Jenna Kanerva | Filip Ginter
Kira Droganova | Daniel Zeman | Jenna Kanerva | Filip Ginter
Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish)
Mateusz Lango | Magda Ševčíková | Zdeněk Žabokrtský
Mateusz Lango | Magda Ševčíková | Zdeněk Žabokrtský
UniMorph 2.0: Universal Morphology
Christo Kirov | Ryan Cotterell | John Sylak-Glassman | Géraldine Walther | Ekaterina Vylomova | Patrick Xia | Manaal Faruqui | Sabrina J. Mielke | Arya McCarthy | Sandra Kübler | David Yarowsky | Jason Eisner | Mans Hulden
Christo Kirov | Ryan Cotterell | John Sylak-Glassman | Géraldine Walther | Ekaterina Vylomova | Patrick Xia | Manaal Faruqui | Sabrina J. Mielke | Arya McCarthy | Sandra Kübler | David Yarowsky | Jason Eisner | Mans Hulden
A Computational Architecture for the Morphology of Upper Tanana
Olga Lovick | Christopher Cox | Miikka Silfverberg | Antti Arppe | Mans Hulden
Olga Lovick | Christopher Cox | Miikka Silfverberg | Antti Arppe | Mans Hulden
Expanding Abbreviations in a Strongly Inflected Language: Are Morphosyntactic Tags Sufficient?
Piotr Żelasko
Piotr Żelasko
A High-Quality Gold Standard for Citation-based Tasks
Michael Färber | Alexander Thiemann | Adam Jatowt
Michael Färber | Alexander Thiemann | Adam Jatowt
Measuring Innovation in Speech and Language Processing Publications.
Joseph Mariani | Gil Francopoulo | Patrick Paroubek
Joseph Mariani | Gil Francopoulo | Patrick Paroubek
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles
Daniel Ferrés | Horacio Saggion | Francesco Ronzano | Àlex Bravo
Daniel Ferrés | Horacio Saggion | Francesco Ronzano | Àlex Bravo
Automatic Identification of Research Fields in Scientific Papers
Eric Kergosien | Amin Farvardin | Maguelonne Teisseire | Marie-Noëlle Bessagnet | Joachim Schöpfel | Stéphane Chaudiron | Bernard Jacquemin | Annig Lacayrelle | Mathieu Roche | Christian Sallaberry | Jean Philippe Tonneau
Eric Kergosien | Amin Farvardin | Maguelonne Teisseire | Marie-Noëlle Bessagnet | Joachim Schöpfel | Stéphane Chaudiron | Bernard Jacquemin | Annig Lacayrelle | Mathieu Roche | Christian Sallaberry | Jean Philippe Tonneau
Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
Deniz Zeyrek | Amália Mendes | Murathan Kurfalı
Deniz Zeyrek | Amália Mendes | Murathan Kurfalı
Enhancing the AI2 Diagrams Dataset Using Rhetorical Structure Theory
Tuomo Hiippala | Serafina Orekhova
Tuomo Hiippala | Serafina Orekhova
QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
Kordula De Kuthy | Nils Reiter | Arndt Riester
Kordula De Kuthy | Nils Reiter | Arndt Riester
The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions
José Lopes | Nils Hemmingsson | Oliver Åstrand
José Lopes | Nils Hemmingsson | Oliver Åstrand
A Context-based Approach for Dialogue Act Recognition using Simple Recurrent Neural Networks
Chandrakant Bothe | Cornelius Weber | Sven Magg | Stefan Wermter
Chandrakant Bothe | Cornelius Weber | Sven Magg | Stefan Wermter
TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations
Philipp Helfrich | Elias Rieb | Giuseppe Abrami | Andy Lücking | Alexander Mehler
Philipp Helfrich | Elias Rieb | Giuseppe Abrami | Andy Lücking | Alexander Mehler
Chats and Chunks: Annotation and Analysis of Multiparty Long Casual Conversations
Emer Gilmartin | Carl Vogel | Nick Campbell
Emer Gilmartin | Carl Vogel | Nick Campbell
Extending the gold standard for a lexical substitution task: is it worth it?
Ludovic Tanguy | Cécile Fabre | Laura Rivière
Ludovic Tanguy | Cécile Fabre | Laura Rivière
Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases
Maria Moritz | David Steding
Maria Moritz | David Steding
Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages
Zsanett Ferenczi | Iván Mittelholcz | Eszter Simon | Tamás Váradi
Zsanett Ferenczi | Iván Mittelholcz | Eszter Simon | Tamás Váradi
Dysarthric speech evaluation: automatic and perceptual approaches
Imed Laaridh | Christine Meunier | Corinne Fredouille
Imed Laaridh | Christine Meunier | Corinne Fredouille
Towards an Automatic Assessment of Crowdsourced Data for NLU
Patricia Braunger | Wolfgang Maier | Jan Wessling | Maria Schmidt
Patricia Braunger | Wolfgang Maier | Jan Wessling | Maria Schmidt
Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning
Jinyoung Yeo | Gyeongbok Lee | Gengyu Wang | Seungtaek Choi | Hyunsouk Cho | Reinald Kim Amplayo | Seung-won Hwang
Jinyoung Yeo | Gyeongbok Lee | Gengyu Wang | Seungtaek Choi | Hyunsouk Cho | Reinald Kim Amplayo | Seung-won Hwang
Is it worth it? Budget-related evaluation metrics for model selection
Filip Klubička | Giancarlo D. Salton | John D. Kelleher
Filip Klubička | Giancarlo D. Salton | John D. Kelleher
Matics Software Suite: New Tools for Evaluation and Data Exploration
Olivier Galibert | Guillaume Bernard | Agnes Delaborde | Sabrina Lecadre | Juliette Kahn
Olivier Galibert | Guillaume Bernard | Agnes Delaborde | Sabrina Lecadre | Juliette Kahn
MIsA: Multilingual “IsA” Extraction from Corpora
Stefano Faralli | Els Lefever | Simone Paolo Ponzetto
Stefano Faralli | Els Lefever | Simone Paolo Ponzetto
A supervised approach to taxonomy extraction using word embeddings
Rajdeep Sarkar | John P. McCrae | Paul Buitelaar
Rajdeep Sarkar | John P. McCrae | Paul Buitelaar
Korean TimeBank Including Relative Temporal Information
Chae-Gyun Lim | Young-Seob Jeong | Ho-Jin Choi
Chae-Gyun Lim | Young-Seob Jeong | Ho-Jin Choi
An Initial Test Collection for Ranked Retrieval of SMS Conversations
Rashmi Sankepally | Douglas W. Oard
Rashmi Sankepally | Douglas W. Oard
FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles
Nathalie Camelin | Géraldine Damnati | Abdessalam Bouchekif | Anais Landeau | Delphine Charlet | Yannick Estève
Nathalie Camelin | Géraldine Damnati | Abdessalam Bouchekif | Anais Landeau | Delphine Charlet | Yannick Estève
Towards Processing of the Oral History Interviews and Related Printed Documents
Zbyněk Zajíc | Lucie Skorkovská | Petr Neduchal | Pavel Ircing | Josef V. Psutka | Marek Hrúz | Aleš Pražák | Daniel Soutner | Jan Švec | Lukáš Bureš | Luděk Müller
Zbyněk Zajíc | Lucie Skorkovská | Petr Neduchal | Pavel Ircing | Josef V. Psutka | Marek Hrúz | Aleš Pražák | Daniel Soutner | Jan Švec | Lukáš Bureš | Luděk Müller
The Effects of Unimodal Representation Choices on Multimodal Learning
Fernando Tadao Ito | Helena de Medeiros Caseli | Jander Moreira
Fernando Tadao Ito | Helena de Medeiros Caseli | Jander Moreira
The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
Ahmed Abdelali | Irina Temnikova | Samy Hedaya | Stephan Vogel
Ahmed Abdelali | Irina Temnikova | Samy Hedaya | Stephan Vogel
Action Verb Corpus
Stephanie Gross | Matthias Hirschmanner | Brigitte Krenn | Friedrich Neubarth | Michael Zillich
Stephanie Gross | Matthias Hirschmanner | Brigitte Krenn | Friedrich Neubarth | Michael Zillich
EMO&LY (EMOtion and AnomaLY) : A new corpus for anomaly detection in an audiovisual stream with emotional context.
Cédric Fayet | Arnaud Delhay | Damien Lolive | Pierre-François Marteau
Cédric Fayet | Arnaud Delhay | Damien Lolive | Pierre-François Marteau
Development of an Annotated Multimodal Dataset for the Investigation of Classification and Summarisation of Presentations using High-Level Paralinguistic Features
Keith Curtis | Nick Campbell | Gareth Jones
Keith Curtis | Nick Campbell | Gareth Jones
GeCoTagger: Annotation of German Verb Complements with Conditional Random Fields
Roman Schneider | Monica Fürbacher
Roman Schneider | Monica Fürbacher
AET: Web-based Adjective Exploration Tool for German
Tatiana Bladier | Esther Seyffarth | Oliver Hellwig | Wiebke Petersen
Tatiana Bladier | Esther Seyffarth | Oliver Hellwig | Wiebke Petersen
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages
Talha Javed | Nizar Habash | Dima Taji
Talha Javed | Nizar Habash | Dima Taji
Building Universal Dependency Treebanks in Korean
Jayeol Chun | Na-Rae Han | Jena D. Hwang | Jinho D. Choi
Jayeol Chun | Na-Rae Han | Jena D. Hwang | Jinho D. Choi
Multilingual Dependency Parsing for Low-Resource Languages: Case Studies on North Saami and Komi-Zyrian
KyungTae Lim | Niko Partanen | Thierry Poibeau
KyungTae Lim | Niko Partanen | Thierry Poibeau
FonBund: A Library for Combining Cross-lingual Phonological Segment Data
Alexander Gutkin | Martin Jansche | Tatiana Merkulova
Alexander Gutkin | Martin Jansche | Tatiana Merkulova
Voice Builder: A Tool for Building Text-To-Speech Voices
Pasindu De Silva | Theeraphol Wattanavekin | Tang Hao | Knot Pipatsrisawat
Pasindu De Silva | Theeraphol Wattanavekin | Tang Hao | Knot Pipatsrisawat
Sudachi: a Japanese Tokenizer for Business
Kazuma Takaoka | Sorami Hisamoto | Noriko Kawahara | Miho Sakamoto | Yoshitaka Uchida | Yuji Matsumoto
Kazuma Takaoka | Sorami Hisamoto | Noriko Kawahara | Miho Sakamoto | Yoshitaka Uchida | Yuji Matsumoto
Chemical Compounds Knowledge Visualization with Natural Language Processing and Linked Data
Kazunari Tanaka | Tomoya Iwakura | Yusuke Koyanagi | Noriko Ikeda | Hiroyuki Shindo | Yuji Matsumoto
Kazunari Tanaka | Tomoya Iwakura | Yusuke Koyanagi | Noriko Ikeda | Hiroyuki Shindo | Yuji Matsumoto
Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus
Shuyuan Cao | Harritxu Gete
Shuyuan Cao | Harritxu Gete
A 2nd Longitudinal Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns
Kay Berkling
Kay Berkling
Development of a Mobile Observation Support System for Students: FishWatchr Mini
Masaya Yamaguchi | Masanori Kitamura | Naomi Yanagida
Masaya Yamaguchi | Masanori Kitamura | Naomi Yanagida
The AnnCor CHILDES Treebank
Jan Odijk | Alexis Dimitriadis | Martijn van der Klis | Marjo van Koppen | Meie Otten | Remco van der Veen
Jan Odijk | Alexis Dimitriadis | Martijn van der Klis | Marjo van Koppen | Meie Otten | Remco van der Veen
BabyCloud, a Technological Platform for Parents and Researchers
Xuân-Nga Cao | Cyrille Dakhlia | Patricia Del Carmen | Mohamed-Amine Jaouani | Malik Ould-Arbi | Emmanuel Dupoux
Xuân-Nga Cao | Cyrille Dakhlia | Patricia Del Carmen | Mohamed-Amine Jaouani | Malik Ould-Arbi | Emmanuel Dupoux
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Building a TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis
Lung-Hao Lee | Yuen-Hsien Tseng | Li-Ping Chang
Lung-Hao Lee | Yuen-Hsien Tseng | Li-Ping Chang
MIAPARLE: Online training for the discrimination of stress contrasts
Jean-Philippe Goldman | Sandra Schwab
Jean-Philippe Goldman | Sandra Schwab
A Leveled Reading Corpus of Modern Standard Arabic
Muhamed Al Khalil | Hind Saddiki | Nizar Habash | Latifa Alfalasi
Muhamed Al Khalil | Hind Saddiki | Nizar Habash | Latifa Alfalasi
Developing New Linguistic Resources and Tools for the Galician Language
Rodrigo Agerri | Xavier Gómez Guinovart | German Rigau | Miguel Anxo Solla Portela
Rodrigo Agerri | Xavier Gómez Guinovart | German Rigau | Miguel Anxo Solla Portela
Modeling Northern Haida Verb Morphology
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation
Caitlin Richter | Matthew Wickes | Deniz Beser | Mitch Marcus
Caitlin Richter | Matthew Wickes | Deniz Beser | Mitch Marcus
Introducing the CLARIN Knowledge Centre for Linguistic Diversity and Language Documentation
Hanna Hedeland | Timm Lehmberg | Felix Rau | Sophie Salffner | Mandana Seyfeddinipur | Andreas Witt
Hanna Hedeland | Timm Lehmberg | Felix Rau | Sophie Salffner | Mandana Seyfeddinipur | Andreas Witt
SB-CH: A Swiss German Corpus with Sentiment Annotations
Ralf Grubenmann | Don Tuggener | Pius von Däniken | Jan Deriu | Mark Cieliebak
Ralf Grubenmann | Don Tuggener | Pius von Däniken | Jan Deriu | Mark Cieliebak
Signbank: Software to Support Web Based Dictionaries of Sign Language
Steve Cassidy | Onno Crasborn | Henri Nieminen | Wessel Stoop | Micha Hulsbosch | Susan Even | Erwin Komen | Trevor Johnston
Steve Cassidy | Onno Crasborn | Henri Nieminen | Wessel Stoop | Micha Hulsbosch | Susan Even | Erwin Komen | Trevor Johnston
J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage
Kaoru Ito | Hiroyuki Nagai | Taro Okahisa | Shoko Wakamiya | Tomohide Iwao | Eiji Aramaki
Kaoru Ito | Hiroyuki Nagai | Taro Okahisa | Shoko Wakamiya | Tomohide Iwao | Eiji Aramaki
Building a List of Synonymous Words and Phrases of Japanese Compound Verbs
Kyoko Kanzaki | Hitoshi Isahara
Kyoko Kanzaki | Hitoshi Isahara
A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier
Bolette Pedersen | Sanni Nimb | Anders Søgaard | Mareike Hartmann | Sussi Olsen
Bolette Pedersen | Sanni Nimb | Anders Søgaard | Mareike Hartmann | Sussi Olsen
SLIDE - a Sentiment Lexicon of Common Idioms
Charles Jochim | Francesca Bonin | Roy Bar-Haim | Noam Slonim
Charles Jochim | Francesca Bonin | Roy Bar-Haim | Noam Slonim
Teanga: A Linked Data based platform for Natural Language Processing
Housam Ziad | John P. McCrae | Paul Buitelaar
Housam Ziad | John P. McCrae | Paul Buitelaar
Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena
Georg Rehm | Julian Moreno-Schneider | Peter Bourgonje
Georg Rehm | Julian Moreno-Schneider | Peter Bourgonje
The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction
René Witte | Bahar Sateli
René Witte | Bahar Sateli
LiDo RDF: From a Relational Database to a Linked Data Graph of Linguistic Terms and Bibliographic Data
Bettina Klimek | Robert Schädlich | Dustin Kröger | Edwin Knese | Benedikt Elßmann
Bettina Klimek | Robert Schädlich | Dustin Kröger | Edwin Knese | Benedikt Elßmann
Towards a Linked Open Data Edition of Sumerian Corpora
Christian Chiarcos | Émilie Pagé-Perron | Ilya Khait | Niko Schenk | Lucas Reckling
Christian Chiarcos | Émilie Pagé-Perron | Ilya Khait | Niko Schenk | Lucas Reckling
PMKI: an European Commission action for the interoperability, maintainability and sustainability of Language Resources
Peter Schmitz | Enrico Francesconi | Najeh Hajlaoui | Brahim Batouche
Peter Schmitz | Enrico Francesconi | Najeh Hajlaoui | Brahim Batouche
Collecting Language Resources from Public Administrations in the Nordic and Baltic Countries
Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Aivars Bērziņš
Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Aivars Bērziņš
LIdioms: A Multilingual Linked Idioms Data Set
Diego Moussallem | Mohamed Ahmed Sherif | Diego Esteves | Marcos Zampieri | Axel-Cyrille Ngonga Ngomo
Diego Moussallem | Mohamed Ahmed Sherif | Diego Esteves | Marcos Zampieri | Axel-Cyrille Ngonga Ngomo
Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus
Suguru Matsuyoshi | Hirotaka Kameko | Yugo Murawaki | Shinsuke Mori
Suguru Matsuyoshi | Hirotaka Kameko | Yugo Murawaki | Shinsuke Mori
Annotating Chinese Light Verb Constructions according to PARSEME guidelines
Menghan Jiang | Natalia Klyueva | Hongzhi Xu | Chu-Ren Huang
Menghan Jiang | Natalia Klyueva | Hongzhi Xu | Chu-Ren Huang
Using English Baits to Catch Serbian Multi-Word Terminology
Cvetana Krstev | Branislava Šandrih | Ranka Stanković | Miljana Mladenović
Cvetana Krstev | Branislava Šandrih | Ranka Stanković | Miljana Mladenović
Construction of Large-scale English Verbal Multiword Expression Annotated Corpus
Akihiko Kato | Hiroyuki Shindo | Yuji Matsumoto
Akihiko Kato | Hiroyuki Shindo | Yuji Matsumoto
Konbitzul: an MWE-specific database for Spanish-Basque
Uxoa Iñurrieta | Itziar Aduriz | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola
Uxoa Iñurrieta | Itziar Aduriz | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola
A Multilingual Test Collection for the Semantic Search of Entity Categories
Juliano Efson Sales | Siamak Barzegar | Wellington Franco | Bernhard Bermeitinger | Tiago Cunha | Brian Davis | André Freitas | Siegfried Handschuh
Juliano Efson Sales | Siamak Barzegar | Wellington Franco | Bernhard Bermeitinger | Tiago Cunha | Brian Davis | André Freitas | Siegfried Handschuh
Towards the Inference of Semantic Relations in Complex Nominals: a Pilot Study
Melania Cabezas-García | Pilar León-Araúz
Melania Cabezas-García | Pilar León-Araúz
Generation of a Spanish Artificial Collocation Error Corpus
Sara Rodríguez-Fernández | Roberto Carlini | Leo Wanner
Sara Rodríguez-Fernández | Roberto Carlini | Leo Wanner
Improving a Neural-based Tagger for Multiword Expressions Identification
Dušan Variš | Natalia Klyueva
Dušan Variš | Natalia Klyueva
DeepTC – An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning Experiments
Tobias Horsmann | Torsten Zesch
Tobias Horsmann | Torsten Zesch
Improving Hate Speech Detection with Deep Learning Ensembles
Steven Zimmerman | Udo Kruschwitz | Chris Fox
Steven Zimmerman | Udo Kruschwitz | Chris Fox
Semantic Relatedness of Wikipedia Concepts – Benchmark Data and a Working Solution
Liat Ein Dor | Alon Halfon | Yoav Kantor | Ran Levy | Yosi Mass | Ruty Rinott | Eyal Shnarch | Noam Slonim
Liat Ein Dor | Alon Halfon | Yoav Kantor | Ran Levy | Yosi Mass | Ruty Rinott | Eyal Shnarch | Noam Slonim
Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution
Dainis Boumber | Yifan Zhang | Arjun Mukherjee
Dainis Boumber | Yifan Zhang | Arjun Mukherjee
A Fast and Accurate Vietnamese Word Segmenter
Dat Quoc Nguyen | Dai Quoc Nguyen | Thanh Vu | Mark Dras | Mark Johnson
Dat Quoc Nguyen | Dai Quoc Nguyen | Thanh Vu | Mark Dras | Mark Johnson
Finite-state morphological analysis for Gagauz
Sevilay Bayatli | Güllü Karanfil | Memduh Gökırmak | Francis M. Tyers
Sevilay Bayatli | Güllü Karanfil | Memduh Gökırmak | Francis M. Tyers
Morphology Injection for English-Malayalam Statistical Machine Translation
Sreelekha S | Pushpak Bhattacharyya
Sreelekha S | Pushpak Bhattacharyya
The Morpho-syntactic Annotation of Animacy for a Dependency Parser
Mohammed Attia | Vitaly Nikolaev | Ali Elkahky
Mohammed Attia | Vitaly Nikolaev | Ali Elkahky
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction
Ossama Obeid | Salam Khalifa | Nizar Habash | Houda Bouamor | Wajdi Zaghouani | Kemal Oflazer
Ossama Obeid | Salam Khalifa | Nizar Habash | Houda Bouamor | Wajdi Zaghouani | Kemal Oflazer
Universal Morphologies for the Caucasus region
Christian Chiarcos | Kathrin Donandt | Maxim Ionov | Monika Rind-Pawlowski | Hasmik Sargsian | Jesse Wichers Schreur | Frank Abromeit | Christian Fäth
Christian Chiarcos | Kathrin Donandt | Maxim Ionov | Monika Rind-Pawlowski | Hasmik Sargsian | Jesse Wichers Schreur | Frank Abromeit | Christian Fäth
EMTC: Multilabel Corpus in Movie Domain for Emotion Analysis in Conversational Text
Duc-Anh Phan | Yuji Matsumoto
Duc-Anh Phan | Yuji Matsumoto
Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis
Stefania Pecore | Jeanne Villaneau
Stefania Pecore | Jeanne Villaneau
Lingmotif-lex: a Wide-coverage, State-of-the-art Lexicon for Sentiment Analysis
Antonio Moreno-Ortiz | Chantal Pérez-Hernández
Antonio Moreno-Ortiz | Chantal Pérez-Hernández
FooTweets: A Bilingual Parallel Corpus of World Cup Tweets
Henny Sluyter-Gäthje | Pintu Lohar | Haithem Afli | Andy Way
Henny Sluyter-Gäthje | Pintu Lohar | Haithem Afli | Andy Way
The SSIX Corpora: Three Gold Standard Corpora for Sentiment Analysis in English, Spanish and German Financial Microblogs
Thomas Gaillat | Manel Zarrouk | André Freitas | Brian Davis
Thomas Gaillat | Manel Zarrouk | André Freitas | Brian Davis
Sarcasm Target Identification: Dataset and An Introductory Approach
Aditya Joshi | Pranav Goel | Pushpak Bhattacharyya | Mark Carman
Aditya Joshi | Pranav Goel | Pushpak Bhattacharyya | Mark Carman
Annotating Opinions and Opinion Targets in Student Course Feedback
Janaka Chathuranga | Shanika Ediriweera | Ravindu Hasantha | Pranidhith Munasinghe | Surangika Ranathunga
Janaka Chathuranga | Shanika Ediriweera | Ravindu Hasantha | Pranidhith Munasinghe | Surangika Ranathunga
Generating a Gold Standard for a Swedish Sentiment Lexicon
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
WordKit: a Python Package for Orthographic and Phonological Featurization
Stéphan Tulkens | Dominiek Sandra | Walter Daelemans
Stéphan Tulkens | Dominiek Sandra | Walter Daelemans
Pronunciation Variants and ASR of Colloquial Speech: A Case Study on Czech
David Lukeš | Marie Kopřivová | Zuzana Komrsková | Petra Poukarová
David Lukeš | Marie Kopřivová | Zuzana Komrsková | Petra Poukarová
A Multilingual Approach to Question Classification
Aikaterini-Lida Kalouli | Katharina Kaiser | Annette Hautli-Janisz | Georg A. Kaiser | Miriam Butt
Aikaterini-Lida Kalouli | Katharina Kaiser | Annette Hautli-Janisz | Georg A. Kaiser | Miriam Butt
Dataset for the First Evaluation on Chinese Machine Reading Comprehension
Yiming Cui | Ting Liu | Zhipeng Chen | Wentao Ma | Shijin Wang | Guoping Hu
Yiming Cui | Ting Liu | Zhipeng Chen | Wentao Ma | Shijin Wang | Guoping Hu
A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks
Amir Hazem | Basma El Amal Boussaha | Nicolas Hernandez
Amir Hazem | Basma El Amal Boussaha | Nicolas Hernandez
WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions supporting Multi-hop Inference
Peter Jansen | Elizabeth Wainwright | Steven Marmorstein | Clayton Morrison
Peter Jansen | Elizabeth Wainwright | Steven Marmorstein | Clayton Morrison
Analysis of Implicit Conditions in Database Search Dialogues
Shun-ya Fukunaga | Hitoshi Nishikawa | Takenobu Tokunaga | Hikaru Yokono | Tetsuro Takahashi
Shun-ya Fukunaga | Hitoshi Nishikawa | Takenobu Tokunaga | Hikaru Yokono | Tetsuro Takahashi
An Information-Providing Closed-Domain Human-Agent Interaction Corpus
Jelte van Waterschoot | Guillaume Dubuisson Duplessis | Lorenzo Gatti | Merijn Bruijnes | Dirk Heylen
Jelte van Waterschoot | Guillaume Dubuisson Duplessis | Lorenzo Gatti | Merijn Bruijnes | Dirk Heylen
Augmenting Image Question Answering Dataset by Exploiting Image Captions
Masashi Yokota | Hideki Nakayama
Masashi Yokota | Hideki Nakayama
Semi-supervised Training Data Generation for Multilingual Question Answering
Kyungjae Lee | Kyoungho Yoon | Sunghyun Park | Seung-won Hwang
Kyungjae Lee | Kyoungho Yoon | Sunghyun Park | Seung-won Hwang
PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering
Andrei Dulceanu | Thang Le Dinh | Walter Chang | Trung Bui | Doo Soon Kim | Manh Chien Vu | Seokhwan Kim
Andrei Dulceanu | Thang Le Dinh | Walter Chang | Trung Bui | Doo Soon Kim | Manh Chien Vu | Seokhwan Kim
BioRead: A New Dataset for Biomedical Reading Comprehension
Dimitris Pappas | Ion Androutsopoulos | Haris Papageorgiou
Dimitris Pappas | Ion Androutsopoulos | Haris Papageorgiou
MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi
Deepak Gupta | Surabhi Kumari | Asif Ekbal | Pushpak Bhattacharyya
Deepak Gupta | Surabhi Kumari | Asif Ekbal | Pushpak Bhattacharyya
Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System
Shweta Yadav | Asif Ekbal | Sriparna Saha | Pushpak Bhattacharyya
Shweta Yadav | Asif Ekbal | Sriparna Saha | Pushpak Bhattacharyya
An Italian Twitter Corpus of Hate Speech against Immigrants
Manuela Sanguinetti | Fabio Poletto | Cristina Bosco | Viviana Patti | Marco Stranisci
Manuela Sanguinetti | Fabio Poletto | Cristina Bosco | Viviana Patti | Marco Stranisci
A Large Multilingual and Multi-domain Dataset for Recommender Systems
Giorgia Di Tommaso | Stefano Faralli | Paola Velardi
Giorgia Di Tommaso | Stefano Faralli | Paola Velardi
RtGender: A Corpus for Studying Differential Responses to Gender
Rob Voigt | David Jurgens | Vinodkumar Prabhakaran | Dan Jurafsky | Yulia Tsvetkov
Rob Voigt | David Jurgens | Vinodkumar Prabhakaran | Dan Jurafsky | Yulia Tsvetkov
A Neural Network Model for Part-Of-Speech Tagging of Social Media Texts
Sara Meftah | Nasredine Semmar
Sara Meftah | Nasredine Semmar
Utilizing Large Twitter Corpora to Create Sentiment Lexica
Valerij Fredriksen | Brage Jahren | Björn Gambäck
Valerij Fredriksen | Brage Jahren | Björn Gambäck
The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions
Laura Fernández Gallardo | Benjamin Weiss
Laura Fernández Gallardo | Benjamin Weiss
Design and Development of Speech Corpora for Air Traffic Control Training
Luboš Šmídl | Jan Švec | Daniel Tihelka | Jindřich Matoušek | Jan Romportl | Pavel Ircing
Luboš Šmídl | Jan Švec | Daniel Tihelka | Jindřich Matoušek | Jan Romportl | Pavel Ircing
A First South African Corpus of Multilingual Code-switched Soap Opera Speech
Ewald van der Westhuizen | Thomas Niesler
Ewald van der Westhuizen | Thomas Niesler
A Web Service for Pre-segmenting Very Long Transcribed Speech Recordings
Nina Poerner | Florian Schiel
Nina Poerner | Florian Schiel
A Real-life, French-accented Corpus of Air Traffic Control Communications
Estelle Delpech | Marion Laignelet | Christophe Pimm | Céline Raynal | Michal Trzos | Alexandre Arnold | Dominique Pronto
Estelle Delpech | Marion Laignelet | Christophe Pimm | Céline Raynal | Michal Trzos | Alexandre Arnold | Dominique Pronto
Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data
Askars Salimbajevs
Askars Salimbajevs
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach
Sunayana Sitaram | Varun Manjunath | Varun Bharadwaj | Monojit Choudhury | Kalika Bali | Michael Tjalve
Sunayana Sitaram | Varun Manjunath | Varun Bharadwaj | Monojit Choudhury | Kalika Bali | Michael Tjalve
Extending Search System based on Interactive Visualization for Speech Corpora
Tomoko Ohsuga | Yuichi Ishimoto | Tomoko Kajiyama | Shunsuke Kozawa | Kiyotaka Uchimoto | Shuichi Itahashi
Tomoko Ohsuga | Yuichi Ishimoto | Tomoko Kajiyama | Shunsuke Kozawa | Kiyotaka Uchimoto | Shuichi Itahashi
German Radio Interviews: The GRAIN Release of the SFB732 Silver Standard Collection
Katrin Schweitzer | Kerstin Eckart | Markus Gärtner | Agnieszka Falenska | Arndt Riester | Ina Rösiger | Antje Schweitzer | Sabrina Stehwien | Jonas Kuhn
Katrin Schweitzer | Kerstin Eckart | Markus Gärtner | Agnieszka Falenska | Arndt Riester | Ina Rösiger | Antje Schweitzer | Sabrina Stehwien | Jonas Kuhn
Preparing Data from Psychotherapy for Natural Language Processing
Margot Mieskes | Andreas Stiegelmayr
Margot Mieskes | Andreas Stiegelmayr
MirasVoice: A bilingual (English-Persian) speech corpus
Amir Vaheb | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Saeid Safavi | Behnam Sabeti
Amir Vaheb | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Saeid Safavi | Behnam Sabeti
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags
Koichiro Yoshino | Hiroki Tanaka | Kyoshiro Sugiyama | Makoto Kondo | Satoshi Nakamura
Koichiro Yoshino | Hiroki Tanaka | Kyoshiro Sugiyama | Makoto Kondo | Satoshi Nakamura
The Niki and Julie Corpus: Collaborative Multimodal Dialogues between Humans, Robots, and Virtual Agents
Ron Artstein | Jill Boberg | Alesia Gainer | Jonathan Gratch | Emmanuel Johnson | Anton Leuski | Gale Lucas | David Traum
Ron Artstein | Jill Boberg | Alesia Gainer | Jonathan Gratch | Emmanuel Johnson | Anton Leuski | Gale Lucas | David Traum
Constructing a Chinese Medical Conversation Corpus Annotated with Conversational Structures and Actions
Nan Wang | Yan Song | Fei Xia
Nan Wang | Yan Song | Fei Xia
Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus
Maria Koutsombogera | Carl Vogel
Maria Koutsombogera | Carl Vogel
A Semi-autonomous System for Creating a Human-Machine Interaction Corpus in Virtual Reality: Application to the ACORFORMed System for Training Doctors to Break Bad News
Magalie Ochs | Philippe Blache | Grégoire de Montcheuil | Jean-Marie Pergandi | Jorane Saubesty | Daniel Francon | Daniel Mestre
Magalie Ochs | Philippe Blache | Grégoire de Montcheuil | Jean-Marie Pergandi | Jorane Saubesty | Daniel Francon | Daniel Mestre
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas
Sashi Novitasari | Quoc Truong Do | Sakriani Sakti | Dessi Lestari | Satoshi Nakamura
Sashi Novitasari | Quoc Truong Do | Sakriani Sakti | Dessi Lestari | Satoshi Nakamura
QUEST: A Natural Language Interface to Relational Databases
Vadim Sheinin | Elahe Khorashani | Hangu Yeo | Kun Xu | Ngoc Phuoc An Vo | Octavian Popescu
Vadim Sheinin | Elahe Khorashani | Hangu Yeo | Kun Xu | Ngoc Phuoc An Vo | Octavian Popescu
Grapheme-level Awareness in Word Embeddings for Morphologically Rich Languages
Suzi Park | Hyopil Shin
Suzi Park | Hyopil Shin
Building a Constraint Grammar Parser for Plains Cree Verbs and Arguments
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen
BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages
Benjamin Heinzerling | Michael Strube
Benjamin Heinzerling | Michael Strube
Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation
Danillo Rocha | Ivandré Paraboni
Danillo Rocha | Ivandré Paraboni
Definite Description Lexical Choice: taking Speaker’s Personality into account
Alex Lan | Ivandré Paraboni
Alex Lan | Ivandré Paraboni
Incorporating Semantic Attention in Video Description Generation
Natsuda Laokulrat | Naoaki Okazaki | Hideki Nakayama
Natsuda Laokulrat | Naoaki Okazaki | Hideki Nakayama
GenDR: A Generic Deep Realizer with Complex Lexicalization
François Lareau | Florie Lambrey | Ieva Dubinskaite | Daniel Galarreta-Piquette | Maryam Nejat
François Lareau | Florie Lambrey | Ieva Dubinskaite | Daniel Galarreta-Piquette | Maryam Nejat
A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification
Sanja Štajner | Sergiu Nisioi
Sanja Štajner | Sergiu Nisioi
Don’t Annotate, but Validate: a Data-to-Text Method for Capturing Event Data
Piek Vossen | Filip Ilievski | Marten Postma | Roxane Segers
Piek Vossen | Filip Ilievski | Marten Postma | Roxane Segers
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data
Diego Moussallem | Thiago Ferreira | Marcos Zampieri | Maria Claudia Cavalcanti | Geraldo Xexéo | Mariana Neves | Axel-Cyrille Ngonga Ngomo
Diego Moussallem | Thiago Ferreira | Marcos Zampieri | Maria Claudia Cavalcanti | Geraldo Xexéo | Mariana Neves | Axel-Cyrille Ngonga Ngomo
Neural Models of Selectional Preferences for Implicit Semantic Role Labeling
Minh Le | Antske Fokkens
Minh Le | Antske Fokkens
A database of German definitory contexts from selected web sources
Adrien Barbaresi | Lothar Lemnitzer | Alexander Geyken
Adrien Barbaresi | Lothar Lemnitzer | Alexander Geyken
Annotating Abstract Meaning Representations for Spanish
Noelia Migueles-Abraira | Rodrigo Agerri | Arantza Diaz de Ilarraza
Noelia Migueles-Abraira | Rodrigo Agerri | Arantza Diaz de Ilarraza
Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification
Marie-Claude L’Homme | Benoît Robichaud | Nathalie Prévil
Marie-Claude L’Homme | Benoît Robichaud | Nathalie Prévil
Rollenwechsel-English: a large-scale semantic role corpus
Asad Sayeed | Pavel Shkadzko | Vera Demberg
Asad Sayeed | Pavel Shkadzko | Vera Demberg
Towards a Standardized Dataset for Noun Compound Interpretation
Girishkumar Ponkiya | Kevin Patel | Pushpak Bhattacharyya | Girish K Palshikar
Girishkumar Ponkiya | Kevin Patel | Pushpak Bhattacharyya | Girish K Palshikar
NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System
Xi Victoria Lin | Chenglong Wang | Luke Zettlemoyer | Michael D. Ernst
Xi Victoria Lin | Chenglong Wang | Luke Zettlemoyer | Michael D. Ernst
World Knowledge for Abstract Meaning Representation Parsing
Charles Welch | Jonathan K. Kummerfeld | Song Feng | Rada Mihalcea
Charles Welch | Jonathan K. Kummerfeld | Song Feng | Rada Mihalcea
Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research
Michael Gref | Joachim Köhler | Almut Leh
Michael Gref | Joachim Köhler | Almut Leh
Sound Signal Processing with Seq2Tree Network
Weicheng Ma | Kai Cao | Zhaoheng Ni | Peter Chin | Xiang Li
Weicheng Ma | Kai Cao | Zhaoheng Ni | Peter Chin | Xiang Li
Open ASR for Icelandic: Resources and a Baseline System
Anna Björk Nikulásdóttir | Inga Rún Helgadóttir | Matthías Pétursson | Jón Guðnason
Anna Björk Nikulásdóttir | Inga Rún Helgadóttir | Matthías Pétursson | Jón Guðnason
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
Zhao Meng | Lili Mou | Zhi Jin
Zhao Meng | Lili Mou | Zhi Jin
Discriminating between Similar Languages on Imbalanced Conversational Texts
Junqing He | Xian Huang | Xuemin Zhao | Yan Zhang | Yonghong Yan
Junqing He | Xian Huang | Xuemin Zhao | Yan Zhang | Yonghong Yan
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition
Michael Stadtschnitzer | Christoph Schmidt
Michael Stadtschnitzer | Christoph Schmidt
Simulating ASR errors for training SLU systems
Edwin Simonnet | Sahar Ghannay | Nathalie Camelin | Yannick Estève
Edwin Simonnet | Sahar Ghannay | Nathalie Camelin | Yannick Estève
Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models
Natalia Tomashenko | Yannick Estève
Natalia Tomashenko | Yannick Estève
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform
Ingmar Steiner | Sébastien Le Maguer
Ingmar Steiner | Sébastien Le Maguer
Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
Akira Hayakawa | Carl Vogel | Saturnino Luz | Nick Campbell
Akira Hayakawa | Carl Vogel | Saturnino Luz | Nick Campbell
Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data
Christopher Tauchmann | Thomas Arnold | Andreas Hanselowski | Christian M. Meyer | Margot Mieskes
Christopher Tauchmann | Thomas Arnold | Andreas Hanselowski | Christian M. Meyer | Margot Mieskes
A New Annotated Portuguese/Spanish Corpus for the Multi-Sentence Compression Task
Elvys Linhares Pontes | Juan-Manuel Torres-Moreno | Stéphane Huet | Andréa Carneiro Linhares
Elvys Linhares Pontes | Juan-Manuel Torres-Moreno | Stéphane Huet | Andréa Carneiro Linhares
TSix: A Human-involved-creation Dataset for Tweet Summarization
Minh-Tien Nguyen | Dac Viet Lai | Huy-Tien Nguyen | Le-Minh Nguyen
Minh-Tien Nguyen | Dac Viet Lai | Huy-Tien Nguyen | Le-Minh Nguyen
A Workbench for Rapid Generation of Cross-Lingual Summaries
Nisarg Jhaveri | Manish Gupta | Vasudeva Varma
Nisarg Jhaveri | Manish Gupta | Vasudeva Varma
Annotation and Analysis of Extractive Summaries for the Kyutech Corpus
Takashi Yamamura | Kazutaka Shimada
Takashi Yamamura | Kazutaka Shimada
Auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus
Markus Zopf
Markus Zopf
PyrEval: An Automated Method for Summary Content Analysis
Yanjun Gao | Andrew Warner | Rebecca Passonneau
Yanjun Gao | Andrew Warner | Rebecca Passonneau
Mapping Texts to Scripts: An Entailment Study
Simon Ostermann | Hannah Seitz | Stefan Thater | Manfred Pinkal
Simon Ostermann | Hannah Seitz | Stefan Thater | Manfred Pinkal
Semantic Equivalence Detection: Are Interrogatives Harder than Declaratives?
João Rodrigues | Chakaveh Saedi | António Branco | João Silva
João Rodrigues | Chakaveh Saedi | António Branco | João Silva
CLARIN: Towards FAIR and Responsible Data Science Using Language Resources
Franciska de Jong | Bente Maegaard | Koenraad De Smedt | Darja Fišer | Dieter Van Uytvanck
Franciska de Jong | Bente Maegaard | Koenraad De Smedt | Darja Fišer | Dieter Van Uytvanck
From ‘Solved Problems’ to New Challenges: A Report on LDC Activities
Christopher Cieri | Mark Liberman | Stephanie Strassel | Denise DiPersio | Jonathan Wright | Andrea Mazzucchi
Christopher Cieri | Mark Liberman | Stephanie Strassel | Denise DiPersio | Jonathan Wright | Andrea Mazzucchi
New directions in ELRA activities
Valérie Mapelli | Victoria Arranz | Hélène Mazo | Pawel Kamocki | Vladimir Popescu
Valérie Mapelli | Victoria Arranz | Hélène Mazo | Pawel Kamocki | Vladimir Popescu
A Framework for Multi-Language Service Design with the Language Grid
Donghui Lin | Yohei Murakami | Toru Ishida
Donghui Lin | Yohei Murakami | Toru Ishida
Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs
Georg Rehm | Stefanie Hegele
Georg Rehm | Stefanie Hegele
Annotating High-Level Structures of Short Stories and Personal Anecdotes
Boyang Li | Beth Cardier | Tong Wang | Florian Metze
Boyang Li | Beth Cardier | Tong Wang | Florian Metze
Discovering the Language of Wine Reviews: A Text Mining Account
Els Lefever | Iris Hendrickx | Ilja Croijmans | Antal van den Bosch | Asifa Majid
Els Lefever | Iris Hendrickx | Ilja Croijmans | Antal van den Bosch | Asifa Majid
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods
Thomas Proisl | Stefan Evert | Fotis Jannidis | Christof Schöch | Leonard Konle | Steffen Pielström
Thomas Proisl | Stefan Evert | Fotis Jannidis | Christof Schöch | Leonard Konle | Steffen Pielström
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions
Albert Gatt | Marc Tanti | Adrian Muscat | Patrizia Paggio | Reuben A Farrugia | Claudia Borg | Kenneth P Camilleri | Michael Rosner | Lonneke van der Plas
Albert Gatt | Marc Tanti | Adrian Muscat | Patrizia Paggio | Reuben A Farrugia | Claudia Borg | Kenneth P Camilleri | Michael Rosner | Lonneke van der Plas
Adapting Serious Game for Fallacious Argumentation to German: Pitfalls, Insights, and Best Practices
Ivan Habernal | Patrick Pauli | Iryna Gurevych
Ivan Habernal | Patrick Pauli | Iryna Gurevych
Crowdsourcing Regional Variation Data and Automatic Geolocalisation of Speakers of European French
Jean-Philippe Goldman | Yves Scherrer | Julie Glikman | Mathieu Avanzi | Christophe Benzitoun | Philippe Boula de Mareüil
Jean-Philippe Goldman | Yves Scherrer | Julie Glikman | Mathieu Avanzi | Christophe Benzitoun | Philippe Boula de Mareüil
Improving Machine Translation of Educational Content via Crowdsourcing
Maximiliana Behnke | Antonio Valerio Miceli Barone | Rico Sennrich | Vilelmini Sosoni | Thanasis Naskos | Eirini Takoulidou | Maria Stasimioti | Menno van Zaanen | Sheila Castilho | Federico Gaspari | Panayota Georgakopoulou | Valia Kordoni | Markus Egg | Katia Lida Kermanidis
Maximiliana Behnke | Antonio Valerio Miceli Barone | Rico Sennrich | Vilelmini Sosoni | Thanasis Naskos | Eirini Takoulidou | Maria Stasimioti | Menno van Zaanen | Sheila Castilho | Federico Gaspari | Panayota Georgakopoulou | Valia Kordoni | Markus Egg | Katia Lida Kermanidis
Grounding Gradable Adjectives through Crowdsourcing
Rebecca Sharp | Mithun Paul | Ajay Nagesh | Dane Bell | Mihai Surdeanu
Rebecca Sharp | Mithun Paul | Ajay Nagesh | Dane Bell | Mihai Surdeanu
Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
Oliver Adams | Trevor Cohn | Graham Neubig | Hilaria Cruz | Steven Bird | Alexis Michaud
Oliver Adams | Trevor Cohn | Graham Neubig | Hilaria Cruz | Steven Bird | Alexis Michaud
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
Pierre Godard | Gilles Adda | Martine Adda-Decker | Juan Benjumea | Laurent Besacier | Jamison Cooper-Leavitt | Guy-Noel Kouarata | Lori Lamel | Hélène Maynard | Markus Mueller | Annie Rialland | Sebastian Stueker | François Yvon | Marcely Zanon-Boito
Pierre Godard | Gilles Adda | Martine Adda-Decker | Juan Benjumea | Laurent Besacier | Jamison Cooper-Leavitt | Guy-Noel Kouarata | Lori Lamel | Hélène Maynard | Markus Mueller | Annie Rialland | Sebastian Stueker | François Yvon | Marcely Zanon-Boito
Chahta Anumpa: A multimodal corpus of the Choctaw Language
Jacqueline Brixey | Eli Pincus | Ron Artstein
Jacqueline Brixey | Eli Pincus | Ron Artstein
BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools
Fatima Hamlaoui | Emmanuel-Moselly Makasso | Markus Müller | Jonas Engelmann | Gilles Adda | Alex Waibel | Sebastian Stüker
Fatima Hamlaoui | Emmanuel-Moselly Makasso | Markus Müller | Jonas Engelmann | Gilles Adda | Alex Waibel | Sebastian Stüker
The MADAR Arabic Dialect Corpus and Lexicon
Houda Bouamor | Nizar Habash | Mohammad Salameh | Wajdi Zaghouani | Owen Rambow | Dana Abdulrahim | Ossama Obeid | Salam Khalifa | Fadhl Eryani | Alexander Erdmann | Kemal Oflazer
Houda Bouamor | Nizar Habash | Mohammad Salameh | Wajdi Zaghouani | Owen Rambow | Dana Abdulrahim | Ossama Obeid | Salam Khalifa | Fadhl Eryani | Alexander Erdmann | Kemal Oflazer
Designing a Collaborative Process to Create Bilingual Dictionaries of Indonesian Ethnic Languages
Arbi Haza Nasution | Yohei Murakami | Toru Ishida
Arbi Haza Nasution | Yohei Murakami | Toru Ishida
Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction
Chin-Ho Lin | Hen-Hsen Huang | Hsin-Hsi Chen
Chin-Ho Lin | Hen-Hsen Huang | Hsin-Hsi Chen
Building a Knowledge Graph from Natural Language Definitions for Interpretable Text Entailment Recognition
Vivian Silva | André Freitas | Siegfried Handschuh
Vivian Silva | André Freitas | Siegfried Handschuh
Combining rule-based and embedding-based approaches to normalize textual entities with an ontology
Arnaud Ferré | Louise Deléger | Pierre Zweigenbaum | Claire Nédellec
Arnaud Ferré | Louise Deléger | Pierre Zweigenbaum | Claire Nédellec
T-REx: A Large Scale Alignment of Natural Language with Knowledge Base Triples
Hady Elsahar | Pavlos Vougiouklis | Arslen Remaci | Christophe Gravier | Jonathon Hare | Frederique Laforest | Elena Simperl
Hady Elsahar | Pavlos Vougiouklis | Arslen Remaci | Christophe Gravier | Jonathon Hare | Frederique Laforest | Elena Simperl
A Large Parallel Corpus of Full-Text Scientific Articles
Felipe Soares | Viviane Moreira | Karin Becker
Felipe Soares | Viviane Moreira | Karin Becker
The IIT Bombay English-Hindi Parallel Corpus
Anoop Kunchukuttan | Pratik Mehta | Pushpak Bhattacharyya
Anoop Kunchukuttan | Pratik Mehta | Pushpak Bhattacharyya
Extracting an English-Persian Parallel Corpus from Comparable Corpora
Akbar Karimi | Ebrahim Ansari | Bahram Sadeghi Bigham
Akbar Karimi | Ebrahim Ansari | Bahram Sadeghi Bigham
Learning Word Vectors for 157 Languages
Edouard Grave | Piotr Bojanowski | Prakhar Gupta | Armand Joulin | Tomas Mikolov
Edouard Grave | Piotr Bojanowski | Prakhar Gupta | Armand Joulin | Tomas Mikolov
SumeCzech: Large Czech News-Based Summarization Dataset
Milan Straka | Nikita Mediankin | Tom Kocmi | Zdeněk Žabokrtský | Vojtěch Hudeček | Jan Hajič
Milan Straka | Nikita Mediankin | Tom Kocmi | Zdeněk Žabokrtský | Vojtěch Hudeček | Jan Hajič
Text Simplification from Professionally Produced Corpora
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Intertextual Correspondence for Integrating Corpora
Jacky Visser | Rory Duthie | John Lawrence | Chris Reed
Jacky Visser | Rory Duthie | John Lawrence | Chris Reed
Building Named Entity Recognition Taggers via Parallel Corpora
Rodrigo Agerri | Yiling Chung | Itziar Aldabe | Nora Aranberri | Gorka Labaka | German Rigau
Rodrigo Agerri | Yiling Chung | Itziar Aldabe | Nora Aranberri | Gorka Labaka | German Rigau
Cross-Document, Cross-Language Event Coreference Annotation Using Event Hoppers
Zhiyi Song | Ann Bies | Justin Mott | Xuansong Li | Stephanie Strassel | Christopher Caruso
Zhiyi Song | Ann Bies | Justin Mott | Xuansong Li | Stephanie Strassel | Christopher Caruso
TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection
Tirthankar Ghosal | Amitra Salam | Swati Tiwari | Asif Ekbal | Pushpak Bhattacharyya
Tirthankar Ghosal | Amitra Salam | Swati Tiwari | Asif Ekbal | Pushpak Bhattacharyya
Analyzing Citation-Distance Networks for Evaluating Publication Impact
Drahomira Herrmannova | Petr Knoth | Robert Patton
Drahomira Herrmannova | Petr Knoth | Robert Patton
Incorporating Global Contexts into Sentence Embedding for Relational Extraction at the Paragraph Level with Distant Supervision
Eun-kyung Kim | Key-Sun Choi
Eun-kyung Kim | Key-Sun Choi
MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge
Simon Ostermann | Ashutosh Modi | Michael Roth | Stefan Thater | Manfred Pinkal
Simon Ostermann | Ashutosh Modi | Michael Roth | Stefan Thater | Manfred Pinkal
A Neural Network Based Model for Loanword Identification in Uyghur
Chenggang Mi | Yating Yang | Lei Wang | Xi Zhou | Tonghai Jiang
Chenggang Mi | Yating Yang | Lei Wang | Xi Zhou | Tonghai Jiang
Revisiting Distant Supervision for Relation Extraction
Tingsong Jiang | Jing Liu | Chin-Yew Lin | Zhifang Sui
Tingsong Jiang | Jing Liu | Chin-Yew Lin | Zhifang Sui
Incorporating Contextual Information for Language-Independent, Dynamic Disambiguation Tasks
Tobias Staron | Özge Alaçam | Wolfgang Menzel
Tobias Staron | Özge Alaçam | Wolfgang Menzel
Overcoming the Long Tail Problem: A Case Study on CO2-Footprint Estimation of Recipes using Information Retrieval
Melanie Geiger | Martin Braschler
Melanie Geiger | Martin Braschler
A vision-grounded dataset for predicting typical locations for verbs
Nelson Mukuze | Anna Rohrbach | Vera Demberg | Bernt Schiele
Nelson Mukuze | Anna Rohrbach | Vera Demberg | Bernt Schiele
Creating dialect sub-corpora by clustering: a case in Japanese for an adaptive method
Yo Sato | Kevin Heffernan
Yo Sato | Kevin Heffernan
A Fast and Flexible Webinterface for Dialect Research in the Low Countries
Roeland van Hout | Nicoline van der Sijs | Erwin Komen | Henk van den Heuvel
Roeland van Hout | Nicoline van der Sijs | Erwin Komen | Henk van den Heuvel
Arabic Dialect Identification in the Context of Bivalency and Code-Switching
Mahmoud El-Haj | Paul Rayson | Mariam Aboelezz
Mahmoud El-Haj | Paul Rayson | Mariam Aboelezz
Unified Guidelines and Resources for Arabic Dialect Orthography
Nizar Habash | Fadhl Eryani | Salam Khalifa | Owen Rambow | Dana Abdulrahim | Alexander Erdmann | Reem Faraj | Wajdi Zaghouani | Houda Bouamor | Nasser Zalmout | Sara Hassan | Faisal Al-Shargi | Sakhar Alkhereyf | Basma Abdulkareem | Ramy Eskander | Mohammad Salameh | Hind Saddiki
Nizar Habash | Fadhl Eryani | Salam Khalifa | Owen Rambow | Dana Abdulrahim | Alexander Erdmann | Reem Faraj | Wajdi Zaghouani | Houda Bouamor | Nasser Zalmout | Sara Hassan | Faisal Al-Shargi | Sakhar Alkhereyf | Basma Abdulkareem | Ramy Eskander | Mohammad Salameh | Hind Saddiki
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach
Houda Saâdane | Hosni Seffih | Christian Fluhr | Khalid Choukri | Nasredine Semmar
Houda Saâdane | Hosni Seffih | Christian Fluhr | Khalid Choukri | Nasredine Semmar
Shami: A Corpus of Levantine Arabic Dialects
Kathrein Abu Kwaik | Motaz Saad | Stergios Chatzikyriakidis | Simon Dobnik
Kathrein Abu Kwaik | Motaz Saad | Stergios Chatzikyriakidis | Simon Dobnik
You Tweet What You Speak: A City-Level Dataset of Arabic Dialects
Muhammad Abdul-Mageed | Hassan Alhuzali | Mohamed Elaraby
Muhammad Abdul-Mageed | Hassan Alhuzali | Mohamed Elaraby
DART: A Large Dataset of Dialectal Arabic Tweets
Israa Alsarsour | Esraa Mohamed | Reem Suwaileh | Tamer Elsayed
Israa Alsarsour | Esraa Mohamed | Reem Suwaileh | Tamer Elsayed
Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features
Gregor Wiedemann | Gerhard Heyer
Gregor Wiedemann | Gerhard Heyer
Automating Document Discovery in the Systematic Review Process: How to Use Chaff to Extract Wheat
Christopher Norman | Mariska Leeflang | Pierre Zweigenbaum | Aurélie Névéol
Christopher Norman | Mariska Leeflang | Pierre Zweigenbaum | Aurélie Névéol
Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications.
Oussama Ahmia | Nicolas Béchet | Pierre-François Marteau
Oussama Ahmia | Nicolas Béchet | Pierre-François Marteau
Using Adversarial Examples in Natural Language Processing
Petr Bělohlávek | Ondřej Plátek | Zdeněk Žabokrtský | Milan Straka
Petr Bělohlávek | Ondřej Plátek | Zdeněk Žabokrtský | Milan Straka
Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus
Anne-Kathrin Schumann | Héctor Martínez Alonso
Anne-Kathrin Schumann | Héctor Martínez Alonso
Annotated Corpus of Scientific Conference’s Homepages for Information Extraction
Piotr Andruszkiewicz | Rafał Hazan
Piotr Andruszkiewicz | Rafał Hazan
WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis
Rüdiger Gleim | Alexander Mehler | Sung Y. Song
Rüdiger Gleim | Alexander Mehler | Sung Y. Song
Studying Muslim Stereotyping through Microportrait Extraction
Antske Fokkens | Nel Ruigrok | Camiel Beukeboom | Gagestein Sarah | Wouter van Atteveldt
Antske Fokkens | Nel Ruigrok | Camiel Beukeboom | Gagestein Sarah | Wouter van Atteveldt
Analyzing the Quality of Counseling Conversations: the Tell-Tale Signs of High-quality Counseling
Verónica Pérez-Rosas | Xuetong Sun | Christy Li | Yuchen Wang | Kenneth Resnicow | Rada Mihalcea
Verónica Pérez-Rosas | Xuetong Sun | Christy Li | Yuchen Wang | Kenneth Resnicow | Rada Mihalcea
Interpersonal Relationship Labels for the CALLHOME Corpus
Denys Katerenchuk | David Guy Brizan | Andrew Rosenberg
Denys Katerenchuk | David Guy Brizan | Andrew Rosenberg
Text Mining for History: first steps on building a large dataset
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker
Building Evaluation Datasets for Cultural Microblog Retrieval
Lorraine Goeuriot | Josiane Mothe | Philippe Mulhem | Eric SanJuan
Lorraine Goeuriot | Josiane Mothe | Philippe Mulhem | Eric SanJuan
Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages
Matīss Rikters | Mārcis Pinnis | Rihards Krišlauks
Matīss Rikters | Mārcis Pinnis | Rihards Krišlauks
Cross-lingual Terminology Extraction for Translation Quality Estimation
Yu Yuan | Yuze Gao | Yue Zhang | Serge Sharoff
Yu Yuan | Yuze Gao | Yue Zhang | Serge Sharoff
Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German
Pierre-Edouard Honnet | Andrei Popescu-Belis | Claudiu Musat | Michael Baeriswyl
Pierre-Edouard Honnet | Andrei Popescu-Belis | Claudiu Musat | Michael Baeriswyl
Improving domain-specific SMT for low-resourced languages using data from different domains
Fathima Farhath | Pranavan Theivendiram | Surangika Ranathunga | Sanath Jayasena | Gihan Dias
Fathima Farhath | Pranavan Theivendiram | Surangika Ranathunga | Sanath Jayasena | Gihan Dias
Discovering Parallel Language Resources for Training MT Engines
Vassilis Papavassiliou | Prokopis Prokopidis | Stelios Piperidis
Vassilis Papavassiliou | Prokopis Prokopidis | Stelios Piperidis
A fine-grained error analysis of NMT, SMT and RBMT output for English-to-Dutch
Laura Van Brussel | Arda Tezcan | Lieve Macken
Laura Van Brussel | Arda Tezcan | Lieve Macken
Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus
Injy Hamed | Mohamed Elmahdy | Slim Abdennadher
Injy Hamed | Mohamed Elmahdy | Slim Abdennadher
Evaluation of Machine Translation Performance Across Multiple Genres and Languages
Marlies van der Wees | Arianna Bisazza | Christof Monz
Marlies van der Wees | Arianna Bisazza | Christof Monz
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
Pierre Zweigenbaum | Serge Sharoff | Reinhard Rapp
Pierre Zweigenbaum | Serge Sharoff | Reinhard Rapp
A Morphologically Annotated Corpus of Emirati Arabic
Salam Khalifa | Nizar Habash | Fadhl Eryani | Ossama Obeid | Dana Abdulrahim | Meera Al Kaabi
Salam Khalifa | Nizar Habash | Fadhl Eryani | Ossama Obeid | Dana Abdulrahim | Meera Al Kaabi
CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing
Amir More | Özlem Çetinoğlu | Çağrı Çöltekin | Nizar Habash | Benoît Sagot | Djamé Seddah | Dima Taji | Reut Tsarfaty
Amir More | Özlem Çetinoğlu | Çağrı Çöltekin | Nizar Habash | Benoît Sagot | Djamé Seddah | Dima Taji | Reut Tsarfaty
Manually Annotated Corpus of Polish Texts Published between 1830 and 1918
Witold Kieraś | Marcin Woliński
Witold Kieraś | Marcin Woliński
Evaluating Inflectional Complexity Crosslinguistically: a Processing Perspective
Claudia Marzi | Marcello Ferro | Ouafae Nahli | Patrizia Belik | Stavros Bompolas | Vito Pirrelli
Claudia Marzi | Marcello Ferro | Ouafae Nahli | Patrizia Belik | Stavros Bompolas | Vito Pirrelli
Parser combinators for Tigrinya and Oromo morphology
Patrick Littell | Tom McCoy | Na-Rae Han | Shruti Rijhwani | Zaid Sheikh | David Mortensen | Teruko Mitamura | Lori Levin
Patrick Littell | Tom McCoy | Na-Rae Han | Shruti Rijhwani | Zaid Sheikh | David Mortensen | Teruko Mitamura | Lori Levin
Building a Morphological Treebank for German from a Linguistic Database
Petra Steiner | Josef Ruppenhofer
Petra Steiner | Josef Ruppenhofer
CATS: A Tool for Customized Alignment of Text Simplification Corpora
Sanja Štajner | Marc Franco-Salvador | Paolo Rosso | Simone Paolo Ponzetto
Sanja Štajner | Marc Franco-Salvador | Paolo Rosso | Simone Paolo Ponzetto
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus
Thanh-Le Ha | Jan Niehues | Matthias Sperber | Ngoc Quan Pham | Alexander Waibel
Thanh-Le Ha | Jan Niehues | Matthias Sperber | Ngoc Quan Pham | Alexander Waibel
Multi-lingual Argumentative Corpora in English, Turkish, Greek, Albanian, Croatian, Serbian, Macedonian, Bulgarian, Romanian and Arabic
Alfred Sliwa | Yuan Ma | Ruishen Liu | Niravkumar Borad | Seyedeh Ziyaei | Mina Ghobadi | Firas Sabbah | Ahmet Aker
Alfred Sliwa | Yuan Ma | Ruishen Liu | Niravkumar Borad | Seyedeh Ziyaei | Mina Ghobadi | Firas Sabbah | Ahmet Aker
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages
Siamak Barzegar | Brian Davis | Manel Zarrouk | Siegfried Handschuh | Andre Freitas
Siamak Barzegar | Brian Davis | Manel Zarrouk | Siegfried Handschuh | Andre Freitas
Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard
Delphine Bernhard | Anne-Laure Ligozat | Fanny Martin | Myriam Bras | Pierre Magistry | Marianne Vergez-Couret | Lucie Steiblé | Pascale Erhart | Nabil Hathout | Dominique Huck | Christophe Rey | Philippe Reynés | Sophie Rosset | Jean Sibille | Thomas Lavergne
Delphine Bernhard | Anne-Laure Ligozat | Fanny Martin | Myriam Bras | Pierre Magistry | Marianne Vergez-Couret | Lucie Steiblé | Pascale Erhart | Nabil Hathout | Dominique Huck | Christophe Rey | Philippe Reynés | Sophie Rosset | Jean Sibille | Thomas Lavergne
Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM
Randah Alharbi | Walid Magdy | Kareem Darwish | Ahmed AbdelAli | Hamdy Mubarak
Randah Alharbi | Walid Magdy | Kareem Darwish | Ahmed AbdelAli | Hamdy Mubarak
HiNTS: A Tagset for Middle Low German
Fabian Barteld | Sarah Ihden | Katharina Dreessen | Ingrid Schröder
Fabian Barteld | Sarah Ihden | Katharina Dreessen | Ingrid Schröder
Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh
Steven Neale | Kevin Donnelly | Gareth Watkins | Dawn Knight
Steven Neale | Kevin Donnelly | Gareth Watkins | Dawn Knight
Graph Based Semi-Supervised Learning Approach for Tamil POS tagging
Mokanarangan Thayaparan | Surangika Ranathunga | Uthayasanker Thayasivam
Mokanarangan Thayaparan | Surangika Ranathunga | Uthayasanker Thayasivam
What Causes the Differences in Communication Styles? A Multicultural Study on Directness and Elaborateness
Juliana Miehle | Wolfgang Minker | Stefan Ultes
Juliana Miehle | Wolfgang Minker | Stefan Ultes
FARMI: A FrAmework for Recording Multi-Modal Interactions
Patrik Jonell | Mattias Bystedt | Per Fallgren | Dimosthenis Kontogiorgos | José Lopes | Zofia Malisz | Samuel Mascarenhas | Catharine Oertel | Eran Raveh | Todd Shore
Patrik Jonell | Mattias Bystedt | Per Fallgren | Dimosthenis Kontogiorgos | José Lopes | Zofia Malisz | Samuel Mascarenhas | Catharine Oertel | Eran Raveh | Todd Shore
Creating Large-Scale Argumentation Structures for Dialogue Systems
Kazuki Sakai | Akari Inago | Ryuichiro Higashinaka | Yuichiro Yoshikawa | Hiroshi Ishiguro | Junji Tomita
Kazuki Sakai | Akari Inago | Ryuichiro Higashinaka | Yuichiro Yoshikawa | Hiroshi Ishiguro | Junji Tomita
Exploring Conversational Language Generation for Rich Content about Hotels
Marilyn Walker | Albry Smither | Shereen Oraby | Vrindavan Harrison | Hadar Shemtov
Marilyn Walker | Albry Smither | Shereen Oraby | Vrindavan Harrison | Hadar Shemtov
A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
Thi-Lan Ngo | Pham Khac Linh | Hideaki Takeda
Thi-Lan Ngo | Pham Khac Linh | Hideaki Takeda
Annotating Attribution Relations in Arabic
Amal Alsaif | Tasniem Alyahya | Madawi Alotaibi | Huda Almuzaini | Abeer Algahtani
Amal Alsaif | Tasniem Alyahya | Madawi Alotaibi | Huda Almuzaini | Abeer Algahtani
The ADELE Corpus of Dyadic Social Text Conversations:Dialog Act Annotation with ISO 24617-2
Emer Gilmartin | Christian Saam | Brendan Spillane | Maria O’Reilly | Ketong Su | Arturo Calvo | Loredana Cerrato | Killian Levacher | Nick Campbell | Vincent Wade
Emer Gilmartin | Christian Saam | Brendan Spillane | Maria O’Reilly | Ketong Su | Arturo Calvo | Loredana Cerrato | Killian Levacher | Nick Campbell | Vincent Wade
An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank
Deniz Zeyrek | Murathan Kurfalı
Deniz Zeyrek | Murathan Kurfalı
Compilation of Corpora for the Study of the Information Structure–Prosody Interface
Alicia Burga | Mónica Domínguez | Mireia Farrús | Leo Wanner
Alicia Burga | Mónica Domínguez | Mireia Farrús | Leo Wanner
Preliminary Analysis of Embodied Interactions between Science Communicators and Visitors Based on a Multimodal Corpus of Japanese Conversations in a Science Museum
Rui Sakaida | Ryosaku Makino | Mayumi Bono
Rui Sakaida | Ryosaku Makino | Mayumi Bono
Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations
Yudai Kishimoto | Shinnosuke Sawada | Yugo Murawaki | Daisuke Kawahara | Sadao Kurohashi
Yudai Kishimoto | Shinnosuke Sawada | Yugo Murawaki | Daisuke Kawahara | Sadao Kurohashi
Automatic Labeling of Problem-Solving Dialogues for Computational Microgenetic Learning Analytics
Yuanliang Meng | Anna Rumshisky | Florence Sullivan
Yuanliang Meng | Anna Rumshisky | Florence Sullivan
Increasing Argument Annotation Reproducibility by Using Inter-annotator Agreement to Improve Guidelines
Milagro Teruel | Cristian Cardellino | Fernando Cardellino | Laura Alonso Alemany | Serena Villata
Milagro Teruel | Cristian Cardellino | Fernando Cardellino | Laura Alonso Alemany | Serena Villata
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Revita: a Language-learning Platform at the Intersection of ITS and CALL
Anisia Katinskaia | Javad Nouri | Roman Yangarber
Anisia Katinskaia | Javad Nouri | Roman Yangarber
The Distribution and Prosodic Realization of Verb Forms in German Infant-Directed Speech
Bettina Braun | Katharina Zahner
Bettina Braun | Katharina Zahner
Cross-linguistically Small World Networks are Ubiquitous in Child-directed Speech
Steven Moran | Danica Pajović | Sabine Stoll
Steven Moran | Danica Pajović | Sabine Stoll
L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures
Keying Li | John Lee
Keying Li | John Lee
The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners
Roberts Darģis | Ilze Auziņa | Kristīne Levāne-Petrova
Roberts Darģis | Ilze Auziņa | Kristīne Levāne-Petrova
An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures
Leonardo Zilio | Rodrigo Wilkens | Cédrick Fairon
Leonardo Zilio | Rodrigo Wilkens | Cédrick Fairon
Portable Spelling Corrector for a Less-Resourced Language: Amharic
Andargachew Mekonnen Gezmu | Andreas Nürnberger | Binyam Ephrem Seyoum
Andargachew Mekonnen Gezmu | Andreas Nürnberger | Binyam Ephrem Seyoum
A Speaking Atlas of the Regional Languages of France
Philippe Boula de Mareüil | Albert Rilliard | Frédéric Vernier
Philippe Boula de Mareüil | Albert Rilliard | Frédéric Vernier
Pronunciation Dictionaries for the Alsatian Dialects to Analyze Spelling and Phonetic Variation
Lucie Steiblé | Delphine Bernhard
Lucie Steiblé | Delphine Bernhard
ChAnot: An Intelligent Annotation Tool for Indigenous and Highly Agglutinative Languages in Peru
Rodolfo Mercado-Gonzales | José Pereira-Noriega | Marco Sobrevilla | Arturo Oncevay
Rodolfo Mercado-Gonzales | José Pereira-Noriega | Marco Sobrevilla | Arturo Oncevay
The DLDP Survey on Digital Use and Usability of EU Regional and Minority Languages
Claudia Soria | Valeria Quochi | Irene Russo
Claudia Soria | Valeria Quochi | Irene Russo
ASR for Documenting Acutely Under-Resourced Indigenous Languages
Robbie Jimerson | Emily Prud’hommeaux
Robbie Jimerson | Emily Prud’hommeaux
Building a Sentiment Corpus of Tweets in Brazilian Portuguese
Henrico Brum | Maria das Graças Volpe Nunes
Henrico Brum | Maria das Graças Volpe Nunes
‘Aye’ or ‘No’? Speech-level Sentiment Analysis of Hansard UK Parliamentary Debate Transcripts
Gavin Abercrombie | Riza Batista-Navarro
Gavin Abercrombie | Riza Batista-Navarro
NoReC: The Norwegian Review Corpus
Erik Velldal | Lilja Øvrelid | Eivind Alexander Bergem | Cathrine Stadsnes | Samia Touileb | Fredrik Jørgensen
Erik Velldal | Lilja Øvrelid | Eivind Alexander Bergem | Cathrine Stadsnes | Samia Touileb | Fredrik Jørgensen
SenSALDO: Creating a Sentiment Lexicon for Swedish
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide
Corpus Building and Evaluation of Aspect-based Opinion Summaries from Tweets in Spanish
Daniel Peñaloza | Rodrigo López | Juanjosé Tenorio | Héctor Gómez | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Daniel Peñaloza | Rodrigo López | Juanjosé Tenorio | Héctor Gómez | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Application and Analysis of a Multi-layered Scheme for Irony on the Italian Twitter Corpus TWITTIRÒ
Alessandra Teresa Cignarella | Cristina Bosco | Viviana Patti | Mirko Lai
Alessandra Teresa Cignarella | Cristina Bosco | Viviana Patti | Mirko Lai
SMILE Swiss German Sign Language Dataset
Sarah Ebling | Necati Cihan Camgöz | Penny Boyes Braem | Katja Tissi | Sandra Sidler-Miserez | Stephanie Stoll | Simon Hadfield | Tobias Haug | Richard Bowden | Sandrine Tornay | Marzieh Razavi | Mathew Magimai-Doss
Sarah Ebling | Necati Cihan Camgöz | Penny Boyes Braem | Katja Tissi | Sandra Sidler-Miserez | Stephanie Stoll | Simon Hadfield | Tobias Haug | Richard Bowden | Sandrine Tornay | Marzieh Razavi | Mathew Magimai-Doss
IPSL: A Database of Iconicity Patterns in Sign Languages. Creation and Use
Vadim Kimmelman | Anna Klezovich | George Moroz
Vadim Kimmelman | Anna Klezovich | George Moroz
Sign Languages and the Online World Online Dictionaries & Lexicostatistics
Shi Yu | Carlo Geraci | Natasha Abner
Shi Yu | Carlo Geraci | Natasha Abner
Elicitation protocol and material for a corpus of long prepared monologues in Sign Language
Michael Filhol | Mohamed Nassime Hadjadj
Michael Filhol | Mohamed Nassime Hadjadj
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions
Heike Brock | Kazuhiro Nakadai
Heike Brock | Kazuhiro Nakadai
Modeling French Sign Language: a proposal for a semantically compositional system
Mohamed Nassime Hadjadj | Michael Filhol | Annelies Braffort
Mohamed Nassime Hadjadj | Michael Filhol | Annelies Braffort
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
Hanae Koiso | Yasuharu Den | Yuriko Iseki | Wakako Kashino | Yoshiko Kawabata | Ken’ya Nishikawa | Yayoi Tanaka | Yasuyuki Usuda
Hanae Koiso | Yasuharu Den | Yuriko Iseki | Wakako Kashino | Yoshiko Kawabata | Ken’ya Nishikawa | Yayoi Tanaka | Yasuyuki Usuda
Carcinologic Speech Severity Index Project: A Database of Speech Disorder Productions to Assess Quality of Life Related to Speech After Cancer
Corine Astésano | Mathieu Balaguer | Jérôme Farinas | Corinne Fredouille | Pascal Gaillard | Alain Ghio | Imed Laaridh | Muriel Lalain | Benoît Lepage | Julie Mauclair | Olivier Nocaudie | Julien Pinquier | Oriol Pont | Gilles Pouchoulin | Michèle Puech | Danièle Robert | Etienne Sicard | Virginie Woisard
Corine Astésano | Mathieu Balaguer | Jérôme Farinas | Corinne Fredouille | Pascal Gaillard | Alain Ghio | Imed Laaridh | Muriel Lalain | Benoît Lepage | Julie Mauclair | Olivier Nocaudie | Julien Pinquier | Oriol Pont | Gilles Pouchoulin | Michèle Puech | Danièle Robert | Etienne Sicard | Virginie Woisard
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville)
Annie Rialland | Martine Adda-Decker | Guy-Noël Kouarata | Gilles Adda | Laurent Besacier | Lori Lamel | Elodie Gauthier | Pierre Godard | Jamison Cooper-Leavitt
Annie Rialland | Martine Adda-Decker | Guy-Noël Kouarata | Gilles Adda | Laurent Besacier | Lori Lamel | Elodie Gauthier | Pierre Godard | Jamison Cooper-Leavitt
A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Arif Khan | Ingmar Steiner | Yusuke Sugano | Andreas Bulling | Ross Macdonald
Arif Khan | Ingmar Steiner | Yusuke Sugano | Andreas Bulling | Ross Macdonald
Statistical Analysis of Missing Translation in Simultaneous Interpretation Using A Large-scale Bilingual Speech Corpus
Zhongxi Cai | Koichiro Ryu | Shigeki Matsubara
Zhongxi Cai | Koichiro Ryu | Shigeki Matsubara
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.
Aghilas Sini | Damien Lolive | Gaëlle Vidal | Marie Tahon | Élisabeth Delais-Roussarie
Aghilas Sini | Damien Lolive | Gaëlle Vidal | Marie Tahon | Élisabeth Delais-Roussarie
The MonPaGe_HA Database for the Documentation of Spoken French Throughout Adulthood
Cécile Fougeron | Véronique Delvaux | Lucie Ménard | Marina Laganaro
Cécile Fougeron | Véronique Delvaux | Lucie Ménard | Marina Laganaro
Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data
Per Fallgren | Zofia Malisz | Jens Edlund
Per Fallgren | Zofia Malisz | Jens Edlund
CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task
Robert Herms | Maria Wirzberger | Maximilian Eibl | Günter Daniel Rey
Robert Herms | Maria Wirzberger | Maximilian Eibl | Günter Daniel Rey
Edit me: A Corpus and a Framework for Understanding Natural Language Image Editing
Ramesh Manuvinakurike | Jacqueline Brixey | Trung Bui | Walter Chang | Doo Soon Kim | Ron Artstein | Kallirroi Georgila
Ramesh Manuvinakurike | Jacqueline Brixey | Trung Bui | Walter Chang | Doo Soon Kim | Ron Artstein | Kallirroi Georgila
Enriching a Lexicon of Discourse Connectives with Corpus-based Data
Anna Feltracco | Elisabetta Jezek | Bernardo Magnini
Anna Feltracco | Elisabetta Jezek | Bernardo Magnini
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain
Carolina Scarton | Gustavo Paetzold | Lucia Specia
Carolina Scarton | Gustavo Paetzold | Lucia Specia
The brWaC Corpus: A New Open Resource for Brazilian Portuguese
Jorge A. Wagner Filho | Rodrigo Wilkens | Marco Idiart | Aline Villavicencio
Jorge A. Wagner Filho | Rodrigo Wilkens | Marco Idiart | Aline Villavicencio
The German Reference Corpus DeReKo: New Developments – New Opportunities
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt
Risamálheild: A Very Large Icelandic Text Corpus
Steinþór Steingrímsson | Sigrún Helgadóttir | Eiríkur Rögnvaldsson | Starkaður Barkarson | Jón Guðnason
Steinþór Steingrímsson | Sigrún Helgadóttir | Eiríkur Rögnvaldsson | Starkaður Barkarson | Jón Guðnason
TriMED: A Multilingual Terminological Database
Federica Vezzani | Giorgio Maria Di Nunzio | Geneviève Henrot
Federica Vezzani | Giorgio Maria Di Nunzio | Geneviève Henrot
Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment
Sonja Bosch | Thomas Eckart | Bettina Klimek | Dirk Goldhahn | Uwe Quasthoff
Sonja Bosch | Thomas Eckart | Bettina Klimek | Dirk Goldhahn | Uwe Quasthoff
A Lexicon of Discourse Markers for Portuguese – LDM-PT
Amália Mendes | Iria del Rio | Manfred Stede | Felix Dombek
Amália Mendes | Iria del Rio | Manfred Stede | Felix Dombek
One Language to rule them all: modelling Morphological Patterns in a Large Scale Italian Lexicon with SWRL
Fahad Khan | Andrea Bellandi | Francesca Frontini | Monica Monachini
Fahad Khan | Andrea Bellandi | Francesca Frontini | Monica Monachini
WordNet-Shp: Towards the Building of a Lexical Database for a Peruvian Minority Language
Diego Maguiño-Valencia | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Diego Maguiño-Valencia | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo
Retrieving Information from the French Lexical Network in RDF/OWL Format
Alexsandro Fonseca | Fatiha Sadat | François Lareau
Alexsandro Fonseca | Fatiha Sadat | François Lareau
Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus
Abbas Ghaddar | Philippe Langlais
Abbas Ghaddar | Philippe Langlais
Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges
Halidanmu Abudukelimu | Abudoukelimu Abulizi | Boliang Zhang | Xiaoman Pan | Di Lu | Heng Ji | Yang Liu
Halidanmu Abudukelimu | Abudoukelimu Abulizi | Boliang Zhang | Xiaoman Pan | Di Lu | Heng Ji | Yang Liu
BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset
Hanieh Poostchi | Ehsan Zare Borzeshi | Massimo Piccardi
Hanieh Poostchi | Ehsan Zare Borzeshi | Massimo Piccardi
Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
Juyeon Kang | Jungyeul Park
Juyeon Kang | Jungyeul Park
A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events
Martin Schiersch | Veselina Mironova | Maximilian Schmitt | Philippe Thomas | Aleksandra Gabryszak | Leonhard Hennig
Martin Schiersch | Veselina Mironova | Maximilian Schmitt | Philippe Thomas | Aleksandra Gabryszak | Leonhard Hennig
A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products
Saskia Schön | Veselina Mironova | Aleksandra Gabryszak | Leonhard Hennig
Saskia Schön | Veselina Mironova | Aleksandra Gabryszak | Leonhard Hennig
Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
Juliana Pirovani | Elias Oliveira
Juliana Pirovani | Elias Oliveira
M-CNER: A Corpus for Chinese Named Entity Recognition in Multi-Domains
Qi Lu | YaoSheng Yang | Zhenghua Li | Wenliang Chen | Min Zhang
Qi Lu | YaoSheng Yang | Zhenghua Li | Wenliang Chen | Min Zhang
SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems
Kevin Bowden | Jiaqi Wu | Shereen Oraby | Amita Misra | Marilyn Walker
Kevin Bowden | Jiaqi Wu | Shereen Oraby | Amita Misra | Marilyn Walker
Transfer Learning for Named-Entity Recognition with Neural Networks
Ji Young Lee | Franck Dernoncourt | Peter Szolovits
Ji Young Lee | Franck Dernoncourt | Peter Szolovits
ForFun 1.0: Prague Database of Forms and Functions – An Invaluable Resource for Linguistic Research
Marie Mikulová | Eduard Bejček
Marie Mikulová | Eduard Bejček
The LIA Treebank of Spoken Norwegian Dialects
Lilja Øvrelid | Andre Kåsen | Kristin Hagen | Anders Nøklestad | Per Erik Solberg | Janne Bondi Johannessen
Lilja Øvrelid | Andre Kåsen | Kristin Hagen | Anders Nøklestad | Per Erik Solberg | Janne Bondi Johannessen
Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project
Guillaume Wisniewski
Guillaume Wisniewski
SandhiKosh: A Benchmark Corpus for Evaluating Sanskrit Sandhi Tools
Shubham Bhardwaj | Neelamadhav Gantayat | Nikhil Chaturvedi | Rahul Garg | Sumeet Agarwal
Shubham Bhardwaj | Neelamadhav Gantayat | Nikhil Chaturvedi | Rahul Garg | Sumeet Agarwal
Creation of a Balanced State-of-the-Art Multilayer Corpus for NLU
Normunds Gruzitis | Lauma Pretkalnina | Baiba Saulite | Laura Rituma | Gunta Nespore-Berzkalne | Arturs Znotins | Peteris Paikens
Normunds Gruzitis | Lauma Pretkalnina | Baiba Saulite | Laura Rituma | Gunta Nespore-Berzkalne | Arturs Znotins | Peteris Paikens
Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution
Sebastien Delecraz | Alexis Nasr | Frederic Bechet | Benoit Favre
Sebastien Delecraz | Alexis Nasr | Frederic Bechet | Benoit Favre
Analyzing Middle High German Syntax with RDF and SPARQL
Christian Chiarcos | Benjamin Kosmehl | Christian Fäth | Maria Sukhareva
Christian Chiarcos | Benjamin Kosmehl | Christian Fäth | Maria Sukhareva
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
Djamé Seddah | Eric de la Clergerie | Benoît Sagot | Héctor Martínez Alonso | Marie Candito
Djamé Seddah | Eric de la Clergerie | Benoît Sagot | Héctor Martínez Alonso | Marie Candito
Universal Dependencies and Quantitative Typological Trends. A Case Study on Word Order
Chiara Alzetta | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Chiara Alzetta | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi
Interoperability of Language-related Information: Mapping the BLL Thesaurus to Lexvo and Glottolog
Vanya Dimitrova | Christian Fäth | Christian Chiarcos | Heike Renner-Westermann | Frank Abromeit
Vanya Dimitrova | Christian Fäth | Christian Chiarcos | Heike Renner-Westermann | Frank Abromeit
Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest
António Branco | Ruben Branco | Chakaveh Saedi | João Silva
António Branco | Ruben Branco | Chakaveh Saedi | João Silva
Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model
Wei-Yun Ma | Yueh-Yin Shih
Wei-Yun Ma | Yueh-Yin Shih
The Circumstantial Event Ontology (CEO) and ECB+/CEO: an Ontology and Corpus for Implicit Causal Relations between Events
Roxane Segers | Tommaso Caselli | Piek Vossen
Roxane Segers | Tommaso Caselli | Piek Vossen