André Lynum


2014

pdf
NTNU: Measuring Semantic Similarity with Sublexical Feature Representations and Soft Cardinality
André Lynum | Partha Pakray | Björn Gambäck | Sergio Jimenez
Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)

2013

pdf
Improving Word Translation Disambiguation by Capturing Multiword Expressions with Dictionaries
Lars Bungum | Björn Gambäck | André Lynum | Erwin Marsi
Proceedings of the 9th Workshop on Multiword Expressions

pdf
Native Language Identification using large scale lexical features
André Lynum
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications

pdf
NTNU-CORE: Combining strong features for semantic similarity
Erwin Marsi | Hans Moen | Lars Bungum | Gleb Sizov | Björn Gambäck | André Lynum
Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity

2012

pdf
The Nordic Dialect Corpus
Janne Bondi Johannessen | Joel Priestley | Kristin Hagen | Anders Nøklestad | André Lynum
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

In this paper, we describe the Nordic Dialect Corpus, which has recently been completed. The corpus has a variety of features that combined makes it an advanced tool for language researchers. These features include: Linguistic contents (dialects from five closely related languages), annotation (tagging and two types of transcription), search interface (advanced possibilities for combining a large array of search criteria and results presentation in an intuitive and simple interface), many search variables (linguistics-based, informant-based, time-based), multimedia display (linking of sound and video to transcriptions), display of results in maps, display of informant details (number of words and other information on informants), advanced results handling (concordances, collocations, counts and statistics shown in a variety of graphical modes, plus further processing). Finally, and importantly, the corpus is freely available for research on the web. We give examples of both various kinds of searches, of displays of results and of results handling.