Adam Meyers

Also published as: A. Meyers


2024

pdf
Towards Better Inclusivity: A Diverse Tweet Corpus of English Varieties
Nhi Pham | Lachlan Pham | Adam Meyers
Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)

The prevalence of social media presents a growing opportunity to collect and analyse examples of English varieties. Whilst usage of these varieties is often used only in spoken contexts or hard-to-access private messages, social media sites like Twitter provide a platform for users to communicate informally in a scrapeable format. Notably, Indian English (Hinglish), Singaporean English (Singlish), and African-American English (AAE) can be commonly found online. These varieties pose a challenge to existing natural language processing (NLP) tools as they often differ orthographically and syntactically from standard English for which the majority of these tools are built. NLP models trained on standard English texts produced biased outcomes for users of underrepresented varieties (Blodgett and O’Connor, 2017). Some research has aimed to overcome the inherent biases caused by unrepresentative data through techniques like data augmentation or adjusting training models. We aim to address the issue of bias at its root - the data itself. We curate a dataset of tweets from countries with high proportions of underserved English variety speakers, and propose an annotation framework of six categorical classifications along a pseudo-spectrum that measures the degree of standard English and that thereby indirectly aims to surface the manifestations of English varieties in these tweets.

2023

pdf
Classification of US Supreme Court Cases Using BERT-Based Techniques
Shubham Vatsal | Adam Meyers | John E. Ortega
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing

Models based on bidirectional encoder representations from transformers (BERT) produce state of the art (SOTA) results on many natural language processing (NLP) tasks such as named entity recognition (NER), part-of-speech (POS) tagging etc. An interesting phenomenon occurs when classifying long documents such as those from the US supreme court where BERT-based models can be considered difficult to use on a first-pass or out-of-the-box basis. In this paper, we experiment with several BERT-based classification techniques for US supreme court decisions or supreme court database (SCDB) and compare them with the previous SOTA results. We then compare our results specifically with SOTA models for long documents. We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories. Our best result produces an accuracy of 80% on the 15 broad categories and 60% on the fine-grained 279 categories which marks an improvement of 8% and 28% respectively from previously reported SOTA results.

2022

pdf bib
On Breadth Alone: Improving the Precision of Terminology Extraction Systems on Patent Corpora
Sean Nordquist | Adam Meyers
Proceedings of the Natural Legal Language Processing Workshop 2022

Automatic Terminology Extraction (ATE) methods are a class of linguistic, statistical, machine learning or hybrid techniques for identifying terminology in a set of documents. Most modern ATE methods use a statistical measure of how important or characteristic a potential term is to a foreground corpus by using a second background corpus as a baseline. While many variables with ATE methods have been carefully evaluated and tuned in the literature, the effects of choosing a particular background corpus over another are not obvious. In this paper, we propose a methodology that allows us to adjust the relative breadth of the foreground and background corpora in patent documents by taking advantage of the Cooperative Patent Classification (CPC) scheme. Our results show that for every foreground corpus, the broadest background corpus gave the worst performance, in the worst case that difference is 17%. Similarly, the least broad background corpus gave suboptimal performance in all three experiments. We also demonstrate qualitative differences between background corpora – narrower background corpora tend towards more technical output. We expect our results to generalize to terminology extraction for other legal and technical documents and, generally, to the foreground/background approach to ATE.

2019

pdf bib
Proceedings of the Natural Legal Language Processing Workshop 2019
Nikolaos Aletras | Elliott Ash | Leslie Barrett | Daniel Chen | Adam Meyers | Daniel Preotiuc-Pietro | David Rosenberg | Amanda Stent
Proceedings of the Natural Legal Language Processing Workshop 2019

2018

pdf
Letting a Neural Network Decide Which Machine Translation System to Use for Black-Box Fuzzy-Match Repair
John E. Ortega | Weiyi Lu | Adam Meyers | Kyunghyun Cho
Proceedings of the 21st Annual Conference of the European Association for Machine Translation

While systems using the Neural Network-based Machine Translation (NMT) paradigm achieve the highest scores on recent shared tasks, phrase-based (PBMT) systems, rule-based (RBMT) systems and other systems may get better results for individual examples. Therefore, combined systems should achieve the best results for MT, particularly if the system combination method can take advantage of the strengths of each paradigm. In this paper, we describe a system that predicts whether a NMT, PBMT or RBMT will get the best Spanish translation result for a particular English sentence in DGT-TM 20161. Then we use fuzzy-match repair (FMR) as a mechanism to show that the combined system outperforms individual systems in a black-box machine translation setting.

2015

pdf bib
Proceedings of the 9th Linguistic Annotation Workshop
Adam Meyers | Ines Rehbein | Heike Zinsmeister
Proceedings of the 9th Linguistic Annotation Workshop

2014

pdf
Corpus and Method for Identifying Citations in Non-Academic Text
Yifan He | Adam Meyers
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

We attempt to identify citations in non-academic text such as patents. Unlike academic articles which often provide bibliographies and follow consistent citation styles, non-academic text cites scientific research in a more ad-hoc manner. We manually annotate citations in 50 patents, train a CRF classifier to find new citations, and apply a reranker to incorporate non-local information. Our best system achieves 0.83 F-score on 5-fold cross validation.

pdf
Annotating Relations in Scientific Articles
Adam Meyers | Giancarlo Lee | Angus Grieve-Smith | Yifan He | Harriet Taber
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Relations (ABBREVIATE, EXEMPLIFY, ORIGINATE, REL_WORK, OPINION) between entities (citations, jargon, people, organizations) are annotated for PubMed scientific articles. We discuss our specifications, pre-processing and evaluation

pdf bib
Proceedings of the COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language
Adam Meyers | Yifan He | Ralph Grishman
Proceedings of the COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language

pdf bib
Jargon-Term Extraction by Chunking
Adam Meyers | Zachary Glass | Angus Grieve-Smith | Yifan He | Shasha Liao | Ralph Grishman
Proceedings of the COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language

2013

pdf
A Preliminary Study of Tweet Summarization using Information Extraction
Wei Xu | Ralph Grishman | Adam Meyers | Alan Ritter
Proceedings of the Workshop on Language Analysis in Social Media

pdf
Towards Fine-grained Citation Function Classification
Xiang Li | Yifan He | Adam Meyers | Ralph Grishman
Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013

pdf
Contrasting and Corroborating Citations in Journal Articles
Adam Meyers
Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013

2011

pdf bib
Proceedings of the 5th Linguistic Annotation Workshop
Nancy Ide | Adam Meyers | Sameer Pradhan | Katrin Tomanek
Proceedings of the 5th Linguistic Annotation Workshop

pdf
Improving MT Word Alignment Using Aligned Multi-Stage Parses
Adam Meyers | Michiko Kosaka | Shasha Liao | Nianwen Xue
Proceedings of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation

2009

pdf bib
The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages
Jan Hajič | Massimiliano Ciaramita | Richard Johansson | Daisuke Kawahara | Maria Antònia Martí | Lluís Màrquez | Adam Meyers | Joakim Nivre | Sebastian Padó | Jan Štěpánek | Pavel Straňák | Mihai Surdeanu | Nianwen Xue | Yi Zhang
Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task

pdf
Automatic Recognition of Logical Relations for English, Chinese and Japanese in the GLARF Framework
Adam Meyers | Michiko Kosaka | Nianwen Xue | Heng Ji | Ang Sun | Shasha Liao | Wei Xu
Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW-2009)

pdf bib
Proceedings of the Third Linguistic Annotation Workshop (LAW III)
Manfred Stede | Chu-Ren Huang | Nancy Ide | Adam Meyers
Proceedings of the Third Linguistic Annotation Workshop (LAW III)

pdf
Transducing Logical Relations from Automatic and Manual GLARF
Adam Meyers | Michiko Kosaka | Heng Ji | Nianwen Xue | Mary Harper | Ang Sun | Wei Xu | Shasha Liao
Proceedings of the Third Linguistic Annotation Workshop (LAW III)

pdf
Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task
Kristen Parton | Kathleen R. McKeown | Bob Coyne | Mona T. Diab | Ralph Grishman | Dilek Hakkani-Tür | Mary Harper | Heng Ji | Wei Yun Ma | Adam Meyers | Sara Stolbach | Ang Sun | Gokhan Tur | Wei Xu | Sibel Yaman
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP

pdf
The Role of Implicit Argumentation in Nominal SRL
Matthew Gerber | Joyce Chai | Adam Meyers
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics

2008

pdf
The CoNLL 2008 Shared Task on Joint Parsing of Syntactic and Semantic Dependencies
Mihai Surdeanu | Richard Johansson | Adam Meyers | Lluís Màrquez | Joakim Nivre
CoNLL 2008: Proceedings of the Twelfth Conference on Computational Natural Language Learning

2007

pdf bib
Proceedings of the Linguistic Annotation Workshop
Branimir Boguraev | Nancy Ide | Adam Meyers | Shigeko Nariyama | Manfred Stede | Janyce Wiebe | Graham Wilcock
Proceedings of the Linguistic Annotation Workshop

pdf
Shared Corpora Working Group Report
Adam Meyers | Nancy Ide | Ludovic Denoyer | Yusuke Shinyama
Proceedings of the Linguistic Annotation Workshop

2006

pdf bib
Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006
Timothy Baldwin | Francis Bond | Adam Meyers | Shigeko Nariyama
Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006

pdf
Annotation Compatibility Working Group Report
Adam Meyers
Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006

2005

pdf bib
Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky
Adam Meyers
Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky

pdf bib
Introduction to Frontiers in Corpus Annotation II: Pie in the Sky
Adam Meyers
Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky

pdf bib
Merging PropBank, NomBank, TimeBank, Penn Discourse Treebank and Coreference
James Pustejovsky | Adam Meyers | Martha Palmer | Massimo Poesio
Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky

2004

pdf
NP-External Arguments: A Study of Argument Sharing in English
Adam Meyers | Ruth Reeves | Catherine Macleod
Proceedings of the Workshop on Multiword Expressions: Integrating Processing

pdf bib
Introduction to Frontiers in Corpus Annotation
Adam Meyers
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004

pdf
The NomBank Project: An Interim Report
Adam Meyers | Ruth Reeves | Catherine Macleod | Rachel Szekely | Veronika Zielinska | Brian Young | Ralph Grishman
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004

pdf
Discriminative Slot Detection Using Kernel Methods
Shubin Zhao | Adam Meyers | Ralph Grishman
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

pdf
The Cross-Breeding of Dictionaries
Adam Meyers | Ruth Reeves | Catherine Macleod | Rachel Szekely | Veronika Zielinska | Brian Young
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf
Annotating Noun Argument Structure for NomBank
Adam Meyers | Ruth Reeves | Catherine Macleod | Rachel Szekely | Veronika Zielinska | Brian Young | Ralph Grishman
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2002

pdf
Formal Mechanisms for Capturing Regularizations
Adam Meyers | Ralph Grishman | Michiko Kosaka
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2001

pdf
Covering Treebanks with GLARF
A. Meyers | Ralph Grishman | Michiko Kosaka | Shubin Zhao
Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources

2000

pdf
Chart-Based Transfer Rule Application in Machine Translation
Adam Meyers | Michiko Kosaka | Ralph Grishman
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics

1998

pdf
A multilingual procedure for dictionary-based sentence alignment
Adam Meyers | Michiko Kosaka | Ralph Grishman
Proceedings of the Third Conference of the Association for Machine Translation in the Americas: Technical Papers

This paper describes a sentence alignment technique based on a machine readable dictionary. Alignment takes place in a single pass through the text, based on the scores of matches between pairs of source and target sentences. Pairings consisting of sets of matches are evaluated using a version of the Gale-Shapely solution to the stable marriage problem. An algorithm is described which can handle N-to-1 (or 1-to-N) matches, for n ≥ 0, i.e., deletions, 1-to-1 (including scrambling), and 1-to-many matches. A simple frequency based method for acquiring supplemental dictionary entries is also discussed. We achieve high quality alignments using available bilingual dictionaries, both for closely related language pairs (Spanish/English) and more distantly related pairs (Japanese/English).

pdf
Deriving Transfer Rules from Dominance-Preserving Alignments
Adam Meyers | Roman Yangarber | Ralph Grishman | Catherine Macleod | Antonio Moreno-Sandoval
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

pdf
Deriving Transfer Rules from Dominance-Preserving Alignments
Adam Meyers | Roman Yangarber | Ralph Grishman | Catherine Macleod | Antonio Moreno-Sandoval
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

pdf
Using NOMLEX to Produce Nominalization Patterns for Information Extraction
Adam Meyers | Catherine Macleod | Roman Yangarber | Ralph Grishman | Leslie Barrett | Ruth Reeves
The Computational Treatment of Nominals

1996

pdf
Alignment of Shared Forests for Bilingual Corpora
Adam Meyers | Roman Yangarber | Ralph Grishman
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics

pdf
The Influence of Tagging on the Classification of Lexical Complements
Catherine Macleod | Adam Meyers | Ralph Grishman
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics

1994

pdf
Comlex Syntax: Building a Computational Lexicon
Ralph Grishman | Catherine Macleod | Adam Meyers
COLING 1994 Volume 1: The 15th International Conference on Computational Linguistics

pdf
The Comlex Syntax Project: The First Year
Catherine Macleod | Ralph Grishman | Adam Meyers
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994

1993

pdf
A Unification-Based Parser for Relational Grammar
David E. Johnson | Adam Meyers | Lawrence S. Moss
31st Annual Meeting of the Association for Computational Linguistics