Abstract
Natural Language Processing continues to grow in popularity in a range of research and commercial applications, yet managing the wide array of potential NLP components remains a difficult problem. This paper describes Curator, an NLP management framework designed to address some common problems and inefficiencies associated with building NLP process pipelines; and Edison, an NLP data structure library in Java that provides streamlined interactions with Curator and offers a range of useful supporting functionality.- Anthology ID:
- L12-1388
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 3276–3283
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/664_Paper.pdf
- DOI:
- Cite (ACL):
- James Clarke, Vivek Srikumar, Mark Sammons, and Dan Roth. 2012. An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines). In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3276–3283, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) (Clarke et al., LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/664_Paper.pdf