James Clarke & Research

An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)

James Clarke, Vivek Srikumar, Mark Sammons, and Dan Roth. 2012. An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines). In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3267–3283. Istanbul, Turkey.

Abstract

Natural Language Processing continues to grow in popularity in a range of research and commercial applications, yet managing the wide array of potential NLP components remains a difficult problem. This paper describes CURATOR, an NLP management framework designed to address some common problems and inefficiencies associated with building NLP process pipelines and EDISON, an NLP data structure library in Java that provide sstreamlined interactions with CURATOR and offers a range of useful supporting functionality.

Bibtex

@inproceedings{Clarke:Srikumar:Sammons:Roth:2012,
  author =       {James Clarke and Vivek Srikumar and Mark Sammons 
                  and Dan Roth},
  title =        {An NLP Curator (or: How I Learned to Stop Worrying 
                  and Love NLP Pipelines)},
  booktitle =    {Proceedings of the Eighth International Conference 
                  on Language Resources and Evaluation (LREC'12)},
  pages =        {3267--3283},
  year =         2012,
  address =      {Istanbul, Turkey},
  URL =          {http://jamesclarke.net/media/papers/clarke-etal-lrec12.pdf},
}