Noun Phrase Chunker
An implementation of the Ramshaw and Marcus BaseNP chunker, which marks noun phrases with a NounChunk annotation. This application also includes a tokeniser, sentence splitter and POS tagger as these are required by the chunking algorithm.
Default annotations | |
:NounChunk | Noun chunks discovered by the chunker |
Additional annotations available if selected | |
:Token | The individual tokens of the text, with "category" feature for POS |
:SpaceToken | The spaces between tokens |
:Sentence | Sentences detected by the sentence splitter |
Use this pipeline
You can process up to 1,200 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.
The API endpoint for this pipeline is:
You can process any amount of data with this pipeline on a pay-as-you-go basis, for £0.80 per hour. This can be data you upload yourself, data you collected from Twitter, or the results of a previous job.