CYMRIE Welsh Named Entity Recognizer

CYMRIE is a named entity recognition pipeline for the Welsh language, that identifies basic entity types, such as Person, Location, Organization, Money amounts, Time and Date expressions.

CYMRIE is a Welsh language version of GATE's prototypical information extraction pipeline, ANNIE. It is part of the Welsh Natural Language Toolkit, a Welsh Government funded project. CYMRIE is distributed with the GATE framework.

Default annotations
:Person Standard named entity types
:Location
:Organization
:Date
:Address Includes email and IP addresses as well as street addresses
Additional annotations available if selected
:Money Monetary amounts
:Percent Expressions representing percentages
:Token The individual tokens of the text, with "category" feature for POS
:SpaceToken The spaces between tokens
:Sentence Sentences detected by the sentence splitter
1,200 free requests / day
Larger batches £0.80 / CPU hour

Use this pipeline

Single documents

You can process up to 1,200 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.

The API endpoint for this pipeline is:

https://cloud-api.gate.ac.uk/process-document/cymrie-welsh-named-entity-recogniser

Create API Key

Batches of documents

You can process any amount of data with this pipeline on a pay-as-you-go basis, for £0.80 per hour. This can be data you upload yourself, data you collected from Twitter, or the results of a previous job.

Reserve a job