ChemDataExtractor

This services annotates chemical named entities using the open source ChemDataExtractor.

More details on ChemDataExtractor, including a number of example texts, can be found in this publication. One simple example, from the documentation would be "UV-vis spectrum of 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin in Tetrahydrofuran (THF)." which results in three annotations over 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin, Tetrahydrofuran, and THF.

Note that the information extracted can be quite complex and is difficult, if not impossible, to easily display within the demo "Test This Pipeline" viewer. Full details of the extracted information can be seen by using the download link once results appear, or by calling the service via the standard REST API.

Default annotations
:ChemicalEntity Annotation spanning each extracted chemical named entity.
Additional annotations
:Sentence Annotation spanning each detected sentence.
:Token Annotation spanning each extracted token. Features include the part-of-speech category and normalized string.
1,200 free requests / day
Batch processing not available

Use this pipeline

Single documents

You can process up to 1,200 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.

The API endpoint for this pipeline is:

https://cloud-api.gate.ac.uk/process/chemdataextractor

Create API Key