ChemDataExtractor
This services annotates chemical named entities using the open source ChemDataExtractor.
More details on ChemDataExtractor, including a number of example texts, can be found in this publication. One simple example, from the documentation would be "UV-vis spectrum of 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin in Tetrahydrofuran (THF)." which results in three annotations over 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin, Tetrahydrofuran, and THF.
Note that the information extracted can be quite complex and is difficult, if not impossible, to easily display within the demo "Test This Pipeline" viewer. Full details of the extracted information can be seen by using the download link once results appear, or by calling the service via the standard REST API.
Default annotations | |
:ChemicalEntity | Annotation spanning each extracted chemical named entity. |
Additional annotations | |
:Sentence | Annotation spanning each detected sentence. |
:Token | Annotation spanning each extracted token. Features include the part-of-speech category and normalized string. |
Use this pipeline
You can process up to 1,200 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.
The API endpoint for this pipeline is: