Pleiades Geolocation Recognizer

This service identifies geographical named entities and disambiguates them against the Pleiades dataset.
The approach taken is to use all the names from each entry in Pleiades (that contains a representative point) to build a simple gazetteer. Locations which are ambiguous (i.e. those where multiple lookups overlap) are disambiguated using a geometrical approach. We assume that, in a similar way to word sense disambiguation, a document is likely to be discussing a single area, and so we choose the set of locations which minimise the area covered by the set of selected points; this is currently done by calculating axis aligned bounding boxes for efficiency purposes.
Note that this service is under active development and so results may vary over time.
Default annotations | |
:Location | An annotation which spans each recognised location. Each annotation contains contains a number of features related to the Pleiades entry:
|
Use this pipeline
You can process up to 150 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.
The API endpoint for this pipeline is: