Pleiades Geolocation Recognizer

This service identifies geographical named entities and disambiguates them against the Pleiades dataset.

The approach taken is to use all the names from each entry in Pleiades (that contains a representative point) to build a simple gazetteer. Locations which are ambiguous (i.e. those where multiple lookups overlap) are disambiguated using a geometrical approach. We assume that, in a similar way to word sense disambiguation, a document is likely to be discussing a single area, and so we choose the set of locations which minimise the area covered by the set of selected points; this is currently done by calculating axis aligned bounding boxes for efficiency purposes.

Note that this service is under active development and so results may vary over time.

Default annotations
:Location An annotation which spans each recognised location. Each annotation contains contains a number of features related to the Pleiades entry:
  • id, the ID of the location within Pleiades
  • lat, the latitude of the location
  • lon, the longitude of the location
  • types, a comma separated list of the location types as given in Pleiades
  • start, the start year associated with the name
  • end, the end year associated with the name
150 free requests / day
Batch processing not available

Use this pipeline

Single documents

You can process up to 150 documents per day free of charge using the REST API, at an average rate of 2 documents/sec. Higher quotas are available for research users by arrangement, contact us for details.

The API endpoint for this pipeline is:

https://cloud-api.gate.ac.uk/process/pleiades-ner

Create API Key