Twitter user classification

A pipeline to attempt to classify the author of a tweet as either a person, location or organization, based on clues found in their "user" profile metadata within the tweet. Within each broad "major type" a number of narrower "minor type" categories are also used.

Output is given as an annotation AuthorClassification spanning the whole document, and when Twitter JSON is selected as the output format the classification is also added as a property "gate_classification" to the top-level "user" object in the tweet.

Note: because this pipeline operates on the user profile information and not on the actual text of the document, it can only run on documents that are in Twitter JSON format. It will not produce useful output on plain text or other non-JSON documents.

