we at Max Planck Institue for Informatics are glad to make an improved version of YAGO
YAGO2 is a huge semantic knowledge base, derived from mostly from Wikipedia, but also from
WordNet and GeoNames. Currently, YAGO2 has knowledge of more than 10 million entities
(like persons, organizations, cities, etc.) and contains more than 80 million facts about
In YAGO2, we made an effort to treat time and location data as first-class citizen,
extending the basic triple model by special fields for time and location for querying.
Also, we took special care to consistently attach temporal and spatial data to all facts
where it is semantically meaningful and where time and location can be derived from
Wikipedia. Unlike many other automatically assembled knowledge bases, YAGO2 has a
confirmed accuracy of 95%.
You can download the complete data set at our website:
Your feedback is of course very much appreciated!