First of all, my query will be: " https://en.wikipedia.org/w/api.php?action=query&format=xml&titles=List_of_cities_proper_by_population&prop=revisions&rvprop=content " (in this page there are a list of most populated cities in the world)The response is all the Wikipedia page, and the information text is inside <rev> tag. Then, my task is to parse the text response and obtain the cities name in the list. I would like to know if ther are some posibility to obtain only the cities list, and it will be more easy to get it.
And the second option is to use bdpedia, I find this option searching and it also can be a posibility, but my doubt is: when I search " dbpedia.org/data/Tokyo " the properties and values are the same in the wikimedia page for the city? I have discoverd that I can obtain the response in json or xml format. In this way, I could get some information about the city, and search for each tag or property that is well identified.
I haven't found the way to obtain more detailed some information about in this case a city (I would like to have the mainly information organizated in tags and then it will be more easy to obtain the results). So, the other option is to do the proces that I have explained, this requires more work but also will be valid.
Thanks for you time
your sincerely,
Guillem.