Hello everybody,
I am currently working on a project that would have the need to collect information about all the music artists present in the wikipedia database and I know it represent a lot of request to send.
It's the reason why I'm calling for your help.
Is this possible to do this without being blocked by the wikipedia servers ?
I looked at the api page to work with, the better idea I had would be to list every artist page I need with the help of the query action, look for the "lastrevid" key, and if it has changed, request the new page.
But still, the initial import would be very important.
How can I do that the proper way ?
I really appreciate for your help.
Thanks.
Cyril.
2010/6/2 Cyril Nicodème cx42net@gmail.com:
Hello everybody,
I am currently working on a project that would have the need to collect information about all the music artists present in the wikipedia database and I know it represent a lot of request to send.
It's the reason why I'm calling for your help.
Is this possible to do this without being blocked by the wikipedia servers ?
I looked at the api page to work with, the better idea I had would be to list every artist page I need with the help of the query action, look for the "lastrevid" key, and if it has changed, request the new page.
But still, the initial import would be very important.
How can I do that the proper way ?
I really appreciate for your help.
I recommend you do the initial import from a data dump instead: http://meta.wikimedia.org/wiki/Data_dumps
Roan Kattouw (Catrope)
That is great ! I didn't find that ! thanks to you :)
And what about the frequencies of my request to keep my database up to date ? (there only will be artist/albums/image) ?
Where can I find the rules that indicates how not being ban from the server ?
Is my original idea correct ? I would list all the data I have with the "pageid" to look for the "lastrevid", if it has changed, I will look for the page modifications.
Thank you for your help.
Cyril.
2010/6/2 Roan Kattouw roan.kattouw@gmail.com
2010/6/2 Cyril Nicodème cx42net@gmail.com:
Hello everybody,
I am currently working on a project that would have the need to collect information about all the music artists present in the wikipedia database and I know it represent a lot of request to send.
It's the reason why I'm calling for your help.
Is this possible to do this without being blocked by the wikipedia
servers ?
I looked at the api page to work with, the better idea I had would be to list every artist page I need with the help of the query action, look for the "lastrevid" key, and if it has changed, request the new page.
But still, the initial import would be very important.
How can I do that the proper way ?
I really appreciate for your help.
I recommend you do the initial import from a data dump instead: http://meta.wikimedia.org/wiki/Data_dumps
Roan Kattouw (Catrope)
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
mediawiki-api@lists.wikimedia.org