Hi,
While using the Wikipedia API I noticed an odd behavior. It seems that I'm only able to retrive the size of a revision in some revisions.
Works fine: http://en.wikipedia.org/w/api.php?action=query&titles=Porto&prop=rev...
Size does not shows up although the call is the same: http://en.wikipedia.org/w/api.php?action=query&titles=Porto&prop=rev...
It seems like a bug. Is it? I do know if Wikipedia is using the latest version of the Wikimedia API. Maybe this is already corrected in the development repository.
Btw, size is given in bytes. Correct?
Can you please comment on this issue?
Thanks! -- Sérgio Nunes
On 11/28/07, Roan Kattouw roan.kattouw@home.nl wrote:
S. Nunes schreef:
It seems like a bug. Is it?
Not really. Revision size was added to the database in version 1.10, so for ancient (pre-1.10) revisions, there is no size in the database.
Thanks for the quick answer. Can't you rebuilt the sizes for the old entries?
When was version 1.10 added to Wikipedia?
Regards, -- Sérgio Nunes
Hi,
I'm trying to visualize the Wikipedia revision history, so having the size for all older versions would be great and serve me :) I think that digging the Wikipedia past will be increasingly important.
Maybe starting a "low volume" slow batch job would not spend a lot of resources.
It is not crucial for me but would surely be a plus for "wikimedia mining" tasks.
Regards, -- Sérgio Nunes
On 11/28/07, Roan Kattouw roan.kattouw@home.nl wrote:
S. Nunes schreef:
Thanks for the quick answer. Can't you rebuilt the sizes for the old entries?
No, that'd require an insane amount of work for the servers and wouldn't serve any real purpose.
When was version 1.10 added to Wikipedia?
In early May 2007.
Roan Kattouw (Catrope)
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/mediawiki-api
On Nov 28, 2007 3:56 PM, Roan Kattouw roan.kattouw@home.nl wrote:
Can't you rebuilt the sizes for the old entries?
No, that'd require an insane amount of work for the servers and wouldn't serve any real purpose.
Maybe you could consider computing the size of a revision only when it is specifically requested (and then store it). This would surely lighten the load of the servers and eventually all old entries would have the size computed.
-- Sérgio Nunes
Please file request to run a size recompute on the entire wiki as a bugzilla request (not in the API category) - admins tend to run this kinds of scripts once in a while. It will be much easier then updating databases on the fly during "read" queries.
On Nov 30, 2007 12:30 PM, S. Nunes snunes@gmail.com wrote:
On Nov 28, 2007 3:56 PM, Roan Kattouw roan.kattouw@home.nl wrote:
Can't you rebuilt the sizes for the old entries?
No, that'd require an insane amount of work for the servers and wouldn't serve any real purpose.
Maybe you could consider computing the size of a revision only when it is specifically requested (and then store it). This would surely lighten the load of the servers and eventually all old entries would have the size computed.
-- Sérgio Nunes
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Regarding this issue, please consider voting on this bug:
http://bugzilla.wikimedia.org/show_bug.cgi?id=12188
Having access to all size values would be important for my research.
Thanks! -- Sérgio Nunes
On Nov 30, 2007 5:34 PM, Yuri Astrakhan yuriastrakhan@gmail.com wrote:
Please file request to run a size recompute on the entire wiki as a bugzilla request (not in the API category) - admins tend to run this kinds of scripts once in a while. It will be much easier then updating databases on the fly during "read" queries.
On Nov 30, 2007 12:30 PM, S. Nunes snunes@gmail.com wrote:
On Nov 28, 2007 3:56 PM, Roan Kattouw roan.kattouw@home.nl wrote:
Can't you rebuilt the sizes for the old entries?
No, that'd require an insane amount of work for the servers and wouldn't serve any real purpose.
Maybe you could consider computing the size of a revision only when it is specifically requested (and then store it). This would surely lighten the load of the servers and eventually all old entries would have the size computed.
-- Sérgio Nunes
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Roan Kattouw wrote:
S. Nunes schreef:
It seems like a bug. Is it?
Not really. Revision size was added to the database in version 1.10, so for ancient (pre-1.10) revisions, there is no size in the database.
Surely for consistency this feature should either be disabled or a maintenance script run to generate the values for old revisions (preferably the latter). Also pre-1.10 revisions are not particularly "ancient" - we have to remember that MediaWiki and it's api is not used solely by Wikipedia. Some projects are stale for months at a time so pre-1.10 revisions would still be relevant.
MinuteElectron.
mediawiki-api@lists.wikimedia.org