Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request: http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
Kind regards, Katja Mueller
There is a precedent for this which includes being granted the "researcher" user right. See http://en.wikipedia.org/wiki/Wikipedia:Researchers#Researcher
You probably need to contact the Research Committee:
http://meta.wikimedia.org/wiki/Research:Committee
Tom
On 11 December 2011 12:32, Katja Müller Katja_Mueller@lavabit.com wrote:
Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request:
http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
Kind regards, Katja Mueller
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Ok, thanks - so this is the way to get access to the titles of deleted pages (no revision content)? Or is there more meta information available for "researcher" status users?
Kind regards, Katja
Am 11.12.2011 13:38, schrieb Thomas Morton:
There is a precedent for this which includes being granted the "researcher" user right. See http://en.wikipedia.org/wiki/Wikipedia:Researchers#Researcher
You probably need to contact the Research Committee:
http://meta.wikimedia.org/wiki/Research:Committee
Tom
On 11 December 2011 12:32, Katja MüllerKatja_Mueller@lavabit.com wrote:
Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request:
http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
Kind regards, Katja Mueller
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Sonnerie Bbs qui rigole pour votre mobile Tlcharger maintenant http://click.lavabit.com/wngjt6cihdwgi8ixey1mn11i8fe6mb7ra6qcbuu4hto4rg9t8cr... ____________________________________________________________________________________
Ok, thanks - so this is the way to get access to the titles of deleted pages (no revision content)? Or is there more meta information available for "researcher" status users?
Kind regards, Katja
Am 11.12.2011 13:38, schrieb Thomas Morton:
There is a precedent for this which includes being granted the "researcher" user right. See http://en.wikipedia.org/wiki/Wikipedia:Researchers#Researcher
You probably need to contact the Research Committee:
http://meta.wikimedia.org/wiki/Research:Committee
Tom
On 11 December 2011 12:32, Katja MüllerKatja_Mueller@lavabit.com wrote:
Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request:
http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
Kind regards, Katja Mueller
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Sonnerie Bbs qui rigole pour votre mobile Tlcharger maintenant http://click.lavabit.com/wngjt6cihdwgi8ixey1mn11i8fe6mb7ra6qcbuu4hto4rg9t8cr... ____________________________________________________________________________________
On Sun, Dec 11, 2011 at 3:34 PM, Katja Müller Katja_Mueller@lavabit.com wrote:
Ok, thanks - so this is the way to get access to the titles of deleted pages (no revision content)? Or is there more meta information available for "researcher" status users?
The titles of deleted pages (as well as when they were deleted, by whom and why) can be found in the deletion log, see http://en.wikipedia.org/wiki/Special:Log/delete .
Roan
I have also wanted this for a long time. http://www.petitiononline.com/urmwpnow/petition.html http://undeletewikipedia.blogspot.com/2009/10/clarity-in-petition.html mike
On Sun, Dec 11, 2011 at 1:32 PM, Katja Müller Katja_Mueller@lavabit.com wrote:
Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request: http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
Kind regards, Katja Mueller
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On 11/12/11 13:38, Mike Dupont wrote:
I have also wanted this for a long time. http://www.petitiononline.com/urmwpnow/petition.html http://undeletewikipedia.blogspot.com/2009/10/clarity-in-petition.html mike
I don't think any sysop would reject a reasonable petition of a good wikipedian of the content of an article he previously wrote.
("good wikipedian" meaning it's not an instance of "I want that copyvio content to recreate it from sockpuppet accounts using proxies")
There are many articles that have been deleted, and we dont even know the full list of them.
That's not true. The list of deleted pages is available at http://en.wikipedia.org/wiki/Special:Log/delete For very old deletions (before December 23, 2004) see http://en.wikipedia.org/wiki/Wikipedia:Deletion_log
However, for most articles the deleted text would (should?) be in the lines of Article: John Smith Content: He's fat child in classroom.
In other cases they may be well-written articles which violate the copyright of eg. Encarta. So they can't be shown either.
You seem to be targetting good articles deleted due to the target being non-notable, and you indeed have a point for them. You could launch a project to host those files if you wish to (I think there was already one doing it? at least many wikis have spun off to their own wiki about their topic).
On 11 December 2011 21:43, Platonides Platonides@gmail.com wrote:
I don't think any sysop would reject a reasonable petition of a good wikipedian of the content of an article he previously wrote. ("good wikipedian" meaning it's not an instance of "I want that copyvio content to recreate it from sockpuppet accounts using proxies")
In general, there's no problem on en:wp with giving people copies of uncontroversial deleted content. (Not copyvio, not BLP violation, possibly other similar rules.)
- d.
On Sun, Dec 11, 2011 at 10:43 PM, Platonides Platonides@gmail.com wrote:
On 11/12/11 13:38, Mike Dupont wrote:
I have also wanted this for a long time. http://www.petitiononline.com/urmwpnow/petition.html http://undeletewikipedia.blogspot.com/2009/10/clarity-in-petition.html mike
I don't think any sysop would reject a reasonable petition of a good wikipedian of the content of an article he previously wrote.
("good wikipedian" meaning it's not an instance of "I want that copyvio content to recreate it from sockpuppet accounts using proxies")
There are many articles that have been deleted, and we dont even know the full list of them.
That's not true. The list of deleted pages is available at http://en.wikipedia.org/wiki/Special:Log/delete For very old deletions (before December 23, 2004) see http://en.wikipedia.org/wiki/Wikipedia:Deletion_log
However, for most articles the deleted text would (should?) be in the lines of Article: John Smith Content: He's fat child in classroom.
In other cases they may be well-written articles which violate the copyright of eg. Encarta. So they can't be shown either.
You seem to be targetting good articles deleted due to the target being non-notable, and you indeed have a point for them. You could launch a project to host those files if you wish to (I think there was already one doing it? at least many wikis have spun off to their own wiki about their topic).
Well, I dont know what to say, except that I put work into my articles, and having them deleted hurts. I would like to have them in my userspace at least.
mike
On 11/12/11 22:44, Mike Dupont wrote:
Well, I dont know what to say, except that I put work into my articles, and having them deleted hurts. I would like to have them in my userspace at least.
mike
I understand you. Having them in your user space may be a bit controversial (I see arguments for both for and against) but there would certainly be no problem in you publishing them in eg. your own blog.
I can understand that about the user space. good point about the blog.
mike
On Sun, Dec 11, 2011 at 11:38 PM, Platonides Platonides@gmail.com wrote:
On 11/12/11 22:44, Mike Dupont wrote:
Well, I dont know what to say, except that I put work into my articles, and having them deleted hurts. I would like to have them in my userspace at least.
mike
I understand you. Having them in your user space may be a bit controversial (I see arguments for both for and against) but there would certainly be no problem in you publishing them in eg. your own blog.
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
I'm not sure if we like it (in all cases?), but have to mention this exists as well:
http://deletionpedia.dbatley.com
Hi Katja,
Op 11-12-2011 13:32, Katja Müller schreef:
Hi there,
we are doing information retrieval research on the Wikipedia history. Currently we are thinking about including the archive of Deleted Articles in the analysis.
What is the current regulation on access to the Deleted Archive? According to this page, admins can permit access to single articles on request: http://en.wikipedia.org/wiki/Wikipedia:Deletion_policy#Access_to_deleted_pag...
However, is there a way (for researchers) to either: a) access (API) or download the whole archive of Deleted Articles b) get statistics or meta data about the Deleted Archive (article counts, revision meta information, logs) ?
If you just want the metadata (and not the actual text) a Toolserver account might be useful. In the archive table you can find all metadata for deleted revisions. More info about this table at https://www.mediawiki.org/wiki/Manual:Archive_table . And about getting a Toolserver account: https://wiki.toolserver.org/view/Account_approval_process
Maarten
Hi,
thanks to all for the helpful responses!
Kind regards, Katja Müller
wikitech-l@lists.wikimedia.org