PyWikipedians,
Regardless of whether this talk gets accepted at Wikimania, we would like
to talk with anyone interested in building this kind of bot. Please
contact me off-list if this is your kind of thing.
-jrf
http://wikimania2012.wikimedia.org/wiki/Submissions/TREC-KBA-Mining-Content…
TREC KBA - Mining Content Streams to Recommend Page Updates to Editors
Abstract: We have organized a new session in NIST's Text Retrieval
Conference (TREC) called Knowledge Base Acceleration (KBA). TREC KBA
challenges computer science researchers to develop algorithms that mine
content streams, such as news and blogs, to recommend edits to knowledge
bases (KB), such as Wikipedia. We consider a KB to be "large" if the
number of entities described by the KB is larger than the number of humans
maintaining the KB. As entities change and evolve in the real world, large
KBs often lag behind by months or years. Such large KBs are an
increasingly important tool in several industries, including biomedical
research, law enforcement, and financial services. TREC KBA aims to
develop algorithms for helping KB editors stay abreast of changes to the
organizations, people, proteins, and other entities described by their
KBs. In this talk, we will give an overview of the TREC KBA data sets and
tasks for 2012 and future years. In addition to developing text analytics,
we are also working on a wikipedia bot for connecting KBA-type systems to
users' talk pages in mediawiki. After presenting the current state of our
bot development, we hope to engage the audience in an open discussion
about how such algorithms might be most fruitfully employed in the
Wikipedia community.
http://trec-kba.org/
(If you want this talk to get accepted in Wikimania this July, consider
putting your name on the "interested" list in the wiki linked above.)
Is it possible for Pywikipediabot to access the number of editors of a
certain page? Or at least, whether there has been more than one editor? If
not, is there another tool I can use to create a list of pages on a wiki
with only one editor?
I think there are quite a few old spam pages on our wiki, mainly in "User
talk" space. A helpful script to identify them would be of the form:
If (number of editors == 1) then replace "(*.http://.*)" with
"{{delete\|tagged by ChriswaterguyBot as possible spam.}}\\1"
Or, if there's another tool that can do the first part (produce a list of
pages with only one editor) then replace.py can do the rest. Any
suggestions?
(If we could also check whether that user has edited any other pages, that
would be wonderful... but that might be asking a bit much.)
Thanks
--
Chris Watkins
Appropedia.org - Sharing knowledge to build rich, sustainable lives.
Hi,
we have a diligent editor who puts defaultsorts and categories onto file
description pages whose files are in Commons, not in huwiki. We tend to
remove these cats and defsorts. So we have to review description pages of
non-existing files for categories. Do we have any tool for this task, or do
I have to write it from the base?
--
Bináris
---------- Forwarded message ----------
From: Sumana Harihareswara <sumanah(a)wikimedia.org>
Date: Sat, Mar 10, 2012 at 11:50
Subject: Re: [Toolserver-l] Submit a talk to Open Source Bridge by Mar
16th, get WMF subsidy
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>,
toolserver-l(a)lists.wikimedia.org, wiki-research-l(a)lists.wikimedia.org,
mediawiki-enterprise(a)lists.wikimedia.org
On 03/10/2012 08:25 AM, Sumana Harihareswara wrote:
> This year I want the Wikimedia technical community to have a strong
> presence at Open Source Bridge <http://opensourcebridge.org/> in
> Portland, Oregon, USA, June 26-29. OSB is tech talks & hack sessions
> with hands-on technologists we want, for Foundation staff recruiting
> (the Portland tech scene has good people looking for jobs) and for
> volunteer recruiting and collaboration (tons of Mozilla people went
> there last year). Good talks, clueful people, great food. :-)
>
> If you submit a talk and it gets accepted, tell me, and Wikimedia
> Foundation will partially subsidize or fully pay for your flight and
> hotel. If you submit a talk and it doesn't get accepted but you still
> want to go, talk with me and I'll see what I can do.
>
> Call for talks:
> <http://opensourcebridge.org/blog/2012/01/announcing-the-2012-call-for-propo…>
> Ideas: the parser rewrite, Wikimedia Labs, how we scale and optimize
> performance on a shoestring budget, our git/gerrit migration, securing
> PHP-based webapps, various approaches to making our data more
> structured/semantic, collaborative design, lessons from our communities,
> JS hacks, hetdeploy, offline/mobile, geodata...
>
> Please forward.
(I should add -- also there would be complications if somehow lots of
Wikimedia people get their talks accepted and I can't budget to
subsidize all of them, or you can't get a visa to the US in time, or
some unforeseen thing comes up like that. But I don't predict those
problems happening.)
--
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
- -------- Original Message --------
Subject: [Toolserver-l] Submit a talk to Open Source Bridge by Mar
16th, get WMF subsidy
Date: Sat, 10 Mar 2012 08:25:34 -0800
From: Sumana Harihareswara <sumanah(a)wikimedia.org>
Reply-To: toolserver-l(a)lists.wikimedia.org
Organization: Wikimedia Foundation
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>,
toolserver-l(a)lists.wikimedia.org, wiki-research-l(a)lists.wikimedia.org,
mediawiki-enterprise(a)lists.wikimedia.org
This year I want the Wikimedia technical community to have a strong
presence at Open Source Bridge <http://opensourcebridge.org/> in
Portland, Oregon, USA, June 26-29. OSB is tech talks & hack sessions
with hands-on technologists we want, for Foundation staff recruiting
(the Portland tech scene has good people looking for jobs) and for
volunteer recruiting and collaboration (tons of Mozilla people went
there last year). Good talks, clueful people, great food. :-)
If you submit a talk and it gets accepted, tell me, and Wikimedia
Foundation will partially subsidize or fully pay for your flight and
hotel. If you submit a talk and it doesn't get accepted but you still
want to go, talk with me and I'll see what I can do.
Call for talks:
<http://opensourcebridge.org/blog/2012/01/announcing-the-2012-call-for-propo…>
Ideas: the parser rewrite, Wikimedia Labs, how we scale and optimize
performance on a shoestring budget, our git/gerrit migration, securing
PHP-based webapps, various approaches to making our data more
structured/semantic, collaborative design, lessons from our communities,
JS hacks, hetdeploy, offline/mobile, geodata...
Please forward.
- --
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation
_______________________________________________
Toolserver-l mailing list (Toolserver-l(a)lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list:
https://wiki.toolserver.org/view/Mailing_list_etiquette
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk9bhVoACgkQAXWvBxzBrDDacwCfTBVnrKwJ24lrFeoJHHm09G4x
MncAoN//GXwXHl57A6cL/TYlgntXm2IV
=3yo7
-----END PGP SIGNATURE-----
I have a local wiki on my computer, Pywikibot naturally supplied. :-)
I want to tell the bot not to sleep when it is time to work. I will sleep
in the night and that's what I expect the bot to do.
How can I tell the bot in config that for 'local' family it should
*never*sleep and wait and throttle and lag?
--
Bináris