----- Original Message ----- From: wiki-research-l-request@lists.wikimedia.org To: wiki-research-l@lists.wikimedia.org Sent: Wednesday, May 19, 2010 7:40 PM Subject: Wiki-research-l Digest, Vol 57, Issue 2
Send Wiki-research-l mailing list submissions to wiki-research-l@lists.wikimedia.org
To subscribe or unsubscribe via the World Wide Web, visit https://lists.wikimedia.org/mailman/listinfo/wiki-research-l or, via email, send a message with subject or body 'help' to wiki-research-l-request@lists.wikimedia.org
You can reach the person managing the list at wiki-research-l-owner@lists.wikimedia.org
When replying, please edit your Subject line so it is more specific than "Re: Contents of Wiki-research-l digest..."
Today's Topics:
- Re: Analyses of Wikipedia quality (Guillaume Paumier)
- Re: Analyses of Wikipedia quality (Piotr Konieczny)
- Final CfP COLING 2010 - 2nd Workshop on Collaboratively Constructed Semantic Resources (Torsten Zesch)
- Balisage 2010 Contest - Solve the Modern Tower of Babel (Eric Bloch)
- Anyone interested in a Wikimania panel on research ethics? (R.Stuart Geiger)
- Re: Anyone interested in a Wikimania panel onresearch ethics? (Fuster, Mayo)
Message: 1 Date: Tue, 11 May 2010 10:26:42 -0700 From: Guillaume Paumier guillom.pom@gmail.com Subject: Re: [Wiki-research-l] Analyses of Wikipedia quality To: Research into Wikimedia content and communities wiki-research-l@lists.wikimedia.org Message-ID: AANLkTildJmrZOU8TOZFIxvefeeXrWTDnlxyj4Hj4X9lA@mail.gmail.com Content-Type: text/plain; charset=UTF-8
Greetings,
On Mon, May 10, 2010 at 6:48 PM, Adrianne Wadewitz wadewitz@gmail.com wrote:
I am looking at submitting a grant so that I can work on an analyses of en.wikipedia's coverage of art, literature, and history articles - their accuracy, etc. I am looking to establish a complete bibliography of all of the "quality" assessments that have been done about Wikipedia - on any language. Could we pool our knowledge?
I would advise to take a look at http://www.citeulike.org/group/382/tag/quality
-- Guillaume Paumier [[m:User:guillom]] http://www.gpaumier.org
Message: 2 Date: Tue, 11 May 2010 13:50:04 -0400 From: Piotr Konieczny piokon@post.pl Subject: Re: [Wiki-research-l] Analyses of Wikipedia quality To: Research into Wikimedia content and communities wiki-research-l@lists.wikimedia.org Message-ID: 4BE998CC.5060906@post.pl Content-Type: text/plain; charset=UTF-8; format=flowed
Guillaume Paumier wrote:
Greetings,
On Mon, May 10, 2010 at 6:48 PM, Adrianne Wadewitz wadewitz@gmail.com wrote:
I am looking at submitting a grant so that I can work on an analyses of en.wikipedia's coverage of art, literature, and history articles - their accuracy, etc. I am looking to establish a complete bibliography of all of the "quality" assessments that have been done about Wikipedia - on any language. Could we pool our knowledge?
I would advise to take a look at http://www.citeulike.org/group/382/tag/quality
Also, search this page for the string "quality": http://en.wikipedia.org/wiki/Wikipedia:Academic_studies_of_Wikipedia
-- Piotr Konieczny PhD Candidate Dept of Sociology Uni of Pittsburgh
"To be defeated and not submit, is victory; to be victorious and rest on one's laurels, is defeat." --J?zef Pi?sudski
Message: 3 Date: Wed, 12 May 2010 10:12:19 +0200 From: Torsten Zesch zesch@tk.informatik.tu-darmstadt.de Subject: [Wiki-research-l] Final CfP COLING 2010 - 2nd Workshop on Collaboratively Constructed Semantic Resources To: "'wiki-research-l@lists.wikimedia.org'" wiki-research-l@lists.wikimedia.org Message-ID: 1FF6659CB8948640A0694C9954BB5A390507AD42F0@pandora.tk.informatik.tu-darmstadt.de
Content-Type: text/plain; charset="iso-8859-1"
COLING 2010
2nd Workshop on "The People's Web meets NLP: Collaboratively Constructed Semantic Resources"
Beijing August 28th, 2010 http://www.ukp.tu-darmstadt.de/scientific-community/coling-2010-workshop/
Keywords: Wikipedia, Wiktionary, Mechanical Turk, Games with a purpose, Folksonomies, Twitter, Social Networks
INTRODUCTION
The workshop builds upon the success of the first ACL "The People's Web meets NLP" Workshop in 2009 that attracted 21 submissions. Accepted submissions included papers on Wikipedia [1], Wiktionary [2], Mechanical Turk [3], and game-based construction of semantic resources [4]. This clearly demonstrates a substantial and growing interest of the NLP community in collaboratively constructed semantic resources (CSRs), also evidenced by the increasing number of publications in this area and the EMNLP 2009 Web 2.0 track. In many works, CSRs have been used to overcome the knowledge acquisition bottleneck and coverage problems pertinent to conventional lexical semantic resources. The greatest popularity in this respect can so far certainly be attributed to Wikipedia [1]. However, other resources, such as folksonomies or the multilingual collaboratively constructed dictionary Wiktionary, have also shown great potential. Thus, the scope of the workshop deliberately includes any collaboratively constructed resource, not only Wikipedia.
Effective deployment of CSRs to enhance NLP introduces a pressing need to address a set of fundamental challenges, e.g. the interoperability with existing resources, or the quality of the extracted lexical semantic knowledge. Interoperability between resources is crucial as no single resource provides perfect coverage. The quality of CSRs is a fundamental issue, as they lack editorial control and entries are often incomplete. Thus, techniques for link prediction [5] or information extraction [6] have been proposed to guide the "crowds" while constructing resources of better quality.
[1] Olena Medelyan, David Milne, Catherine Legg and Ian H. Witten. Mining meaning from Wikipedia. In: International Journal of Human-Computer Studies. 67(9), 2009. [2] Torsten Zesch, Christof Mueller and Iryna Gurevych Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary Proceedings of the Conference on Language Resources and Evaluation (LREC), 2008. http://www.ukp.tu-darmstadt.de/software/jwpl/ http://www.ukp.tu-darmstadt.de/software/jwktl/ [3] Rion Snow, Brendan O'Connor, Daniel Jurafsky and Andrew Y. Ng. Cheap and Fast---But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks. Proceedings of EMNLP. 2008. [4] Luis von Ahn and Laura Dabbish. General Techniques for Designing Games with a Purpose. Communications of the ACM, 2008. [5] Rada Mihalcea and Andras Csomai Wikify!: Linking Documents to Encyclopedic Knowledge. Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007. [6] Daniel S. Weld et al. Intelligence in Wikipedia. Twenty-Third Conference on Artificial Intelligence (AAAI), 2008.
TOPICS
The workshop will bring together researchers from different worlds, for example those using collaboratively constructed resources as sources of lexical semantic information for NLP purposes such as information retrieval, named entity recognition, or keyword extraction, and those using NLP techniques to improve the resources or extract and analyze different types of lexical semantic information from them. We will especially welcome contributions of interdisciplinary nature, e.g. those applying discourse analysis techniques from computational linguistics to the content of CSRs to better understand their properties.
Specific topics include but are not limited to:
- Analysis of collaboratively constructed resources, such as wiki-based platforms, folksonomies, Twitter, or social networks;
- Using collaboratively constructed resources for NLP purposes such as information retrieval, text categorization, information extraction, etc.;
- Using special features of collaboratively constructed resources to create novel resource types, for example revision-based corpora, simplified versions of resources, etc.;
- Analyzing the structure of collaboratively constructed resources related to their use in NLP;
- Interoperability of collaboratively constructed resources with conventional lexical semantic resources and between themselves;
- Mining social and collaborative content for constructing structured semantic resources and the corresponding tools;
- Mining multilingual information from collaboratively constructed resources;
- Quality and reliability of collaboratively constructed semantic resources.
We especially encourage short papers describing publicly available tools for accessing or analyzing collaboratively constructed resources that can serve as a multiplier in the NLP community.
The workshop is intended to be highly interdisciplinary. Thus, we encourage the participation of researchers working on computational linguistics aspects (e.g. parsing or discourse analysis) or NLP applications (e.g. information retrieval, information extraction, question answering, and knowledge representation) as well as researchers from other areas who might benefit from collaboratively constructed semantic resources.
Substantially extended versions of the best papers from the workshop can be submitted to a planned Special Issue in one of the major computational linguistics journals. The revised papers will have to undergo a separate reviewing process required for journal publications.
IMPORTANT DATES
Paper submission deadline (full and short): May 30, 2010 Notification of acceptance of papers: June 30, 2010 Camera-ready copy of papers due: July 10, 2010 COLING 2010 Workshop: Aug 28, 2010
ORGANIZERS
Iryna Gurevych Torsten Zesch
Ubiquitous Knowledge Processing Lab Technische Universit?t Darmstadt, Germany
PROGRAM COMMITTEE
Andras Csomai Google Inc. Anette Frank Heidelberg University Benno Stein Bauhaus University Weimar Bernardo Magnini ITC-irst Trento Christiane Fellbaum Princeton University Dan Moldovan University of Texas at Dallas Delphine Bernhard LIMSI-CNRS, Orsay Diana McCarthy Lexical Computing Ltd Elke Teich Technische Universit?t Darmstadt Emily Pitler University of Pennsylvania Eneko Agirre University of the Basque Country Erhard Hinrichs Eberhard Karls Universit?t T?bingen Ernesto De Luca Technische Universit?t Berlin Florian Laws University of Stuttgart Gerard de Melo MPI Saarbr?cken German Rigau University of the Basque Country Graeme Hirst University of Toronto G?nter Neumman DFKI Saarbr?cken Gy?rgy Szarvas Technische Universit?t Darmstadt Hans-Peter Zorn European Media Lab, Heidelberg Jos? Iria University of Sheffield Laurent Raumary LORIA, Nancy Magnus Sahlgren Swedish Institute of Computer Science Manfred Stede Potsdam University Omar Alonso A9.com, Inc. Pablo Castells Universidad Aut?nonoma de Madrid Paul Buitelaar DERI, National University of Ireland, Galway Philipp Cimiano Delft University of Technology Razvan Bunescu University of Texas at Austin Rene Witte Concordia University Montr?al Roxana Girju University of Illinois at Urbana-Champaign Saif Mohammad University of Maryland Samer Hassan University of North Texas S?ren Auer Leipzig University Tonio Wandmacher CEA, Paris
Message: 4 Date: Tue, 18 May 2010 15:43:30 -0700 From: Eric Bloch Eric.Bloch@marklogic.com Subject: [Wiki-research-l] Balisage 2010 Contest - Solve the Modern Tower of Babel To: "wiki-research-l@lists.wikimedia.org" wiki-research-l@lists.wikimedia.org Message-ID: 94F9268D-7633-4EC0-A89E-78C4176C0A4A@marklogic.com Content-Type: text/plain; charset="us-ascii"
Hi Folks,
Just thought I'd share news of a wiki-related contest:
As part of the Balisage 2010 Conference, MarkLogic has put forth a challenge in the form of a contest. The goal of the contest is to encourage markup experts to review and to research the current state of wiki markup languages and to generate a proposal that serves to de- babelize the current state of affairs for the long haul.
See http://developer.marklogic.com/news/marklogic-sponsors-balisage-2010-contest for details.
Best, Eric
Message: 5 Date: Wed, 19 May 2010 13:27:06 -0400 From: "R.Stuart Geiger" sgeiger@gmail.com Subject: [Wiki-research-l] Anyone interested in a Wikimania panel on research ethics? To: Research into Wikimedia content and communities wiki-research-l@lists.wikimedia.org Message-ID: AANLkTinkGy12BxQNw8sIDA49IofXGXPBcJK7j4-C25Al@mail.gmail.com Content-Type: text/plain; charset="iso-8859-1"
Greetings wiki researchers,
Looking through the submissions for Wikimania, I see that there is nothing yet on the subject of researchers in Wikipedia and there could be a good panel or workshop discussion on this topic. At the moment, the theme would be something along the lines of "Researchers in my Wikipedia? It's more likely than you think" and have wiki researchers on the panel who will talk about ethics, protocols, methodologies, and their relation to community norms and policies. Topics like the SRAG are especially relevant, and there are lots of other issues out there with ethnographic research, archival/data privacy, and more. My idea for the Wikimania panel is to build a dialog with the community on these issues, but if there is enough interest, an excellent open space session at WikiSym could easily be organized to deal with the more academic side of this issue. I'm also open to other ideas if other people have them.
So if anyone who is going to Wikimania and Wikisym is interested in this, please let me know today or tomorrow. Sorry for doing this so late, but I just realized that the deadline is the 20th.
Thanks, Stuart Geiger