Saw this on the Linguist List and thought it might be of interest to folks here.
cheers
Brianna
USER-CONTRIBUTED KNOWLEDGE AND ARTIFICIAL INTELLIGENCE: AN EVOLVING SYNERGY
IJCAI 2009 Workshop
<http://lit.csci.unt.edu/~wikiai09/index.php/Main_Page>
==Overview==
The performance of an Artificial Intelligence system often depends on
the amount of world knowledge available to it. During the last decade,
the AI community has witnessed the emergence of a number of highly
structured knowledge repositories whose collaborative nature has led
to a dramatic increase in the amount of world knowledge that can now
be exploited in AI applications. Arguably, the best-known repository
of user-contributed knowledge is Wikipedia. Since its inception less
than eight years ago, it has become one of the largest and fastest
growing online sources of encyclopedic knowledge. One of the reasons
why Wikipedia is appealing to contributors and users alike is the
richness of its embedded structural information: articles are
hyperlinked to each other and connected to categories from an ever
expanding taxonomy; pervasive language phenomena such as synonymy and
polysemy are addressed through redirection and disambiguation pages;
entities of the same type are described in a consistent format using
infoboxes; related articles are grouped together in series templates.
Many more repositories of user-contributed knowledge exist besides
Wikipedia. Collaborative tagging in Delicious and community-driven
question answering in Yahoo! Answers and Wiki Answers are only a few
examples of knowledge sources that, like Wikipedia, can become a
valuable asset for AI researchers. Furthermore, AI methods have the
potential to improve these resources, as demonstrated recently by
research on personalized tag recommendations, or on matching user
questions with previously answered questions. Consequently, we believe
the time is ripe for a dedicated event focused on the synergy between
repositories of user-contributed knowledge and the research in
Artificial Intelligence.
The workshop is intended to be highly interdisciplinary. We encourage
participation of researchers from different perspectives, including
(but not limited to) machine learning, computational linguistics,
information retrieval, information extraction, question answering,
knowledge representation, and others. We also encourage participation
of researchers from other areas who might benefit from the use of
large bodies of machine-readable knowledge.
== Topics==
Topics covered by this workshop include, but are not limited to:
* Using user-contributed knowledge as a source of training data for AI tasks
* Automatic methods for improving the quality of user contributions
* Routing tasks to people who have the expertise to perform them well
* Integrating Wikipedia with existing ontologies (e.g. WordNet, CYC, ODP)
* Extracting annotated data from user contributions
* Enriching user contributions with new types of structural information
* User-contributed knowledge and the Semantic Web / Web 2.0
* Automatic extraction and use of cross-lingual information
* Computerized use of satellite Wiki projects such as Wiktionary,
Wikibooks or Wikispecies
==Workshop Format==
The workshop is planned as a one-day event (full day), which will
consist of an invited talk, paper and demo presentations, and a
discussion panel.
==Submission Info==
We invite the submission of regular full papers (up to 6 pages), short
papers reporting on late-breaking results (up to 3 pages), and
descriptions of system demonstrations (up to 1 page) using the IJCAI
style. Submissions that have been accepted for publication elsewhere
or are under review for another conference must clearly state so on
the front page of the paper.
Submissions should be properly anonymized to make them suitable for
double-blind review. The papers will be submitted through the
EasyChair site at http://www.easychair.org/conferences/?conf=wikiai09
== Important Dates==
Deadline for long paper submission: March 6th, 2009
Deadline for short papers and demos: March 27th, 2009
Notification of acceptance: April 17th, 2009
Camera-ready papers due at IJCAI: May 8th, 2009
Workshop date: one day between July 11 and July 13, 2009 (to be defined later)
--
They've just been waiting in a mountain for the right moment:
http://modernthings.org/
Dear friend,
We are conducting a study on the motivation of the knowledge sharing on the
Wikipedia community.
The contributors’ experience to Linux is very important to the design and
management of this knowledge platform.
Would you please post the following on-line questionnaire message to the
Wikipedia platform or forward the message to the members?
After the survey is done, we will randomly select twenty persons and present
them with USB 2GB Flash Drives.
Besides, with each valid questionnaire, we will donate US $1 dollar to the
Wikimedia Foundation.
The result of this survey is analyzed in an anonymous way and is only
regarded as the academic use.
Please help us to complete the data collection.
Thanks so much for your help.
Cheers,
Joanne
[The Message content]
Dear friends,
We are conducting a study on the motivation of the knowledge sharing on
Wikipedia. Your experience of the read from and write to Wikipedia is very
important to the design and management of this knowledge platform. The
survey will take about two minutes. We deeply appreciate your help on
answering the following questions.
After the survey is done, we will randomly select twenty persons and
present them with USB 2GB Flash Drives. Besides, with each valid
questionnaire, we will donate US $1 dollar to the Wikimedia Foundation. The
result of this survey is analyzed in an anonymous way and is only regarded
as the academic use. Please feel free to fill out the questionnaire. Thanks
again for your time and valuable input.
May happiness and health be with you everyday!
★ On-line Questionnaire: http://140.119.19.152:8080/wiki/
Shari S. C. Shang
Eldon Y. Li
Professor,
Department of Management Information Systems,
National Chengchi University
Tel.: +886-2-82374038; Fax: +886-2-29393754 ; E-mail: s1213527(a)yahoo.com.
tw
Hello all,
This is sort of related to the previous thread on consent, and is
something that's been on my mind for a while.
As someone who is reasonably visible in the wiki research community
and Wikipedia, I get asked fairly often to either a) help recruit
participants for studies of Wikipedia, or b) participate myself in
such studies.
The common theme of such requests is often:
* The researcher wants to find people who are invested in Wikipedia,
for qualitative studies of contributors
* The researcher does not know how to go about doing this
* What "invested" means is often poorly defined, since the researcher
is often trying to figure out what participation looks like more
generally
* The researcher has done the standard things (posted on the mailing
list, on the village pump) and hasn't gotten any results; or has
semi-randomly posted on people's talk pages, potentially getting a
warning about spamming in the process
As a result:
* many of the same people (i.e. very visible contributors) keep
getting asked to participate in different studies; or
* the researcher is left with a self-selected group of people from the
mailing lists or other places, which may in no way represent 'the
community' (my hypothesis is that we have many small communities,
working under the greater umbrella of Wikipedia); and who may be
people who are particularly outspoken or disgruntled; or
* the researcher does not get enough participants to do a good study
So:
* Is there a good solution for these problems?
* Can we come up with "best practices" or advice for people who are
trying to recruit Wikipedians for studies?
* What about some sort of infrastructure or wikiproject to support
these requests? Every time I get one of these emails I would really
like to pass it on to a group of people to deal with, but I am not
sure who, and this mailing list seems too small and focused to support
such requests.
best,
Phoebe
p.s. I think "any wiki with a large base of contributors" could be
substituted for "Wikipedia" here -- this is probably a problem with
studying any large community-run site. But most of my requests have
come from people specifically interested in Wikipedia.
- phoebe s. ayers | phoebe.ayers(a)gmail.com
Dear All,
My name is Avanidhar Chandrasekaran
(http://en.wikipedia.org/wiki/User_talk:Avanidhar).
I work with GroupLens Research at the University of Minnesota, Twin Cities.
As part of my research, I am involved in analyzing the usefulness and
Necessity of author reputation in Wikipedia.
In lieu of this, I have simulated an Interface to color words in an article
based on their Age.
Being experienced contributors to Wikipedia, I invite you to participate in
this study, which involves the following.
1. Please visit the following Instances of wikipedia and evaluate the
interface components which have been incorporated into each of them. Each
of these use their own algorithm to color text.
a) The Wikitrust project
http://wiki-trust.cse.ucsc.edu/index.php/Main_Page
b) The Wiki-reputation project at Grouplens research
http://wiki-reputation.cs.umn.edu/index.php/Main_Page
2) Once you have evaluated the two interfaces, kindly complete this survey
on Wikipedia quality
http://www.surveymonkey.com/s.aspx?sm=hagN5S1JZHxH6pF9SmXkkA_3d_3d
We hope to get your valuable feedback on these interfaces and how Wikipedia
article quality can be improved.
Thanks for your time
Avanidhar Chandrasekaran,
GroupLens Research, University of Minnesota
Hello,
>From time to time I ask myself (and others) what is a "regular
contributor" to a Wikipedia language edition. According to "Tell us
about your Wikipedia" the definitions are quite different.
At eo.WP I once checked a week long (in this August) who was making
edits, and I calculated a "regular contributor" if someone
* made at least one edit in that week
* obviously speaks Esperanto (is no "foreign helper" like someone who
does Interwiki linking)
* made his first edit at least six months ago
* made at least ten edits at all
My result was: 71, compared to 141 "active users" and 50 "very active
users" (Wikimedia Statistics, May 2008).
What do you think about this definition?
Kind regards
Ziko van Dijk
--
Ziko van Dijk
NL-Silvolde
> Statistics, with "Wikipedians", "active" and "very active users";
> like often, Zachte's Statistics are great, but easily misleading.
Also keep in mind that most figures in wikistats still include bot edits.
IMO it becomes more and more urgent to present separate counts for humans
and bots.
For instance in eo: 54% of total edits for all time were bot edits, but most
of these will be from recent years, so the percentage will be even higher
for recent years.
http://stats.wikimedia.org/EN/BotActivityMatrix.htm
Erik Zachte
Greetings fellow wiki-researchers,
I'm currently a Wikipedian and a graduate student at Georgetown
University's Communication, Culture, and Technology program in
Washington, D.C. Some of you might remember me from this summer's
Wikimania, where I presented "Conceptions and Misconceptions Academics
Hold About Wikipedia: An Ethnography of Academics for the Wikipedian
Community." Now, as I head into the final stage of my degree, I plan
to perform the reverse study: an ethnography of Wikipedia, although
hopefully for both academics and Wikipedians.
The main reason I am writing this rather lengthy message (which is
also going to other places after some feedback here) is to inform the
community about my research and hopefully gain some feedback about my
specific protocols and techniques. Most importantly, I have to get the
permission of my university's Institutional Review Board before I
begin this research, and they require that I perform a good amount of
ethical "due diligence" with the community beforehand. First, however,
let me explain what I plan to do, which will lead into issues of
informed consent, privacy, vulnerable subjects (children), and other
thorny topics that require a response from members of the community.
Participant-observation guides my methodology. This involves entering
the community as an editor and working with all of you in the course
of doing what it is that Wikipedians do, as well as asking many
questions along the way. In one sense, I will do what I have been
doing over the past four years as an editor here: editing articles,
participating in discussions and debates, and other
encyclopedia-building tasks. However, an essential component that
distinguishes this research is the interactive and
intentionally-ignorant questioning of current community practices and
beliefs as they happen. Particularly in disputes in which I will and
will not be involved, I plan on asking Wikipedians to explain events,
outcomes, and justifications that may seem trivial or commonsensical.
The objective is to arrive at a better understanding of the way in
which the community and the project operates on multiple levels –
which my previous research indicates is grossly misrepresented in
contemporary academic and popular culture.
The main issue with such a study is informed consent, which means
making sure that any "interventions" I make while researching do no
harm to human subjects. If a researcher is using surveys, interviews,
or clinical trials, informed consent is usually secured with a signed
form or click-through page that generally states the participant knows
he/she is the subject of research and agrees to have their actions
made public in certain ways. If I were only doing surveys or
interviews, then this wouldn't be an issue; nor would it matter if I
were simply observing Wikipedia but not contributing. The issue
arises when I become an active participant in Wikipedia as a
researcher representing my university for the purpose of collecting
and publishing data.
For obvious reasons, it is incredibly difficult if each time I
entered, for example, a deletion debate, I had to get the formal
consent of everyone involved before participating. I am told that
because of the public nature of Wikipedia's on-wiki communication, I
can get the informed consent requirement waived – if and only if I can
show an alternative way of establishing informed consent that reflects
current community practices and norms, as agreed-upon by community
leaders or representatives. Now, the traditional anthropological
strategy would require me to go to Jimmy Wales or the Board and
negotiate with him/them about the various protocols. However, I think
there is a better way specific to Wikipedia, and that is creating a
page in the Wikipedia namespace where we work out what protocols any
generic ethnographic researcher ought to follow. This way, any other
ethnographers don't have to re-invent the wheel.
So this is where all of you come in, I trust. I've created a page at
Wikipedia:Ethically researching Wikipedia. It the following tentative
guidelines/protocols that I – or any ethnographer – would follow in
order to make sure that on-wiki interventions inform participants of
my research and protect everyone involved:
1.I will recognize that as an ethnographer, I am a guest of the
Wikipedian community and the Wikimedia Foundation. As such, I will
respect any decisions made by the community, the Arbitration
Committee, or the Wikimedia Foundation regarding the way in which I
participate in the project and collect data about my experiences.
2.I will fully disclose myself as a researcher of Wikipedia on my
account's userpage and user talk page. Here, I will explain who I am,
what I am doing and why, my research protocols, ways to opt-out of
research, and University administrators or faculty members who can be
contacted if concerns arise with my research.
3.I will have a signature that shows my status as a researcher of
Wikipedia to let editors know that I am interacting with them in such
a role. This will include a link to the above research description and
my talk page. Feor example:[[User:Staeiou|Staeiou]]
<sup>[[User:Staeiou#My Research|I'm researching
Wikipedia]]</sup><sub>[[User_talk:Staeiou|Questions, concerns,
comments?]]</sub>. I will sign every contribution I make to talk or
process pages.
4.When collecting data and publishing results, I can refer to th
specific actions of editors or quote them using their username. I can
also publish information they have made public on userpage, their
edit/log history, and the results of various programs that analyse
publicly available data like Interiot's edit counter.
5.I will let editors opt-out of my research. Any editor will be free
to tell me that he or she does not wish to be a subject in my
research. If this happens, I will not communicate with him or her
further, and I will exclude from my research any existing data
specifically based on my interactions with him or her.
6.If my research leads me to communicate with Wikipedians off-wiki –
whether via e-mail, chat, in person, or other medium outside of the
public wikispace – I will use established interview-based research
protocols to establish informed consent. This means that those who
communicate with me off-wiki will be initially informed of my research
project and asked to digitally consent to such communication being
used for research purposes. I will work to mutually establish the
privacy of data collected in each situation: if the conversation can
be quoted, paraphrased, or alluded to; if the author can be attributed
by name or username; or if the entire conversation is off-the-record.
7.I will work to minimize risks to subjects by focusing on topics
directly or indirectly related to Wikipedia, encyclopedia-building,
and the community. To protect subjects, I will not discuss personally
sensitive topics, such as editors' past or current illegal behavior,
sexual behavior, medical or psychological care, and drug or alcohol
use. If editors express these or other personally sensitive topics, I
will not include them in my research.
If anyone has any modifications or additions, please let me know, or
better yet, change it yourself! I've marked it with {{proposed}}, and
I'd like for it to get some sort of review or consensus.
Thanks for reading this long message. I am very excited to finally be
working on this research project, and hope to hear from some of you
soon.
R. Stuart Geiger (Staeiou)