Hi Emir
This is a great initiative, thanks for setting it up.
I have a question: isn't it usual to list authors like this: Lastname Firstname?
At present you have Firstname Lastname, as in:
Adam S
Alejandro L
Angela B
etc
Is this appropriate - sorry for being awkward!?
cheers,
Mathieu
>
> Date: Sun, 12 Feb 2012 14:17:24 +0100
> From: emijrp <emijrp(a)gmail.com>
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>
> Subject: Re: [Wiki-research-l] WikiPapers: all the literature about
> wikis compiled in a... wiki
> Message-ID:
> <CAPgALA5Vf0d+drs0jvsE=u-9AiK=1NWXDqpbaCvk2fdpvWA+HA(a)mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> After a week, some stuff has been improved in WikiPapers:
>
> * Lists have now graphs, timelines and tag clouds using Semantic Results
> Formats MediaWiki extension:
> http://wikipapers.referata.com/wiki/List_of_publications,
> http://wikipapers.referata.com/wiki/List_of_authors,
> http://wikipapers.referata.com/wiki/List_of_keywords and others
> * You can export all the publications metadata in BibTex, CSV, JSON and RDF.
> * Added 4 new social networks to share content
> * Links [+] in every infobox field to improve usability and editability
> * Milestones: 1200+ pages, 500+ publications
>
> If you have any suggestion, please, drop a line.
>
> 2012/2/6 emijrp <emijrp(a)gmail.com>
>
> > You can now discover new colleagues and publications next to you using the
> > country-by-country pages http://wikipapers.referata.com/wiki/Spain
> >
> >
> > 2012/2/4 emijrp <emijrp(a)gmail.com>
> >
> >> Playing with Semantic Maps http://wikipapers.referata.com/wiki/WikiSym
> >>
> >> No WikiSym in the southern hemisphere yet and Asia is alone too : (
> >>
> >
> >
>
Hi Andrew,
Thanks for the aggregate pageview count data link. I may eventually be
interested in the MySQL parser and will follow up off-list if that later
becomes needed.
I don't necessarily need an IP address for registered users; the
username (or IP if anon. user) would be sufficient to link view and
rating. Ideally I'd transform the data into a few bits about viewed
talk page/edited talk page/edited article, and maybe some overall
measures of Wikipedia experience level, with the article feedback
ratings that user left.
>Its not my intention to discourage you, but have you thought about
>looking at this in a more aggregate fashion (i.e., average daily
>talk-page views vs. article quality rating)? -AW
I would expect any correlation there to be "explained away" when
accounting for article-page views; the relevant metric there would
probably be a ratio between article and talk page views. I expect that
ratio to be fairly uniform, perhaps a Gaussian distribution with
relatively low variance, especially when partitioned by assessment class
or "controversial" status. If there is any significant correlation
between the ratio and the article quality ratings, that would be
interesting but still not enough to make causal claims about how
exposure to the discussion or the type of discussion impacts perceived
article quality.
We'll see what we can do with the information available- and if anybody
has other ideas, please reply!
Grace and peace,
Ben
--
W. Ben Towne
wbt+wiki(a)cs.cmu.edu
Computation, Organizations, & Society <http://www.cos.cs.cmu.edu/>
Carnegie Mellon University
> Date: Tue, 07 Feb 2012 15:43:39 -0500
> From: "Andrew G. West"<westand(a)cis.upenn.edu>
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>
> Subject: Re: [Wiki-research-l] Pageviews and ratings
> Message-ID:<4F318CFB.7010505(a)cis.upenn.edu>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> Hi Ben,
>
> If you are interested in "pageviews", the best available public resource is:
>
> [http://dumps.wikimedia.org/other/pagecounts-raw/]
>
> which provides an aggregate count of views for a page, by hour (and I
> have a parser to store all this to a MySQLDB if it interests you).
> However, this does not map views to a particular identifier (username or
> IP address) or an exacting time-stamp, as you seem to desire. This might
> be tough because:
>
> * The WMF treats the IP addresses of registered editors as confidential
> information. IP address is used for "unregistered" editing. Regardless,
> no data pertaining to simple access is available in a public-facing
> fashion to my knowledge (and if it were, it would be trivial to
> determine the IP addresses of registered editors)
>
> * Assuming you were allowed to view it, even for an hour's time, the
> apache-like log of en:wp access would be LARGE. Consider that the terse
> and aggregate format they make available is already on the order of
> ~80MB/hour zipped.
>
> I am not terribly familiar with the article ratings tool and its
> operation, but I assume it would incur the same privacy concerns.
> Ratings data does seem to be accessible via the API:
>
> [http://en.wikipedia.org/w/api.php]
>
> But there are no fields describing the user/IP that left that feedback.
>
> -----
>
> Of course, I speak only of publicly available data. If you are able to
> convince the administration to collect and confidentially share this
> data, it would become more feasible (although you'd be trying to trace
> user click-paths from a -ton- of data).
>
> Its not my intention to discourage you, but have you thought about
> looking at this in a more aggregate fashion (i.e., average daily
> talk-page views vs. article quality rating)? -AW
>
>
Hi all;
I saw the discussion about creating a site to compile all the literature
about wikis many times in this mailing list in the past. It was discussed
in WikiSym 2011 too.[1] As a wikipedian interested on wiki research and as
a predoctoral student, I need to make a state of the art.
I have started WikiPapers[2] a wiki using Semantic MediaWiki, which is very
powerful to establish relations between pages and to generate dynamic
lists.[3] It doesn't only include papers, but info about tools and
datasets, to replicate results.
>From time to time, I post a backup link of the entire wiki in the mainpage,
in the case you want a copy or a disaster occurs ; ) So nothing will be
lost (some previous efforts in this area finished losing all the info).
When WikiPapers grows a bit more, I guess that some cool stats about itself
could be generated, as researchers by country, most studied topics (and
those with little literature), biases between English Wikipedia analysis
and other wikis, etc.
I have added over 40 publications by now[4], you can subscribe to the RSS
feeds[5] (the Firefox dynamic bookmark is great).
I'm not sure if you want to join to the effort : ). You are more than
welcome to add your publications, tools and datasets. Any suggestion would
be great.
Regards,
emijrp
[1]
http://wikipapers.referata.com/wiki/WikiLit:_Collecting_the_Wiki_and_Wikipe…
[2] http://wikipapers.referata.com
[3] http://wikipapers.referata.com/wiki/List_of_survey_papers
[4] http://wikipapers.referata.com/wiki/List_of_publications
[5] http://wikipapers.referata.com/wiki/WikiPapers:RSS_feeds
Because this subject was previously mentioned on Foundation-l and Research-l, I thought that those who are interested in research budgets might want to look at the information that’s now available here. Thanks to Steven Walling (WMF) for following up. I’m hoping that we’ll see more of this kind of information in the future so that the community can compare ROIs of proposed and completed projects.
http://meta.wikimedia.org/wiki/Research_talk:Wikimedia_Summer_of_Research_2…
Pine
I invite you to the yearly Berlin hackathon.
This is the premier event for the MediaWiki and Wikimedia technical
community. We'll be hacking, designing, and socialising.
Our goals for the event are to bring 100-150 people together, with
lots of people who have not attended such events before. User
scripts, gadgets, API use, Toolserver, Wikimedia Labs, mobile,
structured data, templates -- if you are into any of these things, we
want you to come!
Some financial assistance will be available -- more details soon.
This event will be hosted by Wikimedia Germany (WMDE) and supported by
the Wikimedia Foundation. Thank you, WMDE!
Dates: June 1-3 2012. Barely-started wiki page, no registration details
yet: https://www.mediawiki.org/wiki/Berlin_Hackathon_2012 . Organizers:
me and WMDE's Nicole Ebber with assistance from Lydia Pintscher and
Daniel Kinzler.
Mark your calendars!
--
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation
Hi -
Do you happen to have data on English Wikipedia article quality around mid-late 2008? I need the article name and the quality rating (by one WikiProject or more), I have some data on this from 2009 and I would like to get a sense how fast article quality evolves.
Thank you very much!
Best,
Andreea
____________________________________________________________
Andreea D. Gorbatai | Doctoral Candidate
PhD Program in Organizational Behavior/Sociology
Harvard Business School/ Harvard Graduate School of Arts & Sciences
Hello,
Does the English Wikipedia currently track pageviews?
I'm doing a study looking at the page ratings, and how that is (or
isn't) affected by a reader's understanding of the discussion process
that went on behind the scenes. We'd really like to be able to know if
the rater saw the talk page before they rated the article. As secondary
goals, we'd like to see if they edited the article and/or talk page, and
as a tertiary goal, we'd like to measure how familiar they are with
Wikipedia and talk pages in general (e. g. do they even know Talk pages
exist, are they a frequent discussant on them, etc.).
If it is possible to get the information about ratings and pageviews
(esp. common fields/links between them), can somebody guide me on how to?
If the data is currently not collected but there is a way to start doing
so (i. e. no philosophical objection or significant tech/performance
issue b/c of the caching layers), who's the right person to work with
for that?
Thanks!
Grace and peace,
Ben
--
W. Ben Towne
wbt+wiki(a)cs.cmu.edu
Computation, Organizations, & Society <http://www.cos.cs.cmu.edu/>
Carnegie Mellon University
> Date: Tue, 12 Jul 2011 21:51:56 -0700
> From: Dario Taraborelli<dtaraborelli(a)wikimedia.org>
> Subject: [Wiki-research-l] New data dumps available
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>
> Message-ID:<461E101F-0005-443C-96F8-7382E8463BD4(a)wikimedia.org>
> Content-Type: text/plain; charset="windows-1252"
>
> As part of its product development program, the Wikimedia Foundation's Tech Department will be releasing regular data dumps for all the features that are currently being implemented. The first weekly dumps from the Article Feedback Tool ? an experimental feature to engage readers to interact with Wikipedia's contents via a quality rating system [1] ? are available since this afternoon [2]. The latest datasets contain raw ratings data collected each week from a random sample of 100K articles of the English Wikipedia. More datasets will be released in the coming weeks as we deploy new features.
>
> Over the summer a new series of datasets produced by the participants in the Wikimedia Summer of Research [3] will be released and an open data repository will be announced to host and permanently identify these datasets. Further details on this program and WMF's open data policy will follow on the Foundation's blog and on this list.
>
> Dario
>
> [1]http://www.mediawiki.org/wiki/Article_feedback
> [2]http://www.mediawiki.org/wiki/Article_feedback/Data
> [3]http://meta.wikimedia.org/wiki/Research:Wikimedia_Summer_of_Research_2011
>
>
> --
> Dario Taraborelli, PhD
> Senior Research Analyst
> Wikimedia Foundation
>
> http://wikimediafoundation.org
> http://nitens.org/taraborelli
>
Hello,
Does the English Wikipedia currently track pageviews?
I'm doing a study looking at the page ratings, and how that is (or
isn't) affected by a reader's understanding of the discussion process
that went on behind the scenes. We'd really like to be able to know if
the rater saw the talk page before they rated the article. As secondary
goals, we'd like to see if they edited the article and/or talk page, and
as a tertiary goal, we'd like to measure how familiar they are with
Wikipedia and talk pages in general (e. g. do they even know Talk pages
exist, are they a frequent discussant on them, etc.).
If it is possible to get the information about ratings and pageviews
(esp. common fields/links between them), can somebody guide me on how to?
If the data is currently not collected but there is a way to start doing
so (i. e. no philosophical objection or significant tech/performance
issue b/c of the caching layers), who's the right person to work with
for that?
Thanks!
Grace and peace,
Ben
--
W. Ben Towne
wbt+wiki(a)cs.cmu.edu
Computation, Organizations, & Society <http://www.cos.cs.cmu.edu/>
Carnegie Mellon University
> Date: Tue, 12 Jul 2011 21:51:56 -0700
> From: Dario Taraborelli<dtaraborelli(a)wikimedia.org>
> Subject: [Wiki-research-l] New data dumps available
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>
> Message-ID:<461E101F-0005-443C-96F8-7382E8463BD4(a)wikimedia.org>
> Content-Type: text/plain; charset="windows-1252"
>
> As part of its product development program, the Wikimedia Foundation's Tech Department will be releasing regular data dumps for all the features that are currently being implemented. The first weekly dumps from the Article Feedback Tool ? an experimental feature to engage readers to interact with Wikipedia's contents via a quality rating system [1] ? are available since this afternoon [2]. The latest datasets contain raw ratings data collected each week from a random sample of 100K articles of the English Wikipedia. More datasets will be released in the coming weeks as we deploy new features.
>
> Over the summer a new series of datasets produced by the participants in the Wikimedia Summer of Research [3] will be released and an open data repository will be announced to host and permanently identify these datasets. Further details on this program and WMF's open data policy will follow on the Foundation's blog and on this list.
>
> Dario
>
> [1]http://www.mediawiki.org/wiki/Article_feedback
> [2]http://www.mediawiki.org/wiki/Article_feedback/Data
> [3]http://meta.wikimedia.org/wiki/Research:Wikimedia_Summer_of_Research_2011
>
>
> --
> Dario Taraborelli, PhD
> Senior Research Analyst
> Wikimedia Foundation
>
> http://wikimediafoundation.org
> http://nitens.org/taraborelli
>
Magnus Manske just sent this message to another list and I thought you'd
want to know. Baglama is a tool to "View counts for pages using Commons
images in certain category trees." It's especially useful for GLAM
institutions: https://outreach.wikimedia.org/wiki/GLAM
--
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation
-- original message --
Due to popular demand, my "baglama" tool now has charts for all
projects on its main page, and a project chart on the project page,
with the selected month highlighted. The bars are relative to the
maximum monthly view counter in the project. Enjoy.
http://toolserver.org/~magnus/baglama.php
Magnus
FYI
-----Ursprüngliche Nachricht-----
Von: air-l-bounces(a)listserv.aoir.org
[mailto:air-l-bounces@listserv.aoir.org] Im Auftrag von Eduard Aibar Puentes
Gesendet: Mittwoch, 1. Februar 2012 12:58
An: air-l(a)listserv.aoir.org
Betreff: [Air-L] Call for Papers : Academic Research into Wikipedia
The Journal Digithum: The Humanities in the Digital Age has published a
call for papers on Academic Research into Wikipedia:
http://cv.uoc.edu/~debats/digithum-call-for-papers-november11-en.pdf
Deadline for submission of originals: 1 March 2012 Publication date: May
2012
Eduard Aibar
Estudis d'Arts i Humanitats
Universitat Oberta de Catalunya
Av. Tibidabo, 39-43
08035 Barcelona
Spain
http://www.uoc.edu/webs/eaibarhttp://arts-humanitats.uoc.edu
Tel. (00 34) 93 253 7569
Fax (00 34) 93 417 6495