Aaron and Max! Thanks so much for your work on this. I have sent over
the methods and data to Stanford and they're excited about
incorporating this into a full study. You are awesome. Jake (Ocaasi)
On 10/22/14, wiki-research-l-request(a)lists.wikimedia.org
<wiki-research-l-request(a)lists.wikimedia.org> wrote:
> Send Wiki-research-l mailing list submissions to
> wiki-research-l(a)lists.wikimedia.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
> or, via email, send a message with subject or body 'help' to
> wiki-research-l-request(a)lists.wikimedia.org
>
> You can reach the person managing the list at
> wiki-research-l-owner(a)lists.wikimedia.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Wiki-research-l digest..."
>
>
> Today's Topics:
>
> 1. Re: Extracting PMIDs (Maximilian Klein)
> 2. Re: Extracting PMIDs (Aaron Halfaker)
> 3. Re: Extracting PMIDs (Maximilian Klein)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Wed, 22 Oct 2014 11:27:09 -0700
> From: Maximilian Klein <isalix(a)gmail.com>
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>, Jake Orlowitz
> <jorlowitz(a)gmail.com>
> Subject: Re: [Wiki-research-l] Extracting PMIDs
> Message-ID:
> <CAKbmofi_RDgq+vh2MTwWpBObahrwLpw5dXBS9y+XmUga2VPybg(a)mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Jake,
> I have script that does this already for DOIs, Its was one-line change to
> make. These files should answer what you were looking for.
>
> https://raw.githubusercontent.com/notconfusing/listiness/pmc/pmc_list.txt
> https://raw.githubusercontent.com/notconfusing/listiness/pmc/pmid_list.txt
>
> In the future you can tell them to use halfak's
> https://pythonhosted.org/mediawiki-utilities/
> This is the code I used to get those lists.
> https://github.com/notconfusing/listiness/commit/e140ce9202b9c1098dec40ca1d…
>
> Make a great day,
> Max Klein ‽ http://notconfusing.com/
>
> On Mon, Oct 20, 2014 at 9:20 PM, Andrew G. West <west.andrew.g(a)gmail.com>
> wrote:
>
>> Jake,
>>
>> Yes, its a rather straightforward parse based on the citation format which
>> Jeremy described. Doc James and I already have this coded up for a soon to
>> be published [[WP:MED]] readership/editorship paper.
>>
>> Searching for PMID's in the entirety of the Wikipedia article base would
>> be a bit time consuming -- but if one needs to pull down only articles in
>> WikiProject Medicine, for example, I am also able to help on that front.
>>
>> Perhaps we'll take this offline, but if anyone else is interested in the
>> dirty details, feel free to contact one of us off-list. -AW
>>
>> --
>> Andrew G. West, PhD
>> http://www.andrew-g-west.com
>>
>>
>>
>> On 10/20/2014 11:57 PM, Jake Orlowitz wrote:
>>
>>> Hi folks,
>>>
>>> Relaying a question from a Stanford medical researcher:
>>>
>>> "Do you know if it is possible to extract PubMed ID (PMID) or PMCIDs
>>> from Wiki references? Furthermore, could you dump those IDs out into a
>>> list for analysis?"
>>>
>>> Best,
>>> Jake Orlowitz (Ocaasi)
>>>
>>>
>>> _______________________________________________
>>> Wiki-research-l mailing list
>>> Wiki-research-l(a)lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>>
>>>
>>
>> _______________________________________________
>> Wiki-research-l mailing list
>> Wiki-research-l(a)lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>
>
Hey, saw this going around, might be of interest to some people on this
list:
The Software Sustainability Institute have set up a petition to recognise
the importance of software and software engineers to the research
community: http://bit.ly/SoftwareIsFundamental
*Edward Saperia*
Conference Director Wikimania London <http://www.wikimanialondon.org>
email <ed(a)wikimanialondon.org> • facebook
<http://www.facebook.com/edsaperia> • twitter
<http://www.twitter.com/edsaperia> • 07796955572
133-135 Bethnal Green Road, E2 7DG
I also can help in extracting PMID's from the Wikipedia articles. What else
do you need? Just PMIDs?
On Tue, Oct 21, 2014 at 2:00 PM, <
wiki-research-l-request(a)lists.wikimedia.org> wrote:
> Send Wiki-research-l mailing list submissions to
> wiki-research-l(a)lists.wikimedia.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
> or, via email, send a message with subject or body 'help' to
> wiki-research-l-request(a)lists.wikimedia.org
>
> You can reach the person managing the list at
> wiki-research-l-owner(a)lists.wikimedia.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Wiki-research-l digest..."
>
>
> Today's Topics:
>
> 1. Extracting PMIDs (Jake Orlowitz)
> 2. Re: [OpenAccess] Extracting PMIDs (Jeremy Baron)
> 3. Re: Extracting PMIDs (Andrew G. West)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 20 Oct 2014 23:57:49 -0400
> From: Jake Orlowitz <jorlowitz(a)gmail.com>
> To: wiki-research-l(a)lists.wikimedia.org, Open Access discussions
> <openaccess(a)lists.wikimedia.org>
> Subject: [Wiki-research-l] Extracting PMIDs
> Message-ID:
> <CABL4yajbL9FuL-vEL=
> cOBqzXQR_6jVDYrxH99vLkthJ_kM8hRg(a)mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Hi folks,
>
> Relaying a question from a Stanford medical researcher:
>
> "Do you know if it is possible to extract PubMed ID (PMID) or PMCIDs from
> Wiki references? Furthermore, could you dump those IDs out into a list for
> analysis?"
>
> Best,
> Jake Orlowitz (Ocaasi)
>
[ Please feel free to forward this on to any other people or lists you
think might be interested! ]
Greetings Fellow Wiki Researchers!
Along with fellow Wikimedians Jonathan Morgan and Frances Hocutt (and
many others), I am helping run three and a half day-long workshops in
November for anyone interested in learning how to use programming and
data science tools to ask and answer questions about online
communities with a strong focus on doing research on Wikipedia. This
will be a new and improved version of the workshops we ran
successfully earlier this year.
You will need to be in Seattle to participate or volunteer in our
workshops. That said, you can /totally/ build on our curriculum to run
your own workshops. One group in Waterloo already has! Details on how
to do that are online (http://mako.cc/go/03) and we'd be happy to
help!
If you /are/ in Seattle...
The workshops are for people with no previous programming experience
and will be free of charge and open to anyone.
Our goal is that, after the three workshops, participants will be able
to use data to produce numbers, hypothesis tests, tables, and
graphical visualizations to answer questions like:
- Are new contributors to an article in Wikipedia sticking around
longer or contributing more than people who joined last year?
- Who are the most active or influential users of a particular
Twitter hashtag?
- Are people who participated in a Wikipedia outreach event staying
involved? How do they compare to people that joined the project
outside of the event?
Details and dates are online here:
http://networkcollectiv.es/2014/10/16/cdsw-november-2014/
If you are interested in participating, fill out our registration at
the link above before October 30th.
If you already know how to program in Python, it would be really
awesome if you would volunteer as a mentor! Being a mentor will
involve working with participants and talking them through the
challenges they encounter in programming. No special preparation is
required. If you’re interested, send me an email.
Regards,
Mako
--
Benjamin Mako Hill
http://mako.cc/
Creativity can be a social contribution, but only in so far
as society is free to use the results. --GNU Manifesto
Dear colleagues,
As you may have heard,
https://meta.wikimedia.org/wiki/Grants:IEG/Editor_Interaction_Data_Extracti…
is an individual engagement grant proposal. I am working on this proposal
with volunteer assistance and advice from Aaron Halfaker (WMF), Haitham
Shammaa (WMF), and Fabian Flöck (Karlsruhe Institute of Technology
<https://en.wikipedia.org/wiki/Karlsruhe_Institute_of_Technology>).
We are still developing this proposal, and plan to have it finalized in the
next few days.
We would greatly appreciate your comments on whether you support or oppose
the general concept of this project, and any suggestions about how to
refine the proposal.
Additionally, we would like to hear from you about which sets of editor
interaction data, and what visualizations of editor interaction data, would
be most relevant to your interests. We intend to prioritize our outputs
with your comments in mind.
Please comment on the proposal talk page. Questions and feedback, both
positive and critical, are helpful to us as the proposers, and also help
the Individual Engagement Grants Committee [1] to assess the proposal.
Regards,
Pine
[1] I am a member of the Individual Engagement Grants Committee. I am
recusing from reviewing proposals in this funding round.
Thanks Rachel. I'm forwarding this invite to other lists.
Pine
On Wed, Oct 8, 2014 at 9:17 AM, Rachel Farrand <rfarrand(a)wikimedia.org>
wrote:
> Hi Pine,
>
> The streaming youtube link is public. Feel free to distribute it however
> you like. People can also ask questions on the google+ page during the talk
> if they don't have access to IRC. I will not be monitoring it quite as
> closely, but I still will check it during the talk.
>
> As you probably already know, the youtube video will also be public after
> the talk so anyone can watch or rewatch the talk at any point after it is
> over as well.
>
> I hope this answers your question,
>
> Rachel
>
>
>
> On Tue, Oct 7, 2014 at 11:08 PM, Pine W <wiki.pine(a)gmail.com> wrote:
>
>> Hi Rachel,
>>
>> Would it be appropriate to invite people who are outside of the Wikimedia
>> universe to watch on Youtube and participate on IRC? This talk in
>> particular may interest outsiders who are designers, PMs, researchers, or
>> coders.
>>
>> Pine
>> On Oct 7, 2014 3:40 PM, "Rachel Farrand" <rfarrand(a)wikimedia.org> wrote:
>>
>>> Please join us for the following tech talk:
>>>
>>> *Tech Talk**:* Design Research in Product Development
>>> *Presenter:* Abbey Ripstra, Design & Usability Research Analyst on The UX
>>> team at the Wikimedia Foundation
>>> *Date:* October 22
>>> *Time:* 1900 UTC
>>> <
>>> http://www.timeanddate.com/worldclock/fixedtime.html?msg=Tech+Talk%3A+Desig…
>>> >
>>> Link to live YouTube stream <http://www.youtube.com/watch?v=jYMTzzosUIw>
>>> *IRC channel for questions:* #wikimedia-office
>>> Google+ page
>>> <
>>> https://plus.google.com/u/0/b/103470172168784626509/events/caiiagf75bvddr09…
>>> >,
>>> another
>>> place for questions
>>>
>>> Talk description: The value of design research in product development is
>>> being recognized more frequently these days. This talk will quickly
>>> describe the innovation process, and how, when and why design research
>>> fits
>>> into the different parts of the innovation process. For most of the talk,
>>> Abbey will focus in on the product development part of innovation and
>>> describe how, when and why to best utilize the various methodologies of
>>> design research toward building intuitive, easy to use products that meet
>>> the needs of users. Abbey will also talk about, and want to collaborate
>>> on,
>>> the best ways to integrate design research, specifically, into product
>>> development at Wikimedia Foundation.
>>> _______________________________________________
>>> Wikitech-l mailing list
>>> Wikitech-l(a)lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>>
>>
>
After a break in September, we’re resuming our monthly Research and Data showcase. The next showcase will be live-streamed tomorrow Wednesday October 15 at 11.30 PT. As usual you can join the conversation via IRC on freenode.net by joining the #wikimedia-research channel.
We look forward to seeing you there,
Dario
This month:
Emotions under Discussion: Gender, Status and Communication in Wikipedia
By David Laniado: I will present a large-scale analysis of emotional expression and communication style of editors in Wikipedia discussions. The talk will focus especially on how emotion and dialogue differ depending on the status, gender, and the communication network of the about 12000 editors who have written at least 100 comments on the English Wikipedia's article talk pages. The analysis is based on three different predefined lexicon-based methods for quantifying emotions: ANEW, LIWC and SentiStrength. The results unveil significant differences in the emotional expression and communication style of editors according to their status and gender, and can help to address issues such as gender gap and editor stagnation.
Wikipedia as a socio-technical system
By Aaron Halfaker: Wikipedia is a socio-technical system. In this presentation, I'll explain how the integration of human collective behavior ("social") and information technology ("technical") has lead to a phenomena that, while being massively productive, is poorly understood due to lack of precedence. Based on my work in this area, I'll describe five critical functions that healthy, Wikipedia-like socio-technical systems must serve in order to continue to function: allocation, regulation, quality control, community management and reflection. Next I'll argue the Wikimedia Foundation's analytics strategy currently focuses on outcomes related to a relatively narrow aspect of system health and all but completely ignores productivity. Finally, I'll conclude with an overview of three classes of new projects that should provide critical opportunities to both practically and academically understand the maintenance of Wikipedia's socio-technical fitness.
Ada Lovelace Day is celebrated on October 14 this year.
Augusta Ada King, Countess of Lovelace (born in the year 1815) was a
mathematician and computer programmer who worked on Charles Babbage's
Analytical Engine. She foresaw how computers could evolve into devices that
perform tasks more sophisticated than simple calculations. She is
controversially credited with authoring the world's first computer
program, and certainly worked extensively with Babbage. [1]
Ada Lovelace Day celebrates women's contributions to science, technology,
engineering, and mathematics.
Wikimedia Commons, English Wikipedia, and Persian Wikipedia have designated
a watercolor portrait of Lovelace as a featured picture. [2]
Happy Ada Lovelace Day,
Pine
[1] https://en.wikipedia.org/wiki/Ada_Lovelace
[2] https://commons.m.wikimedia.org/wiki/File:Ada_Lovelace_portrait.jpg
Thanks for this. I'm forwarding to the Analytics and Research lists.
Pine
---------- Forwarded message ----------
From: Rachel Farrand <rfarrand(a)wikimedia.org>
Date: Mon, Oct 6, 2014 at 1:12 PM
Subject: Re: [Wikitech-l] Tech Talk: The Dashboarding Problem: October 6
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
Thank you for the great turnout today!
If you would like to view the recording of the talk, here is the link:
http://www.youtube.com/watch?v=hzMwwLfvh5g
If you have any questions about today's talk please feel free to get in
touch with Dan Andreescu <dandreescu(a)wikimedia.org> and Nuria Ruiz <
nuria(a)wikimedia.org>
You can check out past tech talk recondrings at the MediaWiki YouTube page
here: http://www.youtube.com/channel/UCg4wlhlN8RjP6_e_vMC4CTA
If you would like to nominate future tech talks or see what we have coming
up, go here:
https://www.mediawiki.org/wiki/Project:Calendar/How_to_schedule_an_event/Te…
Thanks!
On Mon, Oct 6, 2014 at 11:03 AM, Rachel Farrand <rfarrand(a)wikimedia.org>
wrote:
> Reminder: This tech talk starts in 1 hour
>
> On Wed, Oct 1, 2014 at 12:01 PM, Rachel Farrand <rfarrand(a)wikimedia.org>
> wrote:
>
>> Please join us for the following tech talk:
>>
>> Tech Talk: *The Dashboarding Problem*
>> Date: October 6
>> Time: 1900 UTC
>> <
http://www.timeanddate.com/worldclock/fixedtime.html?msg=Tech+Talk%3A+The+D…
>
>> Link to live YouTube stream <http://www.youtube.com/watch?v=hzMwwLfvh5g>
>> IRC channel for questions: #wikimedia-office
>> Google+ page
>> <
https://plus.google.com/u/0/b/103470172168784626509/events/ch8uuivq05nqejql…>,
another
>> place for questions
>>
>> Talk description:
>> The Analytics team has been busy exploring dashboarding and visualizing
>> editor engagement data. We found that while most people focus on
>> visualization, data access and information architecture are just as
>> important and separate problems.
>> Mike Bostock solved visualization and the design team took care of
>> information architecture, so we built a dashboard around their work.
>> In this talk we share our learnings from developing dashiki, our new
>> dashboard stack. We will talk about why we believe a server-less
javascript
>> app was the right architecture for the problem, how with about 900 lines
of
>> javascript we transform data into Vega grammar, and how knockout
components
>> helped us stay modular.
>>
>> While we'll look at some javascript, the talk is high level, about 30
>> minutes long, and everyone that is interested in dashboarding,
>> visualization, and modularity is welcome to attend.
>>
>> Dashiki Code: https://github.com/wikimedia/analytics-dashiki
>>
>> Editor Dashboard: https://metrics.wmflabs.org/static/public/dash/
>>
>
>
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l