Hi all,
For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
<https://github.com/wikimedia/operations-puppet/blob/production/modules/role…>
script
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Beeline.
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
<http://phabricator.wikimedia.org/tag/analytics>.
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Best,
--Madhu :)
We’re glad to announce the release of an aggregate clickstream dataset extracted from English Wikipedia
http://dx.doi.org/10.6084/m9.figshare.1305770 <http://dx.doi.org/10.6084/m9.figshare.1305770>
This dataset contains counts of (referer, article) pairs aggregated from the HTTP request logs of English Wikipedia. This snapshot captures 22 million (referer, article) pairs from a total of 4 billion requests collected during the month of January 2015.
This data can be used for various purposes:
• determining the most frequent links people click on for a given article
• determining the most common links people followed to an article
• determining how much of the total traffic to an article clicked on a link in that article
• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release: https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream <https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream>
Ellery and Dario
Hello all,
My name is Andrew Hall and I’m going to be working with Aaron Halfaker over the coming months on a project looking to understand how Wikidata is used in wikis such as Wikipedia and the value that Wikidata provides them. We would also like to investigate Wikidata's use in other applications (e.g. Google Knowledge Graph). For more information on the project, check out the research proposal that we have created [1]. Definitely feel free to reach out to me with any questions or suggestions as well.
[1] https://meta.wikimedia.org/wiki/Research:Understanding_Wikidata%27s_Value <https://meta.wikimedia.org/wiki/Research:Understanding_Wikidata's_Value>
Thanks,
Andrew
>
> All tracks, keynotes and workshops are directly related to Wikimedia
> movement
Interesting. How does this differ from Wikimania or WikiConference?
Are you targeting studies that just happen to use Wikipedia data for
measurement (e.g. [1]) or would you like to limit studies that target a
Wikimedia movement priority (e.g. [2])?
Generally, I aim my scientific output regarding Wikimedia stuff to big
academic conferences where other researchers are studying similar phenomena
in other field sites. For example, you can find lots of work covering
OpenStreetMap, Zooniverse, Open source development, and Open scientific
practices at ACM's OpenSym, CSCW, and GROUP conferences.
With this conference operate with formal peer review and archived
publication (like in computer science/ACM/IEEE) or will be more like a
pre-publication outlet (like conferences are in basically all the other
disciplines)?
1. Halfaker, A., Keyes, O., Kluver, D., Thebault-Spieker, J., Nguyen, T.,
Shores, K., ... & Warncke-Wang, M. (2015, May). User session identification
based on strong regularities in inter-activity time. In *Proceedings of the
24th International Conference on World Wide Web* (pp. 410-418). ACM.
https://arxiv.org/pdf/1411.2878.pdf
2. Halfaker, A., Geiger, R. S., Morgan, J. T., & Riedl, J. (2013). The rise
and decline of an open collaboration system: How Wikipedia’s reaction to
popularity is causing its decline. *American Behavioral Scientist*, *57*(5),
664-688.
-Aaron
On Thu, Apr 27, 2017 at 10:06 PM, Rodrigo Padula <
rodrigopadula(a)wikimedia.org.br> wrote:
> Hello Stian,
>
> The focus of the event is exclusively Wikipedia and Wikimedia projects.
>
> All tracks, keynotes and workshops are directly related to Wikimedia
> movement and the general idea is to motivate other countries to organize
> local editions of the event based in our model started last year.
>
> This year, the international edition will be in Niteroi - Rio de Janeiro
> and probably the next edition will be in Porto - Portugal, moving the event
> each year to a different country with support from a local wikimedia
> chapter and a local university, stimulating the development of scientific
> research on Wikipedia in every corner of the planet.
>
> Best regards!
>
> Rodrigo Padula
> Coordenador de Projetos
> Wiki Educação Brasil
> http://www.wikimedia.org.br
> 21 99326-0558
>
>
>
> ---- On Thu, 27 Apr 2017 14:26:03 -0300 *Stian Håklev<shaklev(a)gmail.com
> <shaklev(a)gmail.com>>* wrote ----
>
> Sounds exciting, how is this different from WikiSym?
> Stian
>
> On Thu, Apr 27, 2017 at 6:13 PM, Rodrigo Padula <
> rodrigopadula(a)wikimedia.org.br> wrote:
>
> Hello my friends,
>
> After a very productive week attendin Wikimedia Conference in Berlin and 3
> days visiting our fellows from Portugal and many Portuguese universities in
> Lisboa, Porto and Coimbra we finally confirmed the 1st International
> Wikipedia Scientific Conference (2nd Brazilian edition).
>
> That project was created by the Wiki Education Brasil last year and was
> discussed with Wikimedia Portugal, Wikimedia Spain and many other chapters
> in Berlin.
>
> We are finishing the composition of the international scientific committee
> and the general organization committee during the next weeks.
>
> The event will be in Niterói - Rio de Janeiro in November 8-10 2017,
> organized in partnership with the Federal Fluminense University.
>
> To have global coverage, we will need your help to promote the conference
> in your country and universities.
>
> Our team is producing press releases and printed materials (A4 and A3)
> with information regarding the event and the call for papers.
>
> Translation help will be appreciated!
>
> Here you can see some pictures of the last year's edition
> https://commons.wikimedia.org/wiki/Category:Congresso_Cient%
> C3%ADfico_Brasileiro_da_Wikip%C3%A9dia
>
> Best regards
>
> Rodrigo Padula
> Wiki Educação Brasil - User Group
> http://facebook.com/wikiedubr/
>
> _______________________________________________
> open-science mailing list
> open-science(a)lists.okfn.org
> https://lists.okfn.org/mailman/listinfo/open-science
> Unsubscribe: https://lists.okfn.org/mailman/options/open-science
>
>
>
>
> --
> http://reganmian.net/blog -- Random Stuff that Matters
>
>
>
>
> _______________________________________________
> open-science mailing list
> open-science(a)lists.okfn.org
> https://lists.okfn.org/mailman/listinfo/open-science
> Unsubscribe: https://lists.okfn.org/mailman/options/open-science
>
>
Hello everyone,
I am currently working with Aaron Halfaker and Dario Taraborelli at the
Wikimedia Foundation on a project exploring automated classification of
article importance. Our goal is to characterize the importance of an
article within a given context and design a system to predict a relative
importance rank. We have a project page on meta[1] and welcome comments or
thoughts on our talk page. You can of course also respond here on
wiki-research-l, or send me an email.
Before moving on to model-building I did a fairly thorough literature
review, finding a myriad of papers spanning several disciplines. We have a
draft literature review also up on meta[2], which should give you a
reasonable introduction to the topic. Again, comments or thoughts (e.g.
papers we’ve missed) on the talk page, mailing list, or through email are
welcome.
Links:
1. https://meta.wikimedia.org/wiki/Research:Automated_
classification_of_article_importance
<https://meta.wikimedia.org/wiki/Research:Automated_classification_of_articl…>
2. https://meta.wikimedia.org/wiki/Research:Studies_of_Importance
Regards,
Morten
[[User:Nettrom]] aka [[User:SuggestBot]]
Open Data Movements in the Age of Big Data Capitalism
Tue 16 May 2017
17:00 – 19:00
Organised by the Westminster Institute for Advanced Studies
309 Regent Street
University of Westminster
London W1B 2HW
Registration:
https://www.eventbrite.co.uk/e/open-data-movements-in-the-age-of-big-data-c…
A WIAS seminar with International Research Fellow Dr Arwid Lund and Open
Knowledge Activist Dr Jonathan Gray
Big data has received a lot of attention in recent years, open
data/knowledge less so, and the relation between open data/knowledge and
the predominantly commercial big data sector even less so. This seminar
aims at critically discussing and shedding light on the under-theorised
field of open data/knowledge and its relation to capitalism.
In this WIAS seminar, Dr Arwid Lund reflects on his study of the
ideological landscape underpinning the open data/knowledge movement
(Open Knowledge London). Dr Jonathan Gray focuses on his own involvement
in this movement and his forthcoming book Data Worlds: The new politics
of information. The aim of the seminar is to introduce critical
perspectives on open data/knowledge’s relation to capitalism, as well as
a critical understanding of the political character that informs its
advocates.
We will round the event off with a wine reception.
Dr Arwid Lund is a Lecturer at the Department of Arts and Cultural
Sciences, Lund University, Sweden. Arwid is completing the second part
of his WIAS fellowship from 3 April 2017 to 2 June 2017. During his
fellowship, he will be working on how ‘openness’ is understood
ideologically by advocates within the Open Knowledge Network. His aim is
to identify the ideological landscape within this movement.
Dr Jonathan Gray is a Prize Fellow at the Institute for Policy Research,
University of Bath. He is also Research Associate at the médialab of
Sciences Po and Tow Fellow at the Tow Center for Digital Journalism,
Columbia University. As Director of Policy and Research at the global
civil society organisation Open Knowledge, Jonathan has founded and
co-founded numerous initiatives, including the Data Journalism Handbook,
Europe’s Energy, Open Data for Tax Justice, OpenSpending, Open Trials,
The Public Domain Review and Where Does My Money Go?.
Hello my friends,
After a very productive week in Berlin and 3 days visiting our fellows from Portugal and many Portuguese universities in Lisboa, Porto and Coimbra we finally confirmed the 1st International Wikipedia Scientific Conference (2nd Brazilian edition).
That project was created by the Wiki Education Brasil last year and discussed with Wikimedia Portugal in Berlin and many other chapters.
We are finishing the composition of the international scientific committee and the general organization committee during the next weeks.
The event will be in Niterói - Rio de Janeiro organized in partnership with the Federal Fluminense University in November 8-10 2017.
To have global coverage, we will need your help to promote the conference in your country.
Our team is producing press releases and printed materials (A4 and A3) with information regarding the event and the call for papers.
Translation help will be appreciated!
Here you can see some pictures of the last year's edition https://commons.wikimedia.org/wiki/Category:Congresso_Cient%C3%ADfico_Brasi…
Best regards
Rodrigo Padula
Wiki Educação Brasil - User Group
http://facebook.com/wikiedubr/
Following the process described in the Code of Conduct for Wikimedia
technical spaces <https://www.mediawiki.org/wiki/Code_of_Conduct>, the
Wikimedia Foundation’s Technical Collaboration team has selected five
candidates to form the first Code of Conduct Committee and five candidates
to become auxiliary members.
Here you have their names in alphabetical order. For details about each
candidate, please check
https://www.mediawiki.org/wiki/Code_of_Conduct/Committee_members
Committee member candidates:
-
Amir Sarabadani (Ladsgroup)
-
Lucie-Aimée Kaffee (Frimelle)
-
Nuria Ruiz (NRuiz-WMF)
-
Sébastien Santoro (Dereckson)
-
Tony Thomas (01tonythomas)
Auxiliary member candidates:
-
Ariel Glenn (ArielGlenn)
-
Caroline Becker (Léna)
-
Florian Schmidt (Florianschmidtwelzow)
-
Huji
-
Matanya
This list of candidates is subject to a community review period of two
weeks starting today. If no major objections are presented about any
candidate, they will be appointed in six weeks.
You can provide feedback on these candidates, via private email to
techconductcandidates(a)wikimedia.org. This feedback will be received by
the Community
Health
<https://meta.wikimedia.org/wiki/Technical_Collaboration/Community_health>
group handling this process, and will be treated with confidentiality.
We want to thank all the people who has considered the possibility to
support the Code of Conduct with their participation in this Committee. 77
persons have been contacted during the selection process, counting
self-nominations and recommendations. From these, 21 made it to a short
list of candidates confirmed and (according to our estimation) a potential
good fit for the Committee. Selecting the five candidates for the Committee
has been hard, as we have tried to form a diverse group that could work
together effectively in the consolidation of the Code of Conduct. Selecting
the five auxiliary members has been even harder, and we know that we have
left out candidates who could have contributed just as much. Being the
first people assuming these roles, we have tended a bit towards more
technical profiles with good knowledge of our technical spaces. We believe
that future renewals will offer better chances to other profiles (not so
technical and/or not so Wikimedia veteran), adding a higher diversity and
variety of perspectives to the mix.
On Thu, Mar 9, 2017 at 12:30 PM, Quim Gil <qgil(a)wikimedia.org> wrote:
> Dear Wikimedia technical community members,
>
> https://www.mediawiki.org/wiki/Code_of_Conduct
>
> The review of the Code of Conduct for Wikimedia technical spaces has been
> completed and now it is time to bootstrap its first committee. The
> Technical Collaboration team is looking for five candidates to form the
> Committee plus five additional auxiliary members. One of them could be you
> or someone you know!
>
> You can propose yourself as a candidate and you can recommend others
> *privately* at
> techconductcandidates AT wikimedia DOT org
>
> We want to form a very diverse list of candidates reflecting the variety
> of people, activities, and spaces in the Wikimedia technical community. We
> are also open to other candidates with experience in the field. Diversity
> in the Committee is also a way to promote fairness and independence in
> their decisions. This means that no matter who you are, where you come
> from, what you work on, or for how long, you are a potential good member of
> this Committee.
>
> The main requirements to join the Committee are a will to foster an open
> and welcoming community and a commitment to making participation in
> Wikimedia technical projects a respectful and harassment-free experience
> for everyone. The committee will handle reports of unacceptable behavior,
> will analyze the cases, and will resolve on them according to the Code of
> Conduct. The Committee will also handle proposals to amend the Code of
> Conduct for the purpose of increasing its efficiency. The term of this
> first Committee will be one year.
>
> Once we have a list of 5 + 5 candidates, we will announce it here for
> review. You can learn more about the Committee and its selection process at
> https://www.mediawiki.org/wiki/Code_of_Conduct/Committee and you can ask
> questions in the related Talk page (preferred) or here.
>
> You can also track the progress of this bootstrapping process at
> https://www.mediawiki.org/wiki/Talk:Code_of_Conduct#
> Bootstrapping_the_Code_of_Conduct_Committee
>
> PS: We have many technical spaces and reaching to all people potentially
> interested is hard! Please help spreading this call.
>
> --
> Quim Gil
> Engineering Community Manager @ Wikimedia Foundation
> http://www.mediawiki.org/wiki/User:Qgil
>
--
Quim Gil
Engineering Community Manager @ Wikimedia Foundation
http://www.mediawiki.org/wiki/User:Qgil