Hi Jan,
During my PhD, I did some research about how online uses evolve/develop
over time in online communities using topic modeling / clustering. Topic
modeling / mixture models are very powerful tools to learn the hidden
knowledge and structures underlying the data.
These two papers may be what you are looking for:
1. Xiangju Qin, Pádraig Cunningham, Michael Salter-Townshend. “Online
Trans-dimensional Von Mises-Fisher Mixture Models for User Profiles”.
Journal of Machine Learning Research, 17(200): 1-51, 2016.
http://www.jmlr.org/papers/volume17/15-454/source/15-454.pdf
2. Xiangju Qin, Derek Greene, Pádraig Cunningham. “A Latent Space Analysis
of Editor Lifecycles in Wikipedia”. In: M. Atzmueller et al. (eds): MSM,
MUSE, SenseML 2014. LNAI 9546, pp. 46–69, 2016.
http://link.springer.com/chapter/10.1007/978-3-319-29009-6_3
A minus point of these two papers is that we didn't consider user activity
in a very specific/niched level that you just described. Yes, considering
user activity in these context would shape more insights about the
potential challenges underlying Wikipedia platform (e.g. lost of different
types of editors, changes in edits to different Wiki pages over time)
Hope this helpful.
Best, Xiangju
On 20 March 2017 at 14:00, <wiki-research-l-request(a)lists.wikimedia.org>
wrote:
> Send Wiki-research-l mailing list submissions to
> wiki-research-l(a)lists.wikimedia.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
> or, via email, send a message with subject or body 'help' to
> wiki-research-l-request(a)lists.wikimedia.org
>
> You can reach the person managing the list at
> wiki-research-l-owner(a)lists.wikimedia.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Wiki-research-l digest..."
>
>
> Today's Topics:
>
> 1. Editors: research on transitions, learning over time, leaving
> (Jan Dittrich)
> 2. Re: Editors: research on transitions, learning over time,
> leaving (Piscopo A.)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 20 Mar 2017 10:34:31 +0100
> From: Jan Dittrich <jan.dittrich(a)wikimedia.de>
> To: Wiki Research-l <wiki-research-l(a)lists.wikimedia.org>
> Subject: [Wiki-research-l] Editors: research on transitions, learning
> over time, leaving
> Message-ID:
> <CAC2423MxFb86WZODsO7FF487whmfkRqyFhi0apSFAQAMAX9crg(a)mail.gm
> ail.com>
> Content-Type: text/plain; charset=UTF-8
>
> Hello,
>
> I am looking for research on how editors transition through various levels
> of involvement in their time as editors. The questions I ask myself are:
>
> - How many people to come each month?
> - How many editors leave?
>
> …those are not too difficult to answer but…
>
> - How many people become more involved over time? E.g. How many each month
> come to a level where they are interested in handling many pages on the
> watchlist, learn the less obvious aspects of wiki culture etc.
>
> In my work as designer I am often involved in features for intermediate
> and/or very involved users and I’m wondering if there are any ballpark
> estimates of how many people learn these features each month.
>
> Jan
>
> --
> Jan Dittrich
> UX Design/ User Research
>
> Wikimedia Deutschland e.V. | Tempelhofer Ufer 23-24 | 10963 Berlin
> Phone: +49 (0)30 219 158 26-0
> http://wikimedia.de
>
> Imagine a world, in which every single human being can freely share in the
> sum of all knowledge. That‘s our commitment.
>
> Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
> der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
> Körperschaften I Berlin, Steuernummer 27/029/42207.
>
>
> ------------------------------
>
> Message: 2
> Date: Mon, 20 Mar 2017 10:02:38 +0000
> From: Piscopo A. <A.Piscopo(a)soton.ac.uk>
> To: Research into Wikimedia content and communities
> <wiki-research-l(a)lists.wikimedia.org>
> Subject: Re: [Wiki-research-l] Editors: research on transitions,
> learning over time, leaving
> Message-ID: <09B04529-8837-437B-BD09-FDBE0318FF99(a)soton.ac.uk>
> Content-Type: text/plain; charset="utf-8"
>
> Hi Jan
>
> Together with other people in my group, we wrote a paper about how
> Wikidata editors change their behaviour as they become more experienced.
> It might be not precisely what you are looking for, as our study has a
> qualitative approach, while it seems that you are looking for something
> more quantitative, but you might have a look if you want.
> This is the link: https://eprints.soton.ac.uk/401755/
>
> Please let me know if you have any questions,
> Alessandro
>
> –––
> Alessandro Piscopo
> Web and Internet Science Group
> School of Electronics and Computer Science
> University of Southampton
> email: A.Piscopo(a)soton.ac.uk<mailto:A.Piscopo@soton.ac.uk>
>
> On 20 Mar 2017, at 09:34, Jan Dittrich <jan.dittrich(a)wikimedia.de<mailto:
> jan.dittrich(a)wikimedia.de>> wrote:
>
> Hello,
>
> I am looking for research on how editors transition through various levels
> of involvement in their time as editors. The questions I ask myself are:
>
> - How many people to come each month?
> - How many editors leave?
>
> …those are not too difficult to answer but…
>
> - How many people become more involved over time? E.g. How many each month
> come to a level where they are interested in handling many pages on the
> watchlist, learn the less obvious aspects of wiki culture etc.
>
> In my work as designer I am often involved in features for intermediate
> and/or very involved users and I’m wondering if there are any ballpark
> estimates of how many people learn these features each month.
>
> Jan
>
> --
> Jan Dittrich
> UX Design/ User Research
>
> Wikimedia Deutschland e.V. | Tempelhofer Ufer 23-24 | 10963 Berlin
> Phone: +49 (0)30 219 158 26-0
> http://wikimedia.de
>
> Imagine a world, in which every single human being can freely share in the
> sum of all knowledge. That‘s our commitment.
>
> Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
> Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
> der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
> Körperschaften I Berlin, Steuernummer 27/029/42207.
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
> ------------------------------
>
> End of Wiki-research-l Digest, Vol 139, Issue 18
> ************************************************
>
--
Xiangju Qin, PhD
Postdoctoral researcher
Department of Computer Science, Aalto University
Email: qinxiangju(a)gmail.com or xiangju.qin(a)aalto.fi
<xiangju.qin(a)ucdconnect.ie>
Address: Department of Computer Science, Aalto University, P.O. Box 15400,
FI-00076 Aalto, Espoo, Finland
Hi John,
the Wiki-research-l mailing list (CCed) is usually a better place to
ask such questions than Wikimedia-l.
Without having taken a look at the book you mention, here are two
pointers to research that might be related:
https://meta.wikimedia.org/wiki/Research:Newsletter/2012/April#cite_ref-27
("Wikipedia as a thermodynamic system - becoming more efficient over
time")
https://meta.wikimedia.org/wiki/Research:Newsletter/2015/September#More_new…
On Thu, Mar 16, 2017 at 9:29 AM, John Erling Blad <jeblad(a)gmail.com> wrote:
> Has anyone tried to use termodynamics on social capital within Wikipedia?
> Over investment in social capital and negative specific heat might create
> unstable systems, that is people will leave the community.
>
> There is a book on the topic; A Dynamic Balance: Social Capital and
> Sustainable Community Development
> _______________________________________________
> Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/wiki/Wikimedia-l
> New messages to: Wikimedia-l(a)lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, <mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
--
Tilman Bayer
Senior Analyst
Wikimedia Foundation
IRC (Freenode): HaeB
Greetings!
Have you already seen that? It's pure magic! Just take a look here http://smart.parkviewers.com/3233
Pardon my monkey thumbs, alexandre.rosado
* Extended Submission Deadlines *
Call for Papers
10th Conference on Intelligent Computer Mathematics
- CICM 2017 -
July 17-21, 2017
University of Edinburgh, Scotland
http://www.cicm-conference.org/2017
----------------------------------------------------------------------
* Extended deadlines:
Abstract submission deadline : 22. March 2017
Submission deadline : 29. March 2017
* Invited Speakers *
- Alan Bundy (University of Edinburgh)
- Przemysław Chojecki (Polish Academy of Sciences)
- Grant Olney Passmore (University of Cambridge)
----------------------------------------------------------------------
Digital and computational solutions are becoming the prevalent means
for the generation, communication, processing, storage and curation of
mathematical information. Separate communities have developed to
investigate and build computer based systems for computer algebra,
automated deduction, and mathematical publishing as well as novel user
interfaces. While all of these systems excel in their own right, their
integration can lead to synergies offering significant added
value. The Conference on Intelligent Computer Mathematics (CICM)
offers a venue for discussing and developing solutions to the great
challenges posed by the integration of these diverse areas.
CICM has been held annually as a joint meeting since 2008, co-locating
related conferences and workshops to advance work in these
subjects. Previous meetings have been held in Birmingham (UK 2008),
Grand Bend (Canada 2009), Paris (France 2010), Bertinoro (Italy 2011),
Bremen (Germany 2012), Bath (UK 2013), Coimbra (Portugal 2014),
Washington DC (USA 2015) and Bialystok (Poland 2016).
This is a call for papers for CICM 2017, which will be held in
Edinburgh, Scotland, July 17-21, 2017. CICM 2017 also invites work-in-
progress papers.
The principal tracks of the conference will be:
* Track: Calculemus (chair: Matthew England)
All topics in the intersection of computer algebra systems and
automated reasoning systems including:
- Automated theorem proving in computer algebra systems.
- Computer algebra and symbolic computation in theorem proving
systems.
- Theory, design and implementation of interdisciplinary systems for
computer mathematics.
- Case studies and applications that involve a mix of computation
and reasoning.
- Case studies in formalization of mathematical theories that include
non-trivial computations.
- Representation of mathematics in computer algebra systems.
- Input languages, programming languages, types and constraint
languages, and modeling languages for mathematical assistant systems.
* Track: Digital Mathematical Libraries (DML) (chair: Olaf Teschke)
All topics related to the formation of a Global Digital Mathematics
Library (GDML) network, ranging from experiences from existing DMLs,
policies and standards facilitating interoperability, to development
and integration of new techniques for content creation,
preservation, enhancement and retrieval of the corpus, including:
- DML creation and maintenance (content aggregation, validation,
curation, enhancement).
- DML architecture and representations (organization, workflows,
policies, standards).
- DML access and applications (retrieval, interfaces, interoperability).
- DML collections and systems (experiences from various existing DMLs).
* Track: Mathematical Knowledge Management (MKM) (chair: Florian Rabe)
- Knowledge representation using, e.g., formal logics, computational
systems, narrative document formats, or databases
- Solutions to create, store, disseminate, discover, or manipulate
mathematical knowledge
- Corpora of knowledge inlcuding documents, theories, theorems, proofs,
models, algorithms, exercises, or examples
- Methods, systems, frameworks, case studies, challenges, benchmarks,
or applications for mathematical knowledge
- Comparisons, evaluations, or integrations of MKM solutions
* Track: Systems & Projects (chair: Osman Hasan)
- Systems: Stand-alone; plugins, libraries, or extensions of
existing systems; or integrations of existing systems
- Data: Formalizations; harvests or new processing of existing data;
or case studies, test cases, or benchmark suites for systems
- Projects: finished, ongoing or new
- Survey papers
* Track: Doctoral Programme (chair: TBD)
The overall programme is organized by the General Program Chair Herman
Geuvers. The local arrangements will be coordinated by Jacques
Fleuriot. The publicity chair is Serge Autexier.
We plan to have proceedings of the conference as in previous years
with Springer Verlag as a volume in Lecture Notes in Artificial
Intelligence (LNAI).
*Important Dates*
Conference submissions
- Abstract submission deadline: 22. March 2017 (extended)
- Submission deadline: 29. March 2017 (extended)
- Reviews sent to authors: 26. April 2017
- Rebuttals due: 30. April 2017
- Notification of acceptance: 12. May 2017
- Camera ready copies due: 26. May 2017
- Conference: 17.-21. July 2017
More details on the conference are available from
http://www.cicm-conference.org/2017
Dear all,
As some of you may already know I have been working on a tool to
visualize the etymological tree of words using data extracted from
Wiktionary.
This work involved the development of a software to extract
information from Wiktionary textual etymology sections using regular
expressions and a context free grammar.
A first version of the tool can be tested at
http://tools.wmflabs.org/etytree/etymology/resources/html/index.html
I have also set up a sparql endpoint at http://etytree-virtuoso.wmflabs.org/
I only had six months to work on this project and I am now asking for
a renewal. The main aspect I want to improve is the visualization,
which currently uses graphs instead of trees (for the nature of the
current data trees could not be used). Also the data extraction method
needs to be tailored for specific languages that use special
structures and are currently incorrectly extracted.
Please leave your feedback in the endorsement section (end of page) of
the renewal
https://meta.wikimedia.org/wiki/Grants:IEG/A_graphical_and_interactive_etym…
as only projects that have enough support from the community will be funded.
Best,
Ester
Is there a tool or framework forgetting article lifecycle stats in an
automated fashion. Is anyone aware of something like that? Things like
creator (+ their basic stats), total # of edits, who's edited the article
(+ their basic stats), article age, article flags, etc.
I'm reasonably platform / language agnostic. I'll only need stats on dozens
of articles an hour, so no need for a weaponised platform.
cheers
stuart
--
...let us be heard from red core to black sky
I'll suggest Wikihistory, e.g.
https://tools.wmflabs.org/xtools/wikihistory/wh.php?page_title=Tulip_mania
which gives all the editors (ranked by number of edits), article
size(?) and edits (per year, month, or even weeks).
There's a bit more at
https://en.wikipedia.org/w/index.php?title=Tulip_mania&action=info#mw-pagei…
which includes info on page watchers, recent edits and a wikidata
link.
Page views at https://tools.wmflabs.org/pageviews/?project=en.wikipedia.org&platform=all-…
only goes back a couple of years. Before that is an inconsistent
series (somewhere)
All these are available from the history tab on the article page
The only other thing that I'd want is the ORES scores (AI quality
prediction for any individual version given the permid).
Is this best place to get these at
https://ores.wmflabs.org/v2/scores/enwiki/wp10/?revids=769824240 ?
Is there an easy way to get a regular-interval time series of these?
(I wouldn't expect a complete time series for 1,000s of versions!)
Hope this helps.
Peete
=====
Message: 1
Date: Wed, 15 Mar 2017 21:18:59 +1300
From: "Stuart A. Yeates" <syeates(a)gmail.com>
To: Research into Wikimedia content and communities
<wiki-research-l(a)lists.wikimedia.org>
Subject: [Wiki-research-l] tool / framework for article lifecycle
stats ?
Message-ID:
<CAC_Lu0bjKgdcRYeKVr2U9Cr_hzatW10jiVi8=bDFcTXE4rs9Nw(a)mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Is there a tool or framework forgetting article lifecycle stats in an
automated fashion. Is anyone aware of something like that? Things like
creator (+ their basic stats), total # of edits, who's edited the article
(+ their basic stats), article age, article flags, etc.
I'm reasonably platform / language agnostic. I'll only need stats on dozens
of articles an hour, so no need for a weaponised platform.
cheers
stuart
--
...let us be heard from red core to black sky
Apologies for cross-posting
Call for Papers, Posters & Workshops and Tutorials
SEMANTiCS 2017 - The Linked Data Conference
13th International Conference on Semantic Systems
Amsterdam, Netherlands
September 11 -14, 2017
http://2017.semantics.cc
For details please go to: https://2017.semantics.cc/calls
Important Dates (Research & Innovation Track):
*Abstract Submission Deadline: May 17, 2017 (11:59 pm, Hawaii time)
*Paper Submission Deadline: May 24, 2017 (11:59 pm, Hawaii time)
*Notification of Acceptance: July 3, 2017 (11:59 pm, Hawaii time)
*Camera-Ready Paper: August 14, 2017 (11:59 pm,
Hawaii time)
Important Dates (Posters & Demos Track):
*Submission Deadline: July 10, 2017 (11:59 pm, Hawaii time)
*Notification of Acceptance: August 10, 2017 (11:59 pm,
Hawaii time)
*Camera-Ready Paper: August 18, 2017 (11:59 pm, Hawaii time)
Important Dates (Workshops & Tutorials):
*Proposals for Workshops with Call for Papers: March 31, 2017 (23:59
Hawaii Time)
*Proposals for Tutorials and Workshops without Call for Papers: June
30, 2017 (23:59 Hawaii Time)
*Notification of Acceptance: April 13, 2017 (23:59 Hawaii Time)
*Workshop Website/Call for Papers Online: April 30, 2017 (23:59 Hawaii Time)
*Camera-Ready Proceedings: September 4, 2017 (23:59 Hawaii Time)
*SEMANTiCS 2017 Workshop & Tutorial Days: September 11 and 14, 2017
Important Dates (Industry & Use Case Presentations):
*Submission Deadline: June 23, 2017
*Notification of Acceptance: July 4, 2017
*Presentation Ready: August 15, 2017
As in the previous years, SEMANTiCS’17 proceedings will be published by
ACM ICPS (pending) and CEUR WS proceedings.
SEMANTiCS 2017 will especially welcome submissions for the following hot
topics:
*Data Science (special track, see below)
*Web Semantics, Linked (Open) Data & schema.org
*Corporate Knowledge Graphs
*Knowledge Integration and Language Technologies
*Data Quality Management
*Economics of Data, Data Services and Data Ecosystems
Following the success of previous years, the ‘horizontals’ (research)
and ‘verticals’ (industries) below are of interest for the conference:
Horizontals:
*Enterprise Linked Data & Data Integration
*Knowledge Discovery & Intelligent Search
*Business Models, Governance & Data Strategies
*Semantics in Big Data
*Text Analytics
*Data Portals & Knowledge Visualization
*Semantic Information Management
*Document Management & Content Management
*Terminology, Thesaurus & Ontology Management
*Smart Connectivity, Networking & Interlinking
*Smart Data & Semantics in IoT
*Semantics for IT Safety & Security
*Semantic Rules, Policies & Licensing
*Community, Social & Societal Aspects
Data Science Special Track Horizontals:
*Large-Scale Data Processing (stream processing, handling large-scale
graphs)
*Data Analytics (Machine Learning, Predictive Analytics, Network Analytics)
*Communicating Data (Data Visualization, UX & Interaction Design,
Crowdsourcing)
*Cross-cutting Issues (Ethics, Privacy, Security, Provenance)
Verticals:
*Industry & Engineering
*Life Sciences & Health Care
*Public Administration
*e-Science
*Digital Humanities
*Galleries, Libraries, Archives & Museums (GLAM)
*Education & eLearning
*Media & Data Journalism
*Publishing, Marketing & Advertising
*Tourism & Recreation
*Financial & Insurance Industry
*Telecommunication & Mobile Services
*Sustainable Development: Climate, Water, Air, Ecology
*Energy, Smart Homes & Smart Grids
*Food, Agriculture & Farming
*Safety, Security & Privacy
*Transport, Environment & Geospatial
Second Call for Papers
10th Conference on Intelligent Computer Mathematics
- CICM 2017 -
July 17-21, 2017
University of Edinburgh, Scotland
http://www.cicm-conference.org/2017
----------------------------------------------------------------------
NEW * Invited Speakers: Alan Bundy (University of Edinburgh) and
Grant Olney Passmore (University of Cambridge)
----------------------------------------------------------------------
Digital and computational solutions are becoming the prevalent means
for the generation, communication, processing, storage and curation of
mathematical information. Separate communities have developed to
investigate and build computer based systems for computer algebra,
automated deduction, and mathematical publishing as well as novel user
interfaces. While all of these systems excel in their own right, their
integration can lead to synergies offering significant added
value. The Conference on Intelligent Computer Mathematics (CICM)
offers a venue for discussing and developing solutions to the great
challenges posed by the integration of these diverse areas.
CICM has been held annually as a joint meeting since 2008, co-locating
related conferences and workshops to advance work in these
subjects. Previous meetings have been held in Birmingham (UK 2008),
Grand Bend (Canada 2009), Paris (France 2010), Bertinoro (Italy 2011),
Bremen (Germany 2012), Bath (UK 2013), Coimbra (Portugal 2014),
Washington DC (USA 2015) and Bialystok (Poland 2016).
This is a call for papers for CICM 2017, which will be held in
Edinburgh, Scotland, July 17-21, 2017. CICM 2017 also invites work-in-
progress papers.
The principal tracks of the conference will be:
* Track: Calculemus (chair: Matthew England)
All topics in the intersection of computer algebra systems and
automated reasoning systems including:
- Automated theorem proving in computer algebra systems.
- Computer algebra and symbolic computation in theorem proving
systems.
- Theory, design and implementation of interdisciplinary systems for
computer mathematics.
- Case studies and applications that involve a mix of computation
and reasoning.
- Case studies in formalization of mathematical theories that include
non-trivial computations.
- Representation of mathematics in computer algebra systems.
- Input languages, programming languages, types and constraint
languages, and modeling languages for mathematical assistant systems.
* Track: Digital Mathematical Libraries (DML) (chair: Olaf Teschke)
All topics related to the formation of a Global Digital Mathematics
Library (GDML) network, ranging from experiences from existing DMLs,
policies and standards facilitating interoperability, to development
and integration of new techniques for content creation,
preservation, enhancement and retrieval of the corpus, including:
- DML creation and maintenance (content aggregation, validation,
curation, enhancement).
- DML architecture and representations (organization, workflows,
policies, standards).
- DML access and applications (retrieval, interfaces, interoperability).
- DML collections and systems (experiences from various existing DMLs).
* Track: Mathematical Knowledge Management (MKM) (chair: Florian Rabe)
- Knowledge representation using, e.g., formal logics, computational
systems, narrative document formats, or databases
- Solutions to create, store, disseminate, discover, or manipulate
mathematical knowledge
- Corpora of knowledge inlcuding documents, theories, theorems, proofs,
models, algorithms, exercises, or examples
- Methods, systems, frameworks, case studies, challenges, benchmarks,
or applications for mathematical knowledge
- Comparisons, evaluations, or integrations of MKM solutions
* Track: Systems & Projects (chair: Osman Hasan)
- Systems: Stand-alone; plugins, libraries, or extensions of
existing systems; or integrations of existing systems
- Data: Formalizations; harvests or new processing of existing data;
or case studies, test cases, or benchmark suites for systems
- Projects: finished, ongoing or new
- Survey papers
* Track: Doctoral Programme (chair: TBD)
The overall programme is organized by the General Program Chair Herman
Geuvers. The local arrangements will be coordinated by Jacques
Fleuriot. The publicity chair is Serge Autexier.
We plan to have proceedings of the conference as in previous years
with Springer Verlag as a volume in Lecture Notes in Artificial
Intelligence (LNAI).
*Important Dates*
Conference submissions
- Abstract submission deadline: 15. March 2017
- Submission deadline: 22. March 2017
- Reviews sent to authors: 26. April 2017
- Rebuttals due: 30. April 2017
- Notification of acceptance: 12. May 2017
- Camera ready copies due: 26. May 2017
- Conference: 17.-21. July 2017
Workshop Proposals
- Submission deadline: 10. February 2017
- Notification of acceptance: 15. February 2017
More details on the conference are available from
http://www.cicm-conference.org/2017