I was interested to know whether there has been any research done around
the use of the Mass Message mediawiki extension and in particular about
impact of using it.
By extension, I am interested in any research that might be related to
the impact of posting a "template" message (as opposed to an individual
targetted) on a user talk page.
I know the SignPost did a poll in 2017 to evaluate the interest of
switching to the Newsletter extension system. And I remember reading
about impact of notifications. But are there studies related to the
measure of impact in terms of engagement to mass posting on user talk page ?
Thanks for any insight you could provide
moving research-internal to Bcc, + wiki-research-l which is the better
place for this kind of discussio
If you want to respond to this email, please make sure you're
subscribed to https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
, and please find my comments below.
On Tue, Nov 27, 2018 at 6:54 AM Rebecca Petras
> Over the past five years, we have worked with various Wikimedia communities focused on simplified health articles being translated into non-English Wikipedia sites. The biggest area where we have focused is east Africa, especially for Swahili. We would like to expand this work, as it is our mission to increase the ability to get critical information openly in marginalized languages. However, we really feel we need to ask two main questions:
> When translating medical articles from English to Swahili, is the translation of higher quality if it is simplified first?
Can you expand what you mean? I am not sure if I understand your question.
> Can a health-specific machine translation engine, used by community who are translating articles, increase the ability to translate medical articles while maintaining quality?
The quality of many of the standard machine translation engines will
rely on the source and destination languages you choose. While some
language pairs give high quality results, some may not. Which machine
translation tool do you have in mind? And for what language pair?
Also, do you use Content Translation?
> I am would like to know your thoughts on whether a research project addressing one or both of those questions would be acceptable – they are key questions for our community.
Any research project that can address the needs of your community can
be great. :) If you expand some more what you want to use the results
of this line of research for, we can try to help you with figuring out
whether the answer to some of your questions is already in the
literature, and through that save some of your time. :)
> Thanks very much,
> Rebecca Petras
> Deputy Director
> Translators without Borders
One of the main problems of under-resourced Wikipedias is the lack of
content in their languages. We want to tackle this problem by supporting
editors in creating more content in their languages easily.
We are working on a tool to support editors in creating new articles. The
editing tool (Scribe) will display a structure of the new article and
references (with their most important points) for each section, supported
by the information on Wikidata. The project is based on recent research in
document planning, reference discovery and collection, and document
summarization. One of the emphasizes of our project is to keep the
community involved in every step of the development to ensure that we are
serving their needs.
We are currently applying for a Wikimedia project grant and would like to
hear your feedback:
Looking forward to hearing from you!
Web and Internet Science Group
School of Electronics and Computer Science
University of Southampton
as FYI today I rebooted stat1007 due to unexpected maintenance (an error
from my side) while investigating a Spark2 issue (that is now fixed).
Apologies if this has impacted your work!
We’re preparing for the November 2018 research newsletter and looking for contributors. Please take a look at https://etherpad.wikimedia.org/p/WRN201811 and add your name next to any paper you are interested in covering. Our target publication date is on December 2 UTC although actual publication might happen several days later. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
• Bot Detection in Wikidata Using Behavioral and Other Informal Cues
• Integrating Wikipedia Articles and Images into an Information Resource for Radiology Patients
• It Was Fun, but Did It Last?: The Dynamic Interplay Between Fun Motives and Contributors' Activity in Peer Production
• Operationalizing Conflict and Cooperation between Automated Software Agents in Wikipedia: A Replication and Expansion of 'Even Good Bots Fight'
• Reciprocity and Donation: How Article Topic, Quality and Dwell Time Predict Banner Donation on Wikipedia
• Toward an abstract Wikipedia
• With Few Eyes, All Hoaxes Are Deep
Masssly, Tilman Bayer and Dario Taraborelli
as part of https://phabricator.wikimedia.org/T205846 we are going to ask to
all the stat1005's users to move to stat1007 during the next two weeks. The
deadline is November 14th, by which time ssh access to stat1005 will be
Background: on stat1005 we have a GPU (more details in
https://phabricator.wikimedia.org/T148843) that has been sitting there for
almost two years, and it would be great to try to make it work during the
next months. This effort will require a lot of tests/reboots/etc.. that can
of course impact ongoing work of all of you, so we prefer to move everybody
to another identical machine beforehand.
Please reach out to me or to the analytics team in T205846 or IRC
(#wikimedia-analytics on Freenode) if you have any
questions/doubts/blocker/etc.., we are not going to enforce the deadline if
anybody will raise concerns or blockers of course. It would be great to
move everybody by Nov 14th but we surely don't want to disrupt any ongoing
I am going to update the Wikitech documentation about stat1005 and stat1007
as soon as possible, for the moment keep in mind that stat1007 will take
over completely everything that stat1005 currently does.
I have already copied over all the stat1005 directories to stat1007, and
I'll periodically sync them during the following days. If you don't find
anything important, please add a note in T205846.
Thanks a lot and sorry for the trouble,
Luca (on behalf of the Analytics team)
[Apologies for cross-postings]
Call for Papers
formal papers - informal papers - doctoral programme - workshops - tutorials
12th Conference on Intelligent Computer Mathematics
- CICM 2019 -
July 8-12, 2019
CIIRC, Prague, Czech Republic
Digital and computational solutions are becoming the prevalent means
for the generation, communication, processing, storage and curation of
CICM brings together the many separate communities that have developed
theoretical and practical solutions for mathematical applications such as
computation, deduction, knowledge management, and user interfaces.
It offers a venue for discussing problems and solutions in each of these
areas and their integration.
CICM 2018 invites submissions in all topics relating to intelligent computer
mathematics, in particular but not limited to
* theorem proving and computer algebra
* mathematical knowledge management
* digital mathematical libraries
CICM appreciates the varying nature of the relevant research in this area and
invites submissions of very different forms:
1) Formal submissions will be reviewed rigorously and accepted papers will be
published in a volume of Springer LNAI (pending approval):
* regular papers (up to 15 pages) present novel research results
* project and survey papers (up to 15 pages + bibliography) summarize
* system and dataset descriptions (up to 5 pages) present digital artifacts
* system entry (1 page according to the given LaTeX template) provides metadata
and a quick overview of a new tool or a new release of an existent tool
2) Informal submissions will be reviewed with a positive bias and selected for
presentation based on their relevance for the community.
* informal papers may present work-in-progress, project announcements,
position statements, etc.
* posters and system demos will be presented in parallel in special sessions
3) The doctoral programme provides PhD students a forum to present early results
receive constructive feedback and mentoring.
4) Workshops allow smaller groups to self-organize focused discussions.
5) Tutorials allow presenting a particular system in depth.
* Important Dates *
- Abstract deadline: March 01
- Full paper deadline: March 08
- Reviews sent to authors: April 06
- Rebuttals due: April 10
- Notification of acceptance: April 15
- Camera-ready copies due: May 01
- Conference: July 08-12
Informal submissions and doctoral programme
Two separate submission rounds are offered so that some authors can make early
travel plans while other authors submit spontaneously.
- First round submission deadline: April 01
- Second round submission deadline: May 15
Workshop and Tutorial proposals
- Submission deadline: February 01
All submissions should be made via easychair at https://easychair.org/conferences/?conf=cicm2019
Dr. Ravindra Pogaku
Research Professor of Chemical Engineering
Center of Catalysis for Renewable Fuels
The University of South Carolina
541 Main Street Office, H013
Columbia, SC 29201
Phone: 803-576-6069, Cell: 803-849-9623
the Analytics team will shutdown completely the Hadoop cluster for a couple
of hours on Monday Nov 12th at 14:00 CEST to upgrade the Cloudera
distribution to 5.15 (currently 5.10). No big updates but only a collection
of small/medium fixes that (hopefully) will improve the reliability of our
cluster. For more info, please check
This means that tools like HDFS, Hive, Oozie, etc.. will not be available
during the maintenance window, so if this impacts your work please reach
out to us so we can chat about it and possibly re-schedule if needed (in
the task or #wikimedia-analytics on Freenode IRC).
Thanks a lot for the patience, we are trying to do our best to keep all our
systems as up to date as possible :)
Luca (on behalf of the Analytics team)
Hello fellow researchers!
We are conducting a research about "mortality in wikis" and we are looking
for a good definition to determine when a wiki is considered "death",
"inactive" or "abandoned".
So far, I've only found this definition from Haiyi Zhu, Robert E. Kraut and
Aniket Kittur in their paper: "The impact of membership overlap on the
survival of online communities"
We define a community to be dormant (the inverse of active) in a given
month if the community did not have any activity (including discussion
pages and community pages) in the given month and the preceding two months.
Any other references you could point me out? any better ideas?
Thank you in advance!