This is K. Kaushik Reddy from India. I'm keen into Data science & good with
python. I am looking to participate in GSoC'19 for your organisation. I
also have gone through the ideas page for GSoC'19 , but looking for few
data science projects with python as the code base this year.
In future, I'm planning to take Research & Development as my career. I
always wanted to know how you people plan and execute a project in action.
Secondly, Could you please guide me with relevant bugs that need to be
fixed, so as to get involved in GSoC'19 along with you guys.
Hoping to learning more and have fun with you people.
K. Kaushik Reddy.
IC2S2 2019 – 5th INTERNATIONAL CONFERENCE ON COMPUTATIONAL SOCIAL SCIENCE
July 17-20, 2019
University of Amsterdam, The Netherlands
Twitter:@IC2S2 <https://twitter.com/IC2S2>/ #IC2S2
Deadline for abstracts: 5 February 2019
This summer the fifth edition of IC2S2 is hosted by the University of
Amsterdam, The Netherlands. IC2S2 brings together researchers in
computational science, complexity, and social science, and provides a
platform for new work in the field of computational social science.
Following a day of tutorials on July 17, contributed abstracts are
presented orally in parallel thematic sessions or as posters at the
three day conference, which takes place at the University of Amsterdam
in the Netherlands from July 18 to 20.
Kenneth Benoit (London School of Economics and Political Science)
Jana Diesner (University of Illinois at Urbana Champaign)
Deen Freelon (University of North Carolina)
Mirta Galesic (Santa Fe Institute)
Cesar Hidalgo (MIT Media Lab)
Devra Moehler (Facebook Research)
Scott Page (University of Michigan)
Leto Peel (Université Catholique de Louvain)
Emma Spiro (University of Washington)
Lukas Vermeer (Booking.com)
Claudia Wagner (GESIS Leibniz Institute for the Social Sciences)
SPECIAL SESSION “5 YEARS OF IC2S2"
After an opening talk by Duncan Watts (Microsoft Research), several
renowned computational social scientists from the field will engage in a
panel discussion on the future of the field of computational social
science. Among confirmed panelists are:
Ulrik Brandes (ETH Zürich)
Sandra González-Bailón (University of Pennsylvania)
Helen Margetts (University of Oxford, The Alan Turing Institute)Markus
Strohmaier (RWTH Aachen University)
Duncan Watts (Microsoft Research)
CALL FOR PAPERS (ABSTRACTS)
IC2S2 solicits abstracts from researchers in the social sciences with a
clear component of computation, simulation or data analysis or data
science. This includes for example sociology, psychology, communication
science, anthropology, media studies, political science, public health,
and economics. In addition, contributions from computer science, data
science, and computational science with real-world applications in the
social sciences or related fields, are welcome. We emphatically welcome
abstracts that try to integrate both components. This is not limited to
empirical studies; more general theoretical contributions are also welcome.
Topics of interest include, but are not limited to, the following:
Network analysis of social systems
Large-scale social experiments
Agent-based or other simulations of social phenomena
Text analysis and natural language processing (NLP) of social phenomena
Cultural patterns and dynamics
Computational science studies (sociology of science)
Social news curation and collaborative filtering
Social media studies
Theoretical discussions in computational social science
Causal inference and computational methods for social science
Ethics in computational social sciences
Reproducibility in computational social science
Large scale infrastructure in computational social science
Novel digital data sources
Computational analyses for addressing societal challenges
Methods and analyses of observational social data
Computational social science research in industry
Contributions to the conference should be submitted via EasyChair
Please follow the extended abstract template guidelines for Word
and LaTeX (ic2s2-latex-template.zip
for formatting instructions. Note that abstracts should be submitted as
a PDFfile no larger than 20MB. Submissions that exceed the 2-page limit
(including figures and references) will be automatically rejected.
The extended abstract should include a title and a list of 5 keywords,
but no authors’ names or affiliations. The abstract should outline the
main contribution, data and methods used, results, and the impact of the
work. Authors are encouraged to include one figure in their submission
(the figure counts towards the page limit).
Please do not include authors’ names and affiliations in the submitted
document, as peer review will be double blind. Each extended abstract
will be reviewed by multiple members of the Program Committee, composed
of experts in computational social science.
When submitting on EasyChair you will be asked to provide information
about the authors and their affiliations and to include a one-sentence
summaryof the extended abstract (20-50 words). The summary will be used
for assigning reviewers. You can indicate a preference for an oral
presentation or a poster presentation, but your preference may not be
honored in the final decision.
Submissions will be non-archival, and the presented work can be already
published, in preparation for publication elsewhere, or ongoing
research. Submission implies willingness to present a talk or poster at
The deadline for abstract submission is February 5, 2019.
February 5, 2019 – Regular abstract submission deadline
March 6, 2019 – Acceptance notification
April 30, 2019 – Early bird registration deadline
June 30, 2019 – Registration deadline
July 17-20, 2019 – Conference
Do you want to affiliate your organisation to IC2S2? Bespoke sponsorship
packages are available on request. Contact the local chairs Eelke
Heemskerk and Frank Takes at ic2s2(a)uva.nl
IC2S2 2019 is hosted byCORPNET <https://corpnet.uva.nl/>, University of
Amsterdam and so far it is sponsored by Microsoft Research, Statistics
Netherlands, ODISSEI, University of Amsterdam, Leiden University,
Municipality of Amsterdam, Amsterdam Centre for Inequality Studies and
Please watch the video “IC2S2 comes to Amsterdam
Hello fellow wiki investigators!
I have observed that, very often in wikis, users not in the bot groups are
actually behaving like bots. Since the mediawiki api doesn't restrict
normal users to automatize tasks through its API, you might have a "normal"
user, actually doing bot things. I would like to identify those and
consider them as bots.
Is anyone aware if there's any implemented model already to classify
whether an user is a bot or not?
Thanks and nice weekend!
"Anonymous calling: The WikiScanner scandals and anonymity on the
Japanese Wikipedia" from yesterday's call for reviews was very good; I
like this quote:
> “If there is a user ID attached to a user, discussion tends to become a criticizing game. On the other hand, under an anonymous system, even though your opinion/information is criticized, you don’t know with whom to be upset. Also, with a user ID, those who participate in the site for a long time end to have authority, and it becomes difficult for a user to disagree with them. Under a perfectly anonymous system, you can say ‘it’s boring’, if it is actually boring. All information is treated equally; only an accurate argument will work.” [Andrew Lih. The Wikipedia revolution: How a bunch of nobodies created the world’s greatest encyclopedia. New York: Hyperion, 2009, p. 145.]
Most of the research on the ways anonymity helps productivity went on
during the design of double-blind experiments, and I hope more of that
is done in the context of Wikipedia.
I am too busy this month to do any reviews, and will be disappointed
if someone else doesn't:
As some of you know, we started a line of research back in 2016 to
understand Wikipedia readers better. We published the first taxonomy
of Wikipedia readers and we studied and characterized the reader types
in English Wikipedia . During the past 1+ year, we focused on
learning about the potential differences of Wikipedia readers across
languages based on the taxonomy built in . We've learned a lot, and
today we're sharing what we learnt with you.
* Publication: https://arxiv.org/abs/1812.00474
* Data: https://figshare.com/articles/Why_the_World_Reads_Wikipedia/7579937/1
* (under continuous improvement) Research page on meta:
* Research showcase presentation:
* A series of presentations to WMF teams and community: Look for tasks
under https://phabricator.wikimedia.org/T201699 with title "Present
the results of WtWRW" for link to slides and more info when available.
* We will send out a blog post about it hopefully soon. A blog post
about the intermediate results is at
In a nutshell:
* We ran the taxonomy of Wikipedia readers in 14 languages and
measured the prevalence of Wikipedia use-cases and characterized
Wikipedia readers in these languages.
* While we observe similarities in terms of the prevalence of the use
cases as well as the way we can characterize readers, we can see that
Wikipedia languages lend themselves to different distributions of
readership and characteristics. In many cases, the one-size-fit-all
solutions may simply not work for readers.
* Intrinsic learning remains as the number one motivation for people
to come to Wikipedia in the majority of the languages, followed by
* In-depth reading and the reading of scientific oriented topics is
highly and negatively correlated with the socio-economic status and
Human Development Index of countries the readers in these languages
are coming from. Long articles that may seem just too long for the
bulk of our audience in US, Japan, and the Netherlands is in high
demand in India, Bolivia, Argentina, Panamá, México, …
This research was not possible without the extensive contributions by
our formal collaborators: Florian Lemmerich (RWTH Aachen University)
and Bob West (EPFL). On the WMF end, I was fortunate to work with
Diego Saez on this project as well as more recently, Isaac Johnson.
And all those in the Reading Web and Legal team who supported us
throughout the process. I also want to underline the amazing work that
the volunteers in the languages in the study did to support us heavily
to learn more about their languages, not only through help with
communications within their communities but also with the translation
task which was not an easy one as they were asked to offer their time
not only to translate but also do in-person meetings with us for us to
make sure the intent of the question is translated the same way across
the languages. Usernames Strainu, Tgr, Amire80, Awossink, Antanana,
Lyzzy, Shangkuanlc, Whym, Kaganer, عباد_ديرانية, Satdeep_Gill, Racso,
Hasive: Thank you!
Next we are going to extend this study to include demographics
information. More information about it coming out in the next few
weeks. (And I will send out a separate email to wikimedia-l about this
topic and future research over the weekend. I need some time to
finalize the message to make the message most useful for that
Senior Research Scientist, Lead
We’re preparing for the January 2019 research newsletter and looking for contributors. Please take a look at https://etherpad.wikimedia.org/p/WRN201901 and add your name next to any paper you are interested in covering. Our target publication date is on January 31 UTC although actual publication might happen several days later. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
• "Anonymous calling": The WikiScanner scandals and anonymity on the Japanese Wikipedia
• A Historical Perspective on Information Systems: A Tool and Methodology for Studying the Evolution of Social Representations on Wikipedia
• An inferior source? Quantitatively analyzing the production and revision of five technology-enhanced learning-related terms on Wikipedia
• Asthma Information Seeking via Wikipedia Between 2015 and 2018: Implications for Awareness Promotion
• Can deep learning techniques improve classification performance of vandalism detection in Wikipedia?
• Feature Analysis for Assessing the Quality of Wikipedia Articles through Supervised Classification
• Following the Fukushima Disaster on (and against) Wikipedia: A Methodological Note about STS Research and Online Platforms
• Hacking History: Redressing Gender Inequities on Wikipedia Through an Editathon
• Improving New Editor Retention on Wikipedia
• Let’s Talk About Refugees: Network Effects Drive Contributor Attention to Wikipedia Articles About Migration-Related Topics
• Pharmacy students can improve access to quality medicines information by editing Wikipedia articles
• The digital knowledge economy index: mapping content production
• Towards Compiling Textbooks from Wikipedia
• Trolls, bans and reverts: simulating Wikipedia
• Why the World Reads Wikipedia: Beyond English Speakers
• Wizard of Wikipedia: Knowledge-Powered Conversational agents
Masssly, Tilman Bayer and Dario Taraborelli
We’re thrilled to announce the release of WikiConv—a multilingual corpus
reconstructing the complete conversational history of multiple Wikipedia
The corpus—a collaboration between Jigsaw, Cornell and Wikimedia
foundation—includes over 100M individual conversation threads and 300M
conversational actions extracted from the English, Chinese, German, Greek,
and Russian Wikipedia talk pages.
WikiConv can be used to understand and model conversational turns in online
collaborative spaces, as we showed in an earlier study, predicting when
conversations go awry <https://arxiv.org/abs/1805.05345>.
The reconstruction methodology, as well as its possible applications, are
described in a paper by Hua et al. recently presented at EMNLP 2018
<https://arxiv.org/abs/1810.13181>. You can also watch a video presentation
of this work from the Wikimedia Research showcase
The corpus is released under CC0 (CC BY SA for individual comments). All
the underlying code is available in this Github repository
If you have any questions about the dataset, feel free to contact us at