Wikitech-l January 2018

wikitech-l@lists.wikimedia.org

64 participants
70 discussions

Syncing data between infoboxes was Fwd: [Wikidata] GlobalFactSync
by Sebastian Hellmann 15 Jan '18

15 Jan '18

Hi all, I am forwarding you this email, because we have a specific technical question. With DBpedia as middleware, we can create a global view on all the data that is in Wikipedias Infoboxes and Wikidata and compare them (for details see the email below and also the proposal). We were wondering what is the latest and most appropriate tech to interface with the editors of infoboxes. VisualEditor seems appropriate, but I checked here for example: https://en.wikipedia.org/wiki/Fulda It seems to be possible to edit some values, but there is no Wikidata support and also population does have a reference which does not show up in the VisualEditor. Do you think it would be a good way to provide comparative facts from other language versions in the VisualEditor? Or would you choose something else? All the best, Sebastian -------- Forwarded Message -------- Subject: [Wikidata] GlobalFactSync Date: Mon, 15 Jan 2018 19:57:04 +0100 From: Magnus Knuth <knuth(a)informatik.uni-leipzig.de> Reply-To: Discussion list for the Wikidata project. <wikidata(a)lists.wikimedia.org> To: wikidata(a)lists.wikimedia.org Dear all, last year, we applied for a Wikimedia grant to feed qualified data from Wikipedia infoboxes (i.e. missing statements with references) via the DBpedia software into Wikidata. The evaluation was already quite good, but some parts were still missing and we would like to ask for your help and feedback for the next round. The new application is here: https://meta.wikimedia.org/wiki/Grants:Project/DBpedia/GlobalFactSync The main purpose of the grant is: - Wikipedia infoboxes are quite rich, are manually curated and have references. DBpedia is already extracting that data quite well (i.e. there is no other software that does it better). However, extracting references is not a priority on our agenda. They would be very useful to Wikidata, but there are no user requests for this from DBpedia users. - DBpedia also has all the infos of all infoboxes of all Wikipedia editions (>10k pages), so we also know quite well, where Wikidata is used already and where information is available in Wikidata or one language version and missing in another. - side-goal: bring the Wikidata, Wikipedia and DBpedia communities closer together Here is a diff between the old an new proposal: - extraction of infobox references will still be a goal of the reworked proposal - we have been working on the fusion and data comparison engine (the part of the budget that came from us) for a while now and there are first results: 6823 birthDate_gain_wiki.nt 3549 deathDate_gain_wiki.nt 362541 populationTotal_gain_wiki.nt 372913 total We only took three properties for now and showed the gain where no Wikidata statement was available. birthDate/deathDate is already quite good. Details here: https://drive.google.com/file/d/1j5GojhzFJxLYTXerLJYz3Ih-K6UtpnG_/view?usp=… Our plan here is to map all Wikidata properties to the DBpedia Ontology and then have the info to compare coverage of Wikidata with all infoboxes across languages. - we will remove the text extraction part from the old proposal (which is here for you reference: https://meta.wikimedia.org/wiki/Grants:Project/DBpedia/CrossWikiFact). This will still be a focus during our work in 2018, together with Diffbot and the new DBpedia NLP department, but we think that it distracted from the core of the proposal. Results from the Wikipedia article text extraction can be added later once they are available and discussed separately. - We proposed to make an extra website that helps to synchronize all Wikipedias and Wikidata with DBpedia as its backend. While the external website is not an ideal solution, we are lacking alternatives. The Primary Sources Tool is mainly for importing data into Wikidata, not so much synchronization. The MediaWiki instances of the Wikipedias do not seem to have any good interfaces to provide suggestions and pinpoint missing info. Especially to this part, we would like to ask for your help and suggestions, either per mail to the list or on the talk page: https://meta.wikimedia.org/wiki/Grants_talk:Project/DBpedia/GlobalFactSync We are looking forward to a fruitful collaboration with you and we thank you for your feedback! All the best Magnus -- Magnus Knuth Universität Leipzig Institut für Informatik Abt. Betriebliche Informationssysteme, AKSW/KILT Augustusplatz 10 04109 Leipzig DE mail: knuth(a)informatik.uni-leipzig.de tel: +49 177 3277537 webID: http://magnus.13mm.de/ _______________________________________________ Wikidata mailing list Wikidata(a)lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata

1 0

Re: [Wikitech-l] Create aliases for sister projects wikilinks
by יגאל חיטרון 15 Jan '18

15 Jan '18

Hello, and thank you for your answer. Yes, he likes local aliases indeed. They can't be done for all wikis, because it's on wiki own language. About templates - we can create them, of course, but it's about wikilinks. Isn't there any where to create them, even in phabricator, as namespaces aliases were? Igal On Jan 15, 2018 09:32, "MZMcBride" <z(a)mzmcbride.com> wrote: יגאל חיטרון wrote: >We just added a lot of aliases for namespaces, for example WP: for >Wikipedia: and U: for User:. Is there a way to do the same thing for the >sister projects? For example, adding a local name for n: or wict:. Are you familiar with <https://meta.wikimedia.org/wiki/Interwiki_map>? It sounds similar to what you want, except interwiki prefixes defined on that page apply to all public Wikimedia wikis. Do you want local-only prefixes? Would templates (i.e., {{wict|hello}} instead of [[wict:hello]]) work? MZMcBride _______________________________________________ Wikitech-l mailing list Wikitech-l(a)lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l

2 2

Create aliases for sister projects wikilinks
by יגאל חיטרון 15 Jan '18

15 Jan '18

Hello. One of tech masters on our wiki asked me to check this. He is thousands times clever than me, but has thouthands times less time, so I do this for him. We just added a lot of aliases for namespaces, for example WP: for Wikipedia: and U: for User:. Is there a way to do the same thing for the sister projects? For example, adding a local name for n: or wict:. Thank you. Igal (User:IKhitron)

2 1

TechCom Radar, 2018-01-10
by Daniel Kinzler 13 Jan '18

13 Jan '18

Hello all! Here are the minutes from this week's meeting: * Services team experimenting with Minikube for providing a Kubernetes based development environment. * RFC approved after last call: Migrate WMF production to PHP 7 <https://phabricator.wikimedia.org/T176370> * For dropping support for PHP 5.6 with MW 1.31, a few bits of code still need to be fixed that are not compatible with PHP 7: <https://phabricator.wikimedia.org/T172165#3758081> * Last Call: TechCom proposes to decline “MediaWiki Action API is a unique API specification” because it does not offer an actionable solution to discuss and does not address the issue of migrating existing client code. <https://phabricator.wikimedia.org/T180096>. * IRC discussion next week: MediaWiki platform architecture topics for the dev summit. Please let use know what you think the most important topics for the MediaWiki platform are, with a one to five year horizon. Discussion before the meeting is encouraged, see <https://phabricator.wikimedia.org/T183313> * Please also provide input on the other sessions planned for the Developer Summit: <https://phabricator.wikimedia.org/project/view/3119/>. Keep in mind that this year, the summit is about strategic direction, not about concrete engineering issues. * TechCom had a triage session. We went over 20 of about 100 RFCs in the “backlog” and “under discussion” columns of the RFC board to check on the status. You can also find our meeting minutes at <https://www.mediawiki.org/wiki/Wikimedia_Technical_Committee/Minutes> See also the TechCom RFC board <https://phabricator.wikimedia.org/tag/mediawiki-rfcs/>. -- Daniel Kinzler Principal Platform Engineer Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.

1 0

Wednesday morning SWAT moved 1 hour earlier
by Greg Grossmeier 12 Jan '18

12 Jan '18

Hello, A recent incident post-mortem improvement was to create a one hour buffer between the last Service/SWAT deploys and the MediaWiki Train[0]. We addressed that on Tuesday already by removing the Morning SWAT window (was at 11am Pacific). Now we are addressing that on Wednesday by moving the Morning SWAT window one hour earlier (to 10am Pacific). This is effective as of next week (Jan 15th). As always, please see the Deployments calendar[1] for the canonical source of what is scheduled when. Best, Greg [0] https://phabricator.wikimedia.org/T182733 [1] https://wikitech.wikimedia.org/wiki/Deployments -- | Greg Grossmeier GPG: B2FA 27B1 F7EB D327 6B8E | | Release Team Manager A18D 1138 8E47 FAC8 1C7D |

1 0

1.31.0-wmf.16 MediaWiki train blocker - Rollback
by Greg Grossmeier 12 Jan '18

12 Jan '18

We just had to rollback 1.31.0-wmf.16 from group1 (eg: all non-wikipedias), so -wmf.16 is only on group0 at this time. The issue is a data corruption bug[0]. The relevant people are aware. Reminder that we have a task for each train rollout to track (potential) blockers; the one for this week is [1]. You can follow that task to track future progress. Additional reminder that you can use the "Wikimedia MediaWiki versions"[2] tool on Toolforge to know which wikis have which version at any time. Greg [0] https://phabricator.wikimedia.org/T184749 [1] https://phabricator.wikimedia.org/T180749 [2] https://tools.wmflabs.org/versions/ -- | Greg Grossmeier GPG: B2FA 27B1 F7EB D327 6B8E | | Release Team Manager A18D 1138 8E47 FAC8 1C7D |

1 1

(no subject)
by Sonali Patro 12 Jan '18

12 Jan '18

Hi Everyone, I am an undergraduate student from India, and I plan on taking part in the google summer of code through your organization. I am interested in frontend development, also I like working in C++. Can you please help me out in moving forward. Regards, Sonali Patro Contact: 00917381809123/0097455579707

2 1

TechOps update
by Victoria Coleman 12 Jan '18

12 Jan '18

Hi everyone. We are making some exciting changes in TechOps! The Technical Operations team in the Technology department is possibly the oldest team in the organization. Originating from a group of volunteers (Mark being one of them) that enjoyed building and maintaining this up-and-coming, soon to become global top-10 web site as a hobby, the team has always focused on the challenge of keeping Wikimedia’s sites, services, and infrastructure working as well as possible. They did this at first on what can only be described as a shoestring budget, and still with modest resources today (more on this later). Over time the team has grown to a professional staff of currently 18, with a pretty flat structure. Besides the other two sub teams (Traffic and Data Center Ops) that do have a clearly defined scope, most of the team’s members as well as the majority of TechOps’s responsibilities still reside in the “Core Ops” sub team. To strengthen the team as it continues to grow in responsibilities and membership we’ve decided to make some changes to the team’s structure, its leadership and its public profile. Starting with the latter, we've decided to rename the team from Technical Operations to Site Reliability Engineering (SRE). SRE is a relatively modern term that more accurately describes the type of work the Technical Operations team has been doing for the past few years to some extent, as well as the path where it needs to grow into. Coined by Ben Treynor of Google, it’s now widely used across the industry. SRE describes a discipline where the emphasis is on the software engineering aspects of the work, with a focus on tools development and automation rather than human labor. Our hope is that this name change will more accurately represent the work and will help with recruiting into the team. Second, we will increase the team’s management capacity. As the responsibilities and management/coordination/planning needs of the team kept growing, Faidon has stepped up and increased his involvement significantly. For example he covered for Mark during his paternity leave, and he has played a key leadership role in our efforts in the lawsuit against the NSA. In my time at the Foundation, I have come to rely on Faidon’s judgement, his ability to execute, and most of all on his leadership. So in recognition of Faidon’s important leadership role and responsibilities in the team, he is promoted to Director of Site Reliability Engineering. Well done Faidon! Mark and Faidon both will now be “Director of Site Reliability Engineering”, reporting to me. They will share some of the responsibilities of the team, such as its roadmap, and CapEx and OpEx planning and execution, as they have been doing for some time now. Each will lead one of two new sub-groups, “Service Operations” led by Mark, and “Infrastructure Foundations”, led by Faidon. The team will continue to operate as a single group responsible for the organization’s broader Site Reliability Engineering function, with both Mark and Faidon as leaders of the respective groups. I also want to offer a few words about Mark. Mark exemplifies our values and we wouldn’t be the same without him. pubFrom driving servers around in the trunk of his car at the earliest days of the projects to building and running an exemplary team that has consistently delivered 99.98% uptime for the world’s fifth-most popular website, his work has been nothing short of heroic. He has done this with a team of 18 people, which many in our industry find incomprehensible. Both Katherine and our Board have recognized that delivering this level of performance with our radically efficient team is not sustainable as we continue to grow and make steps towards our strategic directions of knowledge equity and knowledge as a service. Katherine has asked, and the Board has unanimously recommended, that we step up our investment in the team. I am thrilled at their support which will enable our SRE team to have access to additional resources within the current fiscal year. Last but not least, and in an effort to return to his earlier days in the projects (and, in his words, an attempt to gain back some respect from his technical colleagues :-), Mark will dedicate two days a week to individual technical contributions in addition to his managerial work. Mark, thank you for your remarkable contributions! Finally, I wanted to share more detail on our new sub team structure and scope. Data Center Operations The existing Data Center Operations sub team continues as-is but will now be managed by Faidon. The team, consisting of Rob, Chris, and Papaul, is responsible for all of Wikimedia’s data center deployments and logistics as well as maintaining our presence in 8 locations across the world. They perform on-site work and maintain the full 5-year life cycle (specs, purchasing, physical install, break/fix and decommissioning) for all hardware. Infrastructure Foundations This new sub team will focus on building and maintaining our base platform (“metal cloud”) that forms the foundations upon which nearly everything else in our infrastructure builds upon. On top of our bare metal deployments, their responsibilities include (but are not limited to) configuration management systems, infrastructure automation, orchestration tooling, logging, metrics and monitoring as well as infrastructure security. This team consists of Riccardo, Filippo, Keith and Moritz, who will report to Faidon. Traffic The current Traffic sub team remains unchanged in membership, scope, and management. They are responsible for the critical first layer of high-traffic infrastructure which now spans much of the globe, including our TLS termination and caching layers, load balancing, DNS and our own network. The members of this team are Brandon, Emanuele and Arzhel as well as Valentin Gutierrez, our newly hired Traffic Security Engineer who will be starting on February 12th. They report to the team’s technical lead and manager, Brandon, who in turn will continue to report to Mark. Data Persistence The new Data Persistence sub team will focus on Wikimedia’s persistent data storage and retrieval systems, including (No)SQL databases, (distributed) object storage, file storage and backup systems. Today, this team will start with just our two database administrators, Jaime and Manuel, but the expectation is that this team will be built out in the near future with additional hands and expertise. They will report to Mark. Service Operations Finally, the Service Operations sub team will take care of public and “user-visible” services alongside Technology and Audiences teams. This includes, for example, our big MediaWiki platform, but also the newer (micro)services that comprise our stack. It also includes miscellaneous services and components that we rely upon (think Phabricator, mail systems, OTRS, etc…). The team will continue building our new SOA service infrastructure based on Kubernetes. Its membership will consist of Alexandros, Giuseppe, Ariel and Daniel, reporting to Mark. Please welcome our new SRE team! Victoria (with a lot of help from Mark, Faidon and the SRE team)

2 1

Open Web Application Security Project presentation on January 10
by Lani Goto 11 Jan '18

11 Jan '18

Hi everyone - We're excited to have a special guest present on the Open Web Application Security Project <https://www.owasp.org/index.php/About_The_Open_Web_Application_Security_Pro…> . Information security expert Dave Wichers <https://www.owasp.org/index.php/User:Wichers> will will discuss the new OWASP Top 10 - 2017 <https://www.owasp.org/images/7/72/OWASP_Top_10-2017_%28en%29.pdf.pdf>, which encompasses the ten most serious web application security risks from last year. The OWASP Top 10 is a powerful awareness document for web application security. It represents a broad consensus about the most critical security risks to web applications. Project members include a variety of security experts from around the world who have shared their expertise to produce this list. Please join us for this presentation on Wednesday, January 10 at 11:30am PT (19:30 UTC). You can join remotely via the following: Youtube: https://www.youtube.com/watch?v=wf1SfipLLzE Google Hangout: https://hangouts.google.com/hangouts/_/wikimedia. org/hold-owasp-talk Feel free to use the #wikimedia-office channel on IRC to ask any questions. Hope to see you then! -- Lani Goto Project Assistant, Engineering Admin

2 3

Readers Monthly update for December 2017
by Chris Koerner 11 Jan '18

11 Jan '18

Howdy, This is the monthly update from the Readers team at the foundation for December 2017. As always, feedback and questions are welcome. == Discussions == === Apps=== * Android made 2 releases: a major release introduced (amongst other things) a fully customisable Explore feed, a new and improved Randomiser, and by popular request a new Black theme that optimises battery life on AMOLED displays. A subsequent minor release fixed a few bugs/performance issues. [0] === Web === * Performance testing of the new PDF renderer is giving hopeful results [1] * New summary endpoint, created to provide better page previews is ready for deployment to all projects * Initial bugs with download to PDF button on mobile now resolved === Reading Infrastructure === * Deployed the ReadingLists extension and Reading List Service to production. [2] [3] === New Readers === * New Reader updates can be found at Meta. [4] === Multimedia === * Data Analysts Chelsy and Mikhail have completed baseline metrics for Wikimedia Commons to understand and measure the impact the Structured Data project will have. [5] === Discovery === The Discovery team provides weekly updates. Here are their updates for the month. * 2017-12-04 [6] * 2017-12-11 [7] [0] https://play.google.com/store/apps/details?id=org.wikipedia&hl=en_GB [1] https://phabricator.wikimedia.org/T178278 [2] https://www.mediawiki.org/wiki/Extension:ReadingLists [3] https://www.mediawiki.org/wiki/Reading/Reading_List_Service [4] https://meta.wikimedia.org/wiki/New_Readers/Updates [5] https://meta.wikimedia.org/wiki/Research:Baseline_Metrics_for_Structured_Da… [6] https://www.mediawiki.org/wiki/Discovery/Status_updates/2017-12-04 [7] https://www.mediawiki.org/wiki/Discovery/Status_updates/2017-12-11 --- Subscribe to receive theses updates as on-wiki notifications or opt-in email. https://www.mediawiki.org/wiki/Newsletter:Readers_Monthly The archive of all past updates can be found on MediaWiki.org: https://www.mediawiki.org/wiki/Reading/Status_updates Yours, Chris Koerner Community Liaison Wikimedia Foundation

2 2

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Wikitech-l January 2018