Wikitech-l April 2017

wikitech-l@lists.wikimedia.org

81 participants
82 discussions

Discovery Weekly Update for the week starting 2017-04-03
by Chris Koerner 07 Apr '17

07 Apr '17

Hello, Here is this week's update from the Discovery department. As always, feedback and questions are welcome. == Highlights == * Jan presented the Wiktionary widget during the CREDIT showcase [0] * Translations were asked for, in order to post a message on the top language Village Pumps about the upcoming production release of sister projects snippets being shown on the search results pages [1] * Posted about the upcoming sister projects snippets in search results on various Village Pumps and to a few email lists [2] [3] * The Interactive team has asked for feedback on a new map style [4] [5] == Discussions == === Analysis === * Fixed an issue with the retrieval scripts not using correct data on the portal dashboard [6] * Removed regex in ZRR breakdown by type on the Search dashboard [7] [8] === Portal === * We got some help fixing a deployment bug on the Portals - yay! [9] == Did you know? == * Amberjack served sashimi style is pretty good! [10] [0] https://www.youtube.com/watch?v=Jn_3CT6GR9o [1] https://www.mediawiki.org/wiki/Cross-wiki_Search_Result_Improvements/villag… [2] https://phabricator.wikimedia.org/T162064#3161941 [3] https://phabricator.wikimedia.org/T162064#3161951 [4] https://lists.wikimedia.org/pipermail/maps-l/2017-April/001565.html [5] https://phabricator.wikimedia.org/T153282 [6] https://phabricator.wikimedia.org/T162178 [7] https://discovery.wmflabs.org/metrics/#failure_breakdown [8] https://phabricator.wikimedia.org/T161876 [9] https://phabricator.wikimedia.org/T161832 [10] https://commons.wikimedia.org/wiki/File:Amberjack_fish_served_sashimi_style… --- The archive of all past updates can be found on MediaWiki.org: https://www.mediawiki.org/wiki/Discovery/Status_updates Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator. [1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R [2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R Yours, Chris Koerner Community Liaison - Discovery Wikimedia Foundation

1 0

Re: [Wikitech-l] [Potential Spoof] Question about wikidata dump bz2 file
by Trung Dinh 07 Apr '17

07 Apr '17

Sorry, I hit enter early by accident. I realized the dump file for wikidata is no longer in the format wikidatawiki-2017XXXX-pages-articles.xml.bz2 anymore. Now, it is split in to different dumps: https://dumps.wikimedia.org/wikidatawiki/latest/wikidatawiki-latest-md5sums… I am wondering when did this happen and the rationale behind it. Will it be permanent or we will switch back to the original format soon ? Thank you, Best regards, Trung On 4/5/17, 9:57 PM, "Wikitech-l on behalf of Trung Dinh" <wikitech-l-bounces(a)lists.wikimedia.org on behalf of trd(a)fb.com> wrote: Hi everyone, I realized the dump file for wikidata is no longer in the format wikidatawiki-2017XXXX-pages-articles.xml.bz2 anymore. _______________________________________________ Wikitech-l mailing list Wikitech-l(a)lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l

3 3

Migrated Reportcard with Updated Data
by Nuria Ruiz 07 Apr '17

07 Apr '17

Hello! The Analytics team would like to announce that we have migrated the reportcard to a new domain: https://analytics.wikimedia.org/dashboards/reportcard/# pageviews-july-2015-now The migrated reportcard includes both legacy and current pageview data, daily unique devices and new editors data. Pageview and devices data is updated daily but editor data is still updated ad-hoc. The team is working at this time on revamping the way we compute edit data and we hope to be able to provide monthly updates for the main edit metrics this quarter. Some of those will be visible in the reportcard but the new wikistats will have more detailed reports. You can follow the new wikistats project here: https://phabricator. wikimedia.org/T130256 Thanks, Nuria

1 0

Parsoid: node 0.1x deprecated now; node 0.1x support will end March 31st, 2017
by Subramanya Sastry 07 Apr '17

07 Apr '17

The Parsing team at the Wikimedia Foundation that develops the Parsoid service is deprecating support for node 0.1x. Parsoid is the service that powers VisualEditor, Content Translation, and Flow. If you don't run a MediaWiki install that uses VisualEditor, then this announcement does not affect you. Node 0.10 has reached end of life on October 31st, 2016 [1] and node 0.12 is scheduled to reach end of life December 31st, 2016 [1]. Yesterday, we released a 0.6.1 debian package [2] and a 0.6.1 npm version of Parsoid [3]. This will be the last release that will have node 0.1x support. We'll continue to provide any necessary critical bug fixes and security fixes for the 0.6.1 release till March 31st 2017 and will be completely dropping support for all node versions before node v4.x starting April 2017. If you are running a Parsoid service on your wiki and are still using node 0.1x, please upgrade your node version by April 2017. The Wikimedia cluster runs node v4.6 right now and will soon be upgraded to node v6.x [4]. Parsoid has been tested with node 0.1x, node v4.x and node v6.x and works with all these versions. However, we are dropping support for node 0.1x right away from the master branch of Parsoid. Going forward, the Parsoid codebase will adopt ES6 features available in node v4.x and higher which aren't supported in node 0.1x and will constitute a breaking change. Subramanya Sastry (Subbu), Technical Lead and Manager, Parsing Team, Wikimedia Foundation. [1] Node.js Long Term Support schedule @ https://github.com/nodejs/LTS [2] https://www.mediawiki.org/wiki/Parsoid/Releases [3] https://www.npmjs.com/package/parsoid [4] https://phabricator.wikimedia.org/T149331

1 1

Wikimedia REST API hits v1.0
by Gabriel Wicke 07 Apr '17

07 Apr '17

It is official: The Wikimedia REST API <https://www.mediawiki.org/wiki/REST_API>, your scalable and fresh source of Wikimedia content and data in machine-readable formats, is now ready for full production use. The 1.0 release means that you can now fully rely on the stability guarantees set out in the API versioning policy <https://www.mediawiki.org/wiki/API_versioning>. Read more about the stability levels, use cases, as well as technical background on how the REST API integrates with our caching layers in our blog post: https://blog.wikimedia.org/2017/04/06/wikimedia-rest-api/ We are looking forward to your feedback at https://www.mediawiki.org/wiki/Talk:REST_API, or here on-list. This release was made possible by the hard work of many. First of all, the Services team <https://www.mediawiki.org/wiki/Wikimedia_Services> (Marko Obrovac, Petr Pchelko and Eric Evans), created the general API proxy and storage functionality, and curated the API documentation <https://en.wikipedia.org/api/rest_v1/>. The actual end points are co-designed with, and largely backed, by services developed by the following WMF teams: Editing (Parsing <https://www.mediawiki.org/wiki/Parsing> and citoid <https://www.mediawiki.org/wiki/Citoid>), Reading <https://www.mediawiki.org/wiki/Reading> (Infrastructure <https://www.mediawiki.org/wiki/Wikimedia_Reading_Infrastructure_team> and Web <https://www.mediawiki.org/wiki/Reading/Web>), and Analytics <https://www.mediawiki.org/wiki/Analytics>. Volunteer Moritz Schubotz and the MathJax <https://www.mathjax.org/> community contributed the math end points, and the PDF end point is powered by the open source electron-render-service <https://github.com/msokk/electron-render-service> project. Finally, the WMF techops team <https://www.mediawiki.org/wiki/Wikimedia_Technical_Operations> runs the excellent caching infrastructure that makes this API scale so well, and have helped with many aspects from hardware procurement to firewalling. Thank you all for your hard work! We are looking forward to continuing to work with you all on making this API an even better platform for building user experiences, services, and tools. Cheers, Gabriel and the Services team -- Gabriel Wicke Principal Engineer, Wikimedia Foundation

3 2

[MediaWiki-announce] Security Release: 1.28.1 / 1.27.2 / 1.23.16
by Chad Horohoe 06 Apr '17

06 Apr '17

Hello! I would like to announce the release of MediaWiki 1.28.1, 1.27.2 and 1.23.16! These releases fix five security issues in core and one for the extension SyntaxHighlight_GeSHi. Download links are given at the end of this email. Please note that next month is the End-Of-Life date for MediaWiki 1.23. This means that MediaWiki 1.23.16 will be the last security release for that version, barring any unforeseen issues. We would strongly encourage users of MediaWiki 1.23 to upgrade to MediaWiki 1.27, released in June 2016, or a yet newer version as soon as possible. MediaWiki 1.27 will be supported until June 2019. See <https://www.mediawiki.org/wiki/Version_lifecycle> for more information. This release also serves as a maintenance release for these branches. == Security fixes == * (T109140) (T122209) Special:UserLogin and Special:Search allow redirect to interwiki links. (CVE-2017-0363, CVE-2017-0364) * (T144845) XSS in SearchHighlighter::highlightText() when $wgAdvancedSearchHighlighting is true. (CVE-2017-0365) * (T125177) API parameters may now be marked as "sensitive" to keep their values out of the logs. (CVE-2017-0361) * (T150044) "Mark all pages visited" on the watchlist now requires a CSRF token. (CVE-2017-0362) * (T156184) Escape content model/format url parameter in message. (CVE-2017-0368) * (T151735) SVG filter evasion using default attribute values in DTD declaration. (CVE-2017-0366) * (T48143) Spam blacklist ineffective on encoded URLs inside file inclusion syntax's link parameter. (CVE-2017-0370) * (T108138) Sysops can undelete pages, although the page is protected against it. (CVE-2017-0369) The following only affects 1.27 and above and is not included in the 1.23 upgrade: * (T161453) LocalisationCache will no longer use the temporary directory in its fallback chain when trying to work out where to write the cache. (CVE-2017-0367) The following fix is for the SyntaxHighlight extension: * (T158689) Parameters injection in SyntaxHighlight results in multiple vulnerabilities. (CVE-2017-0372) == Links to all mentioned tasks == https://phabricator.wikimedia.org/T109140 https://phabricator.wikimedia.org/T122209 https://phabricator.wikimedia.org/T144845 https://phabricator.wikimedia.org/T125177 https://phabricator.wikimedia.org/T150044 https://phabricator.wikimedia.org/T156184 https://phabricator.wikimedia.org/T151735 https://phabricator.wikimedia.org/T161453 https://phabricator.wikimedia.org/T48143 https://phabricator.wikimedia.org/T108138 https://phabricator.wikimedia.org/T158689 == Release notes == Full release notes for 1.28.1: <https://www.mediawiki.org/wiki/Release_notes/1.28> Full release notes for 1.27.2: <https://www.mediawiki.org/wiki/Release_notes/1.27> Full release notes for 1.23.16: <https://www.mediawiki.org/wiki/Release_notes/1.23> For information about how to upgrade, see <https://www.mediawiki.org/wiki/Manual:Upgrading> ********************************************************************** 1.23.16 ********************************************************************** Download: https://releases.wikimedia.org/mediawiki/1.23/mediawiki-1.23.16.tar.gz Download without bundled extensions: https://releases.wikimedia.org/mediawiki/1.23/mediawiki-core-1.23.16.tar.gz Patch to previous version (1.23.15), without interface text: https://releases.wikimedia.org/mediawiki/1.23/mediawiki-1.23.16.patch.gz Interface text changes: https://releases.wikimedia.org/mediawiki/1.23/mediawiki-i18n-1.23.16.patch.… GPG signatures: https://releases.wikimedia.org/mediawiki/1.23/mediawiki-core-1.23.16.tar.gz… https://releases.wikimedia.org/mediawiki/1.23/mediawiki-1.23.16.tar.gz.sig https://releases.wikimedia.org/mediawiki/1.23/mediawiki-1.23.16.patch.gz.sig https://releases.wikimedia.org/mediawiki/1.23/mediawiki-i18n-1.23.16.patch.… Public keys: https://www.mediawiki.org/keys/keys.html ********************************************************************** 1.27.2 ********************************************************************** Download: https://releases.wikimedia.org/mediawiki/1.27/mediawiki-1.27.2.tar.gz Download without bundled extensions: https://releases.wikimedia.org/mediawiki/1.27/mediawiki-core-1.27.2.tar.gz Patch to previous version (1.27.1), without interface text: https://releases.wikimedia.org/mediawiki/1.27/mediawiki-1.27.2.patch.gz Interface text changes: https://releases.wikimedia.org/mediawiki/1.27/mediawiki-i18n-1.27.2.patch.gz GPG signatures: https://releases.wikimedia.org/mediawiki/1.27/mediawiki-core-1.27.2.tar.gz.… https://releases.wikimedia.org/mediawiki/1.27/mediawiki-1.27.2.tar.gz.sig https://releases.wikimedia.org/mediawiki/1.27/mediawiki-1.27.2.patch.gz.sig https://releases.wikimedia.org/mediawiki/1.27/mediawiki-i18n-1.27.2.patch.g… Public keys: https://www.mediawiki.org/keys/keys.html ********************************************************************** 1.28.1 ********************************************************************** Download: https://releases.wikimedia.org/mediawiki/1.28/mediawiki-1.28.1.tar.gz Download without bundled extensions: https://releases.wikimedia.org/mediawiki/1.28/mediawiki-core-1.28.1.tar.gz Patch to previous version (1.28.0), without interface text: https://releases.wikimedia.org/mediawiki/1.28/mediawiki-1.28.1.patch.gz Interface text changes: https://releases.wikimedia.org/mediawiki/1.28/mediawiki-i18n-1.28.1.patch.gz GPG signatures: https://releases.wikimedia.org/mediawiki/1.28/mediawiki-core-1.28.1.tar.gz.… https://releases.wikimedia.org/mediawiki/1.28/mediawiki-1.28.1.tar.gz.sig https://releases.wikimedia.org/mediawiki/1.28/mediawiki-1.28.1.patch.gz.sig https://releases.wikimedia.org/mediawiki/1.28/mediawiki-i18n-1.28.1.patch.g… Public keys: https://www.mediawiki.org/keys/keys.html _______________________________________________ MediaWiki announcements mailing list To unsubscribe, go to: https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce

3 2

Update on Discovery search efforts and upcoming releases
by Deborah Tankersley 06 Apr '17

06 Apr '17

tl;dr: Search continues to expand functionality by displaying more information on the search results page Ever started searching for something on Wikipedia and wondered—*really*, is that all that there is? Does it feel like you’re somehow playing hide and seek with all the knowledge that’s out there? And...wouldn’t it be great to see articles or categories that are similar to your search query and maybe some related images or links to other languages in which to read that article? Or, maybe you just want to read and contribute to projects other than Wikipedia but need a jump start with a few short summaries from sister projects. The Discovery Search team has been testing out some really cool new features that will enable some fun and fascinating clicking—down the rabbit hole of Wikipedia.[1] But first, let’s recap what we’ve been doing recently. We've been doing tons of work creating, updating, and finessing the search back end to enhance search queries. There have been many complex things that have happened, things like: adding ascii-folding and stemming, detecting when a visitor might be typing in a language that is different than the Wikipedia that they are on, switching from tf-idf to BM25, dropping trailing question marks, and updating to ElasticSearch version 5. [2][3][4][5][6][7] Whew! We have much more planned in the coming months—machine learning with ‘learning to rank’, investigating and deploying new language analyzers, and, after exhaustive analysis, removing quotes within queries by default.[8][9][10][11] We’ll also be working closely with the new Structured Data team in their brand new work on Commons.[12][13] We also want to improve the part that our readers and editors interface with: the search results page! We started brainstorming during the late summer of 2016 on what we could do to make search results better—to easily find interesting, relevant content and to create a more intuitive viewing experience.[14] We designed and refined numerous ideas on how to improve the search results page and received lots of good feedback from the community.[15] Empowered by the feedback, we began testing starting with a display of results from the Wikimedia sister projects next to the regular search results.[16] The idea for this test was to enable discovery into other projects—projects that our visitors might not have known about—by displaying interesting results in small snippets. The sidebar display of the sister projects borrows from a similar feature in use on the Italian, Catalan and French Wikipedias. We've run two A/B tests on the sister project search results with detailed analysis and, after a bit of final touches to the code, we will release the new functionality into production on all Wikipedias near the end of April 2017. Our next A/B test will be to add additional information and related results for each search query. This will be in the form of an ‘explore similar’ link that, when someone interacts with the link, an expanded display will appear with related pages, categories and links to the article in other languages—all of which might lead to further knowledge discovery.[17] We know that not every search query will return exactly what folks were looking for, but we feel that adding links to similar, but related information would be helpful and, possibly, super interesting! We also plan on doing a few more A/B tests in the coming year: * Test a new display that will show the pronunciation of a word with its definition and part of speech—all from existing data in Wiktionary. Initially this will be in English only. * Test placing a small image (from the article) next to each search result that is displayed on the page. * Test an additional future using a new auto completion metadata display in the search box that is located on the top right of most pages in Wikipedia, similar to what happens on the Wikipedia.org portal.[18] For the more technical minded, there is a way to test out these new features in your own browser. To display the sister project search results, it will require a bit of URL manipulation; but for the explore similar and Wiktionary widget, you can modify your common.js file to test an early version of the features. Detailed information is available on MediaWiki.org.[19] Once the testing, analysis and feedback cycle is done for each new feature, we’d like to slowly implement them into production on all Wikipedias throughout the rest of the year. We’re really hoping that these enhancements to how search works will further the usefulness of search and make our readers and editors more productive. Cheers from the Discovery Search team! [1] https://xkcd.com/214/ [2] https://www.mediawiki.org/wiki/User:TJones_(WMF)/Notes/R e-Ordering_Stemming_and_Ascii-Folding_on_English_Wikipedia [3] https://blog.wikimedia.org/2016/07/27/wikipedia-language-search/ [4] https://en.wikipedia.org/wiki/Tf%E2%80%93idf [5] https://en.wikipedia.org/wiki/Okapi_BM25 [6] https://www.mediawiki.org/wiki/User:TJones_(WMF)/Notes/Drop ping_Final_Question_Marks_in_the_Top_10_Wikipedias [7] https://phabricator.wikimedia.org/T154501 [8] https://en.wikipedia.org/wiki/Learning_to_rank [9] https://phabricator.wikimedia.org/T154511 [10] https://commons.wikimedia.org/wiki/File:From_Zero_to_ Hero_-_Anticipating_Zero_Results_From_Query_Features,_Ignoring_Content.pdf [11] https://www.mediawiki.org/wiki/User:TJones_(WMF)/Notes/ Quotes_and_Questions [12] https://commons.wikimedia.org/wiki/Commons:Structured_data [13] https://blog.wikimedia.org/2017/01/09/sloan-foundation-structured-data/ [14] https://www.mediawiki.org/wiki/Cross-wiki_Search_Result_Improvements [15] https://www.mediawiki.org/wiki/Talk:Cross-wiki_Search_ Result_Improvements [16] https://www.mediawiki.org/wiki/Cross-wiki_Search_Result _Improvements/Testing#A.2FB_test:_Add_cross-wiki_search_ results_in_a_right_hand_sidebar [17] https://www.mediawiki.org/wiki/Cross-wiki_Search_Result _Improvements/Testing#A.2FB_test:_Add_.27explore_similar. 27_pages_and_categories_for_search_results [18] https://www.wikipedia.org/ [19] https://www.mediawiki.org/wiki/Cross-wiki_Search_Result _Improvements/self-guided_testing -- deb tankersley irc: debt Product Manager, Discovery Wikimedia Foundation

1 0

Question about wikidata dump bz2 file
by Trung Dinh 06 Apr '17

06 Apr '17

Hi everyone, I realized the dump file for wikidata is no longer in the format wikidatawiki-2017XXXX-pages-articles.xml.bz2 anymore.

1 0

Improvements to Phabricator Search Deployed
by Mukunda Modell 06 Apr '17

06 Apr '17

Hello Wikimedia developers! I've just deployed the latest batch of Phabricator updates. Normally I wouldn't write an announcement for routine upgrades, however, this update is different. This week's update includes notable improvements to Phabricator's global search functionality which I have been working on for the past week. *Bugs Fixed:* Several minor bugs have been resolved, most notably, longstanding bug which prevented viewing results numbered 100+ has been resolved [1]. *Better Search Results:* There have been many small improvements to search query parsing, performance & reliability in the past few weeks. A few of these are launching today but the most visible change is a significantly improved search results page with document body highlighting[2]. This feature shows a snippet of documents with the matching search terms highlighted in bold. Previously, Phabricator only displayed the title of each result with matching terms highlighted only if they appeared within the title. With today's release, the matching terms are highlighted from the body of the document as well and this takes advantage of an Elasticsearch feature[3] to accurately highlight the terms which actually lead to the result being included in the search result. *Welcome to The Future:* Some of you might be thinking that this is just too much. Such unnecessary features are just extravagant and wasteful. To that I say: why should we let advanced technologies like cascading style sheets sit idle, neglected. We can do better than a 1970s search experience. We deserve to have our search terms rendered as stylized hypertext with bold, beautiful letters and contextually accurate emphasis. We deserve modern conveniences and I don't feel the least bit guilty about that. It's the 90s[4], after all. *Upstream Status:* This new functionality has been submitted upstream for inclusion in Phabricator, however, as of today it remains in differential pending code review. The feature is likely to evolve further before finally making it into the upstream. It is a fairly large patch which adds a new "Engine Extension" infrastructure to phabricator. This foundation can be used to add various enhancements to the search results views (e.g. customized views for each object type.) This also lays the foundation for resolving https://secure.phabricator.com/T8646, although that bug doesn't really affect Wikimedia's developers because we have disabled Phabricator's integrated wiki. 1. https://phabricator.wikimedia.org/T92960 2. https://phabricator.wikimedia.org/T162284 3. https://www.elastic.co/guide/en/elasticsearch/reference/current/search-requ… 4. https://vimeo.com/29455771 That's all for now, I hope you enjoy these improvements to Phabricator search experience! Mukunda Modell Release Engineer & Phabricator Admin Wikimedia Foundation, Inc.

1 0

[MediaWiki-announce] Security pre-release announcement: 1.28.1 / 1.27.2 / 1.23.16
by Chad Horohoe 05 Apr '17

05 Apr '17

Hi, Tomorrow, April 6th we will be performing a security release of MediaWiki for all supported branches. The new versions will be 1.28.1, 1.27.2 and 1.23.16. This will resolve 9 issues in MediaWiki core, and one in a bundled extension. Have a great day, -- Chad Horohoe & Sam Reed _______________________________________________ MediaWiki announcements mailing list To unsubscribe, go to: https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce

1 0

← Newer
1
2
3
4
5
6
7
8
9
Older →

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Wikitech-l April 2017