Greetings, Quite a few updates from across Discovery this week. As always, feedback and suggestions welcome.
== Highlights == * Our wikis now implement the Open Graph standard, so sharing links on social media now include the appropriate imagery. Thanks to Ladsgroup for this work! [0] [1] [2] * For those interested in how we test changes to search, there's now a page on Testing Search.[3] * Requested more text to be translated on the wikipedia.org portal page via translatewiki. [4] * "Hiring a data scientist", a post on Wikimedia Blog that provides an in-depth look into Discovery's hiring process for the Analysis team. [5]
== Discussions ==
=== Search === * Work continues to upgrade to Elasticsearch 5 [6] [7] * A lot of work on TextCat (language identification) has been deployed; configuration to enable it in production should go out next week. [8] [9] * Added document content model into the search index and contentmodel: keyword. [10] [11] * Added more aliases for filetype: keyword [12] * Nearly done with getting things ready for a new A/B test to be launched on a few wikipedias for sister project search results [13] [14] * Fixed a timeout issue with advanced searches [15] [16] (not yet deployed, will be deployed with Elasticsearch 5 upgrade) * Delayed updates from previous weeks: ** Created a list of languages for which we want to investigate analysers [17] ** After analysis, decided to use Stempel as our new Polish language analyser [18]; analysis of Stempel is underway [19] ** Fixed issue with ICU folding that caused problems with the search index [20] [21]
=== Analysis === * Wrapping up migrating a significant amount of data using the ReportUpdater infrastructure - almost done! Updating dashboards now to use the new datasets, including NEW datasets (like LDF endpoint usage for WDQS) [22]
=== Portal === * Added new text to translatewiki for wikipedia.org portal page for app links and legal language in footer [23] [24]
=== Other Noteworthy Stuff === * San Francisco by Maxime Le Forestier [25] [26]
[0] https://en.wikipedia.org/wiki/Open_Graph [1] https://www.mediawiki.org/wiki/User:Ladsgroup [2] https://phabricator.wikimedia.org/T142048 [3] https://www.mediawiki.org/wiki/Wikimedia_Discovery/Search/Testing_Search [4] https://lists.wikimedia.org/pipermail/translators-l/2017-January/003810.html [5] https://blog.wikimedia.org/2017/02/02/hiring-data-scientist/ [6] https://phabricator.wikimedia.org/T155671 [7] https://phabricator.wikimedia.org/T151224 [8] https://www.mediawiki.org/w/index.php?title=User:TJones_(WMF)/Notes/TextCat_... [9] https://phabricator.wikimedia.org/T149324 [10] https://phabricator.wikimedia.org/T156371 [11] https://www.mediawiki.org/wiki/Help:CirrusSearch#contentmodel [12] https://phabricator.wikimedia.org/T156413 [13] https://phabricator.wikimedia.org/T149806 [14] https://phabricator.wikimedia.org/T156299 [15] https://phabricator.wikimedia.org/T152895 [16] https://phabricator.wikimedia.org/T134157 [17] https://phabricator.wikimedia.org/T155549 [18] https://phabricator.wikimedia.org/T154516 [19] https://phabricator.wikimedia.org/T154517 [20] https://www.elastic.co/guide/en/elasticsearch/plugins/current/analysis-icu-f... [21] https://phabricator.wikimedia.org/T156234 [22] https://phabricator.wikimedia.org/T150915 [23] https://phabricator.wikimedia.org/T154350 [24] https://phabricator.wikimedia.org/T153764 [25] https://www.youtube.com/watch?v=tDtXXlD98kw&feature=youtu.be&t=1m30s [26] https://en.wikipedia.org/wiki/Maxime_Le_Forestier
----
The archive of all past updates can be found on MediaWiki.org:
https://www.mediawiki.org/wiki/Discovery/Status_updates
Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator.
[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R [2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R
Yours, Chris Koerner Community Liaison - Discovery Wikimedia Foundation