Greetings,
Quite a few updates from across Discovery this week. As always, feedback
and suggestions welcome.
== Highlights ==
* Our wikis now implement the Open Graph standard, so sharing links on
social media now include the appropriate imagery. Thanks to Ladsgroup for
this work! [0] [1] [2]
* For those interested in how we test changes to search, there's now a page
on Testing Search.[3]
* Requested more text to be translated on the
wikipedia.org portal page via
translatewiki. [4]
* "Hiring a data scientist", a post on Wikimedia Blog that provides an
in-depth look into Discovery's hiring process for the Analysis team. [5]
== Discussions ==
=== Search ===
* Work continues to upgrade to Elasticsearch 5 [6] [7]
* A lot of work on TextCat (language identification) has been deployed;
configuration to enable it in production should go out next week. [8] [9]
* Added document content model into the search index and contentmodel:
keyword. [10] [11]
* Added more aliases for filetype: keyword [12]
* Nearly done with getting things ready for a new A/B test to be launched
on a few wikipedias for sister project search results [13] [14]
* Fixed a timeout issue with advanced searches [15] [16] (not yet deployed,
will be deployed with Elasticsearch 5 upgrade)
* Delayed updates from previous weeks:
** Created a list of languages for which we want to investigate analysers
[17]
** After analysis, decided to use Stempel as our new Polish language
analyser [18]; analysis of Stempel is underway [19]
** Fixed issue with ICU folding that caused problems with the search index
[20] [21]
=== Analysis ===
* Wrapping up migrating a significant amount of data using the
ReportUpdater infrastructure - almost done! Updating dashboards now to use
the new datasets, including NEW datasets (like LDF endpoint usage for WDQS)
[22]
=== Portal ===
* Added new text to translatewiki for
wikipedia.org portal page for app
links and legal language in footer [23] [24]
=== Other Noteworthy Stuff ===
* San Francisco by Maxime Le Forestier [25] [26]
[0]
https://en.wikipedia.org/wiki/Open_Graph
[1]
https://www.mediawiki.org/wiki/User:Ladsgroup
[2]
https://phabricator.wikimedia.org/T142048
[3]
https://www.mediawiki.org/wiki/Wikimedia_Discovery/Search/Testing_Search
[4]
https://lists.wikimedia.org/pipermail/translators-l/2017-January/003810.html
[5]
https://blog.wikimedia.org/2017/02/02/hiring-data-scientist/
[6]
https://phabricator.wikimedia.org/T155671
[7]
https://phabricator.wikimedia.org/T151224
[8]
https://www.mediawiki.org/w/index.php?title=User:TJones_(WMF)/Notes/TextCat…
[9]
https://phabricator.wikimedia.org/T149324
[10]
https://phabricator.wikimedia.org/T156371
[11]
https://www.mediawiki.org/wiki/Help:CirrusSearch#contentmodel
[12]
https://phabricator.wikimedia.org/T156413
[13]
https://phabricator.wikimedia.org/T149806
[14]
https://phabricator.wikimedia.org/T156299
[15]
https://phabricator.wikimedia.org/T152895
[16]
https://phabricator.wikimedia.org/T134157
[17]
https://phabricator.wikimedia.org/T155549
[18]
https://phabricator.wikimedia.org/T154516
[19]
https://phabricator.wikimedia.org/T154517
[20]
https://www.elastic.co/guide/en/elasticsearch/plugins/current/analysis-icu-…
[21]
https://phabricator.wikimedia.org/T156234
[22]
https://phabricator.wikimedia.org/T150915
[23]
https://phabricator.wikimedia.org/T154350
[24]
https://phabricator.wikimedia.org/T153764
[25]
https://www.youtube.com/watch?v=tDtXXlD98kw&feature=youtu.be&t=1m30s
[26]
https://en.wikipedia.org/wiki/Maxime_Le_Forestier
----
The archive of all past updates can be found on
MediaWiki.org:
https://www.mediawiki.org/wiki/Discovery/Status_updates
Interested in getting involved? See tasks marked as "Easy" or "Volunteer
needed" in Phabricator.
[1]
https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R
[2]
https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R
Yours,
Chris Koerner
Community Liaison - Discovery
Wikimedia Foundation