Привет!
Another update from the Search Platform team for the week starting 2018-05-07
**Programming note:** Due to the upcoming Wikimedia Hackathon and some (personal) holiday time, the next update will be the week of 2018-05-28. Until then, and as always, feedback and questions are welcome.
== Highlights== * Map internationalization launched everywhere, and embedded maps (mapframe) are now live on 276 Wikipedias [0] * ''"Hello, my name is _____"'' is an in-depth blog post by Trey that was published earlier this week where he details the irony that searching for names is not always as straightforward as you might think. [1]
== Discussions ==
=== Search === * Erik updated a script that was populating lots of 500 errors in the logs [2] * Erik also did a lot of research to evaluate impact of adding ~2700 new shards to production cluster (there is a pdf attached to the last comment in the ticket that contains more information) [3] There is a follow-up ticket as well for the next steps [4] * Trey worked on the analysis config for the new Slovak stemmer that was deployed this week—but the plugin still needs to be deployed and the wikis re-indexed. [5] * Stas and others worked on looking up entities by external identifiers - the work is done for now, but it needs a re-index to be fully ready [6] * David worked on externalizing the parsing logic from SimpleKeywordFeature and FullTextQueryStringQueryBuilder and it was pushed into production in April 2018 [7]
== Other Noteworthy Stuff == * Trey's most recent updates to transliteration on the Crimean Tatar Wikipedia are live; after a year of part-time 10% project work, the transliteration infrastructure for Crimean Tatar is done and the accuracy is in the high 90% range. [8]
== Did you know? == * The English word “dove”, as the past tense of “dive”, is one of the rare cases where a conjugation has become more irregular over time. The verb “dive” picked up the strong conjugation [9] by analogy with other strong verbs, particularly “drive/drove”. [10] Going in the more typical direction of regularization, Swedish strong verbs slowly lost some of their distinctive plural forms. [11] The change started in the 16th century, and was still in progress as late as the 1940s. From the search perspective, regular forms are easier to deal with—so, way to go Swedish!
[0] https://lists.wikimedia.org/pipermail/wikitech-l/2018-May/089964.html [1] https://blog.wikimedia.org/2018/05/08/searching-for-names-is-not-always-stra... [2] https://phabricator.wikimedia.org/T179266 [3] https://phabricator.wikimedia.org/T192972 [4] https://phabricator.wikimedia.org/T193654 [5] https://phabricator.wikimedia.org/T191544 [6] https://phabricator.wikimedia.org/T99899 [7] https://phabricator.wikimedia.org/T188530 [8] https://phabricator.wikimedia.org/T188321 [9] https://en.wikipedia.org/wiki/Germanic_strong_verb [10] https://en.wiktionary.org/wiki/dove#Etymology_2 [11] https://en.wikipedia.org/wiki/Swedish_grammar#Historical_plural_forms
---
Subscribe to receive on-wiki (or opt-in email) notifications of the Discovery weekly update.
https://www.mediawiki.org/wiki/Newsletter:Discovery_Weekly
The archive of all past updates can be found on MediaWiki.org:
https://www.mediawiki.org/wiki/Discovery/Status_updates
Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator.
[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R [2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R
Yours, Chris Koerner Community Liaison Wikimedia Foundation