A belated greetings,
This is the weekly update from the Search Platform team for the week starting '''2018-09-03''' and '''2018-09-10'''
== Highlights== * New blog post: The anatomy of search: Variation under nature - "This is the second in a series of blog posts giving a basic overview of how full-text search engines work, with a bit of extra attention for some interesting language-specific issues. Our topic for today is how we normalize the words we found in our text during the previous step, tokenization." [0]
== Discussions ==
=== Search === * Trey finished up the Esperanto analysis, stemmer updates and reindexing of the wikis [1], [2], [3] * Erik tackled a lot of small bugs that will be in production the week of Sep 18th: redirect showing irrelevant info [4], strings starting with a # sign were redirecting to main page [5], error when using double quotes in the category or title [6], "close redirects" in error messages [7], UI issue where 'results 1 of 2' is displayed when only one result is found [8] * Mathew fixed an issue with alerting for when Elasticsearch has shards larger than the maximum size [9] * Erik worked on getting CirrusSearch to gracefully handle missing plugin sections in the ElasticSearch response [10] * Gehel and Erik did some Mjolnir query daemon monitor work [11], [12] * Trey fixed an issue where empty strings caused by ICU folding in ElasticSearch were getting indexed [13] * The work for using Kafka for communicating between the analytics cluster and elasticsearch has finished [14]
=== Wikidata Query Service ===
* Diskspace upgrade has been finished * Categories are now updated daily in WDQS from daily diffs [15]
[0] https://wikimediafoundation.org/2018/09/13/anatomy-search-variation-under-na... [1] https://phabricator.wikimedia.org/T202173 [2] https://phabricator.wikimedia.org/T202662 [3] https://phabricator.wikimedia.org/T203005 [4] https://phabricator.wikimedia.org/T190010 [5] https://phabricator.wikimedia.org/T182452 [6] https://phabricator.wikimedia.org/T73123 [7] https://phabricator.wikimedia.org/T191485 [8] https://phabricator.wikimedia.org/T71382 [9] https://phabricator.wikimedia.org/T203546 [10] https://phabricator.wikimedia.org/T191493 [11] https://phabricator.wikimedia.org/T199732 [12] https://phabricator.wikimedia.org/T200740 [13] https://phabricator.wikimedia.org/T192502 [14] https://phabricator.wikimedia.org/T198490 [15] https://phabricator.wikimedia.org/T201217
----
Subscribe to receive on-wiki (or opt-in email) notifications of the Discovery weekly update.
https://www.mediawiki.org/wiki/Newsletter:Discovery_Weekly
The archive of all past updates can be found on MediaWiki.org:
https://www.mediawiki.org/wiki/Discovery/Status_updates
Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator.
[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R [2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R
Yours, Chris Koerner Community Relations Specialist Wikimedia Foundation