Hello,
This is the weekly update from the Search Platform team for the week
starting 2018-08-13.

As always, feedback and questions welcome.

== Discussions ==

=== Search ===
* We re-indexed wikis in Malay, Indonesian, and Polish, adding stemming to Malay and ICU normalization to Indonesian [0], and fixing the worst pathological stemming errors in Polish [1]. 
** Reindexing tickets are [2] and [3]. 
*Trey wrote a blog post about the difficulties of tokenization (breaking text up into words, more or less). [4] It's the first in a series covering the basics of search.

== Did you know? ==
* Auto-antonyms (or contronyms, among other names) are words that have contradictory meanings. [5] Some cases arise from words that were originally distinct, but came to sound the same through normal sound change, such as “cleave”, which means both to hold fast (from Old English “clifian”) and to cut in two (from Old English “clēofan”). Other times, words just acquire novel but contradictory senses, such as “fast”, which means both to be securely attached to, and able to move quickly. Often the sense is quite clear from context, but when it is not, words can become so confusing that careful writers avoid them; these are sometimes called “skunked” words. [6]

---

[0] https://phabricator.wikimedia.org/T196780
[1] https://phabricator.wikimedia.org/T186046
[2] https://phabricator.wikimedia.org/T200037
[3] https://phabricator.wikimedia.org/T200204
[4] https://wikimediafoundation.org/2018/08/07/anatomy-search-token-affection/
[5] https://en.wikipedia.org/wiki/Auto-antonym
[6] https://en.wikipedia.org/wiki/Skunked_term

Subscribe to receive on-wiki (or opt-in email) notifications of the Discovery weekly update.

https://www.mediawiki.org/wiki/Newsletter:Discovery_Weekly

The archive of all past updates can be found on MediaWiki.org:

https://www.mediawiki.org/wiki/Discovery/Status_updates

Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator.

[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R
[2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R


As always, feedback and questions welcome.
Yours,
Chris Koerner
Community Relations Specialist
Wikimedia Foundation