In case you haven't seen it already, I wrote a blog post about "unpacking" and updating our default language analyzers used for search. It's a project (made of many little projects) that I've been working on over the last year or two. The blog post is a review of the project and some of the fun language facts and computational complexities I've encountered.
Hope you enjoy it.*
Trey JonesStaff Computational Linguist, Search Platform
UTC–4 / EDT
* Read the footnotes!