On Wed, Feb 19, 2014 at 11:02 AM, C. Scott Ananian cananian@wikimedia.orgwrote:
Or use structured search instead of hiding things, so that you can search (say) just the titles of images on commons without corrupting the title index with license text.
We don't lump the title with the text. The problem is when you've got a wiki where there's a whole lot of the same text so it becomes harder to search for things that happen to also be in that repeated text. Two good examples so far are welcome-style templates on Talk namespaces and License templates on files.
Another way of summarizing this is: we want to index all of the page text, except when we don't.
-Chad