Hi all,
Now search results of "commodity" changes:
* Commodities http://en.wikipedia.org/wiki/Commodities Relevance: 100.0% - - * Commodity http://en.wikipedia.org/wiki/Commodity Relevance: 95.4% - - * Commodate http://en.wikipedia.org/wiki/Commodate Relevance: 94.7% - - * Commode http://en.wikipedia.org/wiki/Commode Relevance: 94.6% - -
I suggest that you may want to index "Title" with StandardAnalyzer and "Content" with SnowballAnalyzer, since the title field of Wikipedia is almost all named entities that should not be modified at all. IMHO, to have a mixture of original words and stemmed forms is a good heuristic rule though, but it is only suitable for content field.
Sincerely, /Mike "b6s" Jiang/