Robert Stojnic said:
Comes the question, when/how other languages will have stemming as well?
The new search engine has stemmers for these languages: English, Danish, Dutch, Finnish, German, Italian, Norwegian, Portuguese, Russian, Spanish, Swedish and Esperanto. It also has a filter for Thai, so that words are properly separated.
r.
For example: https://svn.apache.org/repos/asf/lucene/java/trunk/contrib/snowball/src/java...
private Among a_1[] = { new Among ( "la", -1, -1, "", this), new Among ( "sela", 0, -1, "", this), new Among ( "le", -1, -1, "", this), new Among ( "me", -1, -1, "", this), new Among ( "se", -1, -1, "", this), new Among ( "lo", -1, -1, "", this), new Among ( "selo", 5, -1, "", this), new Among ( "las", -1, -1, "", this), new Among ( "selas", 7, -1, "", this), new Among ( "les", -1, -1, "", this), new Among ( "los", -1, -1, "", this), new Among ( "selos", 10, -1, "", this), new Among ( "nos", -1, -1, "", this) };
Regards, /Mike/