Why not just use Lucene
) or one of its many
It's mature, stable, open-source, actively developed and widely
considered to be a very fast, and very high-quality full-text
searching and indexing engine started by an expert in full-text
And it does Unicode just fine.
Krzysztof Kowalczyk | http://blog.kowalczyk.info