I just about have this extension running. It sure would benefit from MWSearch's improved unicode handling:
*** WARNING: Funny characters in title SchrauwenD???HaeneVerstraetenEtAl07 ************************ 14000 row(s) processed ************************ 14100 row(s) processed ************************ 14200 row(s) processed *** WARNING: Funny characters in title SerinoGiovagnoliL??davas09 ************************ 14300 row(s) processed ************************ 14400 row(s) processed ************************ 14500 row(s) processed ************************ 14600 row(s) processed ************************ 14700 row(s) processed ************************ 14800 row(s) processed ************************ 14900 row(s) processed ************************ 15000 row(s) processed ************************ 15100 row(s) processed *** WARNING: Funny characters in title StruffertK??hrmannEngelhornEtAl09 ************************ 15200 row(s) processed ************************ 15300 row(s) processed *** WARNING: Funny characters in title TamosiunaiteAsfourW??rg??tter09 ************************ 15400 row(s) processed *** WARNING: Funny characters in title TanakaBalleineO???Doherty08 ************************ 15500 row(s) processed ************************ 15600 row(s) processed ************************ 15700 row(s) processed ************************ 15800 row(s) processed ************************ 15900 row(s) processed *** WARNING: Funny characters in title UrbanoLeznikLlin??s07 *** WARNING: Funny characters in title ValentinDickinsonO???Doherty07 ************************ 16000 row(s) processed ************************ 16100 row(s) processed *** WARNING: Funny characters in title VerstraetenSchrauwenD??HaeneEtAl07 ************************ 16200 row(s) processed ************************ 16300 row(s) processed ************************ 16400 row(s) processed ************************ 16500 row(s) processed ************************ 16600 row(s) processed ************************ 16700 row(s) processed *** WARNING: Funny characters in title WikiPapers/log/Kov????csMehler09 ************************ 16800 row(s) processed *** WARNING: Funny characters in title WikiPapers/log/SerinoGiovagnoliL??davas09 *** WARNING: Funny characters in title WikiPapers/log/SotoFunesGuzm????n-Garc????aEtAl09 *** WARNING: Funny characters in title WikiPapers/log/TanakaBalleineO???Doherty08 *** WARNING: Funny characters in title WikiPapers/log/ValentinDickinsonO???Doherty07 *** WARNING: Funny characters in title WikiPapers/log/WinklerH??denLadinigEtAl *** WARNING: Funny characters in title WinklerH??denLadinigEtAl ************************ 16900 row(s) processed ************************ 17000 row(s) processed ************************ 17100 row(s) processed ************************ 17200 row(s) processed ************************ 17300 row(s) processed ************************ 17400 row(s) processed ************************ 17500 row(s) processed *** WARNING: Funny characters in title BrouilletCond??BealEtAl99.pdf ************************ 17600 row(s) processed *** WARNING: Funny characters in title Carrillo-ReidTecuapetlaIb????ez-SandovalEtAl09.pdf *** WARNING: Funny characters in title CepedaWuAndr??EtAl07.pdf
On Thu, May 7, 2009 at 1:33 PM, Chris Reigrut chris@reigrut.net wrote:
I'd like to announce the first release of EzMwLucene. This project provides a simplified Lucene search to Mediawiki. It is designed to be easy to install, configure, and run. It provides real-time, multiple field indexing and searching as well as text indexing of standard attachment types (pdf, xls, doc, ppt, vsd). The server is a self contained Java application (no application server needed), and the client portion is a standard Mediawiki extension. It is currently in production on an internal site with over 1000 users running on Mediawiki 1.13.
https://sourceforge.net/projects/ezmwlucene/
I welcome all feedback: questions, suggestions and offers to help improve it!
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l