Thank you, Thomas,
older versions of Joda are working since 1996 as news paper archive for
the Rhein-Zeitung (Koblenz and Mainz, Germany). It's also used for
archive and newsdesk purposes from several other european newspapers. At
the moment it is going into action as full text index for europeans
biggest magazine. It is also in use for the public index of the state
archive of Rheinland-Pfalz (Germany).
Last year I created two mirrors of WikiPedia, one using MediaWiki for
demonstration purposes and another - our public one - using our own
read-only web frontend. Joda is integrated into both mirrors:
http://wikipedia.rhein-zeitung.de/index.php/Hauptseite (MediaWiki)
http://lexikon.rhein-zeitung.de/ (our special Wikipedia interface)
At the suggestion of Magnus Manske (not related :-) I published Joda
under LGPL and made serveral improvements for the Wikipedia task. I
wrote tools for indexing a whole cur table either from MySQL or from a
SQL dump (which is twice faster). Indexing the german Wikipedia cur
table (>210.000 articles, 36 million words) lasts approx. 45 minutes. An
optional database optimization lasts additional 25 minutes. Both on a
dual Athlon 2800+ machine with 1 GB RAM (the indexer is a multi threaded
perl program).
Joda can erase or update entries on the fly and can handle queries with
parantheses and word distance operators like
http://lexikon.rhein-zeitung.de/?((Albert OR Alfred) AND.1 Einstein)
NEAR Quant*) NOT Gravitation. See more features under
http://ioda.sourceforge.net/
Joda kernel is written with the Free Pascal compiler
(
http://sourceforge.net/projects/freepascal/). The tools are written in
Perl. There a libraries for using joda directly from C, Perl, Python and
PHP, all published under LGPL. The joda binaries are: command line
program, TCP socket driven server and CGI.
Yours
jo