On 10/04/07, Platonides Platonides@gmail.com wrote:
David Gerard wrote:
Sounds good. e.g. Antiword is a quick way to turn a Word document into indexable text.
AbiWord --to=txt http://wvware.sourceforge.net/
There's lots of ways, yes :-) Something to extract indexable text from any document there's a filter for, and feed it to the indexer. That'd be just what we need to index Word documents added to a MediaWiki. Does anything like this exist already, or is it a Simple Matter Of Programming?
- d.