David Gerard wrote:
Ian Smith wrote:
So, in an attempt to take the heat out of this and get to facts, what I think you're looking for is an extension to:
- allow admins to configure decoders for specific document types
- run the right decoder (if any) when a document is uploaded
- add the resulting plain text to the "searchindex" table.
You would then have to find, install and configure decoders for your most-used document types.
Sounds good. e.g. Antiword is a quick way to turn a Word document into indexable text.
AbiWord --to=txt