[Mediawiki-l] MediaWiki extension for searching uploaded files?

Brion Vibber brion at pobox.com
Fri Apr 27 18:20:38 UTC 2012


On Fri, Apr 27, 2012 at 11:02 AM, Chad <innocentkiller at gmail.com> wrote:

> On Fri, Apr 27, 2012 at 1:59 PM, Yaron Koren <yaron at wikiworks.com> wrote:
> > Are there any MediaWiki extensions that allow for searching through the
> > text of uploaded files, such as Word documents, PDFs, etc. - whether it's
> > part of the standard search results or in a separate interface? Or is
> > anyone working on such a thing?
> >
>
> Aren't the contents of PDFs indexed in Lucene?
>

Nope, they're extracted and stuffed in metadata but not yet stored for
search -- see https://bugzilla.wikimedia.org/show_bug.cgi?id=21061 and
related.

I have seem a couple one-off extensions for indexing .doc files or such,
but don't recall specifically what they are; should be floating somewhere
on www.mediawiki.org but I don't know how up to date or reliable they are.

-- brion


More information about the MediaWiki-l mailing list