[Mediawiki-l] Lucene-search stops indexing page contents

Robert Stojnic rainmansr at gmail.com
Thu Sep 29 23:25:04 UTC 2011


./build will make an .xml file with the dump of the wiki. Check if this 
file contains the new text. If it doesn't something might be broken at 
mediawiki side, otherwise look at logs for any warning messages.

r.

On 09/29/2011 10:22 PM, Ross Xu wrote:
> For sure, the index has been rebuilt by running ./build without any error, and its cron is also working well.
> I can see all related folders/files have been updated under the indexes directory after each ./build.
> As I said, I can get correct result if the search key words are from the new page title.
>   
> Thank you, Jim.
> Ross Xu
>
> From: James King<jking at metrohealth.org>
> To: "mediawiki-l at lists.wikimedia.org"<mediawiki-l at lists.wikimedia.org>
> Sent: Thursday, September 29, 2011 4:00:42 PM
> Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
>
> Hmm,
>
> Try running ./build instead and see if you have any errors. Also check
> your cron or equivalent that it has been running this script
> periodically.
>
> Jim King
>
>>>> Ross Xu<rossxunix at yahoo.ca>  9/29/2011 3:45 PM>>>
> I haven't tried the manual updater (./update), because it is supposed
> to be used for "larger wikis" and my wiki database is just about 30MB.
> In addition, the manual updater requires the Extension:OAIRepository
> which is very hard to install and it "will fail" as mentioned in its
> instructions.
> Thanks Zach.
> Ross Xu
>
> From: Zach H.<luckenbach at gmail.com>
> To: Ross Xu<rossxunix at yahoo.ca>; MediaWiki announcements and site
> admin list<mediawiki-l at lists.wikimedia.org>
> Sent: Thursday, September 29, 2011 1:26:12 PM
> Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
>
>
> If you run the manual updater (./update) from Lucene's working
> directory do you get any errors? I have seen the Lucene daemon be up but
> have an error and unable to process new requests.
>
>
> On Thu, Sep 29, 2011 at 10:39 AM, Ross Xu<rossxunix at yahoo.ca>  wrote:
>
> Yes, the Lucene daemon is running, and I have restarted it for a couple
> of times.
>> The "old pages" are still valid, and I have no problem to search
> them.
>> I have no problem to search any new pages' titles either.
>> For example,
>> if a new page's title is "abcd", and its content contains "efgh", and
> the index has been rebuilt, if I search for "abcd", I can get the page
> directly, but if I search for "efgh", I get nothing.
>> Thanks a bunch, Olivier!
>> Any more ideas?
>> Ross Xu
>>
>> From: Olivier Beaton<olivier.beaton at gmail.com>
>> To: Ross Xu<rossxunix at yahoo.ca>; MediaWiki announcements and site
> admin list<mediawiki-l at lists.wikimedia.org>
>> Sent: Thursday, September 29, 2011 11:05:13 AM
>> Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
>>
>>
>> I'm sure you already checked this... but is the lucense indexing
>> daemon still running? If not, then that would explain why old page
>> content is still indexed in it's database, but new pages are not
>> getting added. That happened to me with Sphinx and it was a bit
>> confusing until I ran a ps waxu.
>>
>> Olivier Finlay Beaton
>>
>> On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu<rossxunix at yahoo.ca>  wrote:
>>> One thing to add ...
>>> The only thing changed is the Short_URL name. I changed it from
> "mywiki1" to "mywiki2".
>>> But I have tried to change it back. It's still the same problem.
>>>
>>> Is it possible that its MySQL database is corrupted? But anything
> else is working Okay. Again, I can search key words from a newly added
> page title, but if the key words are only from the page contents, the
> search gets nothing.
>>> Thanks for any ideas.
>>> Ross Xu
>>>
>>> From: Ross Xu<rossxunix at yahoo.ca>
>>> To: "mediawiki-l at lists.wikimedia.org"
> <mediawiki-l at lists.wikimedia.org>
>>> Sent: Tuesday, September 27, 2011 11:28:05 PM
>>> Subject: [Mediawiki-l] Lucene-search stops indexing page contents
>>>
>>> Hi there,
>>> I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
> 1.16, and using MWSearch to fetch the results.
>>> It has been working well, but all of sudden, for newly added pages,
> I can only search for keywords from their page titles. If I search for
> keywords only from those newly added pages' conntents, I get 0 result.
>>> There is no problem if I search for any old pages.
>>> It seems the Lucene-search only indexes page titles, and doesn't
> index the new page contents any more.
>>> Any idea would be appreciated,
>>> Ross Xu
>>> _______________________________________________
>>> MediaWiki-l mailing list
>>> MediaWiki-l at lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>>> _______________________________________________
>>> MediaWiki-l mailing list
>>> MediaWiki-l at lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>>>
>> _______________________________________________
>> MediaWiki-l mailing list
>> MediaWiki-l at lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>>
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>
> MetroHealth is an Academic Health Care System committed to Our
> Communities by Saving Lives, Restoring Health, Promoting Wellness, and
> providing Outstanding, Life-long Care Accessible to All. Visit us at
> http://www.metrohealth.org. See remarkable patient stories at
> http://www.thecomeback.org or follow us on Facebook:
> http://www.facebook.com/MetroHealthCLE, Twitter:
> http://twitter.com/MetroHealthCLE, and YouTube:
> http://www.youtube.com/user/MetroHealthCLE. This email and all
> attachments that may have been included are intended only for the use of
> the party to whom/which the email is addressed and may contain
> information that is privileged, confidential, or exempt from disclosure
> under applicable law. If you are not the addressee or the employee or
> agent of the intended recipient, you are hereby notified that you are
> strictly prohibited from printing, storing, disseminating, distributing,
> or copying this communication. If you have received this notification in
> error, please contact the Director of Risk/Privacy Management at
> (216)778-5728. For a copy of our Notice of Privacy Practices, please
> visit: http://www.metrohealth.org/general/privacy.asp
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>




More information about the MediaWiki-l mailing list