Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki 1.16, and using MWSearch to fetch the results. It has been working well, but all of sudden, for newly added pages, I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result. There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index the new page contents any more. Any idea would be appreciated, Ross Xu
One thing to add ... The only thing changed is the Short_URL name. I changed it from "mywiki1" to "mywiki2". But I have tried to change it back. It's still the same problem. Is it possible that its MySQL database is corrupted? But anything else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing. Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki 1.16, and using MWSearch to fetch the results. It has been working well, but all of sudden, for newly added pages, I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result. There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index the new page contents any more. Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from "mywiki1" to "mywiki2". But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki 1.16, and using MWSearch to fetch the results. It has been working well, but all of sudden, for newly added pages, I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result. There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
Yes, the Lucene daemon is running, and I have restarted it for a couple of times. The "old pages" are still valid, and I have no problem to search them. I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing. Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beaton olivier.beaton@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from "mywiki1" to "mywiki2". But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki 1.16, and using MWSearch to fetch the results. It has been working well, but all of sudden, for newly added pages, I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result. There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xu rossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times. The "old pages" are still valid, and I have no problem to search them. I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing.
Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beaton olivier.beaton@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from "mywiki1"
to "mywiki2".
But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything else is
working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
1.16, and using MWSearch to fetch the results.
It has been working well, but all of sudden, for newly added pages, I can
only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result.
There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index
the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
I haven't tried the manual updater (./update), because it is supposed to be used for "larger wikis" and my wiki database is just about 30MB. In addition, the manual updater requires the Extension:OAIRepository which is very hard to install and it "will fail" as mentioned in its instructions. Thanks Zach. Ross Xu
From: Zach H. luckenbach@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 1:26:12 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xu rossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times.
The "old pages" are still valid, and I have no problem to search them. I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing. Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beaton olivier.beaton@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from "mywiki1" to "mywiki2". But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki 1.16, and using MWSearch to fetch the results. It has been working well, but all of sudden, for newly added pages, I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result. There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
Hmm,
Try running ./build instead and see if you have any errors. Also check your cron or equivalent that it has been running this script periodically.
Jim King
Ross Xu rossxunix@yahoo.ca 9/29/2011 3:45 PM >>>
I haven't tried the manual updater (./update), because it is supposed to be used for "larger wikis" and my wiki database is just about 30MB. In addition, the manual updater requires the Extension:OAIRepository which is very hard to install and it "will fail" as mentioned in its instructions. Thanks Zach. Ross Xu
From: Zach H. luckenbach@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 1:26:12 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xu rossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times.
The "old pages" are still valid, and I have no problem to search
them.
I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and
the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing.
Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beaton olivier.beaton@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site
admin list mediawiki-l@lists.wikimedia.org
Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from
"mywiki1" to "mywiki2".
But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything
else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org"
mediawiki-l@lists.wikimedia.org
Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
1.16, and using MWSearch to fetch the results.
It has been working well, but all of sudden, for newly added pages,
I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result.
There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't
index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
_______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MetroHealth is an Academic Health Care System committed to Our Communities by Saving Lives, Restoring Health, Promoting Wellness, and providing Outstanding, Life-long Care Accessible to All. Visit us at http://www.metrohealth.org. See remarkable patient stories at http://www.thecomeback.org or follow us on Facebook: http://www.facebook.com/MetroHealthCLE, Twitter: http://twitter.com/MetroHealthCLE, and YouTube: http://www.youtube.com/user/MetroHealthCLE. This email and all attachments that may have been included are intended only for the use of the party to whom/which the email is addressed and may contain information that is privileged, confidential, or exempt from disclosure under applicable law. If you are not the addressee or the employee or agent of the intended recipient, you are hereby notified that you are strictly prohibited from printing, storing, disseminating, distributing, or copying this communication. If you have received this notification in error, please contact the Director of Risk/Privacy Management at (216)778-5728. For a copy of our Notice of Privacy Practices, please visit: http://www.metrohealth.org/general/privacy.asp
For sure, the index has been rebuilt by running ./build without any error, and its cron is also working well. I can see all related folders/files have been updated under the indexes directory after each ./build. As I said, I can get correct result if the search key words are from the new page title. Thank you, Jim. Ross Xu
From: James King jking@metrohealth.org To: "mediawiki-l@lists.wikimedia.org" mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 4:00:42 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
Hmm,
Try running ./build instead and see if you have any errors. Also check your cron or equivalent that it has been running this script periodically.
Jim King
Ross Xu rossxunix@yahoo.ca 9/29/2011 3:45 PM >>>
I haven't tried the manual updater (./update), because it is supposed to be used for "larger wikis" and my wiki database is just about 30MB. In addition, the manual updater requires the Extension:OAIRepository which is very hard to install and it "will fail" as mentioned in its instructions. Thanks Zach. Ross Xu
From: Zach H. luckenbach@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 1:26:12 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xu rossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times.
The "old pages" are still valid, and I have no problem to search
them.
I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and
the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing.
Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beaton olivier.beaton@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site
admin list mediawiki-l@lists.wikimedia.org
Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xu rossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from
"mywiki1" to "mywiki2".
But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything
else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xu rossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org"
mediawiki-l@lists.wikimedia.org
Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
1.16, and using MWSearch to fetch the results.
It has been working well, but all of sudden, for newly added pages,
I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result.
There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't
index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
_______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MetroHealth is an Academic Health Care System committed to Our Communities by Saving Lives, Restoring Health, Promoting Wellness, and providing Outstanding, Life-long Care Accessible to All. Visit us at http://www.metrohealth.org. See remarkable patient stories at http://www.thecomeback.org or follow us on Facebook: http://www.facebook.com/MetroHealthCLE, Twitter: http://twitter.com/MetroHealthCLE, and YouTube: http://www.youtube.com/user/MetroHealthCLE. This email and all attachments that may have been included are intended only for the use of the party to whom/which the email is addressed and may contain information that is privileged, confidential, or exempt from disclosure under applicable law. If you are not the addressee or the employee or agent of the intended recipient, you are hereby notified that you are strictly prohibited from printing, storing, disseminating, distributing, or copying this communication. If you have received this notification in error, please contact the Director of Risk/Privacy Management at (216)778-5728. For a copy of our Notice of Privacy Practices, please visit: http://www.metrohealth.org/general/privacy.asp _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
./build will make an .xml file with the dump of the wiki. Check if this file contains the new text. If it doesn't something might be broken at mediawiki side, otherwise look at logs for any warning messages.
r.
On 09/29/2011 10:22 PM, Ross Xu wrote:
For sure, the index has been rebuilt by running ./build without any error, and its cron is also working well. I can see all related folders/files have been updated under the indexes directory after each ./build. As I said, I can get correct result if the search key words are from the new page title.
Thank you, Jim. Ross Xu
From: James Kingjking@metrohealth.org To: "mediawiki-l@lists.wikimedia.org"mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 4:00:42 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
Hmm,
Try running ./build instead and see if you have any errors. Also check your cron or equivalent that it has been running this script periodically.
Jim King
Ross Xurossxunix@yahoo.ca 9/29/2011 3:45 PM>>>
I haven't tried the manual updater (./update), because it is supposed to be used for "larger wikis" and my wiki database is just about 30MB. In addition, the manual updater requires the Extension:OAIRepository which is very hard to install and it "will fail" as mentioned in its instructions. Thanks Zach. Ross Xu
From: Zach H.luckenbach@gmail.com To: Ross Xurossxunix@yahoo.ca; MediaWiki announcements and site admin listmediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 1:26:12 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xurossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times.
The "old pages" are still valid, and I have no problem to search
them.
I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and
the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing.
Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beatonolivier.beaton@gmail.com To: Ross Xurossxunix@yahoo.ca; MediaWiki announcements and site
admin listmediawiki-l@lists.wikimedia.org
Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xurossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from
"mywiki1" to "mywiki2".
But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything
else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xurossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org"
mediawiki-l@lists.wikimedia.org
Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
1.16, and using MWSearch to fetch the results.
It has been working well, but all of sudden, for newly added pages,
I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result.
There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't
index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MetroHealth is an Academic Health Care System committed to Our Communities by Saving Lives, Restoring Health, Promoting Wellness, and providing Outstanding, Life-long Care Accessible to All. Visit us at http://www.metrohealth.org. See remarkable patient stories at http://www.thecomeback.org or follow us on Facebook: http://www.facebook.com/MetroHealthCLE, Twitter: http://twitter.com/MetroHealthCLE, and YouTube: http://www.youtube.com/user/MetroHealthCLE. This email and all attachments that may have been included are intended only for the use of the party to whom/which the email is addressed and may contain information that is privileged, confidential, or exempt from disclosure under applicable law. If you are not the addressee or the employee or agent of the intended recipient, you are hereby notified that you are strictly prohibited from printing, storing, disseminating, distributing, or copying this communication. If you have received this notification in error, please contact the Director of Risk/Privacy Management at (216)778-5728. For a copy of our Notice of Privacy Practices, please visit: http://www.metrohealth.org/general/privacy.asp _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
Great idea, Robert! The good news is I can see the new text in the .xml file from the dumps. The bad news is I do see an error message from the http error log like this: "PHP Strict Standards: Declaration of Skindeveloper::initPage() should be compatible with that of Skin::initPage() in /var/www/html/wiki/includes/Skin.php on line 144, referer: http://www.mycompany.com/wiki/Special:Search?search=efgh", but it also gets the same kind of log even I search for old text and get correct result. If it's a skin problem, why I can search for old text? Any more ideas? Thanks a lot! Ross Xu
From: Robert Stojnic rainmansr@gmail.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 7:25:04 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
./build will make an .xml file with the dump of the wiki. Check if this file contains the new text. If it doesn't something might be broken at mediawiki side, otherwise look at logs for any warning messages.
r.
On 09/29/2011 10:22 PM, Ross Xu wrote:
For sure, the index has been rebuilt by running ./build without any error, and its cron is also working well. I can see all related folders/files have been updated under the indexes directory after each ./build. As I said, I can get correct result if the search key words are from the new page title. Thank you, Jim. Ross Xu
From: James Kingjking@metrohealth.org To: "mediawiki-l@lists.wikimedia.org"mediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 4:00:42 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
Hmm,
Try running ./build instead and see if you have any errors. Also check your cron or equivalent that it has been running this script periodically.
Jim King
Ross Xurossxunix@yahoo.ca 9/29/2011 3:45 PM>>>
I haven't tried the manual updater (./update), because it is supposed to be used for "larger wikis" and my wiki database is just about 30MB. In addition, the manual updater requires the Extension:OAIRepository which is very hard to install and it "will fail" as mentioned in its instructions. Thanks Zach. Ross Xu
From: Zach H.luckenbach@gmail.com To: Ross Xurossxunix@yahoo.ca; MediaWiki announcements and site admin listmediawiki-l@lists.wikimedia.org Sent: Thursday, September 29, 2011 1:26:12 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
If you run the manual updater (./update) from Lucene's working directory do you get any errors? I have seen the Lucene daemon be up but have an error and unable to process new requests.
On Thu, Sep 29, 2011 at 10:39 AM, Ross Xurossxunix@yahoo.ca wrote:
Yes, the Lucene daemon is running, and I have restarted it for a couple of times.
The "old pages" are still valid, and I have no problem to search
them.
I have no problem to search any new pages' titles either. For example, if a new page's title is "abcd", and its content contains "efgh", and
the index has been rebuilt, if I search for "abcd", I can get the page directly, but if I search for "efgh", I get nothing.
Thanks a bunch, Olivier! Any more ideas? Ross Xu
From: Olivier Beatonolivier.beaton@gmail.com To: Ross Xurossxunix@yahoo.ca; MediaWiki announcements and site
admin listmediawiki-l@lists.wikimedia.org
Sent: Thursday, September 29, 2011 11:05:13 AM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
I'm sure you already checked this... but is the lucense indexing daemon still running? If not, then that would explain why old page content is still indexed in it's database, but new pages are not getting added. That happened to me with Sphinx and it was a bit confusing until I ran a ps waxu.
Olivier Finlay Beaton
On Thu, Sep 29, 2011 at 10:49 AM, Ross Xurossxunix@yahoo.ca wrote:
One thing to add ... The only thing changed is the Short_URL name. I changed it from
"mywiki1" to "mywiki2".
But I have tried to change it back. It's still the same problem.
Is it possible that its MySQL database is corrupted? But anything
else is working Okay. Again, I can search key words from a newly added page title, but if the key words are only from the page contents, the search gets nothing.
Thanks for any ideas. Ross Xu
From: Ross Xurossxunix@yahoo.ca To: "mediawiki-l@lists.wikimedia.org"
mediawiki-l@lists.wikimedia.org
Sent: Tuesday, September 27, 2011 11:28:05 PM Subject: [Mediawiki-l] Lucene-search stops indexing page contents
Hi there, I am using Lucene-search 2.1.3 as the search engine for my MediaWiki
1.16, and using MWSearch to fetch the results.
It has been working well, but all of sudden, for newly added pages,
I can only search for keywords from their page titles. If I search for keywords only from those newly added pages' conntents, I get 0 result.
There is no problem if I search for any old pages. It seems the Lucene-search only indexes page titles, and doesn't
index the new page contents any more.
Any idea would be appreciated, Ross Xu _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
MetroHealth is an Academic Health Care System committed to Our Communities by Saving Lives, Restoring Health, Promoting Wellness, and providing Outstanding, Life-long Care Accessible to All. Visit us at http://www.metrohealth.org. See remarkable patient stories at http://www.thecomeback.org or follow us on Facebook: http://www.facebook.com/MetroHealthCLE, Twitter: http://twitter.com/MetroHealthCLE, and YouTube: http://www.youtube.com/user/MetroHealthCLE. This email and all attachments that may have been included are intended only for the use of the party to whom/which the email is addressed and may contain information that is privileged, confidential, or exempt from disclosure under applicable law. If you are not the addressee or the employee or agent of the intended recipient, you are hereby notified that you are strictly prohibited from printing, storing, disseminating, distributing, or copying this communication. If you have received this notification in error, please contact the Director of Risk/Privacy Management at (216)778-5728. For a copy of our Notice of Privacy Practices, please visit: http://www.metrohealth.org/general/privacy.asp _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
On Fri, Sep 30, 2011 at 12:16 PM, Ross Xu rossxunix@yahoo.ca wrote:
Great idea, Robert! The good news is I can see the new text in the .xml file from the dumps. The bad news is I do see an error message from the http error log like this: "PHP Strict Standards: Declaration of Skindeveloper::initPage() should be compatible with that of Skin::initPage() in /var/www/html/wiki/includes/Skin.php on line 144, referer: http://www.mycompany.com/wiki/Special:Search?search=efgh", but it also gets the same kind of log even I search for old text and get correct result. If it's a skin problem, why I can search for old text?
This is likely unrelated to your search problem -- you should be able to fix it by adding an 'OutputPage' type hint to the declaration so it matches the updated declaration on the parent class.
-- brion
It looks like we are getting closer. But what does the "adding an 'OutputPage' type hint to the declaration" mean? Does it mean we need to edit the skin somewhere? Further hints would be appreciated. Ross Xu
From: Brion Vibber brion@pobox.com To: Ross Xu rossxunix@yahoo.ca; MediaWiki announcements and site admin list mediawiki-l@lists.wikimedia.org Sent: Friday, September 30, 2011 4:44:20 PM Subject: Re: [Mediawiki-l] Lucene-search stops indexing page contents
On Fri, Sep 30, 2011 at 12:16 PM, Ross Xu rossxunix@yahoo.ca wrote:
Great idea, Robert!
The good news is I can see the new text in the .xml file from the dumps. The bad news is I do see an error message from the http error log like this: "PHP Strict Standards: Declaration of Skindeveloper::initPage() should be compatible with that of Skin::initPage() in /var/www/html/wiki/includes/Skin.php on line 144, referer: http://www.mycompany.com/wiki/Special:Search?search=efgh", but it also gets the same kind of log even I search for old text and get correct result. If it's a skin problem, why I can search for old text?
This is likely unrelated to your search problem -- you should be able to fix it by adding an 'OutputPage' type hint to the declaration so it matches the updated declaration on the parent class.
-- brion
mediawiki-l@lists.wikimedia.org