HELP: MediaWiki & Lucene-Search+MWSearch = ZERO search results ??!
Can someone '''please''' assist me? Here's what I've got running;;
- Slackware 12.0 http://www.slackware.com/ on i686 Pentium III [Linux 2.6.21.5] - MediaWiki: 1.9.1 - PHP: 5.2.5 - MySQL: 5.0.37 - MediaWiki Extension(s): MWSearchhttp://www.mediawiki.org/wiki/Extension:MWSearch SVNhttp://svn.wikimedia.org/svnroot/mediawiki/trunk/extensions/MWSearch/(05122008), and Lucene-search http://www.mediawiki.org/wiki/Extension:Lucene-search SVNhttp://svn.wikimedia.org/svnroot/mediawiki/trunk/lucene-search-2/(05122008), + I downloaded & installed mwdumper.jar http://download.wikimedia.org/tools/mwdumper.jar into the Lucene2 lib dir - Other tools: jre-6u2-i586-1, jdk-1_5_0_09-i586-1, apache-ant-1.7.0-i586-1bj, rsync-2.6.9-i486-1
I've followed the steps per MWSearchhttp://www.mediawiki.org/wiki/Extension:MWSearchand Lucene-search http://www.mediawiki.org/wiki/Extension:Lucene-search pages, to the T - I've gone over and over them several times, I've tried to search these forums http://www.mwusers.com/%20MediaWiki%20Forums, and the MediaWiki-L mailing list http://lists.wikimedia.org/pipermail/mediawiki-l ... please help me! =)
*;; My Local LuceneSearch configuration ;;* LuceneSearch SVNhttp://svn.wikimedia.org/svnroot/mediawiki/trunk/lucene-search-2Install dir: /usr/local/search/lucene-search-2svn05112008 Indexes stored: /usr/local/search/indexes
* ;; my /etc/lsearch.conf ;;* <pre> MWConfig.global=file:///etc/lsearch-global.conf MWConfig.lib=/usr/local/search/lucene-search-2svn05112008/lib Indexes.path=/usr/local/search/indexes Search.updateinterval=1 Search.updatedelay=0 Search.checkinterval=30 Index.snapshotinterval=5 Index.maxqueuecount=5000 Index.maxqueuetimeout=12 Storage.master=localhost Storage.username=wikiuser Storage.password=mypass Storage.useSeparateDBs=false Storage.defaultDB=wikidb Storage.lib=/usr/local/search/lucene-search-2svn05112008/sql Localization.url=file:///var/www/htdocs/wiki/languages/messages Logging.logconfig=/etc/lsearch.log4j Logging.debug=true </pre>
*;; my /etc/lsearch-global.conf ;;* <pre> [Database] wikidb : (single) (language,en) (warmup,10) [Search-Group] nen-tftp : wikidb [Index] nen-tftp : wikidb [Index-Path] <default> : /usr/local/search/indexes [OAI] wiktionary : http://$lang.wiktionary.org/w/index.php wikilucene : http://localhost/wiki-lucene/phase3/index.php <default> : http://$lang.wikipedia.org/w/index.php [Properties] Database.suffix=wiki wiktionary wikidb KeywordScoring.suffix=wikidb wiki wikilucene wikidev ExactCase.suffix=wikidb wiktionary wikilucene [Namespace-Prefix] all : <all> [0] : 0 [1] : 1 [2] : 2 [3] : 3 [4] : 4 [5] : 5 [6] : 6 [7] : 7 [8] : 8 [9] : 9 [10] : 10 [11] : 11 [12] : 12 [13] : 13 [14] : 14 [15] : 15 </pre>
* ;; my /etc/lsearch.log4j ;;* <pre> log4j.rootLogger=INFO, A1 log4j.appender.A1=org.apache.log4j.ConsoleAppender log4j.appender.A1.layout=org.apache.log4j.PatternLayout log4j.appender.A1.layout.ConversionPattern=%-4r [%t] %-5p %c %x - %m%n </pre>
* ;; relevant /var/www/htdocs/wiki/LocalSettings.php settings ;;* <pre> $wgSearchType = 'LuceneSearch'; $wgLuceneHost = 'localhost'; $wgLucenePort = 8123; require_once("extensions/MWSearch/MWSearch.php"); </pre>
*;; Building the index works running dumpBackup(Init).php ;;* <pre>
php maintenance/dumpBackupInit.php --current --quiet > wikidb.xml && java
-cp /usr/local/search/lucene-search-2svn05112008/LuceneSearch.jar org.wikimedia.lsearch.importer.Importer -s /var/www/htdocs/wiki/wikidb.xml wikidb MediaWiki Lucene search indexer - index builder from xml database dumps.
Trying config file at path /root/.lsearch.conf Trying config file at path /var/www/htdocs/wiki/lsearch.conf Trying config file at path /etc/lsearch.conf log4j: Trying to find [log4j.xml] using context classloader sun.misc.Launcher$AppClassLoader@133056f. log4j: Trying to find [log4j.xml] using sun.misc.Launcher$AppClassLoader@133056f class loader. log4j: Trying to find [log4j.xml] using ClassLoader.getSystemResource(). log4j: Trying to find [log4j.properties] using context classloader sun.misc.Launcher$AppClassLoader@133056f. log4j: Trying to find [log4j.properties] using sun.misc.Launcher$AppClassLoader@133056f class loader. log4j: Trying to find [log4j.properties] using ClassLoader.getSystemResource(). log4j: Could not find resource: [null]. log4j: Parsing for [root] with value=[INFO, A1]. log4j: Level token is [INFO]. log4j: Category root set to INFO log4j: Parsing appender named "A1". log4j: Parsing layout options for "A1". log4j: Setting property [conversionPattern] to [%-4r [%t] %-5p %c %x - %m%n]. log4j: End of parsing for "A1". log4j: Parsed "A1" options. log4j: Finished configuring. 0 [main] INFO org.wikimedia.lsearch.util.UnicodeDecomposer - Loaded unicode decomposer 18 [main] INFO org.wikimedia.lsearch.util.Localization - Reading localization for En 434 [main] INFO org.wikimedia.lsearch.ranks.RankBuilder - First pass, getting a list of valid articles... 94 pages (99.576/sec), 94 revs (99.576/sec) 1527 [main] INFO org.wikimedia.lsearch.ranks.RankBuilder - Second pass, calculating article links... 94 pages (326.389/sec), 94 revs (326.389/sec) 1928 [main] INFO org.wikimedia.lsearch.importer.Importer - Third pass, indexing articles... 94 pages (24.588/sec), 94 revs (24.588/sec) 6005 [main] INFO org.wikimedia.lsearch.importer.Importer - Closing/optimizing index... Finished indexing in 5s, with final index optimization in 0s Total time: 6s 6530 [main] INFO org.wikimedia.lsearch.index.IndexThread - Making snapshot for wikidb 6582 [main] INFO org.wikimedia.lsearch.index.IndexThread - Made snapshot /usr/local/search/indexes/snapshot/wikidb/20080512024654</pre>
*The above creates a 277KB file @ /var/www/htdocs/wiki/wikidb.xml , which looks just fine to me...*
*;; Starting the lsearch daemon is working ;;* When I run my script */usr/local/search/lucene-search-2svn05112008* - which starts the lsearch deamon, I get the following, which *ALSO* *looks fine* ; <pre> java -Djava.rmi.server.codebase=file:///usr/local/search/lucene-search-2svn05112008/LuceneSeah.jar -Djava.rmi.server.hostname=nen-tftp -jar /usr/local/search/lucene-search-2svn05112008/LuceneSearch.jar $* RMI registry started. Trying config file at path /root/.lsearch.conf Trying config file at path /usr/local/search/lucene-search-2svn05112008/lsearch.conf log4j: Parsing for [root] with value=[INFO, A1]. log4j: Level token is [INFO]. log4j: Category root set to INFO log4j: Parsing appender named "A1". log4j: Parsing layout options for "A1". log4j: Setting property [conversionPattern] to [%-4r [%t] %-5p %c %x - %m%n]. log4j: End of parsing for "A1". log4j: Parsed "A1" options. log4j: Finished configuring. 0 [main] INFO org.wikimedia.lsearch.util.Localization - Reading localization for En 2351 [main] INFO org.wikimedia.lsearch.util.UnicodeDecomposer - Loaded unicode decomposer 2600 [main] INFO org.wikimedia.lsearch.interoperability.RMIServer - RMIMessenger bound 2882 [main] INFO org.wikimedia.lsearch.interoperability.RMIServer - RemoteSearchable<wikidb> bound 2914 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index wikidb ... 2928 [Thread-2] INFO org.wikimedia.lsearch.frontend.HTTPIndexServer - Started server at port 8321 2929 [Thread-3] INFO org.wikimedia.lsearch.frontend.SearchServer - Binding server to port 8123 4246 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in 1331 ms 4246 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index wikidb ... 5079 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in 833 ms 5079 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index wikidb ... 5861 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in 782 ms </pre>
*From here*, I pull up my normal wiki, which has been working fine ALL along - but now, I get ZERO search results, no matter what I do! I know I am searching correctly, I just type in 1 single word for searching (that I '''know''' is on several pages in the wiki) I've even tried to ''edit'' the file before and after building the index, and starting/stoping the lsearch daemon, yet I get this error in my MediaWiki search results page; <pre> Search results
From AgentDcooper's Wiki
You searched for wiki
For more information about searching AgentDcooper's Wiki, see Searching AgentDcooper's Wiki.
Showing below 0 results starting with #1. No page text matches
Note: Unsuccessful searches are often caused by searching for common words like "have" and "from", which are not indexed, or by specifying more than one search term (only pages containing all of the search terms will appear in the result). </pre>
*;; I notice that the lsearch daemon console output shows the following, just after doing each search within the wiki ;;* <pre> 293744 [pool-2-thread-1] INFO org.wikimedia.lsearch.frontend.HttpHandler - query:/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2C6%2C7%2C8%2C9%2C10%2C11%2C12%2C13%2C14%2C15&offset=0&limit=100&version=2&iwlimit=10 what:search dbname:wikidb term:wiki 293759 [pool-2-thread-1] INFO org.wikimedia.lsearch.search.SearchEngine - Using NamespaceFilterWrapper wrap: {0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15} 293786 [pool-2-thread-1] INFO org.wikimedia.lsearch.search.SearchEngine - search wikidb: query=[wiki] parsed=[contents:wiki (title:wiki^6.0 stemtitle:wiki^2.0) (alttitle1:wiki^4.0 alttitle2:wiki^4.0 alttitle3:wiki^4.0) (keyword1:wiki^0.02 keyword2:wiki^0.01 keyword3:wiki^0.0066666664 keyword4:wiki^0.0050 keyword5:wiki^0.0039999997)] hit=[27] in 16ms using IndexSearcherMul:1210585609666 </pre>
;; *Aannnndddd, since I have [How_to_debug|Debuging] enabled, my /var/log/mediawiki/debug_log.txt shows this* ;; <pre> Start request GET /wiki/index.php/Special:Search?search=wiki&fulltext=Search Host: nen-tftp.techiekb.com User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.14) Gecko/20080404 Firefox/2.0.0.14 Accept: text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0. 5 Accept-Language: en-us,en;q=0.5 Accept-Encoding: gzip,deflate Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7 Keep-Alive: 300 Connection: keep-alive Referer: http://nen-tftp.techiekb.com/wiki/index.php/Special:Version Cookie: wikidb_session=3jptdli2pf3nkuq924tq1ihlt0 Authorization: Basic ZGNvb3Blcjp0ZXN0cGFzcw==
Main cache: FakeMemCachedClient Message cache: MediaWikiBagOStuff Parser cache: MediaWikiBagOStuff Unstubbing $wgParser on call of $wgParser->setHook from require_once Fully initialised Unstubbing $wgContLang on call of $wgContLang->checkTitleEncoding from WebRequest::getGPCVal Language::loadLocalisation(): got localisation for en from source Unstubbing $wgUser on call of $wgUser->isAllowed from Title::userCanRead Cache miss for user 2 Unstubbing $wgLoadBalancer on call of $wgLoadBalancer->getConnection from wfGetDB Logged in from session Unstubbing $wgMessageCache on call of $wgMessageCache->getTransform from wfMsgGetKey Unstubbing $wgLang on call of $wgLang->getCode from MessageCache::get MessageCache::load(): got from global cache Unstubbing $wgOut on call of $wgOut->setPageTitle from SpecialSearch::setupPage Fetching search data from http://localhost:8123/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2C... 7%2C8%2C9%2C10%2C11%2C12%2C13%2C14%2C15&offset=0&limit=100&version=2&iwlimit=10 total [0] hits OutputPage::sendCacheControl: private caching; ** Request ended normally </pre>
*Now get this*, if I goto the link in the log from above = http://localhost:8123/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2C..., I see this ;; <pre> 3 1.0 0 Main_Page 0.9577699303627014 0 EFFICIENT%2FCISCO%2FNETSCREEN%2FNETOPIA_Router_Command_Matrix 0.7121278643608093 0 DBU_-_DialBackUp </pre>
*Which leads me to my question* <sorry bout the long post!>: what am I doing *wrong*?? I have tried everything I can think of, I just cannot get my search within my mediawiki to work proplery, yet it seems like the search itself is working, somehow the "total hits" in the log as weel as the wiki, are showing ZERO --- yet manually going to the link in the log, shows me the CORRECT hits as well as search results! Why is MediaWiki not showing this?
*Anyhelp would be kindly appreciated!* -peace-
mediawiki-l@lists.wikimedia.org