HELP: MediaWiki & Lucene-Search+MWSearch = ZERO search results ??!
Can someone '''please''' assist me? Here's what I've got
running;;
- Slackware 12.0 <http://www.slackware.com/> on i686 Pentium III
[Linux 2.6.21.5]
- MediaWiki: 1.9.1
- PHP: 5.2.5
- MySQL: 5.0.37
- MediaWiki Extension(s):
MWSearch<http://www.mediawiki.org/wiki/Extension:MWSearch>
SVN<http://svn.wikimedia.org/svnroot/mediawiki/trunk/extensions/MWSearch…22008),
and
Lucene-search <http://www.mediawiki.org/wiki/Extension:Lucene-search>
SVN<http://svn.wikimedia.org/svnroot/mediawiki/trunk/lucene-search-2/>…22008),
+ I downloaded & installed
mwdumper.jar <http://download.wikimedia.org/tools/mwdumper.jar> into
the Lucene2 lib dir
- Other tools: jre-6u2-i586-1, jdk-1_5_0_09-i586-1,
apache-ant-1.7.0-i586-1bj, rsync-2.6.9-i486-1
I've followed the steps per
MWSearch<http://www.mediawiki.org/wiki/Extension:MWSearch>and
Lucene-search <http://www.mediawiki.org/wiki/Extension:Lucene-search> pages,
to the T - I've gone over and over them several times, I've tried to search
these forums <http://www.mwusers.com/%20MediaWiki%20Forums>, and the
MediaWiki-L
mailing list <http://lists.wikimedia.org/pipermail/mediawiki-l> ... please
help me! =)
*;; My Local LuceneSearch configuration ;;*
LuceneSearch
SVN<http://svn.wikimedia.org/svnroot/mediawiki/trunk/lucene-search-2>…
dir: /usr/local/search/lucene-search-2svn05112008
Indexes stored: /usr/local/search/indexes
* ;; my /etc/lsearch.conf ;;*
<pre>
MWConfig.global=file:///etc/lsearch-global.conf
MWConfig.lib=/usr/local/search/lucene-search-2svn05112008/lib
Indexes.path=/usr/local/search/indexes
Search.updateinterval=1
Search.updatedelay=0
Search.checkinterval=30
Index.snapshotinterval=5
Index.maxqueuecount=5000
Index.maxqueuetimeout=12
Storage.master=localhost
Storage.username=wikiuser
Storage.password=mypass
Storage.useSeparateDBs=false
Storage.defaultDB=wikidb
Storage.lib=/usr/local/search/lucene-search-2svn05112008/sql
Localization.url=file:///var/www/htdocs/wiki/languages/messages
Logging.logconfig=/etc/lsearch.log4j
Logging.debug=true
</pre>
*;; my /etc/lsearch-global.conf ;;*
<pre>
[Database]
wikidb : (single) (language,en) (warmup,10)
[Search-Group]
nen-tftp : wikidb
[Index]
nen-tftp : wikidb
[Index-Path]
<default> : /usr/local/search/indexes
[OAI]
wiktionary : http://$lang.wiktionary.org/w/index.php
wikilucene :
http://localhost/wiki-lucene/phase3/index.php
<default> : http://$lang.wikipedia.org/w/index.php
[Properties]
Database.suffix=wiki wiktionary wikidb
KeywordScoring.suffix=wikidb wiki wikilucene wikidev
ExactCase.suffix=wikidb wiktionary wikilucene
[Namespace-Prefix]
all : <all>
[0] : 0
[1] : 1
[2] : 2
[3] : 3
[4] : 4
[5] : 5
[6] : 6
[7] : 7
[8] : 8
[9] : 9
[10] : 10
[11] : 11
[12] : 12
[13] : 13
[14] : 14
[15] : 15
</pre>
* ;; my /etc/lsearch.log4j ;;*
<pre>
log4j.rootLogger=INFO, A1
log4j.appender.A1=org.apache.log4j.ConsoleAppender
log4j.appender.A1.layout=org.apache.log4j.PatternLayout
log4j.appender.A1.layout.ConversionPattern=%-4r [%t] %-5p %c %x - %m%n
</pre>
* ;; relevant /var/www/htdocs/wiki/LocalSettings.php settings ;;*
<pre>
$wgSearchType = 'LuceneSearch';
$wgLuceneHost = 'localhost';
$wgLucenePort = 8123;
require_once("extensions/MWSearch/MWSearch.php");
</pre>
*;; Building the index works running dumpBackup(Init).php ;;*
<pre>
php maintenance/dumpBackupInit.php --current --quiet
> wikidb.xml && java
-cp
/usr/local/search/lucene-search-2svn05112008/LuceneSearch.jar
org.wikimedia.lsearch.importer.Importer -s /var/www/htdocs/wiki/wikidb.xml
wikidb
MediaWiki Lucene search indexer - index builder from xml database dumps.
Trying config file at path /root/.lsearch.conf
Trying config file at path /var/www/htdocs/wiki/lsearch.conf
Trying config file at path /etc/lsearch.conf
log4j: Trying to find [log4j.xml] using context classloader
sun.misc.Launcher$AppClassLoader@133056f.
log4j: Trying to find [log4j.xml] using
sun.misc.Launcher$AppClassLoader@133056f class loader.
log4j: Trying to find [log4j.xml] using ClassLoader.getSystemResource().
log4j: Trying to find [log4j.properties] using context classloader
sun.misc.Launcher$AppClassLoader@133056f.
log4j: Trying to find [log4j.properties] using
sun.misc.Launcher$AppClassLoader@133056f class loader.
log4j: Trying to find [log4j.properties] using
ClassLoader.getSystemResource().
log4j: Could not find resource: [null].
log4j: Parsing for [root] with value=[INFO, A1].
log4j: Level token is [INFO].
log4j: Category root set to INFO
log4j: Parsing appender named "A1".
log4j: Parsing layout options for "A1".
log4j: Setting property [conversionPattern] to [%-4r [%t] %-5p %c %x -
%m%n].
log4j: End of parsing for "A1".
log4j: Parsed "A1" options.
log4j: Finished configuring.
0 [main] INFO org.wikimedia.lsearch.util.UnicodeDecomposer - Loaded
unicode decomposer
18 [main] INFO org.wikimedia.lsearch.util.Localization - Reading
localization for En
434 [main] INFO org.wikimedia.lsearch.ranks.RankBuilder - First pass,
getting a list of valid articles...
94 pages (99.576/sec), 94 revs (99.576/sec)
1527 [main] INFO org.wikimedia.lsearch.ranks.RankBuilder - Second pass,
calculating article links...
94 pages (326.389/sec), 94 revs (326.389/sec)
1928 [main] INFO org.wikimedia.lsearch.importer.Importer - Third pass,
indexing articles...
94 pages (24.588/sec), 94 revs (24.588/sec)
6005 [main] INFO org.wikimedia.lsearch.importer.Importer -
Closing/optimizing index...
Finished indexing in 5s, with final index optimization in 0s
Total time: 6s
6530 [main] INFO org.wikimedia.lsearch.index.IndexThread - Making snapshot
for wikidb
6582 [main] INFO org.wikimedia.lsearch.index.IndexThread - Made snapshot
/usr/local/search/indexes/snapshot/wikidb/20080512024654</pre>
*The above creates a 277KB file @ /var/www/htdocs/wiki/wikidb.xml , which
looks just fine to me...*
*;; Starting the lsearch daemon is working ;;*
When I run my script */usr/local/search/lucene-search-2svn05112008* - which
starts the lsearch deamon, I get the following, which *ALSO* *looks fine* ;
<pre>
java
-Djava.rmi.server.codebase=file:///usr/local/search/lucene-search-2svn05112008/LuceneSeah.jar
-Djava.rmi.server.hostname=nen-tftp -jar
/usr/local/search/lucene-search-2svn05112008/LuceneSearch.jar $*
RMI registry started.
Trying config file at path /root/.lsearch.conf
Trying config file at path
/usr/local/search/lucene-search-2svn05112008/lsearch.conf
log4j: Parsing for [root] with value=[INFO, A1].
log4j: Level token is [INFO].
log4j: Category root set to INFO
log4j: Parsing appender named "A1".
log4j: Parsing layout options for "A1".
log4j: Setting property [conversionPattern] to [%-4r [%t] %-5p %c %x -
%m%n].
log4j: End of parsing for "A1".
log4j: Parsed "A1" options.
log4j: Finished configuring.
0 [main] INFO org.wikimedia.lsearch.util.Localization - Reading
localization for En
2351 [main] INFO org.wikimedia.lsearch.util.UnicodeDecomposer - Loaded
unicode decomposer
2600 [main] INFO org.wikimedia.lsearch.interoperability.RMIServer -
RMIMessenger bound
2882 [main] INFO org.wikimedia.lsearch.interoperability.RMIServer -
RemoteSearchable<wikidb> bound
2914 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index
wikidb ...
2928 [Thread-2] INFO org.wikimedia.lsearch.frontend.HTTPIndexServer -
Started server at port 8321
2929 [Thread-3] INFO org.wikimedia.lsearch.frontend.SearchServer - Binding
server to port 8123
4246 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in
1331 ms
4246 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index
wikidb ...
5079 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in
833 ms
5079 [main] INFO org.wikimedia.lsearch.search.Warmup - Warming up index
wikidb ...
5861 [main] INFO org.wikimedia.lsearch.search.Warmup - Warmed up wikidb in
782 ms
</pre>
*From here*, I pull up my normal wiki, which has been working fine ALL along
- but now, I get ZERO search results, no matter what I do! I know I am
searching correctly, I just type in 1 single word for searching (that I
'''know''' is on several pages in the wiki) I've even tried to
''edit'' the
file before and after building the index, and starting/stoping the lsearch
daemon, yet I get this error in my MediaWiki search results page;
<pre>
Search results
From AgentDcooper's Wiki
You searched for wiki
For more information about searching AgentDcooper's Wiki, see Searching
AgentDcooper's Wiki.
Showing below 0 results starting with #1.
No page text matches
Note: Unsuccessful searches are often caused by searching for common words
like "have" and "from", which are not indexed, or by specifying more
than
one search term (only pages containing all of the search terms will appear
in the result).
</pre>
*;; I notice that the lsearch daemon console output shows the following,
just after doing each search within the wiki ;;*
<pre>
293744 [pool-2-thread-1] INFO org.wikimedia.lsearch.frontend.HttpHandler -
query:/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2C6%2C7%2C8%2C9%2C10%2C11%2C12%2C13%2C14%2C15&offset=0&limit=100&version=2&iwlimit=10
what:search dbname:wikidb term:wiki
293759 [pool-2-thread-1] INFO org.wikimedia.lsearch.search.SearchEngine -
Using NamespaceFilterWrapper wrap: {0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15}
293786 [pool-2-thread-1] INFO org.wikimedia.lsearch.search.SearchEngine -
search wikidb: query=[wiki] parsed=[contents:wiki (title:wiki^6.0
stemtitle:wiki^2.0) (alttitle1:wiki^4.0 alttitle2:wiki^4.0
alttitle3:wiki^4.0) (keyword1:wiki^0.02 keyword2:wiki^0.01
keyword3:wiki^0.0066666664 keyword4:wiki^0.0050 keyword5:wiki^0.0039999997)]
hit=[27] in 16ms using IndexSearcherMul:1210585609666
</pre>
;; *Aannnndddd, since I have [How_to_debug|Debuging] enabled, my
/var/log/mediawiki/debug_log.txt shows this* ;;
<pre>
Start request
GET /wiki/index.php/Special:Search?search=wiki&fulltext=Search
Host:
nen-tftp.techiekb.com
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.14)
Gecko/20080404 Firefox/2.0.0.14
Accept:
text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.
5
Accept-Language: en-us,en;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
Keep-Alive: 300
Connection: keep-alive
Referer:
http://nen-tftp.techiekb.com/wiki/index.php/Special:Version
Cookie: wikidb_session=3jptdli2pf3nkuq924tq1ihlt0
Authorization: Basic ZGNvb3Blcjp0ZXN0cGFzcw==
Main cache: FakeMemCachedClient
Message cache: MediaWikiBagOStuff
Parser cache: MediaWikiBagOStuff
Unstubbing $wgParser on call of $wgParser->setHook from require_once
Fully initialised
Unstubbing $wgContLang on call of $wgContLang->checkTitleEncoding from
WebRequest::getGPCVal
Language::loadLocalisation(): got localisation for en from source
Unstubbing $wgUser on call of $wgUser->isAllowed from Title::userCanRead
Cache miss for user 2
Unstubbing $wgLoadBalancer on call of $wgLoadBalancer->getConnection from
wfGetDB
Logged in from session
Unstubbing $wgMessageCache on call of $wgMessageCache->getTransform from
wfMsgGetKey
Unstubbing $wgLang on call of $wgLang->getCode from MessageCache::get
MessageCache::load(): got from global cache
Unstubbing $wgOut on call of $wgOut->setPageTitle from
SpecialSearch::setupPage
Fetching search data from
http://localhost:8123/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2…
7%2C8%2C9%2C10%2C11%2C12%2C13%2C14%2C15&offset=0&limit=100&version=2&iwlimit=10
total [0] hits
OutputPage::sendCacheControl: private caching; **
Request ended normally
</pre>
*Now get this*, if I goto the link in the log from above =
http://localhost:8123/search/wikidb/wiki?namespaces=0%2C1%2C2%2C3%2C4%2C5%2…2&iwlimit=10,
I see this ;;
<pre>
3
1.0 0 Main_Page
0.9577699303627014 0
EFFICIENT%2FCISCO%2FNETSCREEN%2FNETOPIA_Router_Command_Matrix
0.7121278643608093 0 DBU_-_DialBackUp
</pre>
*Which leads me to my question* <sorry bout the long post!>: what am I doing
*wrong*?? I have tried everything I can think of, I just cannot get my
search within my mediawiki to work proplery, yet it seems like the search
itself is working, somehow the "total hits" in the log as weel as the wiki,
are showing ZERO --- yet manually going to the link in the log, shows me the
CORRECT hits as well as search results! Why is MediaWiki not showing this?
*Anyhelp would be kindly appreciated!*
-peace-