Hello,

Is there some issue going on with the search API. The returned results appear to be a bit off.


For example

http://www.mediawiki.org/w/api.php?action=query&list=search&srsearch=Country

returns

<?xml version="1.0"?>
<api>
  <query>
    <searchinfo totalhits="10" suggestion="Count" />
    <search>
      <p ns="0" title="Wikidata" snippet="A data-numbers table: page_id revision_id name value - 301 2044 &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt;_population 80000000 301 2040 &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt;_population 75000000 &lt;b&gt;...&lt;/b&gt; " size="13242" wordcount="1760" timestamp="2010-08-15T13:43:24Z" />
      <p ns="0" title="Hit stats aggregation" snippet="Page views : geoip lookup (&lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; or city-resolution?)  Image views : What&#039;s available from each hit:  image name. thumbnail pixel width &lt;b&gt;...&lt;/b&gt; " size="2161" wordcount="311" timestamp="2010-01-03T08:04:51Z" />
      <p ns="0" title="<?xml version="1.0"?>
<api>
  <query>
    <searchinfo totalhits="10" suggestion="Count" />
    <search>
      <p ns="0" title="Wikidata" snippet="A data-numbers table: page_id revision_id name value - 301 2044 &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt;_population 80000000 301 2040 &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt;_population 75000000 &lt;b&gt;...&lt;/b&gt; " size="13242" wordcount="1760" timestamp="2010-08-15T13:43:24Z" />
      <p ns="0" title="Hit stats aggregation" snippet="Page views : geoip lookup (&lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; or city-resolution?)  Image views : What&#039;s available from each hit:  image name. thumbnail pixel width &lt;b&gt;...&lt;/b&gt; " size="2161" wordcount="311" timestamp="2010-01-03T08:04:51Z" />
      <p ns="0" title="WMDE contract offers" snippet="Your real name and &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; of residence. How you plan to go about implementing the desired function. Any experience working with MediaWiki &lt;b&gt;...&lt;/b&gt; " size="1483" wordcount="206" timestamp="2010-04-09T18:31:35Z" />
      <p ns="0" title="Sites using MediaWiki/en" snippet="com : The Unofficial ASEAN Tourism Encyclopedia, the reference guide to 10 Southeast Asian &lt;span class=&#039;searchmatch&#039;&gt;Countries&lt;/span&gt;. Asian Business Round Table - http:// &lt;b&gt;...&lt;/b&gt; " size="138820" wordcount="19314" timestamp="2010-10-30T19:15:23Z" />
      <p ns="0" title="Namespace manager" snippet="In the context of Wikidata, they are intended to segregate different types of structured content, such as person data, &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; information &lt;b&gt;...&lt;/b&gt; " size="11305" wordcount="1693" timestamp="2010-10-26T05:31:01Z" />
      <p ns="0" title="InstantCommons" snippet="It does not permit offline viewing, which is crucial in &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt; which have only intermittent network access. InstantCommons seeks to  &lt;b&gt;...&lt;/b&gt; " size="6505" wordcount="985" timestamp="2010-09-22T08:38:28Z" />
      <p ns="0" title="List of extensions to be merged to the core" snippet="There are laws in some &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt; forbidding logging of IPs, etc.  So it should be disabled by default. Soxred93  00:35, 26 January 2009 &lt;b&gt;...&lt;/b&gt; " size="8676" wordcount="857" timestamp="2010-09-22T12:42:56Z" />
      <p ns="0" title="Summer of Code 2009" snippet="Bot for automation interwikis adding for categories by &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt;, albums by artist, interwikis for categories based on interwikis for  &lt;b&gt;...&lt;/b&gt; " size="12337" wordcount="1677" timestamp="2010-03-12T12:14:19Z" />
      <p ns="0" title="Sites using MediaWiki/corporate" snippet="With more than 50,000 customers in 43 &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt;, Novell helps customers manage, simplify, secure and integrate their technology  &lt;b&gt;...&lt;/b&gt; " size="23008" wordcount="3174" timestamp="2010-10-21T16:40:06Z" />
    </search>
  </query>
</api>" snippet="Your real name and &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; of residence. How you plan to go about implementing the desired function. Any experience working with MediaWiki &lt;b&gt;...&lt;/b&gt; " size="1483" wordcount="206" timestamp="2010-04-09T18:31:35Z" />
      <p ns="0" title="Sites using MediaWiki/en" snippet="com : The Unofficial ASEAN Tourism Encyclopedia, the reference guide to 10 Southeast Asian &lt;span class=&#039;searchmatch&#039;&gt;Countries&lt;/span&gt;. Asian Business Round Table - http:// &lt;b&gt;...&lt;/b&gt; " size="138820" wordcount="19314" timestamp="2010-10-30T19:15:23Z" />
      <p ns="0" title="Namespace manager" snippet="In the context of Wikidata, they are intended to segregate different types of structured content, such as person data, &lt;span class=&#039;searchmatch&#039;&gt;country&lt;/span&gt; information &lt;b&gt;...&lt;/b&gt; " size="11305" wordcount="1693" timestamp="2010-10-26T05:31:01Z" />
      <p ns="0" title="InstantCommons" snippet="It does not permit offline viewing, which is crucial in &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt; which have only intermittent network access. InstantCommons seeks to  &lt;b&gt;...&lt;/b&gt; " size="6505" wordcount="985" timestamp="2010-09-22T08:38:28Z" />
      <p ns="0" title="List of extensions to be merged to the core" snippet="There are laws in some &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt; forbidding logging of IPs, etc.  So it should be disabled by default. Soxred93  00:35, 26 January 2009 &lt;b&gt;...&lt;/b&gt; " size="8676" wordcount="857" timestamp="2010-09-22T12:42:56Z" />
      <p ns="0" title="Summer of Code 2009" snippet="Bot for automation interwikis adding for categories by &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt;, albums by artist, interwikis for categories based on interwikis for  &lt;b&gt;...&lt;/b&gt; " size="12337" wordcount="1677" timestamp="2010-03-12T12:14:19Z" />
      <p ns="0" title="Sites using MediaWiki/corporate" snippet="With more than 50,000 customers in 43 &lt;span class=&#039;searchmatch&#039;&gt;countries&lt;/span&gt;, Novell helps customers manage, simplify, secure and integrate their technology  &lt;b&gt;...&lt;/b&gt; " size="23008" wordcount="3174" timestamp="2010-10-21T16:40:06Z" />
    </search>
  </query>
</api>

Thanks

Prateek

--
Prateek Jain
PhD Student, Research Assistant
Kno.e.sis Center, Wright State University
Dayton,OH 45435
Web: http://wiki.knoesis.org/index.php/Prateek
Email: prateek@knoesis.org
Phone: (770) 406-6356