Hi folks, We have just launched a new Wikipedia-based search engine - http://www.seariki.com
The idea is to combine searching and browsing effectively. For example, we find paths from a category to top categories, and hopefully this makes navigating through category hierarchy easier. At this time, only English and Chinese are supported, but we are working on indexing more languages.
Any comments will be appreciated.
Thanks and happy holiday.
I think the fact that it brings up small snippets of multiple articles related to the keyword search is interesting.
On Dec 24, 2007 7:50 PM, J.H. Huang jinh.huang@gmail.com wrote:
Hi folks, We have just launched a new Wikipedia-based search engine - http://www.seariki.com
The idea is to combine searching and browsing effectively. For example, we find paths from a category to top categories, and hopefully this makes navigating through category hierarchy easier. At this time, only English and Chinese are supported, but we are working on indexing more languages.
Any comments will be appreciated.
Thanks and happy holiday. _______________________________________________ WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: http://lists.wikimedia.org/mailman/listinfo/wikien-l
This is pretty slick. How does the search decide which image to use for the search-result-snippits? Or rather, can you give us some more details on the technical side of the search? -- Ned Scott
On Dec 24, 2007, at 8:50 PM, J.H. Huang wrote:
Hi folks, We have just launched a new Wikipedia-based search engine - http://www.seariki.com
The idea is to combine searching and browsing effectively. For example, we find paths from a category to top categories, and hopefully this makes navigating through category hierarchy easier. At this time, only English and Chinese are supported, but we are working on indexing more languages.
Any comments will be appreciated.
Thanks and happy holiday. _______________________________________________ WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: http://lists.wikimedia.org/mailman/listinfo/wikien-l
There are some ways to measure the relevance between an image and the snippet, e.g. distance from the image to the snippet, and lexical similarity between the image description and the snippet, just to name a few.
At the time being, we only select an image from the same document from which the text snippet is extracted. Selecting an illustrating image from a set of annotated images for the text snippet remains our future work.
We know this is the very first step, but we are working to improve it incrementally.
J.H. Huang
On Dec 26, 2007 7:06 AM, Ned Scott ned@nedscott.com wrote:
This is pretty slick. How does the search decide which image to use for the search-result-snippits? Or rather, can you give us some more details on the technical side of the search? -- Ned Scott
On Dec 24, 2007, at 8:50 PM, J.H. Huang wrote:
Hi folks, We have just launched a new Wikipedia-based search engine - http://www.seariki.com
The idea is to combine searching and browsing effectively. For example, we find paths from a category to top categories, and hopefully this makes navigating through category hierarchy easier. At this time, only English and Chinese are supported, but we are working on indexing more languages.
Any comments will be appreciated.
Thanks and happy holiday. _______________________________________________ WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: http://lists.wikimedia.org/mailman/listinfo/wikien-l
WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: http://lists.wikimedia.org/mailman/listinfo/wikien-l
On 12/25/07, J.H. Huang jinh.huang@gmail.com wrote:
Any comments will be appreciated.
Ok, something weird with my first experience:
1. Search for "hawthorn" 2. A mangled version of the [[Hawthorn]] disambiguation page is shown (complete with asterisks instead of list indentation...) 3. Click on "Hawthorn Football Club" link... 4. Get taken to a new search for that term, including the disambiguation page as the second link...
Step 4 is weird - I just clicked on my preferred disambiguation link, why disambiguate me again?
Try again: 1. Search for "lizard" 2. Mangled disambiguation page is shown, including no links to the actual page [[Lizard]]. Apparently you need to click on the text "Or you can seach[sic] by keyword lizard ."
That behaviour is pretty weird: search for a term that matches an exact article, and you get everything but that article. Wouldn't it make more sense to show at least a snippet of that article first, before getting into the disambiguation?
Steve
First of all, thank you for your feedback. Your thought is very important to us.
Step 4 is weird - I just clicked on my preferred disambiguation link,
why disambiguate me again?
For the time being, we just index the disambiguation pages same as other pages, so their snippets will be shown in search result. Those disambiguation snippets will be removed from result set in the next indexing cycle.
That behaviour is pretty weird: search for a term that matches an exact article, and you get everything but that article. Wouldn't it make more sense to show at least a snippet of that article first, before getting into the disambiguation?
Yes. Quite a good idea. We've just improved it as you described. Try search for "lizard" ( http://www.seariki.com/s/?query=lizard&il=en ), the most relevant snippet will be shown, followed by the disambiguation.
As for the problem of mangled pages, we are developing a more powerful engine to support those wiki markups.
Thank you and please feel free to let us know if you have any further questions or suggestions.
J.H Huang