On 9/2/05, Angela beesley@gmail.com wrote:
Google miscategorizes the language of some of the Hungarian Wikipedia pages. E.g. it thinks that our Adolf Hitler article is in czech.
How do you know they are miscategorising the language? http://www.google.com/search?q=inurl%3A%22Adolf+Hitler%22+site%3Ahu.wikipedia.org
This makes it seem like they haven't indexed the page at all, not that they've marked it as the wrong language.
I believe this is correct. We've had problems in the past with overloading wikipedia so the crawl has been throttled way back.