Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr" class="mw-content-ltr"><p class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not have the same parent language</p>
But when I open the url in browser, there is no such message. And using index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to fix?
Thanks
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr" class="mw-content-ltr"><p class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not have the same parent language</p>
But when I open the url in browser, there is no such message. And using index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to fix?
Thanks
I checked with Jiang Bian and found out that this is still happening -- can anyone help Google out here? :-)
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara <sumanah@wikimedia.org
wrote:
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr"
class="mw-content-ltr"><p
class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not have the same parent language</p>
But when I open the url in browser, there is no such message. And using index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to fix?
Thanks
I checked with Jiang Bian and found out that this is still happening -- can anyone help Google out here? :-)
-- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
There was a bug in some Wikibase version deployed in July which caused this error, but a fix was backported soon and since then I've never seen any similar error as a logged in user. If you still see some errors only when unlogged in at particular URLs (like what you described) now, it's likely that those URLs got cached in Squid when the bug was live... In this case purging those pages[1] should be able to fix the issue.
[1] https://en.wikipedia.org/wiki/Wikipedia:Purge
-Liangent
We are actually crawling the HTML via bot, so the bug is not actually fixed for non-login user, right?
Could you share the bug's link?
Thanks
On Thu, Aug 22, 2013 at 4:38 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara < sumanah@wikimedia.org
wrote:
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr"
class="mw-content-ltr"><p
class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not have
the
same parent language</p>
But when I open the url in browser, there is no such message. And using index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to fix?
Thanks
I checked with Jiang Bian and found out that this is still happening -- can anyone help Google out here? :-)
-- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
There was a bug in some Wikibase version deployed in July which caused this error, but a fix was backported soon and since then I've never seen any similar error as a logged in user. If you still see some errors only when unlogged in at particular URLs (like what you described) now, it's likely that those URLs got cached in Squid when the bug was live... In this case purging those pages[1] should be able to fix the issue.
[1] https://en.wikipedia.org/wiki/Wikipedia:Purge
-Liangent _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN bianjiang@google.com wrote:
We are actually crawling the HTML via bot, so the bug is not actually fixed for non-login user, right?
I can't think of a good way to fix the problem from this aspect besides waiting for old cached page to expire, unless some sysadmin is happy to nuke all existing Squid cached pages.
However if you have a list of affected pages as you're crawling HTML, which we don't have, you can simply purge them in batch and recrawl those pages.
Could you share the bug's link?
There was no bug created in bugzilla... I submitted a patch[1] directly to fix the bug once it was spotted.
[1] https://gerrit.wikimedia.org/r/#/c/76060/
-Liangent
Thanks
On Thu, Aug 22, 2013 at 4:38 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara < sumanah@wikimedia.org
wrote:
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr"
class="mw-content-ltr"><p
class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not have
the
same parent language</p>
But when I open the url in browser, there is no such message. And
using
index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to
fix?
Thanks
I checked with Jiang Bian and found out that this is still happening -- can anyone help Google out here? :-)
-- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
There was a bug in some Wikibase version deployed in July which caused
this
error, but a fix was backported soon and since then I've never seen any similar error as a logged in user. If you still see some errors only when unlogged in at particular URLs (like what you described) now, it's likely that those URLs got cached in Squid when the bug was live... In this case purging those pages[1] should be able to fix the issue.
[1] https://en.wikipedia.org/wiki/Wikipedia:Purge
-Liangent _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Jiang BIAN
This email may be confidential or privileged. If you received this communication by mistake, please don't forward it to anyone else, please erase all copies and attachments, and please let me know that it went to the wrong person. Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Thanks for the link. But I think this is targeting the language variant related fix.
We actually observed stale cache in a wider range, see the bug entry: https://bugzilla.wikimedia.org/show_bug.cgi?id=46014
On Thu, Aug 22, 2013 at 5:26 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN bianjiang@google.com wrote:
We are actually crawling the HTML via bot, so the bug is not actually
fixed
for non-login user, right?
I can't think of a good way to fix the problem from this aspect besides waiting for old cached page to expire, unless some sysadmin is happy to nuke all existing Squid cached pages.
However if you have a list of affected pages as you're crawling HTML, which we don't have, you can simply purge them in batch and recrawl those pages.
Could you share the bug's link?
There was no bug created in bugzilla... I submitted a patch[1] directly to fix the bug once it was spotted.
[1] https://gerrit.wikimedia.org/r/#/c/76060/
-Liangent
Thanks
On Thu, Aug 22, 2013 at 4:38 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara < sumanah@wikimedia.org
wrote:
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like this;
<div id="mw-content-text" lang="zh-CN" dir="ltr"
class="mw-content-ltr"><p
class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not
have
the
same parent language</p>
But when I open the url in browser, there is no such message. And
using
index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to
fix?
Thanks
I checked with Jiang Bian and found out that this is still happening
--
can anyone help Google out here? :-)
-- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
There was a bug in some Wikibase version deployed in July which caused
this
error, but a fix was backported soon and since then I've never seen any similar error as a logged in user. If you still see some errors only
when
unlogged in at particular URLs (like what you described) now, it's
likely
that those URLs got cached in Squid when the bug was live... In this
case
purging those pages[1] should be able to fix the issue.
[1] https://en.wikipedia.org/wiki/Wikipedia:Purge
-Liangent _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Jiang BIAN
This email may be confidential or privileged. If you received this communication by mistake, please don't forward it to anyone else, please erase all copies and attachments, and please let me know that it went to the wrong person. Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On Fri, Aug 23, 2013 at 8:33 AM, Jiang BIAN bianjiang@google.com wrote:
Thanks for the link. But I think this is targeting the language variant related fix.
This is the root cause of that behavior you mentioned. (It only happens / happened on zhwiki and maybe as well as some wikis with variants, right?)
-Liangent
We actually observed stale cache in a wider range, see the bug entry: https://bugzilla.wikimedia.org/show_bug.cgi?id=46014
On Thu, Aug 22, 2013 at 5:26 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN bianjiang@google.com
wrote:
We are actually crawling the HTML via bot, so the bug is not actually
fixed
for non-login user, right?
I can't think of a good way to fix the problem from this aspect besides waiting for old cached page to expire, unless some sysadmin is happy to nuke all existing Squid cached pages.
However if you have a list of affected pages as you're crawling HTML,
which
we don't have, you can simply purge them in batch and recrawl those
pages.
Could you share the bug's link?
There was no bug created in bugzilla... I submitted a patch[1] directly
to
fix the bug once it was spotted.
[1] https://gerrit.wikimedia.org/r/#/c/76060/
-Liangent
Thanks
On Thu, Aug 22, 2013 at 4:38 PM, Liangent liangent@gmail.com wrote:
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara < sumanah@wikimedia.org
wrote:
On 08/01/2013 03:08 AM, Jiang BIAN bianjiang@google.com wrote:
Hi,
I noticed some pages we crawled containing error message like
this;
<div id="mw-content-text" lang="zh-CN" dir="ltr"
class="mw-content-ltr"><p
class="error">Failed to render property P373: Wikibase\LanguageWithConversion::factory: given languages do not
have
the
same parent language</p>
But when I open the url in browser, there is no such message. And
using
index.php can also get normal content without error messages.
Here are examples you can retry:
bad $ wget 'http://zh.wikipedia.org/zh-cn/Google'
good $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
Looks like something is wrong on Wikipedia side, anything need to
fix?
Thanks
I checked with Jiang Bian and found out that this is still
happening
--
can anyone help Google out here? :-)
-- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
There was a bug in some Wikibase version deployed in July which
caused
this
error, but a fix was backported soon and since then I've never seen
any
similar error as a logged in user. If you still see some errors only
when
unlogged in at particular URLs (like what you described) now, it's
likely
that those URLs got cached in Squid when the bug was live... In this
case
purging those pages[1] should be able to fix the issue.
[1] https://en.wikipedia.org/wiki/Wikipedia:Purge
-Liangent _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Jiang BIAN
This email may be confidential or privileged. If you received this communication by mistake, please don't forward it to anyone else,
please
erase all copies and attachments, and please let me know that it went
to
the wrong person. Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Jiang BIAN
This email may be confidential or privileged. If you received this communication by mistake, please don't forward it to anyone else, please erase all copies and attachments, and please let me know that it went to the wrong person. Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org