Thanks for the help; yes, adding a User-Agent seemed to do the trick. I have, however, come up with another somewhat related question. Looking at the source to the Main Page, I see the metadata referenced in the following line:
<link title="Creative Commons" type="application/rdf+xml" href="/w/wiki.phtml?title=Main_Page&action=creativecommons" rel="meta" />
However, when I attempt to retrieve the URL specified by href, I just get the main page again. However, if I replace "&" with a simple & (before action=...), I get the RDF. Is this by design for some reason, or a bug? Thanks.
Nathan R. Yergler
Brion Vibber wrote:
Nathan R. Yergler wrote:
I'm a software engineer for the Creative Commons and am working on our on-line license/RDF validator. We're currently working on adding
<link> support for RDF retrieval, due in large part to Wikipedia's decision to use CC metadata to describe the FDL license.
Neat!
We have intial support
working, but when we test with Wikipedia, we get a 403: Forbidden.
We do block some specific user-agent strings due to past robot abuse; be sure you're using a user-agent string that identifies your software rather than a generic one.
-- brion vibber (brion @ pobox.com)
Wikitech-l mailing list Wikitech-l@Wikipedia.org http://mail.wikipedia.org/mailman/listinfo/wikitech-l