QuotationsBook.com Webmaster/Support wrote:
Hello
I'm constructing a large literary resource, and would like to query
articles about authors automatically, and receive Wikipedia articles
back as RSS/XML documents to present in my website, with all the
relevant backlinks to wikipedia.
Does Wikipedia have customised syndication of its content?
If you do some sort of [semi-]dynamic inclusion of Wikipedia content
into your site, please adhere to the following:
1) Take measures to ensure that your site does not violate robots.txt.
In case it is not apparent, this includes ensuring that bots, which
access your site, do not indirectly violate Wikipedia's robots.txt when
your site is acting as an intermediary.
2) Have your software clearly identify itself to Wikipedia, on each
request, with a user-agent string. There are some broad user-agent
blocks in place to combat the frequent abuse of the site by bots using
certain software/libraries. It's acceptable to alter the user-agent of
your software to get past these blocks, as long as you make sure that it
is well-behaved.
3) Know what your software is doing. Monitor the behaviour of your site
to ensure that it doesn't load Wikipedia's servers with rubbish such as
that you can see at the following URL:
http://noc.wikimedia.org/~jeronim/mysic.txt
I'd also suggest that you list your site on the relevant subpage of
http://en.wikipedia.org/wiki/Wikipedia:Mirrors_and_forks
and describe its interaction with Wikipedia. This will help to conserve
the time of the volunteers who monitor sites which use Wikipedia
content, and hopefully prevent any misunderstandings.
Regards
Amit
quotationsbook.com
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)wikimedia.org
http://mail.wikipedia.org/mailman/listinfo/wikitech-l
Send instant messages to your online friends
http://au.messenger.yahoo.com