[WikiEN-l] extracting protein target infobox information via page export

Rajarshi Guha rajarshi.guha at gmail.com
Wed Jan 19 14:29:13 UTC 2011


Hi, I was trying to extract some information from the protein target
infobox on protein target pages (eg
http://en.wikipedia.org/wiki/Calreticulin or
http://en.wikipedia.org/wiki/Hsp90).

However when I export the page via
http://en.wikipedia.org/w/api.php?action=query&pageids=7120&export=&exportnowrap=
the XML page does not seem to contain the information that I can see
when viewing the page in the browser. For example, the XML export for
Calreticulin does not contain the links to the rendering of the
structure or the PDB identifiers and so on.

Is my export URL wrong? Or is there a reason that the infobox
information is not exported and if so, is there a way to access it via
export?

Thanks,

-- 
Rajarshi Guha
NIH Chemical Genomics Center



More information about the WikiEN-l mailing list