Re: [Wikitech-l] A metadata API module for commons

4 Sep 2013

On 9/1/13, Jean-Frédéric &lt;jeanfrederic.wiki(a)gmail.com&gt; wrote:
[..]
...

  The downside to this is in order to effectively
get metadata out of
 commons given the current practises, one essentially has to screen
 scrape and do slightly ugly things

 This [1] looks quite acrobatic indeed. Can’t we make better use of the
 machine-readable markings provided by templates?
 <https://commons.wikimedia.org/wiki/Commons:Machine-readable_data>

 [1] https://gerrit.wikimedia.org/r/#/c/80403/4/CommonsMetadata_body.php

It is using the machine readable data from that page. (Although its
debatable how much "Look for a <td> with this id, and then look at the
contents of the next sibling <td> you encounter is").

I'm somewhat of a newb though with extracting microformat style
metadata, so its quite possible there is a better way, or some higher
level parsing library I could use (Something like xpath maybe,
although its not really xml I'm looking at).

-- 
-bawolff

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] A metadata API module for commons