[Mediawiki-api] Broken YAML output?

Roan Kattouw roan.kattouw at home.nl
Thu Mar 13 15:26:55 UTC 2008


L. Cleophas schreef:
> Hi
>
> I've been trying to use the API to get output into YAML format for 
> processing in Ruby. Seems (see 
> http://bugzilla.wikimedia.org/show_bug.cgi?id=12120 ) that the API's 
> YAML output is broken, as it does not escape quotes, colons followed 
> by spaces etc. This causes output like
>
> title: Job: A Comedy of Justice
>
> when getting the Wikipedia page with that title; due to the unescaped 
> second colon followed by space, Ruby's YAML.load chokes on this. Seems 
> to me that this basically means that YAML output is not usable for 
> getting data from wikipedia, quite a few of whose articles use ": " in 
> titles or other fields.
I know, I should fix it. It's just that I'm not familiar with YAML at 
all, so I need to find some time to read the YAML specs and rewrite the 
YAML formatter.
>
> Any thoughts on how to circumvent this (without using XML as format; I 
> know I can parse this from Ruby using REXML, but it ain't nearly as 
> easy/clean as YAML), or on a time horizon for a fix/update?
XML is not that bad. There's also JSON, which is equally easy/clean.

Roan Kattouw (Catrope)



More information about the Mediawiki-api mailing list