Hi!
Yes, you can grab it from the API. Here is an sample of request [1]. I don’t know if pywikipedia has a nice wrapper for it but you can do this request easily by directly using low level pywikipedia code.

Thomas

[1] https://commons.wikimedia.org/w/api.php?action=query&prop=imageinfo&format=jsonfm&iiprop=metadata&iilimit=5&titles=File%3A%E0%B4%9C%E0%B4%BE%E0%B4%A4%E0%B4%BF%E0%B4%95%E0%B5%8D%E0%B4%95%E0%B5%81%E0%B4%AE%E0%B5%8D%E0%B4%AE%E0%B4%BF.pdf

Le 6 déc. 2013 à 19:00, ബാലശങ്കർ സി <c.balasankar@gmail.com> a écrit :

Hi all,
I am from ml.wikisource.org and I am having a doubt regarding Mediawiki API and PDF files. I want to know if I can use Pywikipedia to grab the text layer of a pdf file (in the file namespace, obviously) . Is the mediawiki API handling any such functionality? Thanks in advance.

Regards,
Balasankar C
http://balasankarc.in
_______________________________________________
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l