Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Hello Santhosh
Many thanks for the tool, and yes, it worked. I've rendered a Bengali Wikipedia article with this tool. As far as I see, the characters are okay, but there are problems with the placement of images. Texts are not showing next to the images. They are placed top/below of the images. Also, the link of references aren't shown there. Copying the URL from the browser address bar isn't working I think, I had to type. Waiting for others' feedback.
Regards,
Tanvir
[[User:Wikitanvir]] On all Wikimedia Projects [[m:User:Wikitanvir]] For Wikimedia site matrix
On Mon, Feb 14, 2011 at 11:06 PM, Santhosh Thottingal < santhosh.thottingal@gmail.com> wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi Santhosh,
Cool!. Just have some feedback on the generated PDF (Same article ilangai on tamil wikipedia). I see line spacing is not uniform and some characters from adjacent lines overlap, but a slightly more important issue is spacing within the words for certain characters[Am not sure if its a repeatable pattern for a particular charater], but certainly it takes a toll on readability since it gives space with in a word. I am not sure if its a tamil specific issue too.
Tested on Windows XP with Firefox where tamil wikipedia reads fine.
Regards Srikanth.L http://srik.me
On Mon, Feb 14, 2011 at 22:36, Santhosh Thottingal < santhosh.thottingal@gmail.com> wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
at bottom :-
On Mon, Feb 14, 2011 at 22:36, Santhosh Thottingal santhosh.thottingal@gmail.com wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
I tried 5 different files, all of them gave me 500 error messages .
For e.g. http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
I'm just tired doing the copy again and again as it is past my bed time.
Anyways, perhaps it doesn't know to play nicely with the https (secured) or just does not want to play with hindi or those particular pages.
If somebody does find a page which does not give 500 Internal Server message with hindi (preferably with the https:// turned on please let me know ) .
Thanks Santhosh Thottingal http://thottingal.in
Does it have anything to do with accessing secure Wikipedia ? Can you try with normal link?
- Tinu Cherian
2011/2/15 shirish शिरीष shirishag75@gmail.com
at bottom :-
On Mon, Feb 14, 2011 at 22:36, Santhosh Thottingal santhosh.thottingal@gmail.com wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the
generated PDF.
You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
I tried 5 different files, all of them gave me 500 error messages .
For e.g. http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
http://silpa.smc.org.in/Render?wiki=https://secure.wikimedia.org/wikipedia/h...
I'm just tired doing the copy again and again as it is past my bed time.
Anyways, perhaps it doesn't know to play nicely with the https (secured) or just does not want to play with hindi or those particular pages.
If somebody does find a page which does not give 500 Internal Server message with hindi (preferably with the https:// turned on please let me know ) .
Thanks Santhosh Thottingal http://thottingal.in
-- Regards, Shirish Agarwal शिरीष अग्रवाल My quotes in this email licensed under CC 3.0 http://creativecommons.org/licenses/by-nc/3.0/ http://flossexperiences.wordpress.com 065C 6D79 A68C E7EA 52B3 8D70 950D 53FB 729A 8B17
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
On Tue, February 15, 2011 3:44 am, shirish शिरीष wrote:
Anyways, perhaps it doesn't know to play nicely with the https (secured) or just does not want to play with hindi or those particular pages.
Shirish, secure access to wikipedia is not need here(and not supported as of now). It is just anonymous read on a wiki page from our server. Just try with normal http url http://silpa.smc.org.in/Render?wiki=http://hi.wikipedia.org/wiki/%E0%A4%A7%E...
Thanks Santhosh
Santhosh,
Good initiative. I tried with Telugu main page with your URL substituted with telugu reference to main page. It works though layout/rendering needs improvement.
I presently use Firefox print command on Ubuntu and select print to file (pdf), and I get a fine pdf file, except for the romanized url for the indic text portion.
Cheers Arjun
Since the tool is under the early stages of development, I know it is not good to expect a perfect PDF.
Some general comments are:
- For Malayalam, eventhough the text rendering is almost perfect, I could find there are some unwnated space inserted with in few words. I assume it is due to the *justification alignment* used. In general I feel it is better to keep left align text for the paragraphs. May be we can add this an option in PDF generation tool. - By default, it is better to place images inline with the text. May be an additional option to remove images from rendered PDF or place images below the corresponding paragraph can be added in the tool. - The first page of the PDF need to be have some information. May be the introduction paragraph can be included in the first page. It is not good to keep it blank. - The refernces/href links and so on need to be handled in better way.
Since the tool is still under under development I am not going for in depth analysis. \ Shiju
On Mon, Feb 14, 2011 at 10:36 PM, Santhosh Thottingal < santhosh.thottingal@gmail.com> wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
On Mon, Feb 14, 2011 at 10:36 PM, Santhosh Thottingal santhosh.thottingal@gmail.com wrote:
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
Nice work Santhosh ! Can I request that you provide a way for users to email their feedback to the developers using some template ? This would allow you to obtain some test-cases/test-pages which can be used in further releases to check for regressions.
On Tue, Feb 15, 2011 at 9:43 AM, sankarshan foss.mailinglists@gmail.com wrote:
On Mon, Feb 14, 2011 at 10:36 PM, Santhosh Thottingal santhosh.thottingal@gmail.com wrote:
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
Nice work Santhosh ! Can I request that you provide a way for users to email their feedback to the developers using some template ? This would allow you to obtain some test-cases/test-pages which can be used in further releases to check for regressions.
Sure. Just added link to the bug tracker of pypdflib project, email address pypdflib-dev@nongnu.org (for those who are reluctant to file a bug report) to the online interface in silpa project(http://silpa.smc.org.in/Render).
Thanks Santhosh
Santosh,
Excellent effort. Kudos to you and the SMC team.
I feel that we should setup a wiki page or similar close to the project page for users to put up feedback.
Note: Checked with Kannada. The preview verision works fine for most part, which is good. There are some trivial bugs.
On Mon, Feb 14, 2011 at 10:36 PM, Santhosh Thottingal < santhosh.thottingal@gmail.com> wrote:
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi Santhosh,
Good work with the library. I tried a few pages like பட்டாம்பூச்சி, மாந்தவுருவகம் etc., and I got an error. However, it worked for the Tamil Wiki example you cited, இலங்கை and the Malayalam wiki example Mango.
Some quirks: 1. insertion of intra-word spacing (like Srikanth and Shiju mentioned) 2. hyphenation is without the hyphen sign 3. image layout as others observed 4. I notice that the இ in இலங்கை appears to be larger and starting from slightly below other characters. Is it because of the font used? In fact, I just notice that it is so in this email text as well. Is it due to my client-side fonts?
I'm sure that it'll be very useful once the issues are fixed. I can already dream of pediapress for Indian languages.
- Sundar
"That language is an instrument of human reason, and not merely a medium for the expression of thought, is a truth generally admitted." - George Boole, quoted in Iverson's Turing Award Lecture
----- Original Message ----
From: Santhosh Thottingal santhosh.thottingal@gmail.com To: Discussion list on Indian language projects of Wikimedia. wikimediaindia-l@lists.wikimedia.org Sent: Mon, February 14, 2011 10:36:14 PM Subject: [Wikimediaindia-l] PDF rendering of Indian language wiki pages
Hi, We are working on a Complex script PDF rendering library named PyPDFLib(https://savannah.nongnu.org/projects/pypdflib/). One of its test case (or use case) is to render a wiki page in complex script(any Indian Language) to PDF. Currently PDF export feature is not available(not working) for Indian language wiki projects because of technical incapability of Python Reportlab library.
Just wanted to give an early preview of this software library through an online interface : http://silpa.smc.org.in/Render You can try with a Wikipedia page in your language and verify the generated PDF. You can also access this using this URL http://silpa.smc.org.in/Render?wiki=http://ta.wikipedia.org/wiki/%E0%AE%87%E... (replace that wiki URL with other page addresses too - any Ianguage - not limited to Indian languages)
There are lot of items not implemented, but your feedback is requested on the current version. The library uses Pango for text rendering and Cairo for graphics and PDF features.
ps: Don't get surprised if you get a 500 Error page for the random page you are trying. Just try another wiki page ;)
Thanks Santhosh Thottingal http://thottingal.in
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi, Thanks for all comments.We will add bug reports on each of these comments with a reference to this mail thread in our bug/task tracker.
I will post the updates on the project. If anybody interested to join our team, this is our mailing list : http://lists.nongnu.org/mailman/listinfo/pypdflib-dev You can also post your comments to pypdflib-dev@nongnu.org
Thanks Santhosh Thottingal
wikimediaindia-l@lists.wikimedia.org