On Mon, Nov 15, 2010 at 11:33 AM, Murali Kumar pthooran@hotmail.com wrote:
Dear Wikimedia India, As you probably aware the Govt. of India, immediately post Independence started multiple Indian language encyclopedia projects to stream in Science and Technology. The Tamil language encyclopedia was completed [http://en.wikipedia.org/wiki/Tamil_Encyclopedia] I'm pleased to report Tamil Virtual University has scanned in the Tamil Kalaikalanjiam / Tamil Encyclopedia [Please see Reference 1 below]. I was able to download the material via the wonderful wget command and the 'convert' (from imagemagick lib) in GNU/Linux. However each of the 10 volumes is close to 700 MB without compression. I would imagine, the people behind this mammoth task (pre-internet era) would have liked it to be merged into a Wiki type format, which would make it a truly living document in-sync with the times. I do not have any experience with 1) Tamil OCR software and 2) Automated updates to Wikipedia.
Can anyone take the lead on this project ? It will help boost the number of quality, articles in Indian languages. The Children's encyclopedia is being scanned and has a lot of great visual content. I have uploaded a sample (10 MB) PDF file at https://sites.google.com/site/periasamythooran/kalaikalanjiam/kalaikalanjiam...
do we have license to adopt or use this material for tamil wikipedia? I am working on a similar plan with Marathi Viswakosh project in Mumbai. -- GN
On 15 November 2010 11:40, Nagarjuna G nagarjun@gnowgi.org wrote:
do we have license to adopt or use this material for tamil wikipedia? I am working on a similar plan with Marathi Viswakosh project in Mumbai.
Is this material out of copyright? Or is there some indication from the Government that they might be willing to release it under an open license?
Thank you.
Best,
Gautam ________ http://social.prathambooks.org/
It's license needs to be verified. (I don't know whether it includes other earlier Tamil encyclopaedias which are in public domain. PD-OLD) Also, it may not go directly into Tamil Wikipedia, but would rather be suitable for Tamil Wikisource, I think. We can then pick appropriate material for Tamil Wikipedia.
- Sundar
"That language is an instrument of human reason, and not merely a medium for the expression of thought, is a truth generally admitted." - George Boole, quoted in Iverson's Turing Award Lecture
----- Original Message ----
From: Gautam John gautam@prathambooks.org To: wikimediaindia-l@lists.wikimedia.org Sent: Mon, November 15, 2010 11:53:57 AM Subject: Re: [Wikimediaindia-l] Tamil Encyclopedia merge into Wikipedia.
On 15 November 2010 11:40, Nagarjuna G nagarjun@gnowgi.org wrote:
do we have license to adopt or use this material for tamil wikipedia? I am working on a similar plan with Marathi Viswakosh project in Mumbai.
Is this material out of copyright? Or is there some indication from the Government that they might be willing to release it under an open license?
Thank you.
Best,
Gautam ________ http://social.prathambooks.org/
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
wikimediaindia-l@lists.wikimedia.org