[Wikimediaindia-l] Tamil Encyclopedia merge into Wikipedia.
BalaSundaraRaman
sundarbecse at yahoo.com
Mon Nov 15 07:35:26 UTC 2010
Yes, we need to get it under a suitable license. If the technical issue related
to OCR is resolved, we can talk to them about releasing the content into public
domain.
- Sundar
"That language is an instrument of human reason, and not merely a medium for
the expression of thought, is a truth generally admitted."
- George Boole, quoted in Iverson's Turing Award Lecture
>
>From: Shiju Alex <shijualexonline at gmail.com>
>To: wikimediaindia-l at lists.wikimedia.org
>Sent: Mon, November 15, 2010 12:02:42 PM
>Subject: Re: [Wikimediaindia-l] Tamil Encyclopedia merge into Wikipedia.
>
>I have a query.
>
>What is the license of Tamil Kalaikalanjiam? Did Tamil Nadu government or Tamil
>Virtual University had officially announced that this Encyclopedia is released
>in Public Domain or in some creative commons license so that we can reuse the
>content. If yes, we can very well reuse the content. Otherwise it will be
>copyright violation. So kindly verify this.
>
>
>Let us not assume that since it is published by Government it will be in pubic
>domain. In India that is not the case.
>
>In 2008 December, Kerala Government has officially announced that it is
>changing the license of similar encyclopedic project in Malayalam
>(sarvavijanakosam) to Free documentation license so that Malayalam wiki
>community can reuse its content to develop Malayalam wikipedia. Governmant has
>officially announced it. Kerala Government has also set up its own wiki (to help
>us) for Sarvavijanakosamand they are slowly digitizing the content and posting
>in its own wiki (http://mal.sarva.gov.in). They have completed some 2,900
>articles now. We are reusing this content to enhance many of the existing
>articles. But we are not copy pasting the entire content due to various
>reasons. The main reason is, the content need to rewritten as per the style of
>wikipedia.
>
>I really have doubt about the efficiency of current OCR softwares for Indian
>languages. It is still under development. The existing solutions are not good.
>I am not sure about Tamil OCR softwares.
>
>Shiju Alex
>
>
>On Mon, Nov 15, 2010 at 11:33 AM, Murali Kumar <pthooran at hotmail.com> wrote:
>
>Dear Wikimedia India,
>>
>>
>>As you probably aware the Govt. of India, immediately post Independence started
>>multiple Indian language encyclopedia projects to stream in Science and
>>Technology. The Tamil language encyclopedia was completed
>>[http://en.wikipedia.org/wiki/Tamil_Encyclopedia]
>>
>>
>>I'm pleased to report Tamil Virtual University has scanned in the Tamil
>>Kalaikalanjiam / Tamil Encyclopedia [Please see Reference 1 below].
>>
>>
>>I was able to download the material via the wonderful wget command and the
>>'convert' (from imagemagick lib) in GNU/Linux. However each of the 10 volumes
>>is close to 700 MB without compression.
>>
>>
>>I would imagine, the people behind this mammoth task (pre-internet era) would
>>have liked it to be merged into a Wiki type format, which would make it a truly
>>living document in-sync with the times.
>>
>>
>>I do not have any experience with 1) Tamil OCR software and 2) Automated updates
>>to Wikipedia.
>>
>>Can anyone take the lead on this project ? It will help boost the number of
>>quality, articles in Indian languages. The Children's encyclopedia is being
>>scanned and has a lot of great visual content.
>>
>>
>>I have uploaded a sample (10 MB) PDF file at
>>https://sites.google.com/site/periasamythooran/kalaikalanjiam/kalaikalanjiamWikiMergeAttempt.pdf
>> if you are interested to give it a spin.
>>
>>
>>Thanks,
>>
>>
>>Murali.
>>
>>
>>1. http://www.tamilvu.org/library/libindex.htm and click on Kalaikalanjiam /
>>Tamil Encyclopedia.
>>_______________________________________________
>>Wikimediaindia-l mailing list
>>Wikimediaindia-l at lists.wikimedia.org
>>https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikimedia.org/pipermail/wikimediaindia-l/attachments/20101114/5e90162f/attachment-0001.htm
More information about the Wikimediaindia-l
mailing list