FYI. Yann
---------- Forwarded message ---------- From: Ravishankar ravidreams@gmail.com Date: 2014-08-30 20:10 GMT+05:30 Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil Encylopedia released under Creative Commons To: Wikimedia India Community list wikimediaindia-l@lists.wikimedia.org
Hi,
Tamil Development Board (an autonomous institution under Government of Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Develo...
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content typed and transferred to WikiSource. Doing so can be a good model to encourage more such works to be released in public domain.
Following are two options I can think of:
1. Volunteers type all the content. Besides taking years to complete, this won't do justice for the value of time of volunteers who can do more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage more people to join this effort but not all communities can't emulate this model successfully.
2. Request WMF to give a grant to the owner of the content and let them hand over the typed content to Wikisource volunteers who will upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a grant towards this can be given to community / chapter who will then outsource the typing work.
I welcome community's input on any other model for this as India has vast amount of literature and works like this are waiting to be transfered to Wikisource. This is one area where we can add lot of content to Wiki projects at once.
Ravi
_______________________________________________ Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Let me mention of yet another possible model that we are almost about to try in ml:
Apart from the already tried and tremendously successful models of school-wise student model and general community competition (for prizes in kind or just for credential certificates), we are now pondering upon a new idea:
Use secondary social network collaboration.
As an example, if you have formed (or if you could grow from now,) a very vibrant Wikimedia or other language-focused community network in Facebook, you can open up an appropriately scaled competition model within that. Many people are now seeking independent localization tools and input methods just to use Facebook.
(And BTW, thanks to Facebook, which I always detested, for making a great invisible revolution among mainstream local community members, that most other models are still struggling to achieve!)
In Malayalam, now we have several such large communities each one specializing in their own arenas (eg. Butterflies, Plants, Birds, Grammar, Films and Film Songs etc. etc.). They all share the idea that most of such shared knowledge and digital text and images shall ultimately end up in WM pools.
So, imagine a world where, A Tamil community in Facebook takes up this project. (May or may not be with an organized competition spirit). Each member picks up a few pages one at a time, and post it back to Facebook. A small team collects this and pipes to Wikisource. Eventually, the volume gets completely in.
Just imagine... and it shall happen!
PS. Why in FB and then why not directly in Wikisourse? You know why! The interface makes a big difference. Let's admit that!
-Viswam
On Sun, Aug 31, 2014 at 12:03 AM, Yann Forget yannfo@gmail.com wrote:
FYI. Yann
---------- Forwarded message ---------- From: Ravishankar ravidreams@gmail.com Date: 2014-08-30 20:10 GMT+05:30 Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil Encylopedia released under Creative Commons To: Wikimedia India Community list wikimediaindia-l@lists.wikimedia.org
Hi,
Tamil Development Board (an autonomous institution under Government of Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Develo...
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content typed and transferred to WikiSource. Doing so can be a good model to encourage more such works to be released in public domain.
Following are two options I can think of:
- Volunteers type all the content. Besides taking years to complete,
this won't do justice for the value of time of volunteers who can do more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage more people to join this effort but not all communities can't emulate this model successfully.
- Request WMF to give a grant to the owner of the content and let
them hand over the typed content to Wikisource volunteers who will upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a grant towards this can be given to community / chapter who will then outsource the typing work.
I welcome community's input on any other model for this as India has vast amount of literature and works like this are waiting to be transfered to Wikisource. This is one area where we can add lot of content to Wiki projects at once.
Ravi
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
What is the present status of Tamil OCR? 20 volumes (8376 pages) typing should not solution for this!
Jayanta
On Sun, Aug 31, 2014 at 1:07 AM, ViswaPrabha (വിശ്വപ്രഭ) < viswaprabha@gmail.com> wrote:
Let me mention of yet another possible model that we are almost about to try in ml:
Apart from the already tried and tremendously successful models of school-wise student model and general community competition (for prizes in kind or just for credential certificates), we are now pondering upon a new idea:
Use secondary social network collaboration.
As an example, if you have formed (or if you could grow from now,) a very vibrant Wikimedia or other language-focused community network in Facebook, you can open up an appropriately scaled competition model within that. Many people are now seeking independent localization tools and input methods just to use Facebook.
(And BTW, thanks to Facebook, which I always detested, for making a great invisible revolution among mainstream local community members, that most other models are still struggling to achieve!)
In Malayalam, now we have several such large communities each one specializing in their own arenas (eg. Butterflies, Plants, Birds, Grammar, Films and Film Songs etc. etc.). They all share the idea that most of such shared knowledge and digital text and images shall ultimately end up in WM pools.
So, imagine a world where, A Tamil community in Facebook takes up this project. (May or may not be with an organized competition spirit). Each member picks up a few pages one at a time, and post it back to Facebook. A small team collects this and pipes to Wikisource. Eventually, the volume gets completely in.
Just imagine... and it shall happen!
PS. Why in FB and then why not directly in Wikisourse? You know why! The interface makes a big difference. Let's admit that!
-Viswam
On Sun, Aug 31, 2014 at 12:03 AM, Yann Forget yannfo@gmail.com wrote:
FYI. Yann
---------- Forwarded message ---------- From: Ravishankar ravidreams@gmail.com Date: 2014-08-30 20:10 GMT+05:30 Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil Encylopedia released under Creative Commons To: Wikimedia India Community list wikimediaindia-l@lists.wikimedia.org
Hi,
Tamil Development Board (an autonomous institution under Government of Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Develo...
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content typed and transferred to WikiSource. Doing so can be a good model to encourage more such works to be released in public domain.
Following are two options I can think of:
- Volunteers type all the content. Besides taking years to complete,
this won't do justice for the value of time of volunteers who can do more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage more people to join this effort but not all communities can't emulate this model successfully.
- Request WMF to give a grant to the owner of the content and let
them hand over the typed content to Wikisource volunteers who will upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a grant towards this can be given to community / chapter who will then outsource the typing work.
I welcome community's input on any other model for this as India has vast amount of literature and works like this are waiting to be transfered to Wikisource. This is one area where we can add lot of content to Wiki projects at once.
Ravi
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Just to note, just for the time being, there is even a blessing hidden in disguise for the lackof a proper OCR for Indic languages:
Although it may look ridiculous at first, the very reason that we do not have an OCR forced us to the conduct the 2014 Digitization contest in ml.wikisource. More than milking thousands of valuable and rare old pages, it served another purpose that was not originally intended for: thousands of school children as well as otherwise non-wikimedian volunteers learned and enhanced their typing skills in local languages!
OCR is a great and urgent necessity for all Indian languages. But that is not always an excuse too. We can still pursue our mission even through such constraints, leveraging every disadvantage into an advantage.
When you have a great amount of work such as 8376 pages, do not be overwhelmed! Think of the Moore's law. And in India, our strength is our population. We will never run out of work. But we will possibly also never run out of volunteers if sufficient environment is prepared and maintained for them.
The community power of India's digital future is yet to arrive. But it is just around the corner, for sure. We just need to set ourselves ready for that. So, keep thinking of possibilities and shake off limitations!
:)
-Viswam
On Sun, Aug 31, 2014 at 3:38 AM, Jayanta Nath jayantanth@gmail.com wrote:
What is the present status of Tamil OCR? 20 volumes (8376 pages) typing should not solution for this!
Jayanta
On Sun, Aug 31, 2014 at 1:07 AM, ViswaPrabha (വിശ്വപ്രഭ) < viswaprabha@gmail.com> wrote:
Let me mention of yet another possible model that we are almost about to try in ml:
Apart from the already tried and tremendously successful models of school-wise student model and general community competition (for prizes in kind or just for credential certificates), we are now pondering upon a new idea:
Use secondary social network collaboration.
As an example, if you have formed (or if you could grow from now,) a very vibrant Wikimedia or other language-focused community network in Facebook, you can open up an appropriately scaled competition model within that. Many people are now seeking independent localization tools and input methods just to use Facebook.
(And BTW, thanks to Facebook, which I always detested, for making a great invisible revolution among mainstream local community members, that most other models are still struggling to achieve!)
In Malayalam, now we have several such large communities each one specializing in their own arenas (eg. Butterflies, Plants, Birds, Grammar, Films and Film Songs etc. etc.). They all share the idea that most of such shared knowledge and digital text and images shall ultimately end up in WM pools.
So, imagine a world where, A Tamil community in Facebook takes up this project. (May or may not be with an organized competition spirit). Each member picks up a few pages one at a time, and post it back to Facebook. A small team collects this and pipes to Wikisource. Eventually, the volume gets completely in.
Just imagine... and it shall happen!
PS. Why in FB and then why not directly in Wikisourse? You know why! The interface makes a big difference. Let's admit that!
-Viswam
On Sun, Aug 31, 2014 at 12:03 AM, Yann Forget yannfo@gmail.com wrote:
FYI. Yann
---------- Forwarded message ---------- From: Ravishankar ravidreams@gmail.com Date: 2014-08-30 20:10 GMT+05:30 Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil Encylopedia released under Creative Commons To: Wikimedia India Community list <wikimediaindia-l@lists.wikimedia.org
Hi,
Tamil Development Board (an autonomous institution under Government of Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Develo...
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content typed and transferred to WikiSource. Doing so can be a good model to encourage more such works to be released in public domain.
Following are two options I can think of:
- Volunteers type all the content. Besides taking years to complete,
this won't do justice for the value of time of volunteers who can do more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage more people to join this effort but not all communities can't emulate this model successfully.
- Request WMF to give a grant to the owner of the content and let
them hand over the typed content to Wikisource volunteers who will upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a grant towards this can be given to community / chapter who will then outsource the typing work.
I welcome community's input on any other model for this as India has vast amount of literature and works like this are waiting to be transfered to Wikisource. This is one area where we can add lot of content to Wiki projects at once.
Ravi
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
(Not that this is not self-evident)
First steps have to be * Get the digital copies * Upload to commons and get all the copyright stuff done (so we can keep the deletion maggots away) * Get the index pages created, with pagelists
After that it would seem that there needs to be gamification, and what is the 'reward'.
Our Indian colleagues have given us some great examples of what has been done, and some of the benefits that have come from it. So what follows is some random thoughts just to put out there, and to be plus'd or minuse'd by the community as they see fit. This is just a stream of green hat thinking ...
As there is no OCR, we could look to a scoring system based on a combination of ... * characters added (filesize increase) * starting a page * advancing to proofread status * advancing to validate status
We could look to * individual scoring * team scoring
An issue for team scoring is that if we had this overt for schools, classes, age groups, then we have a point of identification, and possibly some problem for predatory behaviour on kids. So teams would need to have a level of self-ownership. Using the previous conversation in the thread about education tool, there could be a value in team scoring availability in that (through facebook) it takes the gamification outside of Wikisource, in that teams can self-organise. We would just need to organise to enable teams and a scoring system.
We could give 'awards' for individual scores, and acknowledge teams. That said, what scope is there for getting some recognition from the releasing agency of the works for any team achievement?
Also to note that this is going to be a long term project, we want to have lots of milestone awards. Recognition by month, and by year. Looking to celebrate 1k 10k, 100k edit +++ As WMF is trying to push into these markets, one would hope that we could look to getting a series of microgrants/microawards that could be utilised for some recognition as a final goal.
So we would need to find some vols in the broader space who can help us build some tools, or maybe that is part of a thought in a grant application. Any tool we would want to be generic so that all places can look to utilise for any other gamification.
Anyway, just my thought bubbles.
Regards, Billinghurst
On Sun, Aug 31, 2014 at 4:33 AM, Yann Forget yannfo@gmail.com wrote:
FYI. Yann
---------- Forwarded message ---------- From: Ravishankar ravidreams@gmail.com Date: 2014-08-30 20:10 GMT+05:30 Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil Encylopedia released under Creative Commons To: Wikimedia India Community list wikimediaindia-l@lists.wikimedia.org
Hi,
Tamil Development Board (an autonomous institution under Government of Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Develo...
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content typed and transferred to WikiSource. Doing so can be a good model to encourage more such works to be released in public domain.
Following are two options I can think of:
- Volunteers type all the content. Besides taking years to complete,
this won't do justice for the value of time of volunteers who can do more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage more people to join this effort but not all communities can't emulate this model successfully.
- Request WMF to give a grant to the owner of the content and let
them hand over the typed content to Wikisource volunteers who will upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a grant towards this can be given to community / chapter who will then outsource the typing work.
I welcome community's input on any other model for this as India has vast amount of literature and works like this are waiting to be transfered to Wikisource. This is one area where we can add lot of content to Wiki projects at once.
Ravi
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
wikisource-l@lists.wikimedia.org