FYI. Yann
---------- Forwarded message ----------
From: Ravishankar <ravidreams(a)gmail.com>
Date: 2014-08-30 20:10 GMT+05:30
Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil
Encylopedia released under Creative Commons
To: Wikimedia India Community list <wikimediaindia-l(a)lists.wikimedia.org>
Hi,
Tamil Development Board (an autonomous institution under Government of
Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and
Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons
license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P.
R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this
initiative coinciding with Tamil Wikipedia's 10 years celebrations.
An official confirmation (in Tamil) can be seen at
https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Devel…
Scanned copies of these works are already available at
http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm
At Tamil Wikipedia, we are discussing how we can get this content
typed and transferred to WikiSource. Doing so can be a good model to
encourage more such works to be released in public domain.
Following are two options I can think of:
1. Volunteers type all the content. Besides taking years to complete,
this won't do justice for the value of time of volunteers who can do
more valuable work than typing mechanically.
A program like IT@School present in Kerala or a contest can encourage
more people to join this effort but not all communities can't emulate
this model successfully.
2. Request WMF to give a grant to the owner of the content and let
them hand over the typed content to Wikisource volunteers who will
upload and wikify the content.
This will ensure maintaining the spirit of volunteerism and yet
getting the work done in a professional and time bound manner.
Numerous works in Wikisource are such ready made content uploaded
already in the web through other projects like Project Gutenberg.
If providing grants to non-Wikimedia organizations is an issue, a
grant towards this can be given to community / chapter who will then
outsource the typing work.
I welcome community's input on any other model for this as India has
vast amount of literature and works like this are waiting to be
transfered to Wikisource. This is one area where we can add lot of
content to Wiki projects at once.
Ravi
_______________________________________________
Wikimediaindia-l mailing list
Wikimediaindia-l(a)lists.wikimedia.org
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi, I have setup a new OCR service on tools.wmflabs.org, it provides
through some javascript hosted on wikisource.org location data of words
for djvu/pdf Index:. It can be used by adding
mw.loader.load('//wikisource.org/w/index.php?title=MediaWiki:Hocr.js&action=raw&ctype=text/javascript&dontcountme=s');
to your site wide MediaWiki:Common.js or to your own User Common.js, the script
works in Page: namespace, in edit or view mode. There is no user interface
except double click on a word should highlight the words on the scan. I found
it very useful for encyclopedia when it can be time consuming to retrieve the
possition of words on the image.
As the ocr and profread text are always different, the location of word is
often shifted by one or more word, location provided is only approximate.
--
Phe
Hi all,
Part of one of my promises at WM2014 was to better communicate components
of our actions at enWS to the broader community.
We have submitted bugs for ...
1. Turning webfonts back on for
enWS.https://bugzilla.wikimedia.org/show_bug.cgi?id=69655
Initially the request for fraktur fonts alone, so we could do blackletter.
However when having a conversation with WMF's Languages and
Internationalisation team they encouraged me to just request for a
reinstatement of the complete webfonts package. This has now had the patch
awaiting for review. This actually replicates what heWS has already done,
so it is not exactly trail-blazing.
One outcome that I have semi-committed to do is to create a help page for
some of the fonts that are not full language fonts, but are able to be
generated with this reinstalled process. At this stage it would have to
live on enWS (due to enablement), however, if there was a local
conversation at OLDWIKISOURCE for a similar function, it would make more
sense to host it there. Hint! hint! to that community, and really should
have all those fonts.
2. General interwiki language link to oldwikisource is required
https://bugzilla.wikimedia.org/show_bug.cgi?id=32189
This is the bug to get the mul: interlanguage prefix functioning. It is
blocked by another change that is waiting for review, and I am told to nag
Reedy, and/or others to progress it, which I am doing. Others may wish to
keep a lookout for those mentioned in the bug and see if they can get their
attention in IRC if they seem them active.
3. Increasing search weighting on namespaces other than main
namespacehttps://bugzilla.wikimedia.org/show_bug.cgi?id=69771
We have found that authors in our author namespace rank pretty low in a
search compared to their works, so we have asked that we increase the
weighting for our Author: ns (ns:102). I made comment in the bugzilla that
this may be of interest to other WSes. To note that I have previously
discussed this with Nik at WM2014, and reported at our WS meeting that I
would be lodging this bugzilla. {{done}}
4. Refining typeahead search results for subpages at English
Wikisourcehttps://bugzilla.wikimedia.org/show_bug.cgi?id=69658
For our biographical works where they are subpages we want to find them
when typing, not necessarily after a search. Another that was discussed at
WM2014, again which may have broader interest. {{done}}
5. Exploring tweaking of English Wikisource header templates for better
search results https://bugzilla.wikimedia.org/show_bug.cgi?id=69772
Question to Nik to see if we can use some of our 'metadata' in templates,
eg. microformat, or other bits, in smarter ways in CirrusSearch. Again
discussed at WM2014, and how asked to progress questions.
Regards, Billinghurst
PS. I still have other homework to submit, and as I get through the
backlog of tasks, it will get done. Just prioritising at the moment.
Maybe this is of interest?
Aubrey
---------- Forwarded message ----------
From: Seth Woodworth <seth(a)sethish.com>
Date: Tue, Aug 19, 2014 at 8:09 PM
Subject: [open-humanities] Forking Project Gutenberg to Github
To: A list for people interested in the use of open source tools and open
access in humanities teaching and research <open-humanities(a)lists.okfn.org>,
okfn-labs <okfn-labs(a)lists.okfn.org>
Hello Humanities!
I've been working on a project called GITenberg <http://gitenberg.github.io>
.
The aim is to move Project Gutenberg's books to github.
As you probably know, Project Gutenberg (PG) is an amazing organization
that has been digitizing public domain books since the 1970s. They have
around 45,000 books.
But PG is hesitant to upgrade their tools, and have limited resources to
work on new projects. But there are issues with the current collection.
There are some remaining typos and transcription errors. And many books
are using old encoding formats (PG predates unicode).
I want to help with that, and along the way, produce something that more
developers, OKFN hackers, digital humanists and other groups can readily
build upon.
Enter GITenberg.
GITenberg uses git and github to keep track of books. This adds a number
of features right out of the gate, including:
+ version control via git
+ public bug tracking (PG uses a private RT instance to track reported
issues)
+ public collaboration (pull requests under public review)
PG's metadata is provided in RDF/XML, in a 230mb zip file. While this is a
wonderful resource, RDF isn't the easiest format for most developers to
pick up and use. In fact, the .zip file has so many top-level folders, it
can't be completely unpacked on some filesystems (ext3).
I've created repos and included the book source files (often including
images!) for 43,000 of PG's books and put them on github.
There is a lot yet that I hope to do, but I would love to get OKFN's
feedback, requests, or assistance!
Uploading script <https://github.com/sethwoodworth/GITenberg>
Mailing list <https://groups.google.com/forum/#!forum/gitenberg-project>
All the best,
Seth
_______________________________________________
open-humanities mailing list
open-humanities(a)lists.okfn.org
https://lists.okfn.org/mailman/listinfo/open-humanities
Unsubscribe: https://lists.okfn.org/mailman/options/open-humanities
--
http://aubreymcfato.wordpress.com
A small detail: this beta feature won't be enabled on wikis that already have the other project sidebar enabled for everyone (like the french and italian wikisources). For these wikis nothing should change.
Thomas
Le 15 août 2014 21:56, David Cuenca <dacuetu(a)gmail.com> a écrit :
---------- Forwarded message ----------
From: Lydia Pintscher <lydia.pintscher(a)wikimedia.de>
Date: Fri, Aug 15, 2014 at 8:04 PM
Subject: [Wikitech-ambassadors] new beta feature: in other projects sidebar
To: wikitech-ambassadors <wikitech-ambassadors(a)lists.wikimedia.org>
Cc: "Discussion list for the Wikidata project." <
wikidata-l(a)lists.wikimedia.org>
Hey folks :)
We'll be rolling out a new beta feature on August 26th/28th to
Wikipedia, Wikisource and Wikiquote. It will add a new section to the
sidebar of an article. This section will contain links to related
articles in other sister projects. Which projects are shown can be
configured per-wiki. You can find out more about it at
https://www.mediawiki.org/wiki/Beta_Features/Other_projects_sidebar
If you have any questions about this please use the discussion page of
https://www.mediawiki.org/wiki/Beta_Features/Other_projects_sidebar
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
_______________________________________________
Wikitech-ambassadors mailing list
Wikitech-ambassadors(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-ambassadors
--
Etiamsi omnes, ego non
_______________________________________________
Wikisource-l mailing list
Wikisource-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l
---------- Forwarded message ----------
From: Lydia Pintscher <lydia.pintscher(a)wikimedia.de>
Date: Fri, Aug 15, 2014 at 8:04 PM
Subject: [Wikitech-ambassadors] new beta feature: in other projects sidebar
To: wikitech-ambassadors <wikitech-ambassadors(a)lists.wikimedia.org>
Cc: "Discussion list for the Wikidata project." <
wikidata-l(a)lists.wikimedia.org>
Hey folks :)
We'll be rolling out a new beta feature on August 26th/28th to
Wikipedia, Wikisource and Wikiquote. It will add a new section to the
sidebar of an article. This section will contain links to related
articles in other sister projects. Which projects are shown can be
configured per-wiki. You can find out more about it at
https://www.mediawiki.org/wiki/Beta_Features/Other_projects_sidebar
If you have any questions about this please use the discussion page of
https://www.mediawiki.org/wiki/Beta_Features/Other_projects_sidebar
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
_______________________________________________
Wikitech-ambassadors mailing list
Wikitech-ambassadors(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-ambassadors
--
Etiamsi omnes, ego non