Wikisource-l September 2012

wikisource-l@lists.wikimedia.org

6 participants
5 discussions

Scripto, free software for transcribing documents
by Lars Aronsson 12 Sep '13

12 Sep '13

Scripto is an alternative to the ProofreadPage extension used by Wikisource. It is based on Mediawiki but also on OpenLayers, the software used to zoom and pan in OpenStreetMap. The only website I have seen that uses Scripto is the U.K. War Department papers, and in many ways it is more clumsy than ProofreadPage. But there might be a few ideas that could be worth picking up. Take a look. The software is described at http://scripto.org/ As for reference installations, they mention http://wardepartmentpapers.org/transcribe.php -- Lars Aronsson (lars(a)aronsson.se) Aronsson Datateknik - http://aronsson.se

2 1

Support of non-arabic page number in page namespace
by Thomas PT 27 Oct '12

27 Oct '12

Hi, For people that doesn't know me, I contribute to the French Wikisource and I try to improve the Proofread Page extension. Here is a description of some modification into Proofread Page that will be deployed sooner, I think next week. 1 I have made a patch in order to add support of not-arabic page number in the "page" namespace for languages that doesn't use arabic numerals: new pages of multi-pages books will now have a name with number in their own language. Existing pages can keep their names with arabic numerals or be renamed by hand or with a bot in order to use not-arabic numbers. The pagelist tag will links to pages with a localized number but, if the localized page doesn't exist and a page with an arabic number exist, pagelist will link to this page. The pages tag and the navigation inside of the page namespace will use the same system. So, with this system, nothing, I hope, will be broken in wikis. 2 I have added support of wiki links to proofreadpage_page_status message. 3 I have written a change that create prp-pagequality-[1-4] HTML classes and add them to all links to Page namespace. These classes are done in order to allow people to color all links to page namespace. The old quality[1-4] classes can of course be ever used in order to color these links only inside of Index namespace. So I hope that these improvments will increase ProofreadPage quality. Sorry for my poor English, Thomas (User:Tpt)

3 3

Toolserver stats
by Andrea Zanni 27 Sep '12

27 Sep '12

Hi all, does someone know what happened to toolserver stats? The froze 3 days ago: http://toolserver.org/~phe/statistics.php?diff=1 Aubrey

1 0

Which books to scan to support Wikipedia
by Lars Aronsson 25 Sep '12

25 Sep '12

After Wikimania, I took a trip to Toronto, where the Internet Archive has a large book scanning center. If you have worked on Wikisource, I think you know that many of the books there come from the Internet Archive, which provides scanned images and OCR text in the form of a Djvu file, which can be proofread and presented in Wikisource. I went there to learn more about how to scan books, but another possibility is to use their existing facilities, which currently are not used at full capacity. The way the collaboration is set up, the room is provided by the University of Toronto, but the equipment and staff belong to the Internet Archive. The participating libraries around Canada decide which books to digitize and pay a small fee to the Internet Archive for scanning them. This opens up an interesting opportunity. If we want a particular book to be digitized, and we can find it in the library catalogs of the University of Toronto, it might be possible (in theory, at least) for us to present this wish and perhaps provide the money needed, either directly from the Wikimedia Foundation, or its chapters. Canada is a country with many immigrants and the library system has books in many languages. This brings me to the question: Which books would be the most important to scan, to help Wikipedia? For the Swedish and Danish Wikipedia, I know this are the out-of-copyright encyclopedias from the early 20th century, that I have already digitized in 2003 and 2008, respectively. The Czech "Ottuv slovnik" has already been digitized by Google, as the Wikipedia article points out, http://cs.wikipedia.org/wiki/Ott%C5%AFv_slovn%C3%ADk_nau%C4%8Dn%C3%BD But are the black-and-white scans good enough, or would the Czech community be interested in Internet Archive quality scans in color? If so, we have to figure out if the Internet Archive will scan it for free, if the UofT library will pay, if the Czech chapter can pay (some chapters have money, but are not allowed to send donated money abroad, because of tax deductions), or perhaps the WMF. I saw the Ottuv slovnik on the shelves, in the same building as the scanning stations. It's just waiting for somebody to piece the puzzle together. But let's begin with a wish list of which books to scan. Of course they need to be out-of-copyright to fit in the Internet Archive, Wikimedia Commons, and in Wikisource. Perhaps illustrated works are more interesting than text? This would be a GLAM + wiki cooperation that cuts across national borders. -- Lars Aronsson (lars(a)aronsson.se) Project Runeberg - free Nordic literature - http://runeberg.org/

5 5

skewed statistics
by teak 06 Sep '12

06 Sep '12

Hi, does anyone know why the wikisource statistics are so skewed from what the local special:statistics shows? compare http://stats.wikimedia.org/wikiquote/EN/Sitemap.htm with, say, http://en.wikisource.org/wiki/Special:Statistics This seems to affect only wikisource, as wikipedia and wikiqoute are fine... And which one is more trustworthy? teak

3 3

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikisource-l September 2012