Dear all,
DPLA* is planning the first annual DPLAFest, in Boston this October 24-25.
It would be great to have groups in attendance from Wikisource,
Wikidata, and Commons. There are already some collaboration underway
on Commons:
http://commons.wikimedia.org/wiki/Commons:Digital_Public_Library_of_America
If groups from each project would be interested in organizing a tent
or sessions at the festival, space (both physical and on the agenda)
could be provided. This is a good opportunity to connect with the
heads of GLAM institutions, and the more technical curators and
archivists (who should be recruited to the generative side :-)
Sam.
* The Digital Public Library of America - a digital platform for
sharing digital collections, and metadata about physical collections,
of all sizes. Started in America, aiming to contribute to shared
standards for similar work everywhere in the world. Focused on
free-software toolchains, CC-0 metadata, and data APIs.
There's a reasonable need to get a shared, standard set of templates,
modules, js scripts for wikisource projects - all projects sharing a
common, identical goal and facing with the same, identical issues, and
needing the same set of international, standard metadata. Nevertheless it's
very difficult to synchronize efforts, while working into different
"boiling" projects; and I personally found very frustrating to admit that
some painful efforts to solve specific issues turned out to be simply a
"rediscovering the wheel". :-(
Oldwikisource, given its "neutral" character, could be IMHO the perfect
project to share the best of source projects, and there's a perfect kind of
works that could uploaded into oldwikisource and proofreaded using common,
shared styles & tools: they are multilanguage works.
Presently, we are going to upload and proofread a three-language (French,
German, Italian and some English too) magazine: Histoire des Alpes - Storia
delle Alpi - Geschichte der
Alpen<http://it.wikisource.org/wiki/Histoire_des_Alpes_-_Storia_delle_Alpi_-_Gesc…>.
It's released under CC-BY-SA-2.0 licence, My idea is, to upload it into
oldwikisource, transcluding it by Iwpage into any interested projecy - the
proofreading/formatting job being done into oldwikisource, with common
tools, common templates, common modules, common "styles". What do you think
about?
Alex (from it.wikisource)
Just to let you know what I'm doing: I'm exploring abbyy.xml (_abbyy.gz
file in Internet Archive file list).
The abbyy.xml file contains many data to go much ahead into
"self-formatting" of text - with details that can't be found into text
layer of djvu files. It contains the XCA_Extended version of xml output of
OCR: (http://www.abbyy-developers.com/en:tech:features:xml), and this is a
brief list of its useful features:
1. coordinates l,t,r,b of any element (from page to character )
2. three main "blockType": text, table, picture;
3. four level details of text areas: region, paragraph, line, character
(and a fifth one, word, can be calculated);
4. data about indenting, font size, word and character certainty of
recognition.
Using coordinates and original images, it's possible to extract images from
original page image; this could be useful both for a "wikiReCaptcha" engine
(extracting doubtful word text and their images) and to extract (or show
without extracting) pictures (the latter can be done showing a clone of
existing thumbnail of the page as the background of a div, and setting
appropriately div and overflow coordinates, with a very low server load).
In brief: all this stuff is extremely exciting, I'm going ahead with my
bold tries, but the matter deserves IMHO the interest of best source geeks
- I'm only playing with very limited skill with a rough layman programming
style.
Alex brollo (from it.wikisource)
I've been preparing a document that explains how the three GsoC-related
projects will affect Wikisource and how book metadata could be connected
with Wikidata
https://meta.wikimedia.org/wiki/User:Micru/Wikisource_across_projects
All the tools are supposed to be opt-in, so no community will be forced to
take any tool or way of working they don't want to.
I would appreciate your feedback about the draft because we would like to
send a message to the most active users in all wikisources and invite them
to join this mailing list and the proposed Wikisource User Group [1]
The tentative list that Andrea has been preparing is here. Please
expand/reduce as you feel convenient. You know better who could be
interested!
https://meta.wikimedia.org/wiki/Global_message_delivery/Targets/Wikisource_…
Usually we would have preferred to use the Central Discussions pages only,
but experience shows that this messages tend to be ignored, maybe there are
too many of them.
Since in this case the changes/improvements are quite big, we believe that
it is important to reach out to as many users as possible to give them the
opportunity to participate in the discussions and voice their opinion.
Would be anyone available to help to write the invitation or translate it
into other languages?
Cheers
David ---Micru
[1] https://meta.wikimedia.org/wiki/Wikisource_User_Group
Hi!
As part of the Google Summer of Code 2013, Aarti Kumari Dwivedi (User:Rtdwivedi), Thibaut Horel (User:Zaran) and I are working on a refactoring of the Proofread Page extension that will allow us to add the ability to edit Page: pages using the Visual Editor. For more information, see https://www.mediawiki.org/wiki/User:Rtdwivedi
We are currently rewriting a lot of code inside of the extension, changes that may cause bugs, like the {{{pagenum}}} one that will be fixed next monday. We are trying to do our best to avoid bugs by increasing the test coverage of the extension but some other ones may occur. Sorry in advance for the inconvenience.
Thomas PT
User:Tpt
PS: We have changed last monday the canonical namespaces names for Page: and Index: namespaces from internationalized ones to English ones ("Page" and "Index") in order to be consistent with MediaWiki core and the other extensions. This allows a more easily sharing of JavaScript gadgets (to test if a page is a Page: page, you just have now to do mw.config.get( "wgCanonicalNamespace" ) === "Index", test that will work in every wikis) but breaks some scripts that are based on the internationalized namespaces names. This change add also "Page" and "Index" as aliases for the Page: and Index: namespaces in every wikis.
Hi guys,
I'm in Geneva (with fellow wikimedians) at a OA conference and we are
talking *a lot* about Wikisource.
We have found a very high quality publisher of OA books (
http://www.openbookpublishers.com/, released in CC-BY), that would be
utmost happy to have their books in Wikisource.
I think the first issue is technical:
* do we have a tool that easily takes an EPUB/HTML and convert it in books
in Wikisource? I'm thinking now about ns0, not nspage.
I think that if we can take a HTML/EPUB index, and transform it in a draft
Wikisource index of links, and upload all the chapters, formatted, we would
have done the 90% of an upload of a book.
This would be really important to insert up to date, high quality OA
content in Wikisource, easily accessible for Wikipedians too.
And, moreover, Open Access books are more relevant to Wikisource than Open
Access articles (IMHO).
Aubrey