Hei,
I'm from the French chapter, Wikimédia France, and I want to annonce a
good news for Wikisource. Some months ago Wikimédia France signed a
partnership ("small" but significant in France) with the French National
Library (BnF) who gives to Wikisource 1400 texts with their OCR [1].
Today these books arrived on Commons [2] and are ready to be checked by
the Wikisource community. The list is available on [3], the range of books
covers many century, different levels of confidence in the OCR and
different subjects, but most of them are in French.
Now there is work for 800 years say somebody, 20 years for another, 'don't
know, we'll see :-)
Wikimedially,
Sébastien/Seb35 (I'm presently at Wikimania, Gdańsk, Poland)
[1]
http://www.wikimedia.fr/wikimédia-france-and-french-national-library-enter-…
[2] http://commons.wikimedia.org/wiki/Category:Books_provided_by_the_BNF
[3]
http://fr.wikisource.org/wiki/Wikisource:Dialogue_BnF/Liste_de_textes_fourn…
[[Please distribute widely to various language communities, projects,
and chapters]]
Hi All,
I'd like to begin a conversation about the 2010-2011 Fundraiser, which
isn't slated to launch for a few months, but for which we'd like to
get community involvement early and often. As you no doubt are aware,
the strategic plan calls for the "many small gifts" model to be the
centerpiece of our funding strategy, so our community fundraiser is
one of the key methods by which we finance and underwrite the
operations of the projects. The fundraiser this year will probably,
as in earlier years, be primarily banner driven. We're going to have
a strong emphasis on testing and iterating ideas, with a defined
methodological testing plan.
But the most important part of what we - all of us - are going to need
to do is what this community has always been good at: thinking,
researching, and iterating.
With that in mind, it's important to identify people who want to
help. Of course, anyone's welcome to join in and help at any time,
but there's a definite need for people who are willing to be deeply
involved from now to the wrap up... people who want to be creative but
rigorous, innovative but willing to learn from the past, and most of
all, to serve as an active part of the team working on this
fundraiser. There will, of course, be Foundation staff deeply
involved in this, but there's a real need for people from the
community to step up and help us design this thing.
If you're willing to help, would you add your name to http://meta.wikimedia.org/wiki/Fundraising_2010/Committee
? We'll be in contact - soon - to get things started.
Thanks,
Philippe
____________________
Philippe Beaudette
Head of Reader Relations
Wikimedia Foundation
philippe(a)wikimedia.org
ofc: +1 415 839 6885 (x 643)
mobile: 918 200 WIKI (9454)
Imagine a world in which every human being can freely share in
the sum of all knowledge. Help us make it a reality!
http://wikimediafoundation.org/wiki/Donate
Hello Aubrey,
On Thu, Jul 15, 2010 at 7:26 PM, Aubrey <zanni.andrea84(a)gmail.com> wrote:
> The issue of metadata is nontheless serious, because it's one of the most
> important flaws of Wikisource: not applying standards (i.e Dublin Core) and not
> having a proper tools for export/import and harvest metadata
Both good points. Are there proposals on wikisource to address these
two points in a way that's friendly to wikisource contributors?
>> I want us to get better, faster, less held up by the idea of
>> coordinating with other projects, because there are much larger
>> projects out there worthy of coordinating with. The annotators who
>> work on the Perseus Project come to mind... but that's truly a harder
>> problem than this one.
>
> The Perseus project is an *amazing* project, but I regard them far more ahead
> than us. The PP is actually a Virtual Research Environment, with tools for
> scholars and researcher for studying texts, (concordances and similar stuff).
<
> I would love to have PP people involved in collaboration with Wikisource, just
> don't know if this is possible.
Yes, PP is ahead of us in some ways. But in other ways they have run
into bottleneck and multilingual issues that a wiki environment can
resolve.
I believe that Prof. Greg Crane of the Perseus Project (cc:ed here) is
interested in starting to collaborate with Wikisource, even while
pursuing ideas about developing a larger framework for wiki-style
annotations and editions.
While it may be hard in the short term, in the long term that's what I
think we all want wikisource to become.
> It is interesting because a project similar to PGDP (it is Italian and started
> in 1993, emulating the glorious PG, just with Italian texts) is, right now,
> moving to a wiki. Although the scale is way smaller, Wikipedia and Wikisource
> showed them a system which tends to eliminate bottlenecks, and for them this is
> becoming crucial.
<
> Luckily, the relationships with the Italian Wikisource are really good, and
> they'll probably share an office with Wikimedia Italy, in October.
> The interesting fact is that the offices will be within a library ;-), so I
> really expect a collaboration there.
Wow. This is all great to hear -- can you include a link to the
project? I'd like to blog about it.
Warmly,
SJ
On Fri, Jul 16, 2010 at 9:26 AM, Aubrey <zanni.andrea84(a)gmail.com> wrote:
>...
>
> The issue of metadata is nontheless serious, because it's one of the most
> important flaws of Wikisource: not applying standards (i.e Dublin Core) and not
> having a proper tools for export/import and harvest metadata is still make us
> amateurs, at least for "real" digital libraries (who focus mainly on the
> metadata stuff, and sometimes provide either texts or images (it is really rare
> to have both)).
This is also a problem with Wikimedia Commons.
http://strategy.wikimedia.org/wiki/Proposal:Dublin_Core
> The Perseus project is an *amazing* project, but I regard them far more ahead
> than us. The PP is actually a Virtual Research Environment, with tools for
> scholars and researcher for studying texts, (concordances and similar stuff).
I agree. I would go further; PP will always be far more advanced than
a mediawiki system.
They store their data in TEI format, which is an extremely rich
standard. Wikisource can incorporate some of the TEI concepts by
using templates, but I doubt we could ever be a leader in this area,
nor do I think we want to.
http://en.wikipedia.org/wiki/Text_Encoding_Initiative
> It happens that I just finished my Master thesis about collaborative digital
> libraries for scholars (in the Italian context), and the outcome is quite clear:
> researcher do want collaborative tools in DLs, but wiki system are
> to simple and (right now) too naive to really help scholars in their work (and
> there's a lot of other issues I'm not going to explain here).
>
> I would love to have PP people involved in collaboration with Wikisource, just
> don't know if this is possible.
I agree. PP and Wikisource are too different, and have very little to
gain from the other. PP wants to improve/increase collaboration &
community, but not at the expense of loosing the quality of their
metadata. Wikisource wants to improve quality and metadata, but not
at the expense of the ability to collaboration and our simple editing
interface.
Again, interoperability is the first step towards useful
'collaboration'. i.e. Wikisource needs to export TEI. Then we could
feed our poorly annotated/described sources into PP, where the
academic community would then add the metadata.
TEI export would also be useful for wiktionary.
> Just one more thing: why this awesome thread has not been linked to the
> source-l? Probably that would have been the best place to discuss.
;-)
--
John Vandenberg
Did you see this interesting thread?
http://thread.gmane.org/gmane.org.wikimedia.foundation/46475
Nemo
-------- Messaggio Originale --------
Oggetto: [Foundation-l] Wikisource and reCAPTCHA
Data: Wed, 23 Jun 2010 21:39:09 +0100
Da: Michael Peel <email(a)mikepeel.net>
Rispondi-a: Wikimedia Foundation Mailing List
<foundation-l(a)lists.wikimedia.org>
A: Wikimedia Foundation Mailing List <foundation-l(a)lists.wikimedia.org>
Referenze: <815482.36310.qm(a)web45902.mail.sp1.yahoo.com>
(Renaming the subject as we've changed topic)
On 23 Jun 2010, at 21:31, Mariano Cecowski wrote:
> --- El mié 23-jun-10, Michael Peel <email(a)mikepeel.net> escribió:
>
>> I always think than not using reCaptcha is a shame, as it's
>> a nice way to get people to proofread text in a reasonably
>> efficient way. It would be really nice if someone could
>> create something similar that proofreads OCR'd text from
>> Wikisource... <hint, hint>.
>
> And how do you decide that what was entered is wrong or right?
>
> Better take a look at Project Gutemberg's Distributed Proofreaders[1].
>
> Cheers,
> MarianoC.-
>
> [1] http://pgdp.net
My understanding is that original text within the reCAPTCHA is shown to
several different people; if they agree then the word is counted as
correct. Looking at the Wikipedia article, it's a little more complex
than that:
http://en.wikipedia.org/wiki/ReCAPTCHA
There's a reason why there are two words to solve during a reCAPTCHA.
What Distributed Proofreaders can do, Wikisource can do - but in a Wiki
environment. If you haven't checked out the proofreading features that
Wikisource now has, I would encourage you to give them a go, e.g. at:
http://en.wikisource.org/wiki/Page:Frederic_Shoberl_-_Persia.djvu/92
Mike
_______________________________________________
foundation-l mailing list
foundation-l(a)lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l
Firefox 4 and Chrome 6 have support for Websockets.
I wrote a websocket server that forwards Recent Changes to a web
browser, in order to visualize them dynamically.
Here is a list of pages using it:
*[http://toolserver.org/~thomasv/rcsound.html a page that plays a sound
everytime a page is proofread at the most active wikisources]
*[http://toolserver.org/~thomasv/wprc.html en.WP's recent changes] (it
scrolls kind of fast)
*[http://toolserver.org/~darkdadaah/wiktio/outils/rc/fr_wikt_rc_table.html
fr.wiktionary's RCs]
If you decide to write another page that uses this server, please add it
to the list.
It is possible to use this tool directly on any Wikimedia wiki ; here is
a script that turns the RC page of your wiki into a self-updating page:
importScriptURI('http://wikisource.org/w/index.php?title=MediaWiki:DynamicRC.js&action=raw&c…');
I hope you enjoy it. If you decide to write a page that uses this tool,
please add it to this list :
https://wiki.toolserver.org/view/RC_Websocket_server
Thomas
There is a lot of potential in Wikisource, but it depends
heavily on the ProofreadPage extension and it has several
bugs that are reported but don't get fixed.
ThomasV is the main developer and perhaps he is the only
maintainer? It would be in the interest of the Wikimedia
Foundation to assign a salaried developer or two into
developing a more robust framework for Wikisource, either
by improving the existing extension or by integrating
some or all of its functionality into MediaWiki proper.
People everywhere have a need to make some PDF (or Djvu)
document available on a website, page by page, with the
ability to add categories and talk pages. This ability
is what the ProofreadPage extension adds to MediaWiki.
In my mind, it is as essential as the support for uploading
JPEG images and automatically generating thumbnails.
Adding multipage documents to a wiki should be a far more
common need than adding mathematical equations.
--
Lars Aronsson (lars(a)aronsson.se)
Aronsson Datateknik - http://aronsson.se
de.ws is already not respecting the system, because it didn't marked the empty pages as "Empty". fr.ws has 17000 empty pages, en.ws has 15000, de.ws has 1000. I suppose that a great part of the missing 14000 or 15000 are marked as "Validated"... and de.ws has 54000 validated pages.
----- Message d'origine -----
De : ThomasV
Envoyés : 01.07.10 12:22
À : discussion list for Wikisource, the free library
Objet : Re: [Wikisource-l] [Wikitech-l] Wikisource bugs
* Thirdly, if the de.wikisource community decides, by vote or by
consensus, that they want IPs to be allowed to change the quality
status of pages, they can do this without destroying
ProofreadPage. They just need to step out of the ProofreadPage
quality system, and restore their previous system in place of it.
I am willing to explain how to do this to any technically skilled
person. Note that I already made the same proposal in bugzilla
8 months ago.
_______________________________________________ Wikisource-l mailing list Wikisource-l(a)lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l