This information interests Wikisouce.
Yann
-------- Original Message --------
Subject: [JIRA] Resolved: (TS-179) Install OCR software Tesseract globally
Date: Tue, 24 Feb 2009 04:43:06 +0000 (UTC)
From: River Tarnell (JIRA) <jira(a)toolserver.org>
To: yann(a)forget-me.net
https://jira.toolserver.org/browse/TS-179?page=com.atlassian.jira.plugin.sy…
River Tarnell resolved TS-179.
------------------------------
Resolution: Fixed
> Install OCR software Tesseract globally
> ---------------------------------------
>
> Key: TS-179
> URL: https://jira.toolserver.org/browse/TS-179
> Project: Toolserver
> Issue Type: Task
> Security Level: Public (all users)(Any user can view this issue (default))
> Components: Software installation
> Reporter: Yann Forget
> Assignee: River Tarnell
>
> Hello,
> Please install OCR software Tesseract globally. I use it for Wikisource.
> Thanks,
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
https://jira.toolserver.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
--
http://www.non-violence.org/ | Site collaboratif sur la non-violence
http://www.forget-me.net/ | Alternatives sur le Net
http://fr.wikisource.org/ | Bibliothèque libre
http://wikilivres.info | Documents libres
Apologies in advanced for the cross-posting. :-)
Please circulate this call among Wikimedia communities, researchers
and other people that may be interested! This call is also online at
http://wikimania2009.wikimedia.org/wiki/Call_for_Participation
== Call for Participation ==
Wikimania is an annual global event devoted to Wikimedia projects
around the globe (including Wikipedia, Wikibooks, Wikisource,
Wikinews, Wiktionary, Wikiversity, Wikiquote, Wikispecies, and
Wikimedia Commons). The conference is a community gathering, giving
the editors and users of Wikimedia projects an opportunity to meet
each other, exchange ideas, report on research and projects, and
collaborate on the future of the projects. The conference is open to
the public, and is a chance for educators, researchers, programmers
and free culture activists who are interested in the Wikimedia
projects to learn more and share ideas about the Wikimedia projects.
This year's conference will be held from '''August 26-28''' in Buenos
Aires, Argentina at '''San Martín Cultural Center'''.
For more information, please visit the official Wikimania 2009 site at
http://wikimania2009.wikimedia.org.
We are accepting submissions for presentations, workshops, panels,
posters, open space discussions, and artistic works related to the
Wikimedia projects or free content topics in general. Please carefully
follow the submission guidelines below.
=== Important dates ===
* '''Submissions will open on:''' March 1
* '''Deadline for submitting workshop, panel, and presentation
submissions:''' April 15
* '''Deadline for submitting posters, open space discussions, and
artistic works:''' April 30
* '''Notification of acceptance of workshops, panels, presentations:''' May 15
* '''Notification of acceptance of posters, discussions, and artistic
works:''' May 31
* '''Conference dates:''' August 26-28
=== Themes and tracks ===
There are two tracks for submission: the '''Casual Track''', for
members of wiki communities and interested observers to share their
own experiences and thoughts and to present new ideas; and the
'''Academic Track''', for research based on the methods of scientific
studies exploring the social, content or technical aspects of
Wikipedia, the other Wikimedia projects, or other massively
collaborative works, as well as open and free content creation and
community dynamics more generally.
Submissions to either track should address one or more of the following themes:
* '''"Wikimedia Communities,"''' including the topics of conflict
resolution and community dynamics; reputation and identity;
multi-lingualism and languages and cultures.
* '''"Free Knowledge,"''' including open access to information; ways
to gather and distribute free knowledge, use of the Wikimedia projects
in education, journalism, research; ways to improve content quality
and usability.
* '''"Latin American challenges,"''' centering on efforts and
limitations for expanding the reach of Wikimedia projects in Latin
America; promotion of projects in Native American languages; specific
problems of the Spanish and Portuguese-speaking Wikimedia communities.
* '''"Technical infrastructure,"''' including issues related to
MediaWiki development and extensions; Wikimedia's technical
infrastructure; and new ideas for development.
Papers should be of interest to members of the Wikimedia communities,
and fit within one of the themes above.
=== Types of Submissions ===
We are seeking submissions for:
* '''Presentations''' (10–30 minute talks with discussion afterwards)
:* This type of submission is appropriate for presenting substantial
research or community projects
* '''Workshops''' (60–120 minute session with a discussion leader and
more audience involvement)
:* This type of submission is appropriate for sessions designed to
teach a specific subject or explore it in depth
* '''Panels''' (group of 2-5 speakers to discuss aspects of a topic
with audience questions, 45-90 minute sessions)
:* This type of submission is appropriate for discussions on a topic
of wide interest among community members, with several participants
who may be presenting their work. For less formal discussions of
limited interest, consider an open space discussion instead.
* '''Open space discussions''' (informal discussion on a specific
topic; the discussion leader helps moderate the conversation but the
session is open to anyone interested to join in)
:* This type of submission is good for a topic that several
participants want to discuss or brainstorm about in an informal
setting
* '''Posters''' (printed visual displays that can stand on their own,
with no associated presentation)
:* This type of submission is good for presenting research in
progress, or smaller community projects
* '''Artistic works''' (plays, competitions, comedy, visualizations,
displays or other representations of some aspect of the projects)
:* This type of submission is good for showing creativity or
showcasing beautiful work about the projects.
In addition there will be the chance to give lightning talks, which
are 5-minute short presentations. Lightning talk sessions will be
organized on the Wikimania 2009 wiki shortly before the conference
begins, without any need to submit them via the submission system.
These talks are best for those who want to quickly present an idea or
project without giving a formal presentation. These are informal talks
that are open to everyone to participate in.
=== Submission Guidelines ===
Wikimania is organized by volunteers, so please help us minimize
wasted effort by submitting via the submission system and following
these guidelines. All submissions MUST include the following:
# '''Event title:''' an English or Spanish title.
# '''Abstract:''' a short English or Spanish abstract of your event in
50 to 100 words. The abstract will be used for the public schedule.
# '''Themes and track:''' list the track you wish to submit to (Casual
or Academic) and the single theme you think your submission fits in
best (Wikimedia Communities, Free Knowledge, Latin American
challenges, Technical infrastructure). Note that posters and artistic
works have their own track in the submission system.
# '''Information about the speaker:''' full name, email, and a short biography.
# '''Submission file:''' A plain text, PDF or OpenDocument file, in
English or Spanish, containing:
#* '''A long description of the submission''', in English or Spanish
that can be used for reviewing, not to exceed 1000 words. Please give
an overview of the areas to be covered or taught. State clearly the
relevance to the Wikimedia projects and whether submission concerns a
specific wiki project. You can also include links, Include graphics an
diagrams if they do not exceed one page.
#* '''Event type:''' please state if the event is a presentation,
workshop, panel, open space discussion, poster, or artistic work; if a
presentation or panel, whether the presentation is expected to be a
certain length.
#* '''For panel submissions only:''' name of a suggested moderator and
short biographies of each suggested panelist
#* '''Language:''' list the language you plan to present in. The
conference will be bilingual in English and Spanish.
#* '''Special requirements:''' list any special requirements,
including any equipment.
In the "Comments for conference director" field you should tell us
whether you will attend to Wikimania (a) surely, (b) probably, (c)
only if your submission is accepted, or (d) only if we provide travel
and/or accommodation. You can also add yourself to the public list of
attendees at the Wikimania 2009 wiki:
http://wikimania2009.wikimedia.org/wiki/Attendees
Please note that all submissions must be dual licensed under the GNU
Free Documentation License version 1.2 or later ''and'' the Creative
Commons Attribution-Share Alike 3.0 3.0! By submitting for Wikimania
2009 you agree to this condition.
===Submissions===
Once you are sure you have included all of the required information,
please send your submission before the respective deadline through our
'''submission system''':
http://wikimania2009.wikimedia.org/wiki/Submission
If you have further questions, email wikimania-program(a)wikimedia.org
(in English or Spanish).
Hi Erik, I'm crossposting this message to the wikisource-l, if anyone is
interested to give some inputs.
The http://stats.wikimedia.org/wikisource/EN/TablesDatabaseWords.htm seens
to be inaccurate. Apparently your tool compute only words in the main
namespace. It may works for projects like Wikipedia and theirs very long
talk pages at the namespace Project: on some subjects (such as deletion
requests). But it doens't work for Wikisource for two main reasons:
1) Some subdomains have custom namespaces for short biographies and list of
works by author (en, it, pt and others), some have it on the main namespace
(fr, de, es and others). This is a minor issue, since the amount of words on
those pages is small
2) Some Wikisources (de, fr and en, according to
http://wikisource.org/wiki/Wikisource:ProofreadPage_Statistics ) have large
amount of contents in a custom namespace devoted to the ProofreadPage
Extension ( http://www.mediawiki.org/wiki/Extension:Proofread_Page ). This
content is displayed on main namespace within page transclusion (see
http://en.wikisource.org/w/index.php?title=35_Sonnets&action=edit for an
example).
Is possible to include the custom namespaces for all Wikisources on your
automated calculation tool?
[[:m:User:555]]
> De: "John Vandenberg" <jayvdb(a)gmail.com>
> A: "discussion list for Wikisource, the free library" <wikisource-l(a)lists.wikimedia.org>
> Objet: Re: [Wikisource-l] Changing the Wikisource main page
> Date: Sun, 14 Sep 2008 06:03:36 +1000
> A Chinese "word" has more meaning than a Spanish "word". I dont have
> the numbers, but the word "word" is not the same in all languages.
> This makes words a very complex statistic.
>
> --
> John
>
> _______________________________________________
> Wikisource-l mailing list
> Wikisource-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
I may have found a very simple solution : if we agree that a chinese sign is a word as we understand "word", than we have to found how many sign there are. I made a test, and found that a chinese sign is 3 octets. The very same statistics tells us that the average number of octets of an article on the chinese wikisource is 1957. So, there are 1957/3 = 652.3 words. The statistics counts (on may 31, 2008) 29084 articles for the chinese wikisource, and 652.3*29084 gives 18.9M words for total.
The only question remaining is : why the statistics page presents 29.3M as the number of words for the chinese wikisource ? Is that the number of "groups of letters" ?
Anyway, if we accept the figures, we would have : 1. English : 211M words - 2. French : 125M - 3. Spanish : 41.8M - 4. Russian : 22.2M - 5. Chinese : 18.9M - 6. Polish : 18.2M - 7. Portuguese : 15.5M - 8. Deutsch : 14.4M - 9. Italian : 12.0M - 10. Arabic : 10.6M.
---------- Forwarded message ----------
From: Brion Vibber <brion(a)wikimedia.org>
Date: Sat, Nov 22, 2008 at 11:49 AM
Subject: [Wikitech-l] Upload filesize limit bumped
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Now that we've got uploads running on our new, beefier file servers,
I've experimentally bumped the upload limit from 20 to 100 megabytes.
Files nearing the high end of that range might not actually succeed,
though, as it'll be hitting post-size limits etc.
As time goes on we'll be improving ways to upload large video files in
particular...
- -- brion
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkknVv4ACgkQwRnhpk1wk45HaQCgzH3rAHh9cvBUQtSt6FZtO0cF
s+0AoNNBOGGA9S8vFE4AQqALJuCHVtUZ
=YKTr
-----END PGP SIGNATURE-----
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
The subdomain coordination page[1] says that FlaggedRevs has been
requested for Hebrew[2] and Russian[3] Wikisource. Are there any
other projects that want to try it? The email below suggests that
they should be set up soon.
1. http://wikisource.org/wiki/Wikisource:Subdomain_coordination
2. https://bugzilla.wikimedia.org/show_bug.cgi?id=14648
3. https://bugzilla.wikimedia.org/show_bug.cgi?id=15006
---------- Forwarded message ----------
From: Brion Vibber <brion(a)wikimedia.org>
Date: Mon, Nov 17, 2008 at 11:56 AM
Subject: Re: [Wikitech-l] FlaggedRevs setups restarting
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Daniel ~ Leinad wrote:
>> Ok, Rob's starting to clear out some of the FlaggedRevs setup requests,
>> now that we've cleaned up some of the configuration files.
>>
>> For today we've set up en.wikibooks.org as requested:
>> https://bugzilla.wikimedia.org/show_bug.cgi?id=14618
>>
>> If everything's going smoothly, we'll start chugging through the rest
>> over the coming days.
>
> When you continue enable FalggedRevs on remaining projects?
Monday. :)
- -- brion
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkkgwTIACgkQwRnhpk1wk473CQCgvkfEETfaiI6QWvrbgCKsUIkB
kXgAoJuaGzIDaJsD5mTXy3gL/+fOiVRJ
=6340
-----END PGP SIGNATURE-----
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l