For a long time Indic languages Wikisource projects depended totally
on manual proofreading, which not only wasted a lot of time, but also
a lot of energy. Recently Google has released OCR software for more
than 20 Indic languages, along with other Asian languages. This
software is far far better and accurate than the previous OCRs. But it
has many limitations. Uploading the same large file two times (one
time for Google OCR and another at Commons) is not an easy solution
for most of the contributors, as Internet connection is way slow in
India. Now if we develop a tool which can feed the uploaded pdf or
djvu files of Commons directly to Google OCRs, so that uploading them
2 times can be avoided.
This was proposed in 2015 community wishlist. Now, as the voting
procedure for the wishlist has been started, the proposal needs your
support. Please follow the link-
FYI, this proposal was also accepted as a highest priority need at the
2015 Wikisource Conference in Vienna.
Administrator, Bengali Wikipedia
''Imagine a world in which every single person on the planet is given
free access to the sum of all human knowledge.''
I have checked for Bengali Images, its works fine with 100% accuracy. Any
how can it be implemented in Proofread extension?
---------- Forwarded message ----------
From: Subhashish Panigrahi <subhashish(a)cis-india.org>
Date: Sat, Aug 29, 2015 at 3:22 PM
Subject: [Wikimediaindia-l] Google's Optical Character Recognition software
now works with all South Asian languages
-----BEGIN PGP SIGNED MESSAGE-----
Google's OCR which apparently is most accurate OCR
we have seen so far, works really good for all the major South Asian
Here are test cases of many Indian scripts: https://goo.gl/3X75iR.
Except Gurmukhi most scripts are working really good.
This could be really useful for Indian language Wikimedians and will
come handy for digitization of printed and scanned text. Here is an
animated tutorial for Wikimedians to use this tool for
Please write to me if anyone wants to localize this tutorial in your
Programme Officer, Access To Knowledge
Centre for Internet and Society
@subhapa / https://cis-india.org
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2
-----END PGP SIGNATURE-----
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
Luis Villa, from WMF, asks if there is a list that wants to try Discourse:
Discourse is a (great) software for discussion, and it could be very useful
for giving order to the many mailing lists that we have in the Wikimedia
This list is active but not too much, so I'm asking you if we want to
propose as guinea pigs :-)
Many great open source projects use Discourse, so we are far from being
pioneers, and from what I've seen we could really benefit from the many
features the software has.
Just look it yourselves:
I need a template solution for the following proof-reading problem I faced
in Bengali Wikisource.
There is a reference in Page 1 which is continued in Page 2. I have used
the following code in Page 1:-
<ref> Text of Page 1 reference and <includeonly>text of Page 2
It was okay while proof-reading, but when it was transcluded in mainspace,
its not working. There the text of Page 2 reference is not shown.
Can anyone help me with a solution? Do we need some more templates? Thanks.
in regards to https://meta.wikimedia.org/wiki/Wikilore
On Tue, Jan 19, 2016 at 2:55 AM, Johan Jönsson <brevlistor(a)gmail.com> wrote:
> 2016-01-18 16:37 GMT+01:00 Tanweer Morshed <wiki.tanweer(a)gmail.com>:
>> Yeah, I do think the same as well. Wikisource is a good option for that.
> It could be that the Wikisources I'm familiar with are the exceptions,
> but they tend to be very vary of material that has never been
> published, and this seems to very much be about writing down what's
> passed down orally.
> I also imagine the need to do this would be greater the smaller the
> language, or at least the less likely it is someone else would write
> them down, which probably would correlate with fewer persons who are
> potential Wikimedians. Wouldn't that require the creation of a fair
> number of Wikisources that would probably never have a decent chance
> of getting a community?
Wikisource could host most of it, if the oral history is recorded from
a native speaker and direct cultural participant.
There was a good talk about this at the Hong Kong Wikimania.
The recording would be uploaded to Wikimedia Commons, and then the
transcription can be created on the Multilingual Wikisource, or a more
appropriate Wikisource if one exists for the language, and it is a
valid source because it has appropriate provenance.
Some Wikisource might reject this type of work, especially if the
provenance was not high quality, but if provenance is high quality it
is a valid type of source.
The Second International Conference on Green Computing, Intelligent and
Renewable Energies (GCIRE2016)
University of Perpetual Help System DALTA
Las Piñas-Manila, Philippines
February 24-26, 2016
The event will be held over three days with presentations delivered by
researchers from the international community, including presentations
from keynote speakers and state-of-the-art lectures. All registered
papers will be included in the SDIWC Digital Library.
RESEARCH TOPICS INCLUDES ( BUT ARE NOT LIMITED TO):
- Benefits of, and barriers to, adopting greener IT practices
- Carbon metering and user feedback
- Climate and ecosystem monitoring
- Energy harvesting, storage, and recycling
- Energy-aware high performance computing and applications
- Energy-aware software
- Energy-efficient network services and operations
- Green IT metrics, maturity models, standards, and regulations
- Green computing models, methodologies and paradigms
- Green networking and communication
- Life-cycle analysis of IT equipment
- Management and profiling tools for energy efficient systems
- Modeling-representations, simulation and validation for energy
consumption optimization problems
- Online dynamic optimization for energy efficient systems
- Power-aware algorithms and protocols
- Power-efficient delivery and cooling
- Renewable energy models and prediction
- Smart buildings and urban development
- Smart homes, buildings, offices, streets
- Stability of smart energy systems
- Using IT to reduce carbon emissions
- Carbon management policies and ecology- related issues with ICT
- Characterization, metrics, and modeling
- Creating green awareness using IT
- Energy-aware computing
- Energy-aware large scale distributed systems, such as Grids, Clouds
and service computing
- Energy-efficient mass data storage and processing
- Governments’ roles in fostering and enforcing green initiatives
- Green business process reengineering and management
- Green design, manufacture, use, disposal, and recycling of computers
and communication systems
- Green software engineering
- Low-power electronics and systems
- Matching energy supply and demand
- Network design optimization
- Optimization of energy-efficient protocols
- Power-aware software and hardware
- Reliability, thermal behavior and control
- Robustness and performance guarantees
- Smart grid and microgrids
- Smart transportation and manufacturing
- Sustainable computing
- International Journal of New Computer Architectures and their
Applications (IJNCAA); EISSN 2220-9085, ISSN 2412-3587
- International Journal of Cyber-Security and Digital Forensics
(IJCSDF); EISSN 2225-658X, ISSN 2412-6551
- International Journal of Digital Information and Wireless
Communications (IJDIWC); EISSN 2305-0012
- International Journal of New Computer Architectures and their
Applications (IJNCAA); EISSN 2410-0439
- Researchers are encouraged to submit their work electronically as pdf
format without author(s) name.
- Full paper must be submitted (abstracts are not acceptable).
- Submitted paper should not exceed 15 pages, including illustrations
and must be without page numbers.
- Paper submission link:
Submission Deadline: January 24, 2016
Acceptance Notification:2-3 weeks from the submission date or Feb. 3,
Camera Ready Deadline: February 14, 2016
Registration Deadline: February 14, 2016
Conference Dates: February 24 - 26, 2016
Please see co-event conferences to be held in the Philippines:
* The International Conference on Innovations in Intelligent Systems and
Computing Technologies (ICIISCT2016)
* The Second International Conference on Electrical and Electronic
Engineering, Telecommunication Engineering, and Mechatronics
1) Is there any bot running which can use the IA upload tool to transfer
files from Internet Archive to Commons? I see lots and lots of public
domain files in IA but they are not present in Commons. Its next to
impossible to be done manually.
2) Is there any bot running, which can create index pages in respective
language Wikisources, whenever a pdf or djvu files are uploaded from IA?
If they are not present, can theses bot accounts be created?