Hi,
For a long time Indic languages Wikisource projects depended totally
on manual proofreading, which not only wasted a lot of time, but also
a lot of energy. Recently Google has released OCR software for more
than 20 Indic languages, along with other Asian languages. This
software is far far better and accurate than the previous OCRs. But it
has many limitations. Uploading the same large file two times (one
time for Google OCR and another at Commons) is not an easy solution
for most of the contributors, as Internet connection is way slow in
India. Now if we develop a tool which can feed the uploaded pdf or
djvu files of Commons directly to Google OCRs, so that uploading them
2 times can be avoided.
This was proposed in 2015 community wishlist. Now, as the voting
procedure for the wishlist has been started, the proposal needs your
support. Please follow the link-
https://meta.wikimedia.org/wiki/2015_Community_Wishlist_Survey/Wikisource#T…
FYI, this proposal was also accepted as a highest priority need at the
2015 Wikisource Conference in Vienna.
(https://etherpad.wikimedia.org/p/wscon2015needs)
Regards
--
Bodhisattwa Mandal
Administrator, Bengali Wikipedia
''Imagine a world in which every single person on the planet is given
free access to the sum of all human knowledge.''
Hi all,
Are there ant tool / bot to Index page move with corresponding Page: move.
We need to rename/move some index page. How could we do?
Regards,
Jayanta Nath
Hi All,
I have checked for Bengali Images, its works fine with 100% accuracy. Any
how can it be implemented in Proofread extension?
Regards,
Jayanta
---------- Forwarded message ----------
From: Subhashish Panigrahi <subhashish(a)cis-india.org>
Date: Sat, Aug 29, 2015 at 3:22 PM
Subject: [Wikimediaindia-l] Google's Optical Character Recognition software
now works with all South Asian languages
To: wikimediaindia-l(a)lists.wikimedia.org
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256
Google's OCR which apparently is most accurate OCR
we have seen so far, works really good for all the major South Asian
scripts:
http://globalvoicesonline.org/2015/08/29/googles-optical-character-recog
nition-software-now-works-with-all-south-asian-languages
Here are test cases of many Indian scripts: https://goo.gl/3X75iR.
Except Gurmukhi most scripts are working really good.
This could be really useful for Indian language Wikimedians and will
come handy for digitization of printed and scanned text. Here is an
animated tutorial for Wikimedians to use this tool for
Wikisource/Wikipedia:
https://commons.wikimedia.org/wiki/File:Tutorial_to_use_Google_Optical_C
haracter_Recognition.gif
Please write to me if anyone wants to localize this tutorial in your
language.
- --
Best!
Subhashish Panigrahi
Programme Officer, Access To Knowledge
Centre for Internet and Society
@subhapa / https://cis-india.org
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2
iQIcBAEBCAAGBQJV4YD0AAoJEHThehXZGxGO9ywP/RcJOXB3tFHJNF03X23x1jkY
vffu+1Iob6kLMZt/JD3nTmpXasXDlme6pbGzaT7/YZsC0VouN+4NE9HoEmZAksJF
3nn7HoEive4mDalXH5qyATOilezqIEYOG2c32LVYHnX6Co+fXPVa5WqsHn5js957
OionIc5t0V9zlGB6e5RLOacPWXsAhXyVunaeY6Ma33cOWHFdVnu1XpUGphJ+miVj
EWszTzjDOPlFiMsSsVonjWHvuz7hYPKXxvVXViXY1QAsoOT7wztvOepzM/hAPmYM
kGiODSaN8fU/e/2l4xdnMRymAt8hsz61hdye2UYx7xRjlda/23BKNZz0hiuWiqgO
FBntHycaHyqR8+fUK5EPE0vnqLp/7XdtRtQkRficuEDYlHz4PlMW8oiVEGhSZOaG
fdpgg02sojU1iMOGOs3h/ODWxkRrE3qpG+eT8n1mWJp6Tq7ZLEaQGxW1P6ytlPFF
qOz8JKl94D/MI7ybAtp+IsuUQk160H9wUPmaLxgemDRom7220xV6BysbmaMEWwww
hgO4fBNG6dPUMp825pTSxx18rY/Kw53sgHmUasixCL6Zv6xnM3rRuTxjZh8j77TR
gq2sKgoU+JkYt9eBpVRjrFO90xS5MxPrvL/lGH6P1smAODPull3o0tR681+NGKRp
C8vU5vJOlmL+HlNXBSh9
=lwbI
-----END PGP SIGNATURE-----
_______________________________________________
Wikimediaindia-l mailing list
Wikimediaindia-l(a)lists.wikimedia.org
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Luis Villa, from WMF, asks if there is a list that wants to try Discourse:
https://lists.wikimedia.org/pipermail/wikimedia-l/2016-January/081244.html
Discourse is a (great) software for discussion, and it could be very useful
for giving order to the many mailing lists that we have in the Wikimedia
world.
This list is active but not too much, so I'm asking you if we want to
propose as guinea pigs :-)
Many great open source projects use Discourse, so we are far from being
pioneers, and from what I've seen we could really benefit from the many
features the software has.
Just look it yourselves:
discourse.org
Aubrey
Hi,
I need a template solution for the following proof-reading problem I faced
in Bengali Wikisource.
There is a reference in Page 1 which is continued in Page 2. I have used
the following code in Page 1:-
<ref> Text of Page 1 reference and <includeonly>text of Page 2
reference</includeonly></ref>
It was okay while proof-reading, but when it was transcluded in mainspace,
its not working. There the text of Page 2 reference is not shown.
Can anyone help me with a solution? Do we need some more templates? Thanks.
--
Bodhisattwa
in regards to https://meta.wikimedia.org/wiki/Wikilore
On Tue, Jan 19, 2016 at 2:55 AM, Johan Jönsson <brevlistor(a)gmail.com> wrote:
> 2016-01-18 16:37 GMT+01:00 Tanweer Morshed <wiki.tanweer(a)gmail.com>:
>> Yeah, I do think the same as well. Wikisource is a good option for that.
>
> It could be that the Wikisources I'm familiar with are the exceptions,
> but they tend to be very vary of material that has never been
> published, and this seems to very much be about writing down what's
> passed down orally.
>
> I also imagine the need to do this would be greater the smaller the
> language, or at least the less likely it is someone else would write
> them down, which probably would correlate with fewer persons who are
> potential Wikimedians. Wouldn't that require the creation of a fair
> number of Wikisources that would probably never have a decent chance
> of getting a community?
Wikisource could host most of it, if the oral history is recorded from
a native speaker and direct cultural participant.
There was a good talk about this at the Hong Kong Wikimania.
The recording would be uploaded to Wikimedia Commons, and then the
transcription can be created on the Multilingual Wikisource, or a more
appropriate Wikisource if one exists for the language, and it is a
valid source because it has appropriate provenance.
Some Wikisource might reject this type of work, especially if the
provenance was not high quality, but if provenance is high quality it
is a valid type of source.
--
John Vandenberg
The Second International Conference on Green Computing, Intelligent and
Renewable Energies (GCIRE2016)
University of Perpetual Help System DALTA
Las Piñas-Manila, Philippines
February 24-26, 2016
http://sdiwc.net/conferences/gcire2016/
=======================================
The event will be held over three days with presentations delivered by
researchers from the international community, including presentations
from keynote speakers and state-of-the-art lectures. All registered
papers will be included in the SDIWC Digital Library.
RESEARCH TOPICS INCLUDES ( BUT ARE NOT LIMITED TO):
- Benefits of, and barriers to, adopting greener IT practices
- Carbon metering and user feedback
- Climate and ecosystem monitoring
- Energy harvesting, storage, and recycling
- Energy-aware high performance computing and applications
- Energy-aware software
- Energy-efficient network services and operations
- Green IT metrics, maturity models, standards, and regulations
- Green computing models, methodologies and paradigms
- Green networking and communication
- Life-cycle analysis of IT equipment
- Management and profiling tools for energy efficient systems
- Modeling-representations, simulation and validation for energy
consumption optimization problems
- Online dynamic optimization for energy efficient systems
- Power-aware algorithms and protocols
- Power-efficient delivery and cooling
- Renewable energy models and prediction
- Smart buildings and urban development
- Smart homes, buildings, offices, streets
- Stability of smart energy systems
- Using IT to reduce carbon emissions
- Carbon management policies and ecology- related issues with ICT
- Characterization, metrics, and modeling
- Creating green awareness using IT
- Energy-aware computing
- Energy-aware large scale distributed systems, such as Grids, Clouds
and service computing
- Energy-efficient mass data storage and processing
- Governments’ roles in fostering and enforcing green initiatives
- Green business process reengineering and management
- Green design, manufacture, use, disposal, and recycling of computers
and communication systems
- Green software engineering
- Low-power electronics and systems
- Matching energy supply and demand
- Network design optimization
- Optimization of energy-efficient protocols
- Power-aware software and hardware
- Reliability, thermal behavior and control
- Robustness and performance guarantees
- Smart grid and microgrids
- Smart transportation and manufacturing
- Sustainable computing
SPECIAL ISSUES:
- International Journal of New Computer Architectures and their
Applications (IJNCAA); EISSN 2220-9085, ISSN 2412-3587
- International Journal of Cyber-Security and Digital Forensics
(IJCSDF); EISSN 2225-658X, ISSN 2412-6551
- International Journal of Digital Information and Wireless
Communications (IJDIWC); EISSN 2305-0012
- International Journal of New Computer Architectures and their
Applications (IJNCAA); EISSN 2410-0439
SUBMISSION GUIDELINES:
- Researchers are encouraged to submit their work electronically as pdf
format without author(s) name.
- Full paper must be submitted (abstracts are not acceptable).
- Submitted paper should not exceed 15 pages, including illustrations
and must be without page numbers.
- Paper submission link:
http://sdiwc.net/conferences/gcire2016/paper-submission/
IMPORTANT DATES:
Submission Deadline: January 24, 2016
Acceptance Notification:2-3 weeks from the submission date or Feb. 3,
2016
Camera Ready Deadline: February 14, 2016
Registration Deadline: February 14, 2016
Conference Dates: February 24 - 26, 2016
Please see co-event conferences to be held in the Philippines:
* The International Conference on Innovations in Intelligent Systems and
Computing Technologies (ICIISCT2016)
* The Second International Conference on Electrical and Electronic
Engineering, Telecommunication Engineering, and Mechatronics
(EEETEM2016)
Hi,
Two questions-
1) Is there any bot running which can use the IA upload tool to transfer
files from Internet Archive to Commons? I see lots and lots of public
domain files in IA but they are not present in Commons. Its next to
impossible to be done manually.
2) Is there any bot running, which can create index pages in respective
language Wikisources, whenever a pdf or djvu files are uploaded from IA?
If they are not present, can theses bot accounts be created?
Regards
--
Bodhisattwa