FYI \o/ Em 03/03/2014 15:13, bugzilla-daemon@wikimedia.org escreveu:
Rohit Dua 8ohit.dua@gmail.com changed bug 57813https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 What Removed Added CC 8ohit.dua@gmail.com
*Comment # 2 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c2 on bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 from Rohit Dua 8ohit.dua@gmail.com *
(In reply to vladjohn2013 from comment #0 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c0)> Google Books > Internet Archive > Commons upload cycle
Wikisources all around the world use heavily GB digitizations for transcription and proofreading. As GB provides just the PDF, the usual cycle is:
go to Google Books and look for a book check if the book is already in IA if it's not, upload it there get the djvu from IA upload it on Commons use it on Wikisource
For point 4, we have this awesome tool: https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is a tool for point 2.1, that would serve many other users outside the Wikimedia movement too. Eventually, we could think of a bot/script which would do all the work altogether, notifying the user when their help is needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is available for "design" mentorship, paired with a technical expert. We can maybe ask help from a IA expert.
URL:https://www.mediawiki.org/wiki/Mentorship_programs/ Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
Hi
This is to inform that I am working on Bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 - Google Books > Internet Archive > Commons upload cycle, via GSOC-2014 project. I'm ready with with the outline of google-books download script.
-- Rohit Dua 8ohit.dua New Delhi,India
You are receiving this mail because:
- You voted for the bug.
Rohit Dua just wrote to me regarding that project (I was one of the possible mentors). if one of you is technically skilled and wants to help, there is plenty of room for that :-)
Aubrey
On Mon, Mar 3, 2014 at 7:24 PM, Luiz Augusto lugusto@gmail.com wrote:
FYI \o/ Em 03/03/2014 15:13, bugzilla-daemon@wikimedia.org escreveu:
Rohit Dua 8ohit.dua@gmail.com changed bug 57813https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 What Removed Added CC 8ohit.dua@gmail.com
*Comment # 2 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c2 on bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 from Rohit Dua 8ohit.dua@gmail.com *
(In reply to vladjohn2013 from comment #0 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c0)> Google Books > Internet Archive > Commons upload cycle
Wikisources all around the world use heavily GB digitizations for transcription and proofreading. As GB provides just the PDF, the usual cycle is:
go to Google Books and look for a book check if the book is already in IA if it's not, upload it there get the djvu from IA upload it on Commons use it on Wikisource
For point 4, we have this awesome tool: https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is a tool for point 2.1, that would serve many other users outside the Wikimedia movement too. Eventually, we could think of a bot/script which would do all the work altogether, notifying the user when their help is needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is available for "design" mentorship, paired with a technical expert. We can maybe ask help from a IA expert.
URL:https://www.mediawiki.org/wiki/Mentorship_programs/ Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
Hi
This is to inform that I am working on Bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 - Google Books > Internet Archive > Commons upload cycle, via GSOC-2014 project. I'm ready with with the outline of google-books download script.
-- Rohit Dua 8ohit.dua New Delhi,India
You are receiving this mail because:
- You voted for the bug.
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
A suggestion: don't focus on Google only. Generalize the tool for other sources. There are plenty of libraries as Opal that share much better images to grab. The quality of Google shared scans is often very poor.
(PS: I presume, I could grab the whole Alma Mater library content... but presently I'm busy; Aubrey, "stai sereno" :-) )
Alex
2014-03-04 10:21 GMT+01:00 Andrea Zanni zanni.andrea84@gmail.com:
Rohit Dua just wrote to me regarding that project (I was one of the possible mentors). if one of you is technically skilled and wants to help, there is plenty of room for that :-)
Aubrey
On Mon, Mar 3, 2014 at 7:24 PM, Luiz Augusto lugusto@gmail.com wrote:
FYI \o/ Em 03/03/2014 15:13, bugzilla-daemon@wikimedia.org escreveu:
Rohit Dua 8ohit.dua@gmail.com changed bug 57813https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 What Removed Added CC 8ohit.dua@gmail.com
*Comment # 2 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c2 on bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 from Rohit Dua 8ohit.dua@gmail.com *
(In reply to vladjohn2013 from comment #0 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c0)> Google Books > Internet Archive > Commons upload cycle
Wikisources all around the world use heavily GB digitizations for transcription and proofreading. As GB provides just the PDF, the usual cycle is:
go to Google Books and look for a book check if the book is already in IA if it's not, upload it there get the djvu from IA upload it on Commons use it on Wikisource
For point 4, we have this awesome tool: https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is a tool for point 2.1, that would serve many other users outside the Wikimedia movement too. Eventually, we could think of a bot/script which would do all the work altogether, notifying the user when their help is needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is available for "design" mentorship, paired with a technical expert. We can maybe ask help from a IA expert.
URL:https://www.mediawiki.org/wiki/Mentorship_programs/ Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
Hi
This is to inform that I am working on Bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 - Google Books > Internet Archive > Commons upload cycle, via GSOC-2014 project. I'm ready with with the outline of google-books download script.
-- Rohit Dua 8ohit.dua New Delhi,India
You are receiving this mail because:
- You voted for the bug.
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Yes please generized the tool for other website also ! As per Indic wikisource concern we are very depend in Digital libray of India and more !
Jayanta Calcutta
On Tuesday, March 4, 2014, Alex Brollo alex.brollo@gmail.com wrote:
A suggestion: don't focus on Google only. Generalize the tool for other sources. There are plenty of libraries as Opal that share much better images to grab. The quality of Google shared scans is often very poor.
(PS: I presume, I could grab the whole Alma Mater library content... but presently I'm busy; Aubrey, "stai sereno" :-) )
Alex
2014-03-04 10:21 GMT+01:00 Andrea Zanni <zanni.andrea84@gmail.comjavascript:_e(%7B%7D,'cvml','zanni.andrea84@gmail.com');
:
Rohit Dua just wrote to me regarding that project (I was one of the possible mentors). if one of you is technically skilled and wants to help, there is plenty of room for that :-)
Aubrey
On Mon, Mar 3, 2014 at 7:24 PM, Luiz Augusto lugusto@gmail.com wrote:
FYI \o/ Em 03/03/2014 15:13, bugzilla-daemon@wikimedia.org escreveu:
Rohit Dua changed bug 57813https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 What Removed Added CC 8ohit.dua@gmail.com
*Comment # 2 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c2 on bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 from Rohit Dua *
(In reply to vladjohn2013 from comment #0 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c0)> Google Books > Internet Archive > Commons upload cycle
Wikisources all around the world use heavily GB digitizations for transcription and proofreading. As GB provides just the PDF, the usual cycle is:
go to Google Books and look for a book check if the book is already in IA if it's not, upload it there get the djvu from IA upload it on Commons use it on Wikisource
For point 4, we have this awesome tool: https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is a tool for point 2.1, that would serve many other users outside the Wikimedia movement too. Eventually, we could think of a bot/script which would do all the work altogether, notifying the user when their help is needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is available for "design" mentorship, paired with a technical expert. We can maybe ask help from a IA expert.
URL:https://www.mediawiki.org/wiki/Mentorship_programs/ Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
Hi
This is to inform that I am working on Bug 57813 https://bugzilla.wikimedia.org/show_bug.cgi?id=57813 - Google Books > Internet Archive > Commons upload cycle, via GSOC-2014 project. I'm ready with with the outline of google-books download script.
-- Rohit Dua 8ohit.dua New Delhi,India
You are receiving this mail because:
- You voted for the bug.
Wikisource-l mailing list Wikisource-l@lists.wiki
wikisource-l@lists.wikimedia.org