Hello everyone, can I ask you if you are currently using IA Upload tool with IA books that *do not* have a Djvu file?
It's few weeks I'm trying to upload this book https://archive.org/details/ComeRuinareLAutoritaImage
with the tool, and in theory the IA-upload can now make the djvu by himself, but in this case it's not working.
But maybe it's just this book. Did you have any issues?
Andrea
(also, BUB is not currently working)
On Mon, Sep 26, 2016 at 2:55 PM, Andrea Zanni zanni.andrea84@gmail.com wrote:
Hello everyone, can I ask you if you are currently using IA Upload tool with IA books that *do not* have a Djvu file?
It's few weeks I'm trying to upload this book https://archive.org/details/ComeRuinareLAutoritaImage
with the tool, and in theory the IA-upload can now make the djvu by himself, but in this case it's not working.
But maybe it's just this book. Did you have any issues?
Andrea
Andrea Zanni, 26/09/2016 15:04:
(also, BUB is not currently working)
That's https://phabricator.wikimedia.org/T144554 , I don't know what to do about it.
Nemo
Time to develop a wiki, excellent OCR (better. hOCR) multilingual service, isn't it?
Alex
2016-09-26 15:04 GMT+02:00 Andrea Zanni zanni.andrea84@gmail.com:
(also, BUB is not currently working)
On Mon, Sep 26, 2016 at 2:55 PM, Andrea Zanni zanni.andrea84@gmail.com wrote:
Hello everyone, can I ask you if you are currently using IA Upload tool with IA books that *do not* have a Djvu file?
It's few weeks I'm trying to upload this book https://archive.org/details/ComeRuinareLAutoritaImage
with the tool, and in theory the IA-upload can now make the djvu by himself, but in this case it's not working.
But maybe it's just this book. Did you have any issues?
Andrea
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
I submitted an issue on GitHub: https://github.com/Tpt/ia-upload/issues/13
I think that IA Upload tool is a critical step in the Wikisource workflow, and I wonder if maybe Sam (as a Community Tech employee) could dedicate some time to it. Tpt can't maintain everything by himself...
For years, I've explained to *a lot* of GLAMs that uploading stuff on IA and then using Wikisource is the way to do things, and I'm sure this is the standard way in other places too.
Maybe it's just me, but I really think it's a priority for our community to have a functioning tool like this, which provides a good-quality image from Archive.
What do you think? How should we solve the issue? Is something that Tpt can solve or should we find another solution?
Andrea
On Mon, Sep 26, 2016 at 9:27 PM, Alex Brollo alex.brollo@gmail.com wrote:
Time to develop a wiki, excellent OCR (better. hOCR) multilingual service, isn't it?
Alex
2016-09-26 15:04 GMT+02:00 Andrea Zanni zanni.andrea84@gmail.com:
(also, BUB is not currently working)
On Mon, Sep 26, 2016 at 2:55 PM, Andrea Zanni zanni.andrea84@gmail.com wrote:
Hello everyone, can I ask you if you are currently using IA Upload tool with IA books that *do not* have a Djvu file?
It's few weeks I'm trying to upload this book https://archive.org/details/ComeRuinareLAutoritaImage
with the tool, and in theory the IA-upload can now make the djvu by himself, but in this case it's not working.
But maybe it's just this book. Did you have any issues?
Andrea
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
On Thu, 29 Sep 2016, at 07:36 PM, Andrea Zanni wrote:
I think that IA Upload tool is a critical step in the Wikisource workflow, and I wonder if maybe Sam (as a Community Tech employee) could dedicate some time to it. Tpt can't maintain everything by himself... For years, I've explained to *a lot* of GLAMs that uploading stuff on IA and then using Wikisource is the way to do things, and I'm sure this is the standard way in other places too.
Yes, I agree: the IA-Commons-Wikisource workflow is a thing that should be encouraged no end! :-)
As far as my work-programming time goes, you (I say 'you' but I just mean 'not-me', for CoI reasons) just need to get tickets onto the Community- Tech board, then I can perhaps look at them. Which basically means they have to contribute towards a Wishlist item.
There's Wishlist #44: https://phabricator.wikimedia.org/T120785 - *Implement an Internet Archive-like digitalization service.* I reckon it'd be great to be able to at least upload a PDF or Djvu with no text layer, and have it create one (either on the same file, or to upload a new derived file).
—Sam
Alex's idea is very bold and I like it: my only fear is that it's too bold and will never be implemented.
At the moment, we have not a proper alternative and independent environment for working with scans: I wonder if that should be something related to CropTool [1] or Commons in general.
What I would like to see *now* (for the good is better than the best, if it's quicker) is a working IA-Upload, with a good support for djvu (because the internet archive's PDF is often very low quality).
From what I understood from Tpt, the IA-upload tool should already do that,
but evidently there are issues at the moment.
For people who know how to run a script from command line, a way to generate a djvu from IA is using Alex brollo's script [2]
Ideally, I would see this script integrated with the IA Upload tool, so we can use the existing IA > ia_upload tool > Commons > Wikisource workflow, as usual. Many librarians are still using it.
Aubrey
[1] https://tools.wmflabs.org/croptool/ [2] https://it.wikisource.org/wiki/Progetto:Bot/Programmi_ in_Python_per_i_bot/djvuCl.py
On Fri, Sep 30, 2016 at 2:48 AM, Sam Wilson sam@samwilson.id.au wrote:
On Thu, 29 Sep 2016, at 07:36 PM, Andrea Zanni wrote:
I think that IA Upload tool is a critical step in the Wikisource workflow, and I wonder if maybe Sam (as a Community Tech employee) could dedicate some time to it. Tpt can't maintain everything by himself... For years, I've explained to *a lot* of GLAMs that uploading stuff on IA and then using Wikisource is the way to do things, and I'm sure this is the standard way in other places too.
Yes, I agree: the IA-Commons-Wikisource workflow is a thing that should be encouraged no end! :-)
As far as my work-programming time goes, you (I say 'you' but I just mean 'not-me', for CoI reasons) just need to get tickets onto the Community-Tech board, then I can perhaps look at them. Which basically means they have to contribute towards a Wishlist item.
There's Wishlist #44: https://phabricator.wikimedia.org/T120785 - *Implement an Internet Archive-like digitalization service.* I reckon it'd be great to be able to at least upload a PDF or Djvu with no text layer, and have it create one (either on the same file, or to upload a new derived file).
—Sam
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Alex's idea is very bold and I like it: my only fear is that it's too bold and will never be implemented.
It's has already been implemented as a WMF tool labs tool by Phe and ia-upload have been modified in order to call it when no DjVu is available in IA. I'm planning to take some time in order to debug the current problems.
Cheers,
Thomas
Le 30 sept. 2016 à 11:18, Andrea Zanni zanni.andrea84@gmail.com a écrit :
Alex's idea is very bold and I like it: my only fear is that it's too bold and will never be implemented.
At the moment, we have not a proper alternative and independent environment for working with scans: I wonder if that should be something related to CropTool [1] or Commons in general.
What I would like to see *now* (for the good is better than the best, if it's quicker) is a working IA-Upload, with a good support for djvu (because the internet archive's PDF is often very low quality).
From what I understood from Tpt, the IA-upload tool should already do that, but evidently there are issues at the moment.
For people who know how to run a script from command line, a way to generate a djvu from IA is using Alex brollo's script [2]
Ideally, I would see this script integrated with the IA Upload tool, so we can use the existing IA > ia_upload tool > Commons > Wikisource workflow, as usual. Many librarians are still using it.
Aubrey
[1] https://tools.wmflabs.org/croptool/ [2] https://it.wikisource.org/wiki/Progetto:Bot/Programmi_in_Python_per_i_bot/dj...
On Fri, Sep 30, 2016 at 2:48 AM, Sam Wilson sam@samwilson.id.au wrote:
On Thu, 29 Sep 2016, at 07:36 PM, Andrea Zanni wrote:
I think that IA Upload tool is a critical step in the Wikisource workflow, and I wonder if maybe Sam (as a Community Tech employee) could dedicate some time to it. Tpt can't maintain everything by himself... For years, I've explained to *a lot* of GLAMs that uploading stuff on IA and then using Wikisource is the way to do things, and I'm sure this is the standard way in other places too.
Yes, I agree: the IA-Commons-Wikisource workflow is a thing that should be encouraged no end! :-)
As far as my work-programming time goes, you (I say 'you' but I just mean 'not-me', for CoI reasons) just need to get tickets onto the Community-Tech board, then I can perhaps look at them. Which basically means they have to contribute towards a Wishlist item.
There's Wishlist #44: https://phabricator.wikimedia.org/T120785 - Implement an Internet Archive-like digitalization service. I reckon it'd be great to be able to at least upload a PDF or Djvu with no text layer, and have it create one (either on the same file, or to upload a new derived file).
—Sam
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Bub is back from 503 but the queue stands still :-(
On 26 September 2016 at 18:34, Andrea Zanni zanni.andrea84@gmail.com wrote:
(also, BUB is not currently working)
wikisource-l@lists.wikimedia.org