Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
Commons should do the same for images buried in PDF and DjVu file types
Sent from my iPad
On Aug 29, 2014, at 5:34 AM, Gerard Meijssen gerard.meijssen@gmail.com wrote:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
On Fri, Aug 29, 2014 at 10:04 AM, Jane Darnell jane023@gmail.com wrote:
Commons should do the same for images buried in PDF and DjVu file types
not sure about DjVu but PDF is already covered:
https://commons.wikimedia.org/wiki/User:Open_Access_Media_Importer_Bot#Galle...
-Jeremy
Thanks, Gerard!
This seems like a great idea.
I believe that Liam Wyatt and Andrew Lih are reaching out to the project leader, to see if he needs help uploading some of that content to Commons.
Music to my ears :)
Fabrice
On Aug 29, 2014, at 2:34 AM, Gerard Meijssen gerard.meijssen@gmail.com wrote:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
_______________________________
Fabrice Florin Product Manager, Multimedia Wikimedia Foundation
A note of caution: this material isn't really suitable for being dumped en masse into Commons just now. as it won't have much metadata beyond "an image, unidentified, from a book on subject X". See https://www.flickr.com/photos/internetarchivebookimages/14595431897/ for an example of what the automated labelling is like. It's certainly useful to keep an eye on, but we'll need to hold off until some of the identification work has been done :-)
We went through this with a similar collection from the British Library - https://commons.wikimedia.org/wiki/Commons:British_Library/Mechanical_Curato... - which is slowly being migrated, bit by bit.
Andrew.
On 29 August 2014 22:14, Fabrice Florin fflorin@wikimedia.org wrote:
Thanks, Gerard!
This seems like a great idea.
I believe that Liam Wyatt and Andrew Lih are reaching out to the project leader, to see if he needs help uploading some of that content to Commons.
Music to my ears :)
Fabrice
On Aug 29, 2014, at 2:34 AM, Gerard Meijssen gerard.meijssen@gmail.com wrote:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Fabrice Florin Product Manager, Multimedia Wikimedia Foundation
https://www.mediawiki.org/wiki/User:Fabrice_Florin_(WMF)
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
beyond "an image, unidentified, from a book on subject X". See https://www.flickr.com/photos/internetarchivebookimages/14595431897/ for an example of what the automated labelling is like. It's certainly
On that page it says "Taken on July 29, 2014". People are really going overboard with those Instagram filters! :-)
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .
On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, This article is of both interest to Commons and Wikipedia.. It is
awesome.
Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
I don't see any issue extracting the images from the books. However I see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.
Regards,
Yann
2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .
On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
-- David Goodman
DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
2014-08-31 15:24 GMT+02:00 Yann Forget yannfo@gmail.com:
I don't see any issue extracting the images from the books. However I see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.
But every image in this Flickr project contains an excerpt of the text that wraps the image. Also in every image there is a link to Internet Archive for reading the full page/book.
I don't think that there is any issue with this. I guess you didn't explored an image example? :-)
https://www.flickr.com/photos/internetarchivebookimages/14598540509/
Regards,
Yann
2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .
On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
-- David Goodman
DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
The simplest way to connect the material is to link it. The entire point of the project was to make the images not merely accessible, but easily findable., and easy to link to specifically.
On Sun, Aug 31, 2014 at 9:45 AM, Emilio J. Rodríguez-Posada < emijrp@gmail.com> wrote:
2014-08-31 15:24 GMT+02:00 Yann Forget yannfo@gmail.com:
I don't see any issue extracting the images from the books. However I
see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.
But every image in this Flickr project contains an excerpt of the text that wraps the image. Also in every image there is a link to Internet Archive for reading the full page/book.
I don't think that there is any issue with this. I guess you didn't explored an image example? :-)
https://www.flickr.com/photos/internetarchivebookimages/14598540509/
Regards,
Yann
2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the
images
from the books. . .
On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen <gerard.meijssen@gmail.com
:
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
-- David Goodman
DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Hoi, The point is very much that where we did not have these images at all, we are now blessed with having them, When you read the article, you will read that the project was about going back to the scans and extract the images from it. Those scans were used before to get only the text out.
So in a round about way, they did have the meta data about the images and extracted them. They have the text and now they now have the images as well. What is left is stitching the images and the texts back together based on the existing meta data.. In essence everything to make this happen is available. Thanks, GerardM
On 30 August 2014 20:26, Yann Forget yannfo@gmail.com wrote:
I think it would be better to have the whole books, rather than images out of context.
Regards,
Yann
2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, This article is of both interest to Commons and Wikipedia.. It is
awesome.
Thanks, GerardM
http://www.bbc.com/news/technology-28976849
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Hello!
I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
@Diego Flickrripper might be an option https://www.mediawiki.org/wiki/Manual:Pywikibot/flickrripper.py
2014-09-02 18:31 GMT+02:00 Poco a Poco pocoapocowiki@gmail.com:
Hello!
I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:
https://tools.wmflabs.org/fist/fist.php is capable of finding useful
images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Flickr2commons [1] would be good too.
[1]: http://tools.wmflabs.org/flickr2commons/
-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:
Hello!
I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Flickr2Commons is the best tool I have used till date for such purposes. On 03-Sep-2014 5:08 am, "Hong, Yena" lists@revi.pe.kr wrote:
Flickr2commons [1] would be good too.
-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:
Hello!
I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
I think the best tool for uploading Flickr images is the Flickr option in standard Upload Wizard. See https://commons.wikimedia.org/wiki/Commons:Village_pump/Proposals#Activate_F...
Jarek T. (user:jarekthttp://commons.wikimedia.org/wiki/User:Jarekt)
From: commons-l-bounces@lists.wikimedia.org [mailto:commons-l-bounces@lists.wikimedia.org] On Behalf Of Srikanth Ramakrishnan Sent: Friday, September 05, 2014 10:21 AM To: Wikimedia Commons Discussion List Subject: Re: [Commons-l] images
Flickr2Commons is the best tool I have used till date for such purposes. On 03-Sep-2014 5:08 am, "Hong, Yena" <lists@revi.pe.krmailto:lists@revi.pe.kr> wrote:
Flickr2commons [1] would be good too.
[1]: http://tools.wmflabs.org/flickr2commons/
-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" <pocoapocowiki@gmail.commailto:pocoapocowiki@gmail.com>님이 작성: Hello!
I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) <nemowiki@gmail.commailto:nemowiki@gmail.com>: https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
_______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.orgmailto:Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
_______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.orgmailto:Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
_______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.orgmailto:Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Please be careful about uploading these images. I've found that many of them actually date to after 1923 and have unclear copyright status. In particular many are from periodical collections, but the date listed is the date for the beginning of the collection. For example, all the images from the *Highland Echo* collection are dated to 1915 even though the collection spans from 1915 to 1925.
Kaldari
On Fri, Sep 5, 2014 at 7:27 AM, Tuszynski, Jarek W. < JAROSLAW.W.TUSZYNSKI@leidos.com> wrote:
I think the best tool for uploading Flickr images is the Flickr option in standard Upload Wizard. See
https://commons.wikimedia.org/wiki/Commons:Village_pump/Proposals#Activate_F...
Jarek T.
(user:jarekt http://commons.wikimedia.org/wiki/User:Jarekt)
*From:* commons-l-bounces@lists.wikimedia.org [mailto: commons-l-bounces@lists.wikimedia.org] *On Behalf Of *Srikanth Ramakrishnan *Sent:* Friday, September 05, 2014 10:21 AM *To:* Wikimedia Commons Discussion List *Subject:* Re: [Commons-l] images
Flickr2Commons is the best tool I have used till date for such purposes.
On 03-Sep-2014 5:08 am, "Hong, Yena" lists@revi.pe.kr wrote:
Flickr2commons [1] would be good too.
-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android --
- 오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:
Hello!
I think that this discussion is relevant for Commons-l.
We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.
Diego
[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot
2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?
Nemo
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l
Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l