images

Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Jeremy Baron

9:38 p.m.

On Fri, Aug 29, 2014 at 10:04 AM, Jane Darnell jane023@gmail.com wrote:

...

Commons should do the same for images buried in PDF and DjVu file types

not sure about DjVu but PDF is already covered:

https://commons.wikimedia.org/wiki/User:Open_Access_Media_Importer_Bot#Galle...

-Jeremy

Fabrice Florin

9:14 p.m.

Thanks, Gerard!

This seems like a great idea.

I believe that Liam Wyatt and Andrew Lih are reaching out to the project leader, to see if he needs help uploading some of that content to Commons.

Music to my ears :)

Fabrice

On Aug 29, 2014, at 2:34 AM, Gerard Meijssen gerard.meijssen@gmail.com wrote:

...

Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

_______________________________

Fabrice Florin Product Manager, Multimedia Wikimedia Foundation

https://www.mediawiki.org/wiki/User:Fabrice_Florin_(WMF)

Andrew Gray

11:07 p.m.

A note of caution: this material isn't really suitable for being dumped en masse into Commons just now. as it won't have much metadata beyond "an image, unidentified, from a book on subject X". See https://www.flickr.com/photos/internetarchivebookimages/14595431897/ for an example of what the automated labelling is like. It's certainly useful to keep an eye on, but we'll need to hold off until some of the identification work has been done :-)

We went through this with a similar collection from the British Library - https://commons.wikimedia.org/wiki/Commons:British_Library/Mechanical_Curato... - which is slowly being migrated, bit by bit.

Andrew.

On 29 August 2014 22:14, Fabrice Florin fflorin@wikimedia.org wrote:

...

Thanks, Gerard!

This seems like a great idea.

I believe that Liam Wyatt and Andrew Lih are reaching out to the project leader, to see if he needs help uploading some of that content to Commons.

Music to my ears :)

Fabrice

On Aug 29, 2014, at 2:34 AM, Gerard Meijssen gerard.meijssen@gmail.com wrote:

Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849 _______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Fabrice Florin Product Manager, Multimedia Wikimedia Foundation

https://www.mediawiki.org/wiki/User:Fabrice_Florin_(WMF)

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- - Andrew Gray andrew.gray@dunelm.org.uk

Daniel Schwen

11:39 p.m.

...

beyond "an image, unidentified, from a book on subject X". See https://www.flickr.com/photos/internetarchivebookimages/14595431897/ for an example of what the automated labelling is like. It's certainly

On that page it says "Taken on July 29, 2014". People are really going overboard with those Instagram filters! :-)

Yann Forget

30 Aug 30 Aug

6:26 p.m.

I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:

...

Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

David Goodman

7:31 p.m.

The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .

On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:

...

I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:

...
Hoi, This article is of both interest to Commons and Wikipedia.. It is

awesome.

...
Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- David Goodman DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG

Yann Forget

31 Aug 31 Aug

1:24 p.m.

I don't see any issue extracting the images from the books. However I see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.

Regards,

Yann

2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:

...

The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .

On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:

...
I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:

...
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- David Goodman

DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Emilio J. Rodríguez-Posada

1:45 p.m.

2014-08-31 15:24 GMT+02:00 Yann Forget yannfo@gmail.com:

...

I don't see any issue extracting the images from the books. However I see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.

But every image in this Flickr project contains an excerpt of the text that wraps the image. Also in every image there is a link to Internet Archive for reading the full page/book.

I don't think that there is any issue with this. I guess you didn't explored an image example? :-)

https://www.flickr.com/photos/internetarchivebookimages/14598540509/

...

Regards,

Yann

2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:

...
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the images from the books. . .

On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:

...
I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:

...
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- David Goodman

DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

David Goodman

4:39 p.m.

The simplest way to connect the material is to link it. The entire point of the project was to make the images not merely accessible, but easily findable., and easy to link to specifically.

On Sun, Aug 31, 2014 at 9:45 AM, Emilio J. Rodríguez-Posada < emijrp@gmail.com> wrote:

...

2014-08-31 15:24 GMT+02:00 Yann Forget yannfo@gmail.com:

I don't see any issue extracting the images from the books. However I

...
see a problem identifying an image when it is out of context. And the text of the books is certainly as much of interest as the images.

But every image in this Flickr project contains an excerpt of the text that wraps the image. Also in every image there is a link to Internet Archive for reading the full page/book.

I don't think that there is any issue with this. I guess you didn't explored an image example? :-)

https://www.flickr.com/photos/internetarchivebookimages/14598540509/

...
Regards,

Yann

2014-08-31 1:01 GMT+05:30 David Goodman dggenwp@gmail.com:

...
The reason to have the images specifically is that the key part of their project was the great difficulty in identifying and extracting the

images

...
from the books. . .

On Sat, Aug 30, 2014 at 2:26 PM, Yann Forget yannfo@gmail.com wrote:

...
I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen <gerard.meijssen@gmail.com

:

...
...
Hoi, This article is of both interest to Commons and Wikipedia.. It is awesome. Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- David Goodman

DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- David Goodman DGG at the enWP http://en.wikipedia.org/wiki/User:DGG http://en.wikipedia.org/wiki/User_talk:DGG

Gerard Meijssen

30 Aug 30 Aug

8:14 p.m.

Hoi, The point is very much that where we did not have these images at all, we are now blessed with having them, When you read the article, you will read that the project was about going back to the scans and extract the images from it. Those scans were used before to get only the text out.

So in a round about way, they did have the meta data about the images and extracted them. They have the text and now they now have the images as well. What is left is stitching the images and the texts back together based on the existing meta data.. In essence everything to make this happen is available. Thanks, GerardM

On 30 August 2014 20:26, Yann Forget yannfo@gmail.com wrote:

...

I think it would be better to have the whole books, rather than images out of context.

Regards,

Yann

2014-08-29 15:04 GMT+05:30 Gerard Meijssen gerard.meijssen@gmail.com:

...
Hoi, This article is of both interest to Commons and Wikipedia.. It is

awesome.

...
Thanks, GerardM

http://www.bbc.com/news/technology-28976849

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Federico Leva (Nemo)

2 Sep 2 Sep

3:05 p.m.

https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Poco a Poco

4:31 p.m.

Hello!

I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:

...

https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Pierre-Selim

4:52 p.m.

@Diego Flickrripper might be an option https://www.mediawiki.org/wiki/Manual:Pywikibot/flickrripper.py

2014-09-02 18:31 GMT+02:00 Poco a Poco pocoapocowiki@gmail.com:

...

Hello!

I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:

https://tools.wmflabs.org/fist/fist.php is capable of finding useful

...
images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

-- Pierre-Selim

Hong, Yena

11:37 p.m.

Flickr2commons [1] would be good too.

[1]: http://tools.wmflabs.org/flickr2commons/

-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:

...

Hello!

I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:

...
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Srikanth Ramakrishnan

5 Sep 5 Sep

2:20 p.m.

Flickr2Commons is the best tool I have used till date for such purposes. On 03-Sep-2014 5:08 am, "Hong, Yena" lists@revi.pe.kr wrote:

...

Flickr2commons [1] would be good too.

-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:

...
Hello!

I think that this discussion is relevant for Commons-l. We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:

...
https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Tuszynski, Jarek W.

2:27 p.m.

I think the best tool for uploading Flickr images is the Flickr option in standard Upload Wizard. See https://commons.wikimedia.org/wiki/Commons:Village_pump/Proposals#Activate_F...

Jarek T. (user:jarekthttp://commons.wikimedia.org/wiki/User:Jarekt)

From: commons-l-bounces@lists.wikimedia.org [mailto:commons-l-bounces@lists.wikimedia.org] On Behalf Of Srikanth Ramakrishnan Sent: Friday, September 05, 2014 10:21 AM To: Wikimedia Commons Discussion List Subject: Re: [Commons-l] images

Flickr2Commons is the best tool I have used till date for such purposes. On 03-Sep-2014 5:08 am, "Hong, Yena" <lists@revi.pe.krmailto:lists@revi.pe.kr> wrote:

Flickr2commons [1] would be good too.

[1]: http://tools.wmflabs.org/flickr2commons/

-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android -- 2014. 9. 3. 오전 1:32에 "Poco a Poco" <pocoapocowiki@gmail.commailto:pocoapocowiki@gmail.com>님이 작성: Hello!

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) <nemowiki@gmail.commailto:nemowiki@gmail.com>: https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

_______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.orgmailto:Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Ryan Kaldari

6:38 p.m.

Please be careful about uploading these images. I've found that many of them actually date to after 1923 and have unclear copyright status. In particular many are from periodical collections, but the date listed is the date for the beginning of the collection. For example, all the images from the *Highland Echo* collection are dated to 1915 even though the collection spans from 1915 to 1925.

Kaldari

On Fri, Sep 5, 2014 at 7:27 AM, Tuszynski, Jarek W. < JAROSLAW.W.TUSZYNSKI@leidos.com> wrote:

...

I think the best tool for uploading Flickr images is the Flickr option in standard Upload Wizard. See

https://commons.wikimedia.org/wiki/Commons:Village_pump/Proposals#Activate_F...

Jarek T.

(user:jarekt http://commons.wikimedia.org/wiki/User:Jarekt)

*From:* commons-l-bounces@lists.wikimedia.org [mailto: commons-l-bounces@lists.wikimedia.org] *On Behalf Of *Srikanth Ramakrishnan *Sent:* Friday, September 05, 2014 10:21 AM *To:* Wikimedia Commons Discussion List *Subject:* Re: [Commons-l] images

Flickr2Commons is the best tool I have used till date for such purposes.

On 03-Sep-2014 5:08 am, "Hong, Yena" lists@revi.pe.kr wrote:

Flickr2commons [1] would be good too.

-Yena Hong (Revi) http://www.revi.pe.kr -- Sent from Android --

오전 1:32에 "Poco a Poco" pocoapocowiki@gmail.com님이 작성:

Hello!

I think that this discussion is relevant for Commons-l.

We (WMES) are actually currently very interested in this topic. We are getting for WLM14 more images via Flickr than directly via Commons, and are now wondering how to bring those over to Wikimedia Commons after the Flickrupload [1] bot didn't make it to WMFLabs.

Diego

[1] https://commons.wikimedia.org/wiki/User:Flickr_upload_bot

2014-09-02 17:05 GMT+02:00 Federico Leva (Nemo) nemowiki@gmail.com:

https://tools.wmflabs.org/fist/fist.php is capable of finding useful images in that Flickr stream; I've reported some issues at https://bitbucket.org/magnusmanske/magnustools/issues/ I asked the researcher to post suggestions in the tracker etc. but he replied that "broader conversations [...] with folks over at Wikipedia" are ongoing. Does someone know who those folks are? Any reason Commons-l can't be cc'd?

Nemo

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l

3608

Age (days ago)

3615

Last active (days ago)

commons-l@lists.wikimedia.org

18 comments

16 participants

tags (0)

participants (16)

Andrew Gray
Daniel Schwen
David Goodman
Emilio J. Rodríguez-Posada
Fabrice Florin
Federico Leva (Nemo)
Gerard Meijssen
Hong, Yena
Jane Darnell
Jeremy Baron
Pierre-Selim
Poco a Poco
Ryan Kaldari
Srikanth Ramakrishnan
Tuszynski, Jarek W.
Yann Forget