[Mediawiki-l] Re: google and image pages

Jamie Bliss astronouth7303 at gmail.com
Wed Apr 6 00:32:38 UTC 2005


On Apr 5, 2005 7:33 PM, Ira Abramov <lists-MediaWiki-l at ira.abramov.org> wrote:
> Quoting Brion Vibber, from the post of Sat, 02 Apr:
> > Christof Damian wrote:
> > >does anyone know why google doesn't index description pages of images?
> > >if i do a google search like this: "site:wikipedia.org inurl:image" i
> > >just get about 650 result, some of which are media files, but none are
> > >images (jpg/gif/png).
> >
> > Probably Google is stupid enough to think '.jpg' at the end of a URL
> > means the resource is itself an image.
> 
> when you are a bot that has to slurp up millions of pages a day, it's
> safe to assume in 99.99% of the cases, that a jpg suffix will indeed
> lead you to an image. requesting that URL just to see that the header
> indeed gives one MIME type or the other means adding a considderable
> overhead. apart from mediawiki and a few rare CMS's, I'd risk a guess
> that practically nobody uses such suffixes in a URL..

Isn't that what the "HEAD" action is for? And shouldn't Googlebots be
also indexing images  for the Google image search?

-- Jamie
-------------------------------------------------------------------
http://endeavour.zapto.org/astro73/
Thank you to JosephM for inviting me to Gmail!
Have lots of invites. Gmail now had 2GB.



More information about the MediaWiki-l mailing list