Someone in the thread on friendliness mentioned that categories are
always in one language (usually english). Well still a long way from
fixing the issue, perhaps if we allowed unrestricted
{{DISPLAYTITLE:...}}, combined with the {{int: hack, that'd allow
better translatable categories. (of course you'd only be able to use
the actual category name in [[category:Foo]] links. I suppose one
could use a bot to automatically change links to redirect categories
to their canonical name, but then we're getting really really hacky).
Anyways, just a thought.
cheers,
bawolff
The Apple Lossless Audio Codec has been released under Apache license 2.0:
http://alac.macosforge.org/trac/wiki
Something to add to the Commons file format portfolio? Do we support
FLAC already?
Note: This foundation-l post is cross-posted to commons-l, since this discussion may be of interest there as well.
> From: Tobias Oelgarte <tobias.oelgarte(a)googlemail.com>
> It is a in house made problem, as i explained at brainstorming [1].
> To put it short: It is a self made problem, based on the fact that this
> images got more attention then others. Thanks to failed deletion
> requests they had many people caring about them. This results in more
> exact descriptions and file naming then in average images. Thats what
> search engines prefer; and now we have them at a top spot. Thanks for
> caring so much about this images and not treating them like anything else.
I don't think that is the case, actually. Brandon described how the search function works here:
http://www.quora.com/Why-is-the-second-image-returned-on-Wikimedia-Commons-…
To take an example, the file
http://commons.wikimedia.org/w/index.php?title=File:Golden_Shower.jpg&actio…
(a prominent search result in searches for "shower") has never had its name or description changed since it was uploaded from Flickr. My impression is that refinement of file names and descriptions following discussions has little to do with sexual or pornography-related media appearing prominently in search listings. The material is simply there, and the search function finds it, as it is designed to do.
> Andreas, you currently represent exactly that kind of argumentation that
> leads into anything, but not to a solution. I described it already in
> the post "Controversial Content vs Only-Image-Filter" [2], that single
> examples don't represent the overall thematic. It also isn't an addition
> to the discussion as an argument. It would be an argument if we would
> know the effects that occur. We have to clear the question:
It is hard to say how else to provide evidence of a problem, other than by giving multiple (not single) examples of it.
You could also search for blond, blonde, red hair, strawberry, or peach ...
What is striking is the crass sexism of some of the filenames and image descriptions: "blonde bombshell", "Blonde teenie sucking", "so, so sexy", "These two had a blast showing off" etc.
http://commons.wikimedia.org/w/index.php?title=Special%3ASearch&search=blon…
One of the images shows a young woman in the bathroom, urinating:
http://commons.wikimedia.org/wiki/File:Blonde_woman_urinating.jpg
Her face is fully shown, and the image, displayed in the Czech Wikipedia, carries no personality rights warning, nor is there evidence that she has consented to or is even aware of the upload.
And I am surprised how often images of porn actresses are found in search results, even for searches like "Barbie". Commons has 917 files in Category:Unidentified porn actresses alone. There is no corresponding Category:Unidentified porn actors (although there is of course a wealth of categories and media for gay porn actors).
> * Is it a problem that the search function displays sexual content? (A
> search should find anything related, by definition.)
I think the search function works as designed, looking for matches in file names and descriptions.
> * Is sexual content is overrepresented by the search?
I don't think so. The search function simply shows what is there. However, the sexual content that comes up for innocuous searches sometimes violates the principle of least astonishment, and thus may turn some users off using, contributing to, or recommending Commons as an educational resource.
> * If that is the case. Why is it that way?
> * Can we do something about it, without drastic changes, like
> blocking/excluding categories?
One thing that might help would be for the search function to privilege files that are shown in top-level categories containing the search term: e.g. for "cucumber", first display all files that are in category "cucumber", rather than those contained in subcategories, like "sexual penetrative use of cucumbers", regardless of the file name (which may not have the English word "cucumber" in it).
A second step would be to make sure that sexual content is not housed in the top categories, but in appropriately named subcategories. This is generally already established practice. Doing both would reduce the problem somewhat, at least in cases where there is a category that matches the search term.
Regards,
Andreas
[1]
>
>http://meta.wikimedia.org/w/index.php?title=Controversial_content%2FBrainst…
>[2]
>http://lists.wikimedia.org/pipermail/foundation-l/2011-October/069699.html
>
>Am 17.10.2011 02:56, schrieb Andreas Kolbe:
>> Personality conflicts aside, we're noting that non-sexual search terms in Commons can prominently return sexual images of varying explicitness, from mild nudity to hardcore, and that this is different from entering a sexual search term and finding that Google fails to filter some results.
>>
>> I posted some more Commons search terms where this happens on Meta; they include
>>
>> Black, Caucasian, Asian;
>>
>> Male, Female, Teenage, Woman, Man;
>>
>> Vegetables;
>>
>> Drawing, Drawing style;
>>
>> Barbie, Doll;
>>
>> Demonstration, Slideshow;
>>
>> Drinking, Custard, Tan;
>>
>> Hand, Forefinger, Backhand, Hair;
>>
>> Bell tolling, Shower, Furniture, Crate, Scaffold;
>>
>> Galipette – French for "somersault"; this leads to a collection of 1920s pornographic films which are undoubtedly of significant historical interest, but are also pretty much as explicit as any modern representative of the genre.
>>
>> Andreas
>
I thought it might be of interest to Wikimedia Commons folks that I
recently updated the Wikistream visualization of real time edits to
include updates from the Commons, and to automatically update the
background to the latest uploaded images:
http://wikistream.inkdroid.org
It's not perfect by any means, but I thought it was interesting to see
the updates that are coming in all the time to the Commons.
//Ed
> ------------------------------
>
> Message: 5
> Date: Tue, 11 Oct 2011 16:22:37 +0100 (BST)
> From: Andreas Kolbe <jayen466(a)yahoo.com>
> Subject: [Commons-l] Commons search function vs. Google
> To: Wikimedia Commons Discussion List <commons-l(a)lists.wikimedia.org>
> Message-ID:
> <1318346557.48784.YahooMailNeo(a)web29620.mail.ird.yahoo.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> We are wondering on Meta[1]?what criteria the Commons search function uses
> to establish the order of search results displayed.
>
> To give some examples, searching for "pearl necklace" in Commons shows a
> woman with sperm on her neck as the first image result:
>
>
> http://commons.wikimedia.org/w/index.php?title=Special%3ASearch&search=pear…
>
>
> The same image is way down in a Google search (with safe search off) for
> pearl necklace on Commons:
>
>
> http://www.google.co.uk/search?q=cucumber+site:commons.wikimedia.org&um=1&h…
> .,cf.osb&biw=1111&bih=774&uss=1#um=1&hl=en&safe=off&tbm=isch&sa=1&q=pearl+necklace+site:
> commons.wikimedia.org&oq=pearl+necklace+site:commons.wikimedia.org
> &aq=f&aqi=&aql=&gs_sm=e&gs_upl=113279l114967l0l115854l14l11l0l0l0l8l261l2003l0.8.3l11l0&bav=on.2,or.r_gc.r_pw.,cf.osb&fp=49f703222a617ec&biw=1111&bih=774
>
>
> Searching for "electric toothbrushes" in Commons shows a woman masturbating
> with a toothbrush as the second image result:
>
>
>
> http://commons.wikimedia.org/w/index.php?title=Special%3ASearch&search=elec…
>
>
> The same image turns up in Google as well (with safe search switched off),
> though not as one of the first results:
>
>
> http://www.google.co.uk/search?q=cucumber+site:commons.wikimedia.org&um=1&h…
> .,cf.osb&biw=1111&bih=774&uss=1#um=1&hl=en&safe=off&tbm=isch&sa=1&q=electric+toothbrushes+site:
> commons.wikimedia.org&pbx=1&oq=electric+toothbrushes+site:
> commons.wikimedia.org
> &aq=f&aqi=&aql=&gs_sm=e&gs_upl=341351l344565l0l345961l21l19l0l0l0l13l255l3528l0.11.8l19l0&bav=on.2,or.r_gc.r_pw.,cf.osb&fp=49f703222a617ec&biw=1111&bih=774
>
>
> Searching for "cucumber" in Commons shows a woman with a cucumber up her
> vagina on the first page of search results:
>
>
> http://commons.wikimedia.org/w/index.php?title=Special%3ASearch&search=cucu…
>
> Doing a Google search for cucumber on Commons (with safe search off) does
> not bring this image up among the first hundred or so results:
>
>
> http://www.google.co.uk/search?q=cucumber+site:commons.wikimedia.org&um=1&h…
> .,cf.osb&biw=1111&bih=774&uss=1
>
>
> Why is our listing so different from the one in Google, and why are sexual
> images so much higher up in our listing of search results?
>
>
> Andreas
>
>
> [1]?http://meta.wikimedia.org/wiki/Controversial_content/Brainstorming?
>
>
I don't know how Google does it, but I'd bet that our search prioritises by
word order in the description. So a description that starts Pearl Necklace
comes before "A white pearl necklace". If you amend the description them I
suspect the search results will change.
WereSpielChequers