On 09/21/2011 03:47 AM, Milos Rancic wrote:
Then governments/ISPs/institutions could block unsafe-Wikipedia via DNS
blocks. This is, compared to DPI, quite easy.
might resolve this issue.
* Create safe.wikimedia.org
. That would be the site
censoring/categorizing Commons images. It shouldn't be Commons itself,
but its virtual fork. The fork would be consisted of hashes of image
names with images themselves. Thus, image on Commons with the name
"Torre_de_H%C3%A9rcules_-_DivesGallaecia2012-62.jpg" would be
"fd37dae713526ee2da82f5a6cf6431de.jpg" on safe.wikimedia.org
image preview located on upload.wikimedia.org
with the name
it would be translated as "thumb/a1f3216e3344ea115bcac778937947f1.jpg"
. (Note: md5 is not likely to be the best hashing
system; some other algorithm could be deployed.)
You're counting on there being too many hashes to go through, which is
But there are far fewer images to go through. You'd only have to create
a list of all hashes of all 11 million or so images on Commons and
compare that list to the list of unsafe images on safe.wikimedia.org
Which is not easy (if you have to download all the files, i.e. if the
files themselves are used for hashing, not only the file name), but
So, in effect, I don't think your proposal properly achieves what it
tries to accomplish. (Sorry if I misunderstood your proposal)