On 9/16/11 9:48 AM, Thomas Gries wrote:
Am 16.09.2011 11:24, schrieb Roan Kattouw:
For some applications, I use the technique of representing the 128 bit of md5 or other checksums
as base-62 character strings instead of hexadecimal (base-16) strings.
MediaWiki already uses a similar technique, storing SHA-1 hashes of images in base 36.
Was there a certain reason to chose base 36 ? Why not recoding to base 62 and saving 3 bytes per checksum ?
11M images in commons x 3 bytes = 33MB / ~1MB per file on average = a savings equivalent to 33 more files in Commons
In the time it took you to write this we probably had more than 33 files uploaded.