On 9/16/11 9:48 AM, Thomas Gries wrote:
Am 16.09.2011 11:24, schrieb Roan Kattouw:
For some applications, I use the technique of
representing the 128 bit
of md5 or other checksums
as base-62 character strings
instead of hexadecimal (base-16) strings.
MediaWiki already uses a similar technique,
storing SHA-1 hashes of
images in base 36.
Was there a certain reason to chose base 36 ?
Why not recoding to base 62 and saving 3 bytes per checksum ?
11M images in commons x 3 bytes = 33MB / ~1MB per file on average =
a savings equivalent to 33 more files in Commons
In the time it took you to write this we probably had more than 33 files
Neil Kandalgaonkar (| <neilk(a)wikimedia.org>