New password hashing proposal

List overview All Threads
Download

newer

older

Re: [Wikitech-l] New password...

Developing true WISIWYG editor for...

Tim Starling

19 Aug 2010 19 Aug '10

1:37 a.m.

It's been said (e.g. [1]) that hashing passwords with two rounds of MD5 is basically a waste of time these days, because brute-forcing even relatively long passwords is now feasible with cheap hardware. Indeed, you can buy software [2] which claims to be able to check 90 million MediaWiki passwords per second on an ordinary GPU. That would let you crack a random 8-letter password in 20 minutes.

So the time has probably come for us to come up with a "C" type password hashing scheme, to replace the B-type hashes that we use at the moment. I've been thinking along the lines of the following goals:

1. Future-proof: should be adaptable to faster hardware. 2. Upgradeable: it should be possible to compute the C-type hash from the B-type hash, to allow upgrades without bothering users. 3. Efficient in PHP, with default configure options. 4. MediaWiki-specific, so that generic software can't be used to crack our hashes.

The problem with the standard key strengthening algorithms, e.g. PBKDF1, is that they are not efficient in PHP. We don't want a C implementation of our scheme to be orders of magnitude faster than our PHP implementation, because that would allow brute-forcing to be more feasible than is necessary.

The idea I came up with is to hash the output of str_repeat(). This increases the number of rounds of the compression function, while avoiding tight loops in PHP code.

PHP's hash extension has been available by default since PHP 5.1.2, and we can always fall back to using B-type hashes if it's explicitly disabled. The WHIRLPOOL hash is supported. It has no patent or copyright restrictions so it's not going to be yanked out of Debian or PHP for legal reasons. It has a 512-bit block size, the largest of any hash function available in PHP, and its security goals state that it can be truncated without compromising its properties.

My proposed hash function is a B-type MD5 salted hash, which is then further hashed with a configurable number of invocations of WHIRLPOOL, with a 256-bit substring taken from a MediaWiki-specific location. The input to each WHIRLPOOL operation is expanded by a factor of 100 with str_repeat().

The number of WHIRLPOOL iterations is specified in the output string as a base-2 logarithm (whimsically padded out to 3 decimal digits to allow for future universe-sized computers). This number can be upgraded by taking the hash part of the output and applying more rounds to it. A count of 2^7 = 128 gives a time of 55ms on my laptop, and 12ms on one of our servers, so a reasonable default is probably 2^6 or 2^7.

Demo code: http://p.defau.lt/?udYa5CYhHFrgk4SBFiTpGA

Typical output: :C:007:187aabf399e25aa1:9441ccffe8f1afd8c277f4d914ce03c6fcfe157457596709d846ff832022b037

-- Tim Starling

[1] http://www.theregister.co.uk/2010/08/16/password_security_analysis/

[2] http://www.insidepro.com/eng/egb.shtml

Show replies by date

Daniel Kinzler

19 Aug 19 Aug

3:45 a.m.

Tim Starling schrieb:

...

It's been said (e.g. [1]) that hashing passwords with two rounds of MD5 is basically a waste of time these days, because brute-forcing even relatively long passwords is now feasible with cheap hardware. Indeed, you can buy software [2] which claims to be able to check 90 million MediaWiki passwords per second on an ordinary GPU. That would let you crack a random 8-letter password in 20 minutes.

I don't know that much about the mathematical details of hashing, but i'd like to drop a pointer to an article if found interesting in this context:

"Stop using unsafe keyed hashes, use HMAC" http://rdist.root.org/2009/10/29/stop-using-unsafe-keyed-hashes-use-hmac/

So, how does your proposal relate to HMAC?

-- daniel

Tim Starling

4:52 a.m.

On 19/08/10 18:45, Daniel Kinzler wrote:

...

Tim Starling schrieb:

...
It's been said (e.g. [1]) that hashing passwords with two rounds of MD5 is basically a waste of time these days, because brute-forcing even relatively long passwords is now feasible with cheap hardware. Indeed, you can buy software [2] which claims to be able to check 90 million MediaWiki passwords per second on an ordinary GPU. That would let you crack a random 8-letter password in 20 minutes.

I don't know that much about the mathematical details of hashing, but i'd like to drop a pointer to an article if found interesting in this context:

"Stop using unsafe keyed hashes, use HMAC" http://rdist.root.org/2009/10/29/stop-using-unsafe-keyed-hashes-use-hmac/

So, how does your proposal relate to HMAC?

HMAC is for secret keys, there's no secret key in this scheme.

That article mentions collision and second-preimage attacks. As far as I can determine, neither is relevant to a password hashing scheme.

Say if you knew someone's password. Then a second-preimage attack would allow you to construct a new, longer password which also allowed you to log in as them. This would be a waste of time though, since you could have just logged in with the original password.

Similarly, nobody really cares if you can construct two long passwords, set one in your preferences, and use the other to log in. That's all a collision lets you do.

The security goals for password hashing are quite different to those for message authentication, and less well-studied. Key strengthening algorithms use hashing as a proof of work, so a break would be an optimised algorithm. Usually, the designers of hash functions consider optimised algorithms to be useful, not a break.

-- Tim Starling

Robert Rohde

4:02 a.m.

On Wed, Aug 18, 2010 at 11:37 PM, Tim Starling tstarling@wikimedia.org wrote: <snip>

...

The idea I came up with is to hash the output of str_repeat(). This increases the number of rounds of the compression function, while avoiding tight loops in PHP code.

<snip>

...

My proposed hash function is a B-type MD5 salted hash, which is then further hashed with a configurable number of invocations of WHIRLPOOL, with a 256-bit substring taken from a MediaWiki-specific location. The input to each WHIRLPOOL operation is expanded by a factor of 100 with str_repeat().

<snip>

Let me preface my comment by saying that I haven't studied WHIRLPOOL, and the following may not apply to it at all.

However, it is known that some block cypher based hashes behave poorly when fed repeated copies of the same block. In the worst cases the hash space is substantially truncated from its full size (which probably is not the case for any serious cryptographic hash function). Under less severe cases, cryptanalysis can find a new block cipher W' such that N applications of block cipher W is the same as one application of W'. If WHIRLPOOL is vulnerable to that kind of attack then it would negate the effect of using str_repeat in your code.

Like I said, I don't know if either concern applies to WHIRLPOOL. However, these concerns only occur because the 256-bit string you are repeating is a fundamental divisor of the 512-bit block size used by WHIRLPOOL. So, it is trivial to avoid the whole issue simply by using a different repeated block size. For example 97 copies of a 33 byte string should have essentially the same computational performance, while making any associated cryptanalysis threat impossible (or at least less likely).

My only other comment is something you presumably already know. Your proposal is still nothing but an arms race. It makes hashes harder to crack by making the hash function itself much more computationally expensive. However, you'd still have to periodically boost the rep rate with the intention of staying far in front of the hackers.

As a complementary approach it would be nice if there was something in Mediawiki to aid in the selection of strong passwords. Regardless of hash function, it will still take about two billion times longer to find one 10 character password in [A-Za-z0-9] as it does to find a 6 character password in [a-z]. Even if password strength testing algorithms were disabled on Wikipedia sites, it would still be a nice addition to have in the Mediawiki codebase in general.

-Robert Rohde

David Gerard

5:18 a.m.

On 19 August 2010 10:02, Robert Rohde rarohde@gmail.com wrote:

...

As a complementary approach it would be nice if there was something in Mediawiki to aid in the selection of strong passwords. Regardless of hash function, it will still take about two billion times longer to find one 10 character password in [A-Za-z0-9] as it does to find a 6 character password in [a-z]. Even if password strength testing algorithms were disabled on Wikipedia sites, it would still be a nice addition to have in the Mediawiki codebase in general.

http://newsarse.com/2010/08/13/if-you-can-remember-your-password-then-its-ho...

Passwords suck, and people are a problem. Now, if we could distribute RSA fobs to every editor ...

- d.

Ryan Lane

9:50 a.m.

...

http://newsarse.com/2010/08/13/if-you-can-remember-your-password-then-its-ho...

Passwords suck, and people are a problem. Now, if we could distribute RSA fobs to every editor ...

We could do a less secure, but more-secure-than-passwords alternative, which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers. We could also make a PKI infrastructure, and allow certificate login, which is obviously safer than passwords.

The real problem with any system stronger than passwords, is that it requires a level of complexity that would be difficult for us, and either annoying or very confusing for users.

Respectfully,

Ryan Lane

Daniel Friesen

20 Aug 20 Aug

4:29 a.m.

Ryan Lane wrote:

...

...
http://newsarse.com/2010/08/13/if-you-can-remember-your-password-then-its-ho...

Passwords suck, and people are a problem. Now, if we could distribute RSA fobs to every editor ...

We could do a less secure, but more-secure-than-passwords alternative, which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers. We could also make a PKI infrastructure, and allow certificate login, which is obviously safer than passwords.

The real problem with any system stronger than passwords, is that it requires a level of complexity that would be difficult for us, and either annoying or very confusing for users.

Respectfully,

Ryan Lane

OpenID? The account my own OpenID is tied to has two-factor authentication.

~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

-- ~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

Lane, Ryan

8:13 a.m.

New subject: Alternative authentication methods (was New password hashing proposal)

...

...
We could do a less secure, but more-secure-than-passwords

alternative,

...
which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers. We could also make a PKI

infrastructure, and

...
allow certificate login, which is obviously safer than passwords.

The real problem with any system stronger than passwords, is that it requires a level of complexity that would be difficult for us, and either annoying or very confusing for users.

Respectfully,

Ryan Lane

OpenID? The account my own OpenID is tied to has two-factor authentication.

As mentioned in another post, I think we should support OpenID as a provider and a consumer. It pushes the authentication problem elsewhere, but that elsewhere could be more secure that what we are providing, assuming we are providing OpenID over SSL. Unfortunately, that elsewhere may be less secure than us, but that would be the user's choice (or problem, if they don't know their provider is less secure).

Like other methods of authentication, though, providing OpenID as a consumer is confusing for end-users and difficult for us. There are a number of usability issues associated with OpenID that haven't been tackled well yet.

Respectfully,

Ryan Lane

Tim Starling

19 Aug 19 Aug

6:52 a.m.

On 19/08/10 19:02, Robert Rohde wrote:

...

Let me preface my comment by saying that I haven't studied WHIRLPOOL, and the following may not apply to it at all.

However, it is known that some block cypher based hashes behave poorly when fed repeated copies of the same block. In the worst cases the hash space is substantially truncated from its full size (which probably is not the case for any serious cryptographic hash function). Under less severe cases, cryptanalysis can find a new block cipher W' such that N applications of block cipher W is the same as one application of W'. If WHIRLPOOL is vulnerable to that kind of attack then it would negate the effect of using str_repeat in your code.

Like I said, I don't know if either concern applies to WHIRLPOOL. However, these concerns only occur because the 256-bit string you are repeating is a fundamental divisor of the 512-bit block size used by WHIRLPOOL. So, it is trivial to avoid the whole issue simply by using a different repeated block size. For example 97 copies of a 33 byte string should have essentially the same computational performance, while making any associated cryptanalysis threat impossible (or at least less likely).

I think it's unlikely that any structure would appear with repeated input with any modern cryptographic hash function. But the solution you suggest is simple enough, so we may as well add that.

...

My only other comment is something you presumably already know. Your proposal is still nothing but an arms race. It makes hashes harder to crack by making the hash function itself much more computationally expensive. However, you'd still have to periodically boost the rep rate with the intention of staying far in front of the hackers.

Yes, I mentioned that I included the ability to add more iterations without knowing the original password. That's why the hash has to be truncated to the output size at each iteration, instead of using the full hash as an internal state and truncating once before output.

...

As a complementary approach it would be nice if there was something in Mediawiki to aid in the selection of strong passwords. Regardless of hash function, it will still take about two billion times longer to find one 10 character password in [A-Za-z0-9] as it does to find a 6 character password in [a-z]. Even if password strength testing algorithms were disabled on Wikipedia sites, it would still be a nice addition to have in the Mediawiki codebase in general.

I believe a JavaScript password strength meter was recently added to the core.

-- Tim Starling

Aryeh Gregor

2:55 p.m.

On Thu, Aug 19, 2010 at 2:37 AM, Tim Starling tstarling@wikimedia.org wrote:

...

The problem with the standard key strengthening algorithms, e.g. PBKDF1, is that they are not efficient in PHP. We don't want a C implementation of our scheme to be orders of magnitude faster than our PHP implementation, because that would allow brute-forcing to be more feasible than is necessary.

The idea I came up with is to hash the output of str_repeat(). This increases the number of rounds of the compression function, while avoiding tight loops in PHP code.

Seems reasonable.

...

PHP's hash extension has been available by default since PHP 5.1.2, and we can always fall back to using B-type hashes if it's explicitly disabled. The WHIRLPOOL hash is supported. It has no patent or copyright restrictions so it's not going to be yanked out of Debian or PHP for legal reasons. It has a 512-bit block size, the largest of any hash function available in PHP, and its security goals state that it can be truncated without compromising its properties.

My proposed hash function is a B-type MD5 salted hash, which is then further hashed with a configurable number of invocations of WHIRLPOOL, with a 256-bit substring taken from a MediaWiki-specific location. The input to each WHIRLPOOL operation is expanded by a factor of 100 with str_repeat().

The number of WHIRLPOOL iterations is specified in the output string as a base-2 logarithm (whimsically padded out to 3 decimal digits to allow for future universe-sized computers). This number can be upgraded by taking the hash part of the output and applying more rounds to it. A count of 2^7 = 128 gives a time of 55ms on my laptop, and 12ms on one of our servers, so a reasonable default is probably 2^6 or 2^7.

That seems reasonable. It could probably be done a lot faster on GPUs, I guess.

On Thu, Aug 19, 2010 at 4:45 AM, Daniel Kinzler daniel@brightbyte.de wrote:

...

I don't know that much about the mathematical details of hashing, but i'd like to drop a pointer to an article if found interesting in this context:

"Stop using unsafe keyed hashes, use HMAC" http://rdist.root.org/2009/10/29/stop-using-unsafe-keyed-hashes-use-hmac/

So, how does your proposal relate to HMAC?

As Tim said, it doesn't -- we aren't using keyed hashes, and we're only concerned about preimage attacks (not collision or second-preimage). Preimage attacks imply second-preimage attacks, and second-preimage attacks imply collision attacks. Thus something that's secure against collision is also secure against preimage and second-preimage, but a function with collision and second-preimage attacks might have no preimage attacks. For instance, MD5 has tons of trivial collision attacks against it, but no preimage attacks (not sure about second-preimage attacks offhand). Whirlpool has no known collision attacks, and thus no known preimage or second-preimage attacks.

On Thu, Aug 19, 2010 at 5:02 AM, Robert Rohde rarohde@gmail.com wrote:

...

Let me preface my comment by saying that I haven't studied WHIRLPOOL, and the following may not apply to it at all.

However, it is known that some block cypher based hashes behave poorly when fed repeated copies of the same block. In the worst cases the hash space is substantially truncated from its full size (which probably is not the case for any serious cryptographic hash function). Under less severe cases, cryptanalysis can find a new block cipher W' such that N applications of block cipher W is the same as one application of W'. If WHIRLPOOL is vulnerable to that kind of attack then it would negate the effect of using str_repeat in your code.

In principle, we could evade any concern like this by just using a provably secure hash function. The usual reason not to use those is that they're slow, but that's an advantage in our case. For instance (from an exercise in my cryptography course), let p be a prime, q a prime dividing p - 1, and let G be the subgroup of Z_p^* of order q. Let g be a randomly chosen generator for G, and let a_1, ..., a_k be randomly chosen elements of G. If x, y_1, ..., y_k are integers in the range 1 to q, define H(x, y_1, ..., y_k) = g^x a_1^y_1 ... a_k^y_k. Then under the discrete logarithm assumption for G, it's easy to prove that H is collision-resistant. You can make similar functions based on other hard problems -- http://en.wikipedia.org/wiki/Provably_secure_cryptographic_hash_function gives another example using factorization, whose correctness is probably more obvious.

There are two problems with using such a function. One is that there's probably no readily available implementation, so we'd have to write our own, which might be vulnerable to side-channel attacks. Another is that we're more worried about brute-forcing than about the algorithm being broken, so we'd have to write it in C to avoid giving one or two orders of magnitude advantage to the attacker, and then shared hosts can't use it. Perhaps the way to solve both problems is to submit the code for inclusion in future versions of the PHP hash module, after at least casual review by some cryptographers. This would require significant extra work, of course.

Another thing to consider is if we could pick a function that's particularly inconvenient to execute on GPUs. Those are a great way for crackers to easily outdo any CPU implementation.

For a basic implementation, anyway, Tim's proposal sounds like it's much better than what we have now. Fancy stuff like provably correct hash functions is probably overkill.

On Thu, Aug 19, 2010 at 5:02 AM, Robert Rohde rarohde@gmail.com wrote:

...

As a complementary approach it would be nice if there was something in Mediawiki to aid in the selection of strong passwords. Regardless of hash function, it will still take about two billion times longer to find one 10 character password in [A-Za-z0-9] as it does to find a 6 character password in [a-z]. Even if password strength testing algorithms were disabled on Wikipedia sites, it would still be a nice addition to have in the Mediawiki codebase in general.

I agree we should have a good system available for those who want it, and it should probably be enabled by default for sysops. However, as I've said elsewhere, these strength testing things are counterproductive for unprivileged users on open wikis. For such users, a compromised account is approximately as bad as just forgetting your password -- the result in either case is you just lose access to the account. If it's compromised, it might be somewhat worse (because the attacker might prevent an e-mail reset), but on the other hand, forgetting your password is at least a couple orders of magnitude more likely. As such, to minimize harm to users, we should encourage them to use easy-to-remember passwords, not strong passwords.

On Thu, Aug 19, 2010 at 10:12 AM, Jonathan Leybovich jleybov@yahoo.com wrote:

...

What about using public key cryptography? Generate a key-pair and use the "public" key to produce your password hashes. Store the private key offline in an underground vault just in case someday you'll need to recover the original passwords in order to rehash them. Needless to say the key-pair must be entirely for internal use and not already part of some PKI system (i.e. the basis for one of Wikimedia's signed SSL certificates).

If you just delete the private key, this is the same as any old hash function, but with various additional deficiencies. For instance, public-key encryption is not designed to hide length information, while hash functions are -- although due to padding, this might not be a big deal. More seriously, public-key encryption schemes are invariably probabilistic, which makes them useless for our purposes. We'd have to adapt the scheme to remove any randomization. And there might be other problems. There's really no reason to go to all this trouble -- people have designed and studied dedicated hash functions for a reason.

On Thu, Aug 19, 2010 at 10:50 AM, Ryan Lane rlane32@gmail.com wrote:

...

We could do a less secure, but more-secure-than-passwords alternative, which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers.

SMS has loads of vulnerabilities:

http://en.wikipedia.org/wiki/SMS#Vulnerabilities

I don't see anyone signing up to get an e-mail or text message every time they want to log in, either. At best, this pushes the authentication problem back to their e-mail or SMS provider. In that case, why not just use OpenID?

...

We could also make a PKI infrastructure, and allow certificate login, which is obviously safer than passwords.

Not if the password is not stored on the computer and the private key is.

...

The real problem with any system stronger than passwords, is that it requires a level of complexity that would be difficult for us, and either annoying or very confusing for users.

Yes, so let's not worry about it, shall we? We aren't the NSA here.

On Thu, Aug 19, 2010 at 2:18 PM, Jonathan Leybovich jleybov@yahoo.com wrote:

...

In that case you could always discard the private portion of the key-pair to produce a strictly "one-way" function. And at least with this scheme you always do have the option of moving to 'C' regardless of whether it can accept the end-products of B as inputs. Plus I would wager that asymmetric ciphers will stand up to attacks far longer than most hashing functions.

As I noted above, there are hash functions whose security is provable based on the exact same assumptions used to prove security of various popular asymmetric encryption schemes. As I also noted above, there are problems with naively trying to use public-key encryption instead of hash functions. It makes more sense to just use known-secure hash functions directly instead of trying to twist public-key encryption to our needs, if we're that worried about Whirlpool (et al.) being broken anytime soon.

For what it's worth, even ancient and thoroughly-broken hash functions like MD4 don't have readily-usable preimage attacks. (MD4 was published in 1990, broken in 1995, and only in 2008 was a preimage attack published -- which still requires 2^102 evaluations, instead of 2^128.) We don't have to worry much about the hash functions being broken.

Lane, Ryan

4:16 p.m.

...

...
We could do a less secure, but more-secure-than-passwords

alternative,

...
which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers.

SMS has loads of vulnerabilities:

http://en.wikipedia.org/wiki/SMS#Vulnerabilities

I don't see anyone signing up to get an e-mail or text message every time they want to log in, either. At best, this pushes the authentication problem back to their e-mail or SMS provider. In that case, why not just use OpenID?

Though SMS has a number of vulnerabilties, as listed in the link, in practical terms, it is likely to be safer than email for one time passwords. Remember: one time passwords are used as a form of two factor authentication. The SMS is sent to something they have after the user enters the thing they know. The thing you know in this system is often a password. People tend to use the same password everywhere, and a user's wikipedia password is likely their email password, which would make a one time password sent to the email less effective.

With SMS, an attacker would have to know the user's password, and would have to intercept the SMS, which isn't easy enough to be worth the trouble.

OpenID also just pushes the authentication problem back to the OpenID provider. If we are acting as the user's OpenID provider, then the problem is back with us. That said, I agree we should act as an OpenID provider and consumer. We are far more likely to act as a provider before we act as a consumer. Note that this was discussed at the Berlin conference.

...

...
We could also make a PKI infrastructure, and allow certificate login, which is obviously safer than passwords.

Not if the password is not stored on the computer and the private key is.

But it is, if the private key is stored on a thumb drive (in a crypto application), or on a smart card. Even if the private key and password are stored on the filesystem unencrypted, an x509 key is safer than a password simply because it is *much* more complex, so it is very unlikely to be brute forced.

...

...
The real problem with any system stronger than passwords, is that it requires a level of complexity that would be difficult for us, and either annoying or very confusing for users.

Yes, so let's not worry about it, shall we? We aren't the NSA here.

This is a pretty smug statement.

I think it would be nice to offer more secure methods of authentication to users who choose to take advantage of them. One time passwords would likely be too confusing to force on everyone, but they aren't too confusing to offer as an option. It also isn't very difficult to implement on the authentication server's end either. Also, if we are to act as an OpenID provider, it would be pretty nice to offer these more secure alternatives.

Respectfully,

Ryan Lane

Aryeh Gregor

4:37 p.m.

On Thu, Aug 19, 2010 at 5:16 PM, Lane, Ryan Ryan.Lane@ocean.navo.navy.mil wrote:

...

Though SMS has a number of vulnerabilties, as listed in the link, in practical terms, it is likely to be safer than email for one time passwords. Remember: one time passwords are used as a form of two factor authentication. The SMS is sent to something they have after the user enters the thing they know. The thing you know in this system is often a password. People tend to use the same password everywhere, and a user's wikipedia password is likely their email password, which would make a one time password sent to the email less effective.

With SMS, an attacker would have to know the user's password, and would have to intercept the SMS, which isn't easy enough to be worth the trouble.

I don't get what you're suggesting here. Every time a user logs in, they need to both enter their password and follow a link in a text message? Do you think that more than a double-digit number of Wikipedia users would actually opt into this? Or are you talking about support in MediaWiki to be used on high-security corporate or government sites, not Wikipedia?

If someone is willing to do the work, I'm not objecting to supporting this in the software, but if we're talking about Wikipedia, it doesn't make sense to try going this far.

...

But it is, if the private key is stored on a thumb drive (in a crypto application), or on a smart card. Even if the private key and password are stored on the filesystem unencrypted, an x509 key is safer than a password simply because it is *much* more complex, so it is very unlikely to be brute forced.

Yes, certificates are usually more secure than passwords.

...

This is a pretty smug statement.

I don't think so. I think it's completely reasonable, when talking about Wikipedia. Hackers go after money, and there's no money in hacking Wikipedia. We have nothing secret or valuable that's not already readily available. We have no black-market competitors who want to try disrupting our service. Any malicious action could be easily reversed. The worst we have to worry about is someone with a grudge trying to frame someone else, which has happened, but it's hardly a pressing concern.

I seriously cannot see more than a few dozen Wikimedia users actually going to the effort of using one-time passwords over SMS just to protect their account. Consequently, it's not reasonable to ask Wikimedia sysops to do the deployment work necessary for sending SMS from Wikimedia servers, if it's nonzero. Nor is it reasonable to have the code reviewed, and the option offered (we already have too many options), when there's so little benefit.

Every feature has an inherent cost in code maintenance and complexity of use, so features that are too marginal should not be part of the software.

...

I think it would be nice to offer more secure methods of authentication to users who choose to take advantage of them. One time passwords would likely be too confusing to force on everyone, but they aren't too confusing to offer as an option. It also isn't very difficult to implement on the authentication server's end either. Also, if we are to act as an OpenID provider, it would be pretty nice to offer these more secure alternatives.

There is no point in providing options that virtually no one will use. It wastes the effort of all the people who have the maintain the relevant code, and it's yet more distraction on our already way-too-bloated preferences page. And it will not be useful to anyone when someone turns on the preference by mistake and can now no longer log in because they gave a phone number that doesn't receive SMS, or whatever. When few enough people want a preference that more people are likely to turn it on by mistake than deliberately, and when there's significant harm or confusion from turning it on by mistake, that's a sign that it's a bad preference. (See also: "Use external editor".)

Do you think that more than 0.01% of Wikimedia users will enable any such preference if provided?

David Gerard

4:44 p.m.

On 19 August 2010 22:37, Aryeh Gregor Simetrical+wikilist@gmail.com wrote:

...

Do you think that more than 0.01% of Wikimedia users will enable any such preference if provided?

People are also going to keep thinking they're clever by using "fuck" as a password. Remember last time?

http://davidgerard.co.uk/notes/2007/05/07/tubgirl-is-love/

A better password algorithm will at least solve a part of the problem that's understood. Anyone who would choose to use SMS would, I suspect, have picked a good password in the first place. Can we do anything practical for people who can't remember passwords?

- d.

Lane, Ryan

4:50 p.m.

...

People are also going to keep thinking they're clever by using "fuck" as a password. Remember last time?

http://davidgerard.co.uk/notes/2007/05/07/tubgirl-is-love/

A better password algorithm will at least solve a part of the problem that's understood. Anyone who would choose to use SMS would, I suspect, have picked a good password in the first place. Can we do anything practical for people who can't remember passwords?

OpenID as a consumer somewhat helps with this problem, as people are more likely to use more complex passwords if they have to remember fewer passwords.

...

From a practical point of view, minus enforcing complexity rules, or at

least showing a password strength indicator and encouraging strong passwords, there isn't much to do.

Respectfully,

Ryan Lane

Aryeh Gregor

5:05 p.m.

On Thu, Aug 19, 2010 at 5:44 PM, David Gerard dgerard@gmail.com wrote:

...

People are also going to keep thinking they're clever by using "fuck" as a password. Remember last time?

http://davidgerard.co.uk/notes/2007/05/07/tubgirl-is-love/

Admins need to be forced to use secure passwords, using some standard intelligent password checker. (The default one on RHEL is excellent, if memory serves.) Nothing more than secure passwords is needed even for admins, and regular users should not be encouraged to use hard-to-remember passwords. Maybe we could ban the very most common passwords for regular users, at most. It wasn't too long ago that we allowed the empty string as a password.

On Thu, Aug 19, 2010 at 5:47 PM, Lane, Ryan Ryan.Lane@ocean.navo.navy.mil wrote:

...

World of Warcraft provides RSA cards to their users. People use them.

Because they have many thousands of dollars and man-hours invested in their account. Hackers who will try to guess their password and sell the loot are a very credible and damaging threat. Nothing comparable is true of Wikipedia. You have to tailor the security measures to the real-world threats.

...

Either way, I'd likely be the person writing this support, and it would be as an extension, or through another means that wouldn't require much effort.

I don't object to people writing whatever extensions interest them. Personally, I'd be surprised if you'll get Wikimedia sysadmins interested enough to turn it on, but that's not my decision.

This has strayed rather far from the original topic, though, so maybe it should split to a separate thread if anyone is interested in continuing.

Lane, Ryan

4:47 p.m.

...

There is no point in providing options that virtually no one will use. It wastes the effort of all the people who have the maintain the relevant code, and it's yet more distraction on our already way-too-bloated preferences page. And it will not be useful to anyone when someone turns on the preference by mistake and can now no longer log in because they gave a phone number that doesn't receive SMS, or whatever. When few enough people want a preference that more people are likely to turn it on by mistake than deliberately, and when there's significant harm or confusion from turning it on by mistake, that's a sign that it's a bad preference. (See also: "Use external editor".)

Do you think that more than 0.01% of Wikimedia users will enable any such preference if provided?

World of Warcraft provides RSA cards to their users. People use them. I think some of the same people that use the SSL secured login would also opt to use a more secure method of authentication. I think this is especially the case if we were an OpenID provider, and people used us for this service.

Either way, I'd likely be the person writing this support, and it would be as an extension, or through another means that wouldn't require much effort.

Respectfully,

Ryan Lane

Tgr

20 Aug 20 Aug

7:16 a.m.

Aryeh Gregor <Simetrical+wikilist <at> gmail.com> writes:

...

I don't think so. I think it's completely reasonable, when talking about Wikipedia. Hackers go after money, and there's no money in hacking Wikipedia. We have nothing secret or valuable that's not already readily available. We have no black-market competitors who want to try disrupting our service. Any malicious action could be easily reversed. The worst we have to worry about is someone with a grudge trying to frame someone else, which has happened, but it's hardly a pressing concern.

That is true for regular accounts, but with administrator access you can run malicious javascript on a large number of machines or track the visitors of a certain article. A totalitarian government going after checkuser access is not an unimaginable scenario either.

That said, the two things that would make the most difference (and are also much easier to implement) are SSL and password strength requirements. There is no point in fancy stuff like SMS or asymmetric cyphers which would be much more disruptive, a lot harder to introduce, and would have less effect.

...

When few enough people want a preference that more people are likely to turn it on by mistake than deliberately, and when there's significant harm or confusion from turning it on by mistake, that's a sign that it's a bad preference. (See also: "Use external editor".)

Not to disagree with your general point, but that specific problem would be easy to handle by throwing a dialog with big red exclamation marks saying "WARNING! Arey you REALLY sure you know what you are doing?" when one is about to turn on such a feature. (Or only showing the controls when the user selects "expert mode".)

Lane, Ryan

8:17 a.m.

New subject: Alternative authentication methods (was New password hashing proposal)

...

...
When few enough people want a preference that more people are likely to turn it on by mistake than deliberately, and when there's significant harm or confusion from turning it on by mistake, that's a sign that it's a bad preference. (See also: "Use external editor".)

Not to disagree with your general point, but that specific problem would be easy to handle by throwing a dialog with big red exclamation marks saying "WARNING! Arey you REALLY sure you know what you are doing?" when one is about to turn on such a feature. (Or only showing the controls when the user selects "expert mode".)

It would be fairly difficult to lock yourself out just by enabling the option, as part of enabling the option would be to login with the new authentication method. The biggest issue would be reseting credentials when the "what you have" is lost.

Respectfully,

Ryan Lane

MZMcBride

21 Aug 21 Aug

1:17 a.m.

Aryeh Gregor wrote:

...

I don't think so. I think it's completely reasonable, when talking about Wikipedia. Hackers go after money, and there's no money in hacking Wikipedia. We have nothing secret or valuable that's not already readily available. We have no black-market competitors who want to try disrupting our service. Any malicious action could be easily reversed. The worst we have to worry about is someone with a grudge trying to frame someone else, which has happened, but it's hardly a pressing concern.

Facebook has been having issues with compromised accounts that send out spam, either through Facebook messages or Wall posts. This doesn't completely refute your point, but it is a pretty good example of bad users going after readily available, free-to-make accounts in order to misuse them.

...

Upon promotion, the user should be required to re-enter their password before they get access to elevated privileges, and change it if it's not secure enough.

It would be much easier and convenient to check the password upon login.

...

It makes much more sense to remove the option and let people use the API or custom JavaScript or a browser extension if they want to use an external editor.

So that a local wiki admin can add the custom JavaScript as a gadget and the preference can ultimately move from one tab to another? :-)

Tgr wrote:

...

A totalitarian government going after checkuser access is not an unimaginable scenario either.

Yes, it is.

MZMcBride

Nikola Smolenski

2:49 a.m.

Дана Saturday 21 August 2010 08:17:17 MZMcBride написа:

...

Tgr wrote:

...
A totalitarian government going after checkuser access is not an unimaginable scenario either.

Yes, it is.

Well, given that he has just imagined it, it is definitely not unimaginable :)

soxred93

19 Aug 19 Aug

4:19 p.m.

...

On Thu, Aug 19, 2010 at 10:50 AM, Ryan Lane rlane32@gmail.com wrote:

...
We could do a less secure, but more-secure-than-passwords alternative, which is to use email or SMS as a one time password device. SMS is obviously more secure than email, but would require us to ask people for their phone numbers.

I don't do SMS, and I'm sure I'm not the only one who would rather not pay to get a password.

-X!

Tim Starling

10:06 p.m.

On 20/08/10 05:55, Aryeh Gregor wrote:

...

On Thu, Aug 19, 2010 at 2:37 AM, Tim Starling tstarling@wikimedia.org wrote:

...
The number of WHIRLPOOL iterations is specified in the output string as a base-2 logarithm (whimsically padded out to 3 decimal digits to allow for future universe-sized computers). This number can be upgraded by taking the hash part of the output and applying more rounds to it. A count of 2^7 = 128 gives a time of 55ms on my laptop, and 12ms on one of our servers, so a reasonable default is probably 2^6 or 2^7.

That seems reasonable. It could probably be done a lot faster on GPUs, I guess.

Well, a GPU is fast because it is massively parallel, with hundreds of cores. Each core is typically slower than a CPU. I chose a function which is non-parallelisable, so you'd expect computation of a single hash to be slower on a GPU than on a CPU. But a GPU can calculate hundreds of them at a time.

My idle fantasy of choosing a parallelisable function and then using GPUs to accelerate password hashing ended when I found out how much it would cost to fit out the Wikimedia cluster with half a dozen Tesla cards. I don't think the powers that be would be particularly interested in spending that kind of money for a tiny improvement in security.

[...]

...

Another thing to consider is if we could pick a function that's particularly inconvenient to execute on GPUs. Those are a great way for crackers to easily outdo any CPU implementation.

I think that would be a more useful way to go than provably secure hash functions. The relevant Wikipedia article suggests using a "memory bound function", which is sensitive to memory access time, and gets faster when more memory is available. Personally, I think it would be interesting to attempt to construct a function which is limited by branch prediction errors. They are said to be particularly expensive for GPUs. They also get progressively more expensive for more recent CPUs, which means that people with old hardware would have access to more secure hashing than they would otherwise.

-- Tim Starling

Jared Williams

20 Aug 20 Aug

11:26 a.m.

...

-----Original Message----- From: wikitech-l-bounces@lists.wikimedia.org [mailto:wikitech-l-bounces@lists.wikimedia.org] On Behalf Of Tim Starling Sent: 19 August 2010 07:37 To: wikitech-l@lists.wikimedia.org Subject: [Wikitech-l] New password hashing proposal

It's been said (e.g. [1]) that hashing passwords with two rounds of MD5 is basically a waste of time these days, because brute-forcing even relatively long passwords is now feasible with cheap hardware. Indeed, you can buy software [2] which claims to be able to check 90 million MediaWiki passwords per second on an ordinary GPU. That would let you crack a random 8-letter password in 20 minutes.

So the time has probably come for us to come up with a "C" type password hashing scheme, to replace the B-type hashes that we use at the moment. I've been thinking along the lines of the following goals:

Future-proof: should be adaptable to faster hardware.

Upgradeable: it should be possible to compute the C-type

hash from the B-type hash, to allow upgrades without bothering

users.

...

Efficient in PHP, with default configure options.

MediaWiki-specific, so that generic software can't be used

to crack our hashes.

The problem with the standard key strengthening algorithms, e.g. PBKDF1, is that they are not efficient in PHP. We don't want a C implementation of our scheme to be orders of magnitude faster than our PHP implementation, because that would allow brute-forcing to be more feasible than is necessary.

The idea I came up with is to hash the output of str_repeat(). This increases the number of rounds of the compression function, while avoiding tight loops in PHP code.

PHP's hash extension has been available by default since PHP 5.1.2, and we can always fall back to using B-type hashes if it's explicitly disabled. The WHIRLPOOL hash is supported. It has no patent or copyright restrictions so it's not going to be yanked out of Debian or PHP for legal reasons. It has a 512-bit block size, the largest of any hash function available in PHP, and its security goals state that it can be truncated without compromising its properties.

My proposed hash function is a B-type MD5 salted hash, which is then further hashed with a configurable number of invocations of WHIRLPOOL, with a 256-bit substring taken from a MediaWiki-specific location. The input to each WHIRLPOOL operation is expanded by a factor of 100 with str_repeat().

The number of WHIRLPOOL iterations is specified in the output string as a base-2 logarithm (whimsically padded out to 3 decimal digits to allow for future universe-sized computers). This number can be upgraded by taking the hash part of the output and applying more rounds to it. A count of 2^7 = 128 gives a time of 55ms on my laptop, and 12ms on one of our servers, so a reasonable default is probably 2^6 or 2^7.

Demo code: http://p.defau.lt/?udYa5CYhHFrgk4SBFiTpGA

Typical output: :C:007:187aabf399e25aa1:9441ccffe8f1afd8c277f4d914ce03c6fcfe15 7457596709d846ff832022b037

-- Tim Starling

[1]

http://www.theregister.co.uk/2010/08/16/password_security_analysis/

...

[2] http://www.insidepro.com/eng/egb.shtml

PHP's crypt has been upgraded in recent times to now include Ulrich Dreppers' SHA crypt [1]

Certainly mets 1 & 3.

[1] http://www.akkadia.org/drepper/SHA-crypt.txt

Jared

5233

Age (days ago)

5235

Last active (days ago)

wikitech-l@lists.wikimedia.org

22 comments

13 participants

tags (0)

participants (13)

Aryeh Gregor
Daniel Friesen
Daniel Kinzler
David Gerard
Jared Williams
Lane, Ryan
MZMcBride
Nikola Smolenski
Robert Rohde
Ryan Lane
soxred93
Tgr
Tim Starling