[messaging] How and When less bits do more

Stefan Birgmeier Thu, 13 Feb 2014 04:38:16 -0800

If this seems long and at the start quite boring and familiar and youare really in a hurry, skip to the ###. :)

Let us not forget some lower limits to the number of bits needed toprovide sufficient security. The number of bits needed depends on thenumber of tries an attacker can do per second. Let's start with 128bits. A 128 bit HMAC, CMAC,... or fingerprint (i.e. hash) is currentlythought to provide enough security even against a powerful attacker thatcan run an attack for years. Now we do not want to use 128 bits becausewe know that people are bad at comparing a 128 bit fingerprint. So howcan we shave off bits?

- make the algorithm slow. The 128 bits starting value are based on thefact that hashes etc. are easy to implement in hardware and can becalculated quickly. Making the algorithm n times slower than a hashsaves ld(n) bits. This also means that the legit user has to invest moretime, but he usually does not need to run the algorithm frequently.Examples that use this approach are:

- PBKDF2
- scrypt
- fingerprints with leading zeros
- fingerprints that are "nicely readable" in some baseXX (*)
- (add more here)

- only allow a limited number of trials per time unit or even totalnumber of trials. It reduces the number of bit dramatically. Thisalready slightly leans into the realm of special systems because you cannot have that in an environment where you can do true offline attacks(that's my theory, surprise me with a counter-example). This is reallysuited for even the most forgetful people (though I must admit that Iwas once completely in the blue about my pin code too... very awkward)

- ATM pin codes ("secure" hardware)
- SIM card pin codes (mobile phones) ("secure" hardware)

- any kind of well-implemented remote login. If you have hundreds ofservers that use the same authentication system while not talking toeach other about failed attempts, that of course factors in. (presumedimpossibility of offline attacks, i.e. you do not have the password'shash or maybe even know whether a user exists)- PAKE tries it but as Trevor mentioned, it is not water-tight as thereare possibilities for online and offline brute-forcing that are viable.

- (add more here)

- restrict yourself to VERY specific cases:
- people must physically meet each other AND
- bring mobile phones / business cards / ...

- make up and remember (or write down, which is mostly bad though,because the value is a secret) bit strings of certain length i.e. randomwords... how do you know if your random words have the reqired number ofbits? this case really is very shady from a security perspective eventhough the procedure is easily understandable, i.e. usability (as inuser-friendlyness) is high- you have an existing secure channel (there should not need to be anydiscussion about this case)

- (add more here)

- make a weak assumption

- "voice imitation is difficult" (it is not, there are many people whocan imitate voices compared to people who can fake hashes)- "people can think up random bitstrings of certain length" (veryunlikely. numerous studies have shown that the human being is not a goodRNG. E.g. if i knew someone met at a conference and they made up ashared secret there, i'd probably try something with *con*, cake,coffee, napkin... and anything i know about those people personally.Also, people tend to forget these one-time secrets and write them down -subverting security is as easy as stealing their wallets then)

- (add more here)

- more, better ways to shave off bits?

Added complications are extra requirements on the value/bitstring whichbasically means added bits and/or decreased usability, often to thepoint where people do not know what to do:

- unlinkability
- (add more here)

Anything non computer-generated with very low bit count is onlyapplicable in _very_ specific settings, varying depending on what isassumed about the attacker (i.e. are you a high profile target?).Personally I am more interested in as unconditionally as possible secureschemes and I do not trust any of the "weak assumptions" I listed - forme, any scheme that relies on them is just about as good as no scheme.That does not mean it actually IS that way, I am just saying, for me,the difference between 0.0001 and 0 is negligible.

### Many discussions here have contained several solutions. Some aresolutions to to very specific, settings, where suddenly, under certainassumptions connected with that setting, the number of needed bits isvery low. Then the same discussions sometimes also contain solutionswith generalized approaches, that do not rely on anything except thepresumed hardness of certain problems (ECDLP, prime factoring,"inverting" hashes). More often than not, some solutions from bothrealms need a comparable number of bits for the user to handle - it isobvious that the strong solution is the better one then. I proposemaking it very clear which case a certain discussion is about, maybe bywriting down the assumptions about the system (even the obvious ones)and further requirements (unlinkability etc). Otherwise there can bediscussions containing 5 different schemes at the same time, that areall secure - under their own very specific assumptions. This might leadto confusion and it is doubtful that the discussion can arrive atvaluable results if the challenge (system parameters, requirements) isnot clear (maybe it was just me that sometimes the challenge was notcompletely clear). As an example requirements might be, "no businesscards, no mobile phones" or "users can easily be tricked into acceptinga wrong hash as the right one", both of which would rule out anydiscussion about fingerprints or random computer-generated values, andthe discussion would then just be about which scheme provides mostsecurity while being as user-friendly as possible under the givenassumptions. There could be discussions that aim for different settings,one allowing business cards, the other one does not - in that cases,answers should make clear which of the target scenarios they refer to.These are just ideas :)

(*) Note on the "nicely readable" fingerprints: I am personally not surehow much you actually gain by doing it. Reading such a fingerprint outloud to someone else, you might (as the listener) not be able todistinguish between a's and e's or i's and since the fingerprint isbasically an alteration between consonant and vowel, there is a lot ofpotential for attackers; basically, they just need to get the consonantsright, and even there you can swap c with k etc... There need to bestudies for this. Of course it is even pretty hard already to just getthe consonants right by brute force. Basically what I am saying is thatas an attacker you just need to get something that "sounds right", whichmight be much easier than getting something that is close to theoriginal in a hamming-distance way.

The amount of bits you save is easily calculated: as mentioned, the"readable" fingerprint is mostly an alteration between vowels (5 =2.3bits) and consonants (21 = 4.4bits), totaling 6.7bits per characterpair, i.e. 3.4 bits per character (actually more, 1 more bit for whetherwe start with consonant or vowel and the alteration is not strict).Comparing that with the theoretical entropy of a base32 string (5 bitsper character) we reduce the entropy to 3.4/5 = 67% (of course itdoesn't scale that way, if you start with a 256 bit hash you willprobably never get close to 67%). A 125 bit value that meets the"niceness" criteria therefore has around 84bits of entropy. Achievingthe same by looking for a hash with leading zeros means finding a 125bit hash with 41 leading zeros, occurring on average every 2^41 hashes(2,199,023,255,552 .. 2,200 billon). However, that hash might be morevaluabe (depending on the results of the needed "nicely readablehash"-study) because you can take these remaining 84 bits and get theminto any shape that is, coding-theoretically speaking, the shortest onewith most distance by human standards (visual and auditorial), thereforeproviding the lowest error rate at shortest word length / lowest wordnumber. Of course you need to encode the number of leading zerossomehow, which adds around 6 bits of information to the remaining 84bits from the hash. Alternatively, find a hash with even more leadingzeros to make up for those 6 bits, getting you 2^47 which is alreadyvery uncomfortable to calculate on standard hardware: 1 every140,737,488,355,328 i.e. 140 trillion...


I hope at least parts of this are useful in the discussion.


Stefan
_______________________________________________
Messaging mailing list
[email protected]
https://moderncrypto.org/mailman/listinfo/messaging

[messaging] How and When less bits do more

Reply via email to