Re: p1619 (disk): Security concerns of LRW and an alternative mode

David McGrew Thu, 22 Dec 2005 06:48:29 -0800

Hi Matt,

On Dec 21, 2005, at 3:20 PM, Matt Ball wrote:

Sorry for the long e-mail, but there are some important securityissues with LRW that we need to look at before finalizing thestandard. These issues won't necessarily preclude using LRW, but Ithink it's important to know how NOT to use LRW.
Abstract:
This document discusses ways to attack LRW through algebraicweaknesses in the Galois Multiplier. This attack becomes strong ifKey2 (K2) looks similar to the plaintext (e.g. if K2 is an ASCIIpassword). The result of this attack is to leak 128-bits ofplaintext for each detected collision (similar to ECB mode). Acovert channel within LRW is discussed. Lastly, an alternativemode is proposed that eliminates these weaknesses.
Alternative mode:
I've attached a picture for a slightly different tweak mode thatreplaces the Galois multiplier with a second AES engine. Hasanyone seen a mode like this? In particular, is there any known IPclaims to this mode? (I do not make any claims myself)

the original LRW paper proposed a mode that used two block cipherinvocations per block of plaintext, one to produce the tweak T, andthe other to compute C = E(K, P + T) + T. AFAIK the LRW authors didnot patent their work - someone asked Moses Liskov about patents atthe Crypto conference, and he said that they had not filed for apatent. He also didn't rule out doing that in the future, but ofcourse at that point overseas rights were gone (since disclosurepreceeded any potential filing). Of course, anyone using LRW or amode like it needs to be aware of the OCB patent.

I'm sending this out because it looks like LRW can leak informationif the KEY2 looks similar to the plaintext. We may want to add analternative mode if people want a higher security level.
Here are some advantages of this alternative mode:
- Using I = 0 is secure (This is also true with LRW if ElectronicCode Book mode is secure)- Knowing the Tweak (T) does not help in recovering Key2 (whereasin LRW, knowledge of T completely reveals Key2)- It is possible to implement this using less hardware by reusingthe AES engine. The throughput would be half as much, but this maynot matter with laptop hard disks.
- Detection of collisions becomes significantly harder


Concerning LRW mode:
There is a possible security problem if Key2 is poorly chosen. Inparticular, if Key2 has low entropy, low hamming weight, or longruns of binary 1s or 0s, (or is otherwise systematically non-random) it then becomes possible to detect situations where the AESengine encoded the same data (thus revealing 128-bits about theplaintext). The problem is especially bad if Key2 is a relativelyshort ASCII password. In this case, the Key2 would look verysimilar to ASCII data on the hard disk, making it easier to createcollisions with the plaintext.
Recovering Key2 in LRW:
What happens if we take the XOR of two ciphertext blocks?Generally speaking, we should get a pseudo-random number. However,what happens when the AES engine encoded the same data? By XORingthe cipherblocks, we completely remove the output of the AES engine(which was the same) and are left only with data relating to Key2(K2) and the IV (a.k.a. I). Let's take a closer look at this:
(Let '+' be bitwise XOR, and '*' be a 128-bit Galois Fieldmultiply; C? = Ciphertext, K2 = Key2, I? = IV, P? = plaintext;substitute '?' with 1 or 2):
From the LRW specification, we get these equations:

        C1 = (K2 * I1) + AES(K1, K2 * I1 + P1)
        C2 = (K2 * I2) + AES(K1, K2 * I2 + P2)
Now, assume that the output of the AES engine is the same for boththese equations. Here's what happens when we add the two lines:
        C1 + C2 = (K2 * I1) + (K2 * I2) + (AES1 + AES2)
C1 + C2 = (K2 * I1) + (K2 * I2) + 0 (output of AES1 and AES2matches)
        C1 + C2 = K2 * (I1 + I2)                                        
(distributive law for finite fields)
Before going any further, take a look at the last line: The sum oftwo ciphertexts involved in a collision equals Key2 times the twoIs XORed together. Now, let's assume that we are incrementing theleast-significant digit of I each time. In this case, (I1 XOR I2)= 1, assuming I1 is even and I2 immediately follows I1. Note thatthe XOR operator will cancel-out anything within I1 or I2 thatmatches (e.g. the drive number).
IMPORTANT POINT: This equation holds even if we use the upper bitsof I for a drive number or other unique identifier.
So what this means is that for every other consecutive pair ofciphertexts, we get this equation:
        C1 + C2 = K2 * (1) = K2

It then becomes trivial to recover K2.
Now, let's looks at what happens when K2 has low hamming weight(i.e. low number of binary 1s). In general, taking (I1 + I2) willalso have low hamming weight (this is because we don't vary the Isvery much). Because of the nature of the 128-bit multiplier, theresult of K2 * (I1 + I2) will also have low hamming weight. Thisis especially true if the most-significant bits of K2 are zero. Inthis case, we may not even need to take the modulo of the primitivepolynomial after expanding the multiplication. Even if we did, thegenerator polynomial itself has low hamming weight and will not addtoo much to the final result (especially if I1 + I2 only has 1 bitset).
In this case, it becomes relatively easy to detect collisionswithin the AES engine by performing statistical analysis on all thecombinations of the ciphertext blocks. If we want to get moresophisticated, it would be possible to completely eliminate theeffects of the Galois multiply altogether by dividing eachciphertext sum by (I1 + I2):
        Compute (C1 + C2) * (I1 + I2)^-1 = K2
Since I1 and I2 are known for each cipher block, this computationbecomes straightforward. In dedicated hardware, it could bepossible to compute the inverse of I1 and I2 in about 128 clockcycles through successive squaring. Frequently, (I1 + I2) willresult in the same value, so it should be possible to precomputeall the most common values, making a software-based attack feasible.
LRW Collisions:
Now, for this attack to be useful, it has to be somewhat likelythat we get a collision. A first guess would be to say that wewouldn't start getting collisions until the birthday limit of thecipher, which is around 2^64 ciphertext blocks (128 bits / 2).However, this is assuming that the input into the AES engine is atotally random number. In practice, this is not generally true.In fact, the chance of a collision greatly increases when K2 looksvery similar to the plaintext (or rather the XOR of two plaintextblocks). Let's take the example of using an ASCII password forK2. Furthermore, let's assume that the harddisk mostly containsASCII characters.
How do we create a collision? That is, how do we make the inputsof the AES engine equivalent? Here is the equation for the inputinto the AES engine:
        AES(K1, K2 * I + P)

For a collision, we need an I and P that match this equation:

        AES(K1, K2 * I1 + P1) = AES(K1, K2 * I2 + P2)
K2 * I1 + P1 = K2 * I2 + P2 (if the outputs of AES match, theinputs match)
or
        P1 + P2 = K2 * I1 + K2 * I2                     (rearrange)
        P1 + P2 = K2 * (I1 + I2)                        (distributive law)
Interestingly, we get that (I1 + I2) term again. Keep in mind thatthis will generally have low hamming weight, making it so K2 * (I1+ I2) will also have low hamming weight. In the case where I1 andI2 are consecutive, we get P1 + P2 = K2 (because I1 + I2 = 1).
If P1, P2 and K2 are all ASCII strings, the average entropy will bemuch lower than 128 bits. Some studies show that text containsabout 1.5 bits of entropy per byte. If this is the case, we couldexpect a total entropy of about 1.5 bits * 16 bytes = 24 bits. Thebirthday attack would then succeed after about 2^12 = 4096ciphertext blocks.
In practice, there's probably more entropy than 1.5 bits. However,if we say there is 4 bits of entropy per character, our birthdaylimit just went down from 2^64 to 2^32. It would be quitereasonable for a harddisk to contain 64 GB of data to make thisattack work.
Obviously, there is a lot of hand waving here. We would probablywant to do a study to see what the real numbers look like.
Here's the bottom line:
If K2 resembles the plaintext, it becomes possible to attack thedata well below the theoretical birthday limit.
Ideally, the security of a cryptosystem should be completelyindependent of the data patterns. This is true of both CBC and CTRmodes. ECB falls flat on its face in this regard. It looks likeLRW has some of the same characteristics as ECB.

That's an interesting analysis. However, if the key is chosenuniformly at random, then there are no issues with LRW, right? Sothe concern could be alleviated by requiring that keys be uniformlyrandom. Any mode of operation is going to have problems if its keysare low-entropy ASCII strings. If we need to assume that the userhas chosen K2 poorly, then we ought to expect that they have notchosen K1 well either. In that case, it is not clear that LRW failsin a way that's more spectacular than any other mode would, AFAICT.

In practice, if there is a desire to use password-based keys, thenI'd suggest that a dedicated password-to-key function be used (PKCS#5or HMAC, for example). This is a good idea for any system thatadmits passwords, though of course it is getting into the subject ofkey management (which the WG should probably start formal workon :-) Alternatively, for LRW, the mode definition could be modifiedso that K2 is derived from K1, similar to what GCM, CMAC, and someother modes do.


David

Covert Channel with LRW:
If an application is running on the host computer that hasknowledge of K2, it becomes possible to create a covert channel bywriting blocks that intentionally create collisions. This is doneby writing blocks with the relationship P1 + P2 = K2 * (I1 + I2).If several blocks are written in this manner, it becomes possibleto detect this using the methods outlined above. However, in thiscase, it doesn't matter if K2 is strong or weak because it'spossible to write several consecutive blocks that containcollisions. The attack algorithm would then search for consecutiveblocks that have collisions. Using this covert channel, it ispossible to send binary information by either writing blocks withcollisions or without. It might even be possible to encode K1 inthis manner, causing a total loss of security.
Notes on alternative mode:
The alternative proposed mode does not have the shortcomingsmentioned above. Since we have replaced the Galois multiplier witha second AES engine, we are now working with strong pseudorandomdata for the tweak value. None of the simple algebraic propertieshold. The birthday limit returns to 2^64 blocks because the outputof the second AES engine is random. Furthermore, if it was somehowpossible to recover the Tweak value (T), this would not reveal Key2(K2). Lastly, there are no weak values for I to worry about(assuming AES is indistinguishable from an ideal cipher).
Let me know what you all think.

Matt Ball
Embedded Software Engineer
Quantum Corporation
4001 Discovery Drive, Suite 1100
Boulder, CO 80303
(720) 406-5766
<LRW_Alternative.jpg>

Re: p1619 (disk): Security concerns of LRW and an alternative mode

Reply via email to