Re: p1619.1 (tape): Link to Hash-Based Key Derivation

David McGrew Wed, 18 Jan 2006 10:21:48 -0800

A little bit more info inline:

On Jan 18, 2006, at 8:52 AM, David McGrew wrote:

Hi Matt,

nice summary of the KDF situation, more inline:

On Jan 5, 2006, at 3:03 PM, Matt Ball wrote:
Can anyone describe the NIST key-derivation algorithm? Thiswould be very helpful in guiding the group.
I went ahead and answered my question about the NIST keyderivation. It looks like this is a link to the current proposalto NIST for a standard "Key Derivation Function":
<http://www.ietf.org/internet-drafts/draft-dang-nistkdf-00.txt>
There is also some discussion about this standard on the IETFwebpage. Here's a link to the e-mail list:
<http://www1.ietf.org/mail-archive/web/cfrg/current/threads.html>
Look for e-mails with the subject "Hash-Based Key Derivation".Several people from the 1619 group are also members of the ietfgroup, so we should be able to get some good feedback on pros andcons of a KDF.
The NIST KDF spec above defines the key derivation as follows:
Compute Hash-i = H (counter || SV {|| algorithmID} || contextID{|| SharedInfo}).
Where
counter is a 32-bit integer going from 1 to ceiling(keydatalen / hashlen),SV is the "Secret Value", or in this case, the 256-bitAES key,
algorithmID is an optional algorithm identifier,
contextID is a combination of an identifier for both the senderand receiver,
SharedInfo  is an optional string with any additional data, and
||          is the concatenation operator
If we were to apply this standard to IEEE 1619.1, it wouldprobably look something like this:
NewKey = SHA-256(0x00000001 || UserKey || Len(FormatID) ||FormatID || Len(Misc) || Misc)
Since the output of SHA-256 matches the 256-bit key size, it isonly necessary to run the hashing function once (that is, counteronly ever equals 0x00000001).
Can anyone from the IETF group make any comments on thisapproach? What kind of pitfalls should we expect? Is it likelyto become a standard?
Since we're getting SV from an out-of-band method, we'll need toconform to the requirements of Section 3.1. "To ensure thatdistinct keying material is generated, a protocol supporting secretvalues established out of band MUST include SharedInfosubstrings with transaction or application specific informationunique to this execution of the protocol." Perhaps the FormatIDcould be defined as part of the sharedInfo.
As an aside, it is funny that the very definition of contextIDshows that the specification has been written with point-to-pointcommunication in scope, and not data storage :-)
Another nit: algorithmID is mandatory, and it appears to be an ASN.1 OID. Which means that we need to get ASN.1 OIDs for all of ouralgorithms if we want to conform, as far as I can tell.

As many of you probably already know, NIST maintains an ASN objectregister for their crypto stuff. However, that registry appears tobe quite incomplete (I see only AES-128, AES-192, and AES-256 in ECB,CBC, CFB, and OFB modes). The registry is online at http://csrc.nist.gov/csor/algorithms.htm#modules


Does anyone know if there is a more complete registry somewhere?

David

Personally, I'd rather use HMAC-SHA256 instead of just SHA256.This construct would help to reduce the entropy loss and wouldalso make it harder to attack SHA256 if a weakness is later foundin the hashing function. We could also remove the 'counter' sinceit's always 1. Here is how this new approach could look:
NewKey = HMAC-SHA-256(UserKey, Len(FormatID) || FormatID || Len(Misc) || Misc)
or, by the definition of HMAC:
NewKey = SHA-256(UserKey XOR 0x5C5C5C... || SHA-256(UserKey XOR0x363636... || Len(FormatID) || FormatID) || Len(Misc) || Misc))
where
UserKey  is the user-provided 256-bit key,
NewKey   is the derived key later used in the AES-GCM engine,
FormatID is a globally unique identifier of the tape format, and
Misc     is any other useful vendor-specific information.
Each length field would be a Big-Endian 32-bit integer indicatingthe number of 8-bit bytes within the following field. TheFormatID and Misc fields would either be a Big-Endian Integer, ora variable length ASCII string. 'FormatID' would need to be aregistered, unique identifier that globally identifies theformat. 'Misc' would be any other vendor-specific informationthat would be useful in uniquely identifying the media.
Is this approach secure?  Is there too much entropy loss?  Comments?
If we're deriving many keys from the master key, then I agree thatHMAC is preferable. I'm not sure if that's the case, though. Yourproposal looks fine to me (though I'm not exactly sure on what"Format" would contain).
David
Thanks,
-Matt

Re: p1619.1 (tape): Link to Hash-Based Key Derivation

Reply via email to