Re: KDF API review, round 2

Michael StJohns Mon, 27 Nov 2017 10:11:50 -0800

On 11/27/2017 1:03 AM, Jamil Nimeh wrote:

One additional topic for discussion: Late in the week we talkedabout the current state of the API internally and one item torevisit is where the DerivationParameterSpec objects are passed. Itwas brought up by a couple people that it would be better to providethe DPS objects pertaining to keys at the time they are called forthrough deriveKey() and deriveKeys() (and possibly deriveData).
Originally we had them all grouped in a List in the init method. Onereason for needing it up there was to know the total length ofmaterial to generate. If we can provide the total length throughthe AlgorithmParameterSpec passed in via init() then things like:
Key deriveKey(DerivationParameterSpec param);
List<Key> deriveKeys(List<DerivationParameterSpec> params);
become possible. To my eyes at least it does make it more clearwhat DPS you're processing since they're provided at derive time,rather than the caller having to keep track in their heads where inthe DPS list they might be with each successive deriveKey orderiveKeys calls. And I think we could do away withderiveKeys(int), too.
See above - the key stream is logically produced in its entiretybefore any assignment of that stream is made to any cryptographicobjects because the mixins (except for the round differentiator) arethe same for each key stream production round. Simply passing inthe total length may not give you the right result if the KDFrequires a per component length (and it should to defeat (5) or itshould only produce a single key).
From looking at 800-108, I don't see any place where the KDF needs aper-component length. It looks like it takes L (total length) as aninput and that is applied to each round of the PRF. HKDF takes Lup-front as an input too, though it doesn't use it as an input to theHMAC function itself. For TLS 1.3 that component length becomes partof the context info (HkdfLabel) through the HKDF-Expand-Labelfunction...and it's only doing one key for a given label which is alsopart of that context specific info, necessitating an init() call. Seems like the length can go into the APS provided via init (for thoseKDFs that need it at least) and you shouldn't need a DPS list up-front.

HKDF and SP800-108 only deal with the creation of the key stream andignore the issues with assigning the key stream to cryptographicobjects. In the TLS version of HDKF, the L value is mandatory and onlya single object is assigned per init/call to the KDF. An HSM can lookat the HKDF label information and set the appropriate policies for theassigned cryptographic object (because if any of the label data changes,the entire key stream changes). That's not the case for the raw HKDFnor for any KDF that allows for multiple objects to be extracted out ofa single key stream. Hence the per-component length values.

Ideally, there should be a complete object spec for each object to begenerated that is part of the mixins (label and context) for any KDF. That allows an HSM to rely upon the object spec when setting policycontrols for each generated object - and incidentally allows for a KDFto generate both public and non-public data in a secure way.

So as long as you allow for the specification of all of the productionobjects as part of the .init() I'm good. A given KDF might not requirethis - but I can't see any way of fixing the current KDFs to work inHSMs without something like this.

As far as your (5) scenario goes, I can see how you can twiddle thelengths to get the keystream output with zero-length keys and large IVbuffers. But that scenario really glosses over what should be a bighurdle and a major access control issue that stands outside the KDFAPI: That the attacker shouldn't have access to the input keyingmaterial in the first place. Protect the input keying materialproperly and their attack cannot be done.

Let me give you an example. I'm running an embedded HSM - to protectTLS keys and to do all of the crypto. An attacker compromises the TLSserver and now has access to the HSM. No problem - I'm going to noticeif the attacker starts extraditing large amounts of data from the server(e.g. copies of the TLS in the clear but possibly reencrypted datastream) so this isn't a threat or is it? Smart attacker does anextraction attack on the TLS 1.2 and before KDF and turns all of the keystream material into IV material and exports it from the HSM. Theattacker now has the much smaller key material so he can send a fewmessages with those keys and allow for the passive external interceptionof the traffic and decryption thereof without the risk of detection ofall that traffic being sent. Alternately, I can place the key materialin a picture via steganography and publish it as part of the server data.

The idea is to protect extraction of the key material from an HSM _*evenfrom authorized users of that key material*_.

KDFs don't currently do this well. Adding the overall length and percomponent length stuff as well as a per component spec to the data usedto derive the key stream means that 1) changes to any of those changethe entire key stream, 2) the per component spec data may be used by thesecurity module policy engine to enforce restrictions and 3) because of(1) and (2) calling the KDF a second time gets me exactly the sameobjects rather than just the same key stream. The last isn't veryimportant in a software based security domain, but turns out to havereal implications for policy enforcing security modules.

This gets worse when you realize that the KDF key is under it all eithera HASH HMAC or CMAC key and all of those algorithms produce publicdata. Ideally you need a way of preventing a KDF key from calling theraw HASH/HMAC/CMAC functions directly (and vice versa).

I would rather see the DPS provided in the deriveKey. It couples whatyou want out with the call that makes the object and it makes a lotmore sense to keep those two together than try to remember where inthe submitted list of DPS objects you are.
95% of the time this will be a call to produce a single key. 4% ofthe time it will be a call to produce multiple keys. Only 1% of thetime will it need to intermix key, data and object productions.Anybody who is doing that is going to write a wrapper around thisclass to make sure they get the key and data production order correctfor each call. So I'm not all that bothered by keeping thecomplexity as a price for keeping flexibility.
You could have a Key deriveKey(Key k, DerivationParameterSpec param)for some things like TLS1.3 (where you can only make a single call toderive key between inits) , but then you'd also need at least abyte[] deriveData (Key k, DerivationParameterSpec param) and anObject deriveObject(Key k, DerivationParameterSpec param).
I don't think those are necessary. If you're just doing HKDF-Expand(for the HKDF-Expand-Label TLS 1.3 key derivation) then you canprovide the input key, label and max length and any other context infothat goes into that HkdfLabel structure...all of that would go intoinit(). Then provide the key alg and desired length via the DPS atderiveKey time. Any subsequent keys in the TLS 1.3 key schedule wouldneed a new init call anyway since the labels change and possibly theoutput length.
Over the next day or so I'm going to have to make some final decisionson this API as there are internal projects that are waiting on thisAPI to proceed. I'm already past the cut-off date I set, but Irecognize these discussions are important to have and I appreciate theinput you and others have provided.
--Jamil

Reading this last I think I've lost the context. Here's where I thinkwe are:

1) Get instance gets the default configuration of a given KDF (and thatdefault will be attached to the instance name defintion)

2) .setParameter() may be used to update the KDF configuration - once.

3) .init() takes at least the key, it may optionally take a set ofderivation parameters. The derivation parameters provided in .init()are intended for use in forming the label and context mixins for theKDF. They may provide - for example - the total length of the keystream, the objects to be derived, the length of the objects, protectionparameters for each of the objects etc.4) A kdf generate a free-running or fixed length key stream depending onthe derivation parameters (e.g. if "L" is not a mixin to the KDF then itis free-running and may produce as much key stream as desired or if theproduction object specifications are not part of the derivation mixins).

Doing (4) is mostly not a good idea, but someone might want to dothis. In that case it may make the most sense to just allow them to doderiveData(int length) calls as the only function (a keyed PRNG basically).

Re the last version of your api - if you add the .setParameter().getParameter() calls to both KeyDerivation and KeyDerivationSpi I thinkI'm happy with this part of the API. I'm wondering if we should talkabout KeyAgreement though.

Re: KDF API review, round 2

Reply via email to