Re: JCA design for RFC 7748

Michael StJohns Thu, 17 Aug 2017 10:45:41 -0700

See inline.

On 8/17/2017 11:19 AM, Adam Petcher wrote:

On 8/16/2017 3:17 PM, Michael StJohns wrote:
On 8/16/2017 11:18 AM, Adam Petcher wrote:
My intention with this ByteArrayValue is to only use it forinformation that has a clear semantics when represented as a bytearray, and a byte array is a convenient and appropriaterepresentation for the algorithms involved (so there isn't a lot ofunnecessary conversion). This is the case for public/private keys inRFC 7748/8032:
1) RFC 8032: "An EdDSA private key is a b-bit string k." "The EdDSApublic key is ENC(A)." (ENC is a function from integers tolittle-endian bit strings.
Oops, minor correction. Here A is a point, so ENC is a function frompoints to little-endian bit strings.
2) RFC 7748: "Alice generates 32 random bytes in a[0] to a[31] andtransmits K_A =X25519(a, 9) to Bob..." The X25519 and X448functions, as described in the RFC, take bit strings as input andproduce bit strings as output.
Thanks for making my point for me. The internal representation ofthe public point is an integer. It's only when encoding or decodingthat it gets externally represented as an array of bytes. (And yes,I understand that the RFC defines an algorithm using little endianbyte array representations of the integers - but that's theimplementation's call, not the API).
With respect to the output of the KeyAgreement algorithm - your (2)above, the transmission representation (e.g. the encoded public key)is little endian byte array representation of an integer. Theinternal representation is - wait for it - integer.
I have no problems at all with any given implementation using littleendian math internally. For the purposes of using JCA, stick withBigInteger to represent your integers. Use your provider encodingmethods to translate between what the math is internally and what thebits are externally if necessary. Implement the conversion methodsfor the factory and for dealing with the existing EC classes. Maybeget BigInteger to be extended to handle (natively) littleEndianrepresentation (as well as fixed length outputs necessary for thingslike ECDH).
All good points, and I think BigInteger may be a reasonablerepresentation to use for public/private key values. I'm just not surethat it is better than byte arrays. I'll share some relevantinformation that affects this decision.
First off, one of the goals of RFC 7748 and 8032 is to address some ofthe implementation challenges related to ECC. These algorithms aredesigned to eliminate the need for checks at various stages, and togenerally make implementation bugs less likely. These improvements aremotivated by all the ECC implementation bugs that have emerged in thelast ~20 years. I mention this because I think it is important that wechoose an API and implementation that allows us to benefit from theseimprovements in the standards. That means we shouldn't necessarilyfollow all the existing ECC patterns in the API and implementation.

No - it means that the authors of the RFCs have a bias to have theircode be the only code. As I note below I don't actually think they goteverything right. The underlying math is really what matters, and theAPI should be able to handle any implementation that gets the math correct.

Specifically, these standards have properties related to byte arrayslike: "The Curve25519 function was carefully designed to allow all32-byte strings as Diffie-Hellman public keys."[1]

This statement is actually a problem. Valid keys are in the range of 1to p-1 for the field (with some additional pruning). 32 byte strings(or 256 bit integers) do not map 1-1 into that space. E.g. there aresome actual canonical keys where multiple (at least 2) 32 byte stringsmap to them. (See the pruning and clamping algorithms). The NISTprivate key generation for EC private keys mitigates this bias by either(a) repeatedly generating random keys until you get one in the range or(b) generating a key stream with extra (64) bits and reducing that mod pof the curve.

If we use representations other than byte strings in the API, then weshould ensure that our representations have the same properties (e.g.every BigInteger is a valid public key).
It's best to talk about each type on its own. Of course, one of thebenefits of using bit strings is that we may have the option of usingthe same class/interface in the API to hold all of these.
RFC 7748 public keys: I think we can reasonably use BigInteger to holdpublic key values. One minor issue is that we need to specify howimplementations should handle non-canonical values (numbers that areless than 0 or greater than p-1). This does not seem like a hugeissue, though, and the existing ECC API has the same issue. Anotherminor issue is that modeling this as a BigInteger may encourageimplementations to use BigInteger in the RFC 7748 Montgomery ladder.This would be unfortunate because it would leak sensitive informationthrough timing channels.

When you do the conversion from a key spec to a key you do what you'resupposed to do - e.g. I think its mod p to reduce it. Either that oryou throw an error. That's implementation side so not a big problemexcept that the documentation should explain what is supposed to happen.

RFC 7748 private keys: This one is a bit more difficult. RFC 7748defines a "clamping" operation that ensures that the integerscorresponding to bit strings have certain properties (e.g. they are amultiple of the cofactor). So if we use BigInteger for private keys inthe API, we need to specify whether the value is clamped or unclamped.If an unclamped value is treated as clamped, then this can result insecurity and correctness issues. Also, the RFC treats private keys asbit strings---they are not used in any integer operations. So modelingthem with byte arrays seems just as valid as modeling them withBigInteger.

Nope. The private keys are actually integers - the first thing that isdone to the bit string is to "decodeLittleEndian". Any programmer worththeir salary is going to do this once on input. The implementation stuffdescribed in the RFC then does big integer math on the bytes.

I assume that the PKCS8 conventions will use the bit string - butinternally, this is going to be an integer of some sort. It would benice if BigInteger had support for input/output of little endian values- but it doesn't. I expect that anyone who implements this set ofcurves will probably extend BigInteger to make little endian supportjust work. Externally, the BigInteger in/out stuff (e.g. key spec's)would then be completely backwards compatible.

In any event - please don't confuse the suggested implementation of thevarious RFCs and the various external representations with the actualunderlying math.

RFC 8042 public keys: The analysis here is similar to RFC 7748 publickeys, except we also need to store the (probably compressed) xcoordinate. So if we don't use byte arrays, we would need to usesomething like ECPoint.

Yup - see my previous email on how to handle this.

RFC 8032 private keys: These are definitely bit strings, and modelingthem as integers doesn't make much sense. The only thing that is everdone with these private keys is that they are used as input to a hashfunction.

Again - no. The actual private key is what you get after stage 3 ofsection 5.1.5. E.g. generate a random string of 32 bytes. Hash it tohelp with the bad random generators (*sheesh*), Interpret the hashafter pruning as a little endian integer. Any programmer worth theirsalary is going to do steps 1-3 once at generation and store the privatekey as that pruned value. An equivalent (and possibly stronger)generation function would be to randomly generate 320 bits as aninteger, take that mod p of the curve and then do any additionalpruning. That reduces the bias introduced by generating only 256 bitsfrom the hash and immediately throwing away the last bit (MSBit of theMSByte in little endian terms).

Bernstein et al hide a lot under the covers in the RFCs, but this isinteger and point math and there's nothing special about it. Forcing thetransmission formats to be little endian when every other public keysystem uses big endian (and trying to hide that by calling them byte andbit strings) seems to be short sighted, but its what we got. But youshouldn't confuse the RFC defined external encodings to be the API youneed for JCA. Especially if yet another edwards RFC comes alongspecifying Big Endian encoding for a different curve type. (Orinterleaved bytes or something that makes sense for a highly parallelprocessing regime but translates poorly to an on the wire representation).

These are Integers (private scalars) and ECPoints. The curve parameterset needs mapping to the JCA API - but the curves are not anything special.


Mike


[1] https://cr.yp.to/ecdh.html

Re: JCA design for RFC 7748

Reply via email to