Re: RFR 8181594: Efficient and constant-time modular arithmetic

Xuelei Fan Sun, 11 Mar 2018 09:06:12 -0700

On 2/26/2018 10:39 AM, Adam Petcher wrote:

http://cr.openjdk.java.net/~apetcher/8181594/webrev.01/

See inline below.

On 2/23/2018 12:46 PM, Xuelei Fan wrote:
ArrayUtil.java:
===============
I'm not very sure how widely this utilities will be used in thefuture. Looks like only BigIntegerModuloP uses this classes. I mayprefer to define private methods for byte array swap inBigIntegerModuloP.
It is also used by XDHPublicKeyImpl (in the XDH code review). XDH publickeys are represented as BigInteger, and I use the array reverse methodto convert encoded keys to BigInteger.

If it is not widely used by other classes, please have these methods inthe class where is get called. The sun.security.util is exported toother modules as well, we may not want to add stuff into this packageunless it is really necessary.

MutableIntegerModuloP.java
==========================
void conditionalSwapWith(MutableIntegerModuloP b, int swap);
As the 'swap' parameter can only be 0 or 1, could it be a booleanparameter?
I couldn't come up with a way to implement this without branching whenthe swap parameter is boolean. See IntegerPolynomial.conditionalSwap tosee how this is implemented in arithmetic with an int swap argument. Ifyou (or anyone) can think of a way to do this with boolean, let me know.
I added a sentence to the comment above conditionalSwapWith thatdescribes why it is an int instead of a boolean.

I did not get the point about the need to avoid branching. Can you havemore details?

Except the conditionalSwapWith() method, I did not get the points whywe need a mutable version. Would you please have more description ofthis requirement?
The comment above the class definition has this sentence:
"This interface can be used to improve performance and avoid theallocation of a large number of temporary objects."
Do you need more information than this in the comments? The performancemotivation is so that a.add(b).multiply(c)... can be done withoutallocating a new buffer for each operation. For example, without mutablefield elements, an X25519 point multiplication would allocate around4,300 temporary arrays totaling 350,000 bytes. If I remember correctly,switching the X25519 implementation to mutable field elements reducedthe point multiplication time by about half.

I see your point.  The benefits is obviously.

OK, why you need the immutable version then? Sounds like the mutableversion interface is sufficient, including performance. If an immutableversion is really needed, we can have the implementation making thedecision. Accordingly, the conditionalSwapWith() can be defined asoptional method, if it is not required to be implemented in immutableimplementation.

It's confusing to me that the immutable and mutable and the baseversions/interfaces mixed together. It would be nice if we can simplifythe interface a little bit.

For internal APIs, sometimes we don't want the same quality level aspublic APIs. I think this set of class will be widely used by new ECcurves, ChaCha20/Poly1305, or more in the future. It would be nice ifwe could do it good at the beginning.

IntegerModuloP_Base.java
========================
default byte[] addModPowerTwo(IntegerModuloP_Base b, int len)
void addModPowerTwo(IntegerModuloP_Base b, byte[] result);
For the first sign of the method names, I thought it is to calculateas "(this + b) ^ 2 mod m".
To be precise, it calculates "((this % p) + (b % p)) % 2^m" (where p isthe prime that defines the field, and m is the desired length, in bits).Note that the addition here is normal integer addition (not addition inGF(p)).
This operation is not used in XDH, but it is used in Poly1305 to add theAES encryption of a nonce to a field element. So you can get moreinformation about this operation by reading the Poly1305 paper/RFC.

I was not meant to say the function of the method. I meant that themethod name is a little bit misleading, not very straightforward to me.

Besides, what's the benefits of the two methods?  Could we just use:
      this.add(b).asByteArray()
No, because that would calculate "((this + b) mod p) mod 2^m". The valueof (this + b) can be larger than p, so this would not produce thedesired result.
 >>
I guess, but not very sure, it is for constant time calculation. Ifthe function is required, could it be renamed as:
      // the result is inside of the size range
      IntegerModuloP addModSize(IntegerModuloP_Base b, int size)
Or
      // the result is wrapped if outside of the size range
      IntegerModuloP addOnWrap(IntegerModuloP_Base b, int size)

and the use may look like:
      this.addModSize(b, size).asByteArray()
Any attempt to perform the addition in IntegerModuloP and then pull outthe byte array will not work.

Does it mean if I perform a addition, and cannot get the byte array inthe following step?

     that = this.add(b);
     byte[] bs = that.asByteArray();          // does not work?
or
     byte[] bs = that.asByteArray(length);    // does not work?
or
     byte[] bs = that.asByteArray(byteArray); // does not work?

This class can only represent fieldelements, so the sum would be in the field, which is not what we want.

I did not get the point. If getting an add result, the add is done (sumin the field). Did you have an example that pulling out the byte arraywill not work?


Xuelei

Will review the rest when I understand more about the interfaces design.

Thanks,
Xuelei

On 1/30/2018 8:52 AM, Adam Petcher wrote:
+core-libs-dev


On 1/26/2018 4:06 PM, Adam Petcher wrote:
JBS: https://bugs.openjdk.java.net/browse/JDK-8181594
Webrev: http://cr.openjdk.java.net/~apetcher/8181594/webrev.00/
This is a code review for the field arithmetic that will be used inimplementations of X25519/X448 key agreement, the Poly1305authenticator, and EdDSA signatures. I believe that the library hasall the features necessary for X25519/X448 and Poly1305, and Iexpect at most a couple of minor enhancements will be required tosupport EdDSA. There is no public API for this library, so we canchange it in the future to suit the needs of new algorithms withoutbreaking compatibility with external code. Still, I made an attemptto clearly structure and document the (internal) API, and I want tomake sure it is understandable and easy to use.
This is not a general-purpose modular arithmetic library. It willonly work well in circumstances where the sequence of operations isrestricted, and where the prime that defines the field has someuseful structure. Moreover, each new field will require somefield-specific code that takes into account the structure of theprime and the way the field is used in the application. The initialimplementation includes a field for Poly1305 and the fields forX25519/X448 which should also work for EdDSA.
The benefits of using this library are that it is much moreefficient than using similar operations in BigInteger. Also, manyoperations are branch-free, making them suitable for use in aside-channel resistant implementation that does not branch on secrets.
To provide some context, I have attached a code snippet describinghow this library can be used. The snippet is the constant-timeMontgomery ladder from my X25519/X448 implementation, which I expectto be out for review soon. X25519/X448 only uses standard arithmeticoperations, and the more unusual features (e.g. add modulo a powerof 2) are needed by Poly1305.
The field arithmetic (for all fields) is implemented using a 32-bitrepresentation similar to the one described in the Ed448 paper[1](in the "Implementation on 32-bit platforms" section). Though myimplementation uses signed limbs, and grade-school multiplicationinstead of Karatsuba. The argument for correctness is essentiallythe same for all three fields: the magnitude of each 64-bit limb isat most 2^(k-1) after reduction, except for the last limb which mayhave a magnitude of up to 2^k. The values of k are between 26 to 28(depending on the field), and we can calculate that the maximummagnitude for any limb during an add-multiply-carry-reduce sequenceis always less than 2^63. Therefore, no overflow occurs and alloperations are correct.
Process note: this enhancement is part of JEP 324 (Key Agreementwith Curve25519 and Curve448). When this code review is complete,nothing will happen until all other work for this JEP is complete,and the JEP is accepted as part of some release. This means thatthis code will be pushed to the repo along with the X25519/X448 codethat uses it.
[1] https://eprint.iacr.org/2015/625.pdf

Re: RFR 8181594: Efficient and constant-time modular arithmetic

Reply via email to