constantine

Commit Graph

Author	SHA1	Message	Date
Mamy Ratsimbazafy	26954f905a	Constant time (#185 ) * Implement fully constant-time division closes #2 closes #9 * constant-time hex parsing * prevent cache timing attacks in toHex() conversion (which is only for test/debug purposes anyway)	2022-02-28 09:23:26 +01:00
Mamy Ratsimbazafy	ffacf61e8a	Don't dump all in "backend" (#184 ) * backend -> math * towers -> extension fields * move ISA and compiler specific code out of math/ * fix export	2022-02-27 01:49:08 +01:00
Mamy Ratsimbazafy	5bc6d1d426	BLS signatures for Ethereum (BLS sig on BLS12-381 G2 with SHA256) (#183 ) * Finally add the (Ethereum) bls signatures (on BLS12-381 G2) * fix test path and remove old low-level signature test	2022-02-26 21:22:34 +01:00
Mamy Ratsimbazafy	fe500a6a79	Productionize: move protocols top-level vs backend (#179 ) * Productionize: move protocols top-level vs backend * fix path * import fix * the last one * benches as well	2022-02-21 01:04:53 +01:00
Mamy Ratsimbazafy	81acfb1626	Nim 1.6 in CI (#170 ) * try 1.6 CI * Try CI with 1.6 and windows. * Bend the knee * have fun debugging CI * have fun debugging CI * more CI spam * branch -> nim_version * fight or flight * properly detect windows * Fix galore * 🐍 🐍 snake: * meh give up on parallelizing windows and dealing with windows PATH issues * ¯\_ (ツ)_/¯	2022-02-20 23:44:00 +01:00
Mamy Ratsimbazafy	dc73c71801	Pairings optimizations (#178 ) * bench for cyclotomic square, exp and rename cyclotomic exp + multipairings for BLS12-377 * refactor/unify lines and cyclotomic functions * Add Karabina's compressed squaring * Use compressed squarings in final exponentiation * Weighted addchain for bn254_snarks * Add new towering options and cost functions * Rearrange bench summaries * fix BW6-761	2022-02-20 20:15:20 +01:00
Mamy Ratsimbazafy	14af7e8724	Low-level refactoring (#175 ) * Add specific fromMont conversion routine. Rename montyResidue to getMont * missed test file * Add x86_64 ASM for fromMont * Add x86_64 MULX/ADCX/ADOX for fromMont * rework Montgomery Multiplication with prefetch/latency hiding techniques * Fix ADX autodetection, closes #174. Rollback faster mul_mont attempt, no improvement and debug pain. * finalSub in fromMont & adx_bmi -> adx * Some {.noInit.} to avoid Nim zeroMem (which should be optimized away but who knows) * Uniformize name 'op+domain': mulmod - mulmont * Fix asm codegen bug "0x0000555555565930 <+896>: sbb 0x20(%r8),%r8" with Clang in final substraction * Prepare for skipping final substraction * Don't forget to copy the result when we skip the final substraction * Seems like we need to stash the idea of skipping the final substraction for now, needs bounds analysis https://eprint.iacr.org/2017/1057.pdf * fix condition for ASM 32-bit * optim modular addition when sparebit is available	2022-02-14 00:16:55 +01:00
Mamy Ratsimbazafy	53c4db7ead	Fast modular inversion (#172 ) * split modular inversion in its own file * Stash fast GCD inversion https://eprint.iacr.org/2020/972.pdf * Stash Pornin's bingcd -> issue with inner modular reduction * Implement Bernstein-Yang inversion * Avoid Nim checks on signed integers (32-bit runtime issue) * cleanup: remove old inversion impls * cleanup: static moduli, move div2 * small comments (skip ci) * comment cleanup (skip ci) * fix total iterations on 32-bit * Add batch conversion to affine coordinates using simultaneous inversion trick * fix conditional setZero and batchAffine conversion * cleanup unneeded branches following affine conversion unification * Fix batchAffine with zero inputs and add fuzz failure to test suite	2022-02-10 14:05:07 +01:00
Mamy Ratsimbazafy	404a966601	^k to ᵏ (skip ci)	2022-02-06 15:38:26 +01:00
Mamy Ratsimbazafy	50717d8de6	Test GT-subgroup for BW6-761 (#171 )	2022-01-08 17:30:26 +01:00
Mamy Ratsimbazafy	f6c02fe075	Optimized subgroup checks and cofactor clearing (#169 ) * Move cofactor clearing to dedicated per-curve subgroups file * Add BLS12-381 fast subgroup checks * Implement fast cofactor clearing for BN254_snarks * Add fast subgroup check to BN254Snarks * add BLS12_377 optimized cofactor and subgroup functions * Add BN254_Nogami * Add GT-subgroup tests * Use the new subgroup checks for Eth1 EVM precompiles	2022-01-03 14:12:58 +01:00
Mamy Ratsimbazafy	c42e2a0251	Rename NotOnTwist/OnTwist => subgroup G1 and G2	2022-01-01 19:17:04 +01:00
Mamy Ratsimbazafy	86a67013dd	glv filename -> endomorphisms	2022-01-01 17:49:26 +01:00
Mamy Ratsimbazafy	bea798e27c	Field sqrt optimization (#168 ) * add more Fp tests for Twisted Edwards curves * add fused sqrt+division bench * Significant fused sqrt+division improvement for any prime field over algorithm described in "High-Speed High-Security Signature", Bernstein et al, p15 "Fast decompression", https://ed25519.cr.yp.to/ed25519-20110705.pdf * Activate secp256k1 field benches + spring renaming of field multiplication * addition chains for inversion and sqrt of Curve25519 * Make isSquare use addition chains * add double-prec mul/square bench for <256-bit prime fields.	2022-01-01 16:19:35 +01:00
Mamy Ratsimbazafy	53f9708c2b	Initial support for Twisted Edwards curves (#167 ) * Point decoding: optimized sqrt for p ≡ 5 (mod 8) (Curve25519) * Implement fused sqrt(u/v) for twisted edwards point deserialization * Introduce twisted edwards affine * Allow declaration of curve field elements (and fight against recursive dependencies * Twisted edwards group law + tests * Add support for jubjub and bandersnatch #162 * test twisted edwards scalar mul	2021-12-29 01:54:17 +01:00
Mamy Ratsimbazafy	1195e5e980	Eth1 evm precompiles (#166 ) * Prepare support for Eth1 EVM * Implement EIP 196 (Ethereum BN254 add/mul) * Implement ETH1 pairing precompile * Accelerate isOnCurve for G2 with precomputation	2021-12-15 00:02:11 +01:00
Mamy Ratsimbazafy	f5c0b6245d	Multipairing (#165 ) * Productionize multipairings for BLS12-381 * typo * arg order + benchmark * Introduce mul_3way_sparse_sparse * cleanup MultiMiller loop * fix init sparse optimization in multimiller loop [skip ci]	2021-08-16 22:22:51 +02:00
Mamy Ratsimbazafy	979d183657	Tests for the eth2 BLS signature protocol (BLS12-381, pubkeys G1, signatures G2) using low-level primitives (#164 )	2021-08-15 11:41:46 +02:00
Mamy Ratsimbazafy	0bc228126a	hash-to-curve BLS12-381 perf (#163 ) * fp square noasm split from non-4 non-6 limbs fallback (40% speedup) * optimized cofactor clearing for BLS12-381 G2 * Support jacobian isogenies and point_add on isogenies * fuse addition and isogeny map * {.noInit.} and sparseMul * poly_eval_horner init * dedicated invsqrt + cleanup square root file * hash to field: reduce copy overhead and don't return arrays * h2c isogeny jacobian reuse pow 3 precomputed value * Fix sqrt bench	2021-08-14 21:01:50 +02:00
Mamy Ratsimbazafy	499f9605b2	Hash to curve - BLS12-381 (#110 ) * Hash to Curve: impl expand_message_xmd * Try to precompute part of hash to curve at compile-time * sha256 bench - use the new hashes module * [WIP] smoke test hash to field * Implement hash_to_field with expected output * unoptimized hash-to-curve G2 for BLS12-381 * Don't run sanitizer on hash to field as it uses GC-ed strings	2021-08-13 22:07:26 +02:00
Mamy Ratsimbazafy	aefd40f455	Square ADX (#160 ) * Add MULX/ADOX/ADCX assembly for squaring 4 limbs * Add squarings for 6 limbs * Use the new square assembly where relevant * Fix 32-bit register name and calling convention * typo * Disable MontRed ASM for 2 limbs or less	2021-02-20 13:18:49 +01:00
Mamy Ratsimbazafy	9ac9862401	Optimize Miller Loop and prepare Multi-pairing (#159 ) * Pairing with affine: align API to BLST and Gurvy and common use-case. * Implement multi-pairing / aggregate verif for BLS12-381 (+2% pairing perf) * Generalize the optimized miller loop for single pairing * Immplement the miller loop addchain for BLS12-377 * Miller addition chain for BN254-Nogami * no Miller adchain for BN254-Snarks * Update the line test with new tower https://github.com/mratsim/constantine/pull/153 * Somewhat sparse for Fp2 M-Twist * Implement line by line multiplication for Fp12 D-Twist * Somewhat sparse Mul for Fp12 D-Twist * Finish the sparse and somewhat sparse multiplications	2021-02-14 13:06:57 +01:00
Mamy Ratsimbazafy	e7296a78a8	Double-precision cubic towering + pairing (#158 ) * Double-precision cubic towering 5% perf+ * Lazy Cubic squaring, yet another 3% boost. * Implement lazy reduced inverse (but inclusive perf boost) * Double precision sparse multiplication for D-Twist ~ 2% for BN254 Nogami and Snarks curves * Implement lazy sparse mul for M-twist * Try to introduce more laziness but need bound proofs	2021-02-12 21:27:58 +01:00
Mamy Ratsimbazafy	5806cc4638	Double-Precision towering (#155 ) * consistent naming for dbl-width * Isolate double-width Fp2 mul * Implement double-width complex multiplication * Lay out Fp4 double-width mul * Off by p in square Fp4 as well :/ * less copies and stack space in addition chains * Address https://github.com/mratsim/constantine/issues/154 partly * Fix #154, faster Fp4 square: less non-residue, no Mul, only square (bit more ops total) * Fix typo * better assembly scheduling for add/sub * Double-width -> Double-precision * Unred -> Unr * double-precision modular addition * Replace canUseNoCarryMontyMul and canUseNoCarryMontySquare by getSpareBits * Complete the double-precision implementation * Use double-precision path for Fp4 squaring and mul * remove mixin annotations * Lazy reduction in Fp4 prod * Fix assembly for sum2xMod * Assembly for double-precision negation * reduce white spaces in pairing benchmarks * ADX implies BMI2	2021-02-09 22:57:45 +01:00
Mamy Ratsimbazafy	491b4d4d21	Drop nim-json-serialization for testing (#156 )	2021-02-09 22:10:16 +01:00
Mamy Ratsimbazafy	e23f990280	Tower drop concepts (#153 ) * Fix affine instantiation * drop concept from the codebase * Remove alignment requirement, this cases problem in sequences on 32-bit for t_fp12_anti_regression * slight sparse optim	2021-02-07 14:03:56 +01:00
Mamy Ratsimbazafy	258e7e516f	[WIP] Pairings for bw6 761 (#108 ) * Prepare BW6-761 pairing constants * Extract the basic miller loop from pairings * template and method call syntax issue * Layout pairing for BW6-761 * Fix rebasing woes * Try to match the paper (still buggy) * Stash BW6-761	2021-02-07 09:46:41 +01:00
Mamy André-Ratsimbazafy	5710a961a1	Rename ECP_ShortW_Proj -> ECP_ShortW_Prj	2021-02-06 16:29:53 +01:00
Mamy Ratsimbazafy	c312210878	Rework towering (#148 ) * naive removal of out-of-place mul by non residue * Use {.inline.} in a consistent manner across the codebase * Handle aliasing for quadratic multiplication * reorg optimization * Handle aliasing for quadratic squaring * handle aliasing in mul_sparse_complex_by_0y * Rework multiplication by nonresidue, assume tower and twist use same non-residue * continue rework * continue on non-residues * Remove "NonResidue " calls handle aliasing in Chung-Hasan SQR2 * Handla aliasing in Chung-Hasan SQR3 * Use one less temporary in Chung Hasan sqr2 * handle aliasing in cubic extensions * merge extension tower in the same file to reduce duplicate proc and allow better inlining * handle aliasing in cubic inversion * drop out-of-place proc from BigInt and finite fields as well * less copies in line_projective * remove a copy in fp12 by lines	2021-02-06 16:28:38 +01:00
Mamy Ratsimbazafy	83dcd988b3	FpDbl revisited (#144 ) - 7% perf improvement everywhere, up to 30% in double-width primitives * reorg mul -> limbs_double_width, ConstantineASM CttASM * Implement squaring specialized scalar path (22% faster than mul) * Implement "portable" assembly for squaring * stash part of the changes * Reorg montgomery reduction - prepare to introduce Comba optimization * Implement comba Montgomery reduce (but it's slower!) * rename t -> a * 30% performance improvement by avoiding toOpenArray! * variable renaming * Fix 32-bit imports * slightly better assembly for sub2x * There is an annoying bottleneck * use out-of-place Fp assembly instead of in-place * diffAlias is unneeded now * cosmetic * speedup fpDbl sub by 20% * Fix Fp2 -> Fp6 -> Fp12 towering. It seems 5% faster * Stash ADCX/ADOX squaring	2021-02-01 03:52:27 +01:00
Mamy Ratsimbazafy	d12d5faf21	Implement Jacobian mixed addition (#142 )	2021-01-30 14:21:55 +01:00
Mamy Ratsimbazafy	95e23339b2	Decimal conversion (#139 ) * Add constant-time fromDecimal conversion. Add warnings on intended purposes of hex/decimals * introduce setuint + cosmetic fixes Wordbitsize -> Wordbitwidth in comments * Add decimal conversion (non-constant-time) * fix comments [skip ci]	2021-01-29 20:42:36 +01:00
Mamy André-Ratsimbazafy	47daefde1f	forgot an import	2021-01-24 13:55:18 +01:00
Mamy André-Ratsimbazafy	98a4b2f91a	constant cosmetics	2021-01-24 12:57:13 +01:00
Mamy André-Ratsimbazafy	75493dfb5b	Fix #131 , inversion tests didn't take into account that the RNG can produce a 0 input and so a.inv can be different from 1	2021-01-24 12:37:02 +01:00
Mamy Ratsimbazafy	7e97cd4ac5	Fuzz fix - non-unique modular representation after Assembly negate (#137 ) * Fix #114 - Negating 0 left the prime modulus, which is working most of the time for everything except for comparison. (also somehow triggers and workaround weird compiler bug where exceptions tracking is activated in macros and all the curve enums were stringified as their ordinal value) * https://github.com/mratsim/constantine/issues/136 was also fixed, add to anti-regression * add comment in test * Fix the pure Nim fallback as well	2021-01-24 12:35:27 +01:00
Mamy Ratsimbazafy	82819b1b10	Square Root & Inversion addition chains - 20% perf increase (#132 ) * Addition chain for sqrt BLS12-381: 20% perf improvement * sqrt addchain for BN254_Snarks - 20% perf improvement as well * Fix operation count [skip ci] * BLS12-377 sqrt - 10% perf improvement * sqrt addition chain for BW6-761 - 6% speedup * BN254_Nogami inversion addchain * sqrt addchain for BN254_Nogami * Inversion addchain for BLS12-377 * inversion ddition chain for BW6-761	2021-01-23 20:55:40 +01:00
Mamy Ratsimbazafy	638cb71e16	Fr: Finite Field parametrized by the curve order (#115 ) * Introduce Fr type: finite field over curve order. Need workaround for https://github.com/nim-lang/Nim/issues/16774 * Split curve properties into core and derived * Attach field properties to an instantiated field instead of the curve enum * Workaround https://github.com/nim-lang/Nim/issues/14021, yet another "working with types in macros" is difficult https://github.com/nim-lang/RFCs/issues/44 * Implement finite field over prime order of a curve subgroup * skip OpenSSL tests on windows	2021-01-22 00:09:52 +01:00
Mamy André-Ratsimbazafy	a5c1d077fb	deal with DLL mess for OpenSSL test	2021-01-03 21:50:22 +01:00
Mamy André-Ratsimbazafy	e89429e822	SHA256 Hash function	2020-12-15 19:18:36 +01:00
Mamy Ratsimbazafy	244f58350c	Implement BW6-761 Endomorphism acceleration (#104 ) * Implement BW6-761 GLV on G1 + Psi Untwist-Frobenius-Twist * Fix frobenius constants for embedding degree != 12 * Fix test type/parsing issues * Generalize frobenius map coefficient formula * Fix Frobenius Psi generalization * Don't confuse t and trace of frobenius + update scalarMul to use Frobenius on Fp Twist * Fix ec_sage type definition * fix decription [skip ci] * update comment [skip ci] * typo * restore frobenius tests iterations	2020-10-13 23:58:35 +02:00
mratsim	1383aae105	Remove outdated TODOs [skip ci] - noinline consts: https://github.com/nim-lang/RFCs/issues/257	2020-10-11 21:33:59 +02:00
Mamy Ratsimbazafy	6530596032	Endomorphism acceleration for BN254-Nogami (#102 )	2020-10-10 18:53:48 +02:00
Mamy Ratsimbazafy	a2f46f77b7	Sage constants & tests codegen (#101 ) * Implement a Sage codegenerator for frobenius constants * Sage codegen for pairings * Autogen of endomorphism acceleration constants * The autogen fixed a copy-paste bug in lattice decomposition. We can use conditional negation now and save an add+dbl in scalar mul * small fixes * sage code for square root bls12-377 is not old * readme updates * Provide test suggestions for derive_frobenius * indentation + add equation form to sage * Sage test vector generator * Use the json vectors - includes type system workaround: generic sandwich https://github.com/nim-lang/Nim/issues/11225 - converting NimNode to typedesc: https://github.com/nim-lang/Nim/issues/6785 * Delete old sage code * Install nim-serialization and nim-json-serialization in CI * CI nimble install force yes	2020-10-10 16:19:23 +02:00
Mamy Ratsimbazafy	71bb4c799a	BW6-761 part 1 (#100 ) * Add Fp, Fp2, Fp6 support for BW6-761 * Add G1 for BW6-761 * Prepare to support G2 twists on the same field as G1 * Remove a useless dependent type for lines * Implement G2 for BW6-761 * Fix Line leftover	2020-10-09 07:51:47 +02:00
Mamy André-Ratsimbazafy	49164b66d8	fix testing canary	2020-10-05 22:20:29 +02:00
Mamy Ratsimbazafy	d622f48507	Unsed imports cleanup (#97 )	2020-10-04 17:33:17 +02:00
Mamy Ratsimbazafy	fc1c3472ce	Fused projective line eval (#96 ) * Reorg line functions to allow for Jacobian eval * 2x faster Miller loop!!! with fused line eval double * Support Line Double Fusion for D-Twists * Implement fused line addition	2020-10-04 09:39:02 +02:00
Mamy Ratsimbazafy	986245b5c1	Jacobian coordinates (#95 ) * Add projective-> affine bench * Add conditional copy and div2 benches * Fp4 benchmarks * Constant-time Jacobian addition * Jacobian doubling * Use a simpler Add+Dbl complete formula * Update tests * Fix conditional negate * Rollaback complete addition, we were only handling curve coef a == 0	2020-10-02 00:01:09 +02:00
Mamy André-Ratsimbazafy	0effd66dbd	SWei -> SHortW, weierstrass -> shortweierstrass	2020-09-27 23:02:48 +02:00

1 2 3 4

177 Commits