constantine

Commit Graph

Author	SHA1	Message	Date
Mamy Ratsimbazafy	cbb454fff1	Codecs (#217 ) * create a codecs.nim file for hex/base64 and other encoding conversions * improve maintenance/readability of hex conversion * add skeleton of constant-time base64 decoding * use raw casts * use raw casts only for same size types	2023-02-07 13:10:17 +01:00
Mamy Ratsimbazafy	495ef4497b	Parallel batchadd (#215 ) * [Threadpool] Fix syncAll releasing while a thread was attempting to steal + force no exception in tasks * fix unguarded access on MacOS barriers * parallel batchadd * moved import	2023-01-29 01:06:37 +01:00
Mamy Ratsimbazafy	ff8c26c1fe	BLS Aggregate and Batch verify (#214 ) * pairing -> pairings, and use alloca arrays instead of static arrays * aggregate and batched BLS signature * DLL generation broken by path changes	2023-01-27 00:42:12 +01:00
Mamy Ratsimbazafy	188f3e710c	add fast_aggregate_verify	2023-01-23 01:54:40 +01:00
Mamy Ratsimbazafy	4be89d309f	chore: remove stew/byteutils dependencies and unneeded imports	2023-01-12 20:25:57 +01:00
Mamy Ratsimbazafy	4052a07611	chore: cleanup TODOs, unused constants	2023-01-12 01:27:23 +01:00
Mamy Ratsimbazafy	1f4bb174a3	[Backend] Add support for Nvidia GPUs (#210 ) * Add PoC of JIT exec on Nvidia GPUs [skip ci] * Split GPU bindings into low-level (ABI) and high-level [skip ci] * small typedef reorg [skip ci] * refine LLVM IR/Nvidia GPU hello worlds * [Nvidia GPU] PoC implementation of field addition [skip ci] * prod-ready field addition + tests on Nvidia GPUs via LLVM codegen	2023-01-12 01:01:57 +01:00
Mamy Ratsimbazafy	c0b30a08be	style: casing of WordBitWidth/WordBitwidth	2023-01-11 19:31:23 +01:00
Mamy Ratsimbazafy	928f515582	Batch additions (#207 ) * Batch elliptic curve addition * accelerate chained muls * jac mixed add handle doubling. jac additions handle aliasing when adding infinity * properly skip sanitizer on BLS signature test * properly skip sanitizer² on BLS signature test	2022-10-29 22:43:40 +02:00
Mamy Ratsimbazafy	351a3f6bd2	Sha256 refactor (#206 ) * sha256: separate message scheduling and state updates to help implement specific use-cases like #205; also implement SSSE3 acceleration (2006, Intel Core 2 Duo) * sha256: simplify update flow, store less metadata in context * sha256: Fix reworked update function * Implement x86 hardware SHA acceleration * typo	2022-09-19 02:02:57 +02:00
Mamy Ratsimbazafy	fb594c5938	OpenSSL upstream: no more SHA256 public function :/, skip in Windows CI	2022-09-19 01:22:01 +02:00
Mamy Ratsimbazafy	df048112c3	Example+Test C API vs GMP (#203 ) * Example+Test C API vs GMP * Create build directory for bindings test * --nimMainPrefix is 1.6 only * Add libdl for dynamic loading * absolute paths * add static link test * Fix man main, rename Nimmain to init_NimMain * Deal with MacOS annoying linker w.r.t. static libraries * use .exe extension to satisfy windows (?) * annoying GCC which doesn't create paths * Try skipping DLL test on windows * windows extensions ... * no lib prefix on windows	2022-09-15 17:11:57 +02:00
Mamy Ratsimbazafy	962e7ccf49	CI: enable GMP tests on Windows and Linux 32-bit and fix caching (#204 ) * Try to compile with GMP on windows and 32-bit linux * remove leftover msys shell * Don't use GMP Mersenne Twister, bad randomness and untested Nim wrapper * properly cache nim * fix path after cache * run pacman in msys2 env * rework msys2 ... again * shell compat for file clearing * shell compat try-again for file clearing * force bash for clearing parallel builds on windows * Use nimscript directly (why didn't it work last time?) * Avoid IO redirection to support any shell * Avoid IO redirection v2 to support any shell * add debug data * add debug again * Introduce pararun, a parallel test runner to remove need of GNU parallel * pararun: style	2022-09-15 09:33:34 +02:00
Mamy Ratsimbazafy	094445482b	Eip2333 (#202 ) * HMAC-SHA256 * EIP2333 * activate EIP2333 tests and faster random test case generation	2022-08-16 12:07:57 +02:00
Mamy Ratsimbazafy	9770b3108c	Fp12 over fp6 (#201 ) * introduce sumprod for direct fp6_mul * change curves -> constants * forgotten constants * Full pairing using Fp2->Fp6->Fp12 towering	2022-08-14 09:48:10 +02:00
Mamy Ratsimbazafy	99c9730793	Self-contained bindings generation (#196 ) * First draft at bindings generation * finite field bindings PoC * support openarray, export NimMain * PoC extension fields and elliptic curve bindings * Pasta * expose more bindings, remove nimZeroMem, remove tracer when unused, codegen name_mangling`gensym issue * workaround bad C gensym codegen with {.inline.} pragma in non-dirty template nested in generic proc instantiated by template	2022-08-06 19:05:54 +02:00
Mamy Ratsimbazafy	e29e529f18	Add multipairing for BN curves (#194 )	2022-05-08 19:01:23 +02:00
Mamy Ratsimbazafy	39a8a413de	Pasta curves (#191 ) * Pasta curves field arithmetic * implement elliptic curve arith for the Pasta curves	2022-04-27 00:58:48 +02:00
Mamy Ratsimbazafy	e9e7a1809c	BN254 - Hash-to-Curve (SVDW method) (#190 ) * Hash to BN254-Snarks * Test SVDW code path with old v7 vectors for BLS12-381 * add benches	2022-04-26 21:24:07 +02:00
Mamy Ratsimbazafy	65eedd1cf7	Hash-to-Curve BLS12-381 G1 (#189 ) * Skeleton of hash to curve for BLS12-381 G1 * Remove isodegree parameter * Fix polynomial evaluation of hashToG1 * Optimize hash_to_curve and add bench for hash to G1 * slight optim of jacobian isomap + v7 test vectors	2022-04-11 00:57:16 +02:00
Mamy Ratsimbazafy	bde4f97b56	Line refactor (#188 ) * Align line evaluations to papers notations * Adjust line fusion op * precompute G2 b' for costly D-Twists	2022-04-04 10:10:36 +02:00
Mamy Ratsimbazafy	742cecce08	Poly1305 Message Authentication Code (#186 ) * Groundwork for Poly1305 MAC * Implement fast reduction for Poly1305 * don't import assembly files when compiling without assembly	2022-03-05 23:39:24 +01:00
Mamy Ratsimbazafy	c2eb42b769	Add ChaCha20 stream cipher	2022-03-02 01:18:47 +01:00
Mamy Ratsimbazafy	26954f905a	Constant time (#185 ) * Implement fully constant-time division closes #2 closes #9 * constant-time hex parsing * prevent cache timing attacks in toHex() conversion (which is only for test/debug purposes anyway)	2022-02-28 09:23:26 +01:00
Mamy Ratsimbazafy	ffacf61e8a	Don't dump all in "backend" (#184 ) * backend -> math * towers -> extension fields * move ISA and compiler specific code out of math/ * fix export	2022-02-27 01:49:08 +01:00
Mamy Ratsimbazafy	5bc6d1d426	BLS signatures for Ethereum (BLS sig on BLS12-381 G2 with SHA256) (#183 ) * Finally add the (Ethereum) bls signatures (on BLS12-381 G2) * fix test path and remove old low-level signature test	2022-02-26 21:22:34 +01:00
Mamy Ratsimbazafy	fe500a6a79	Productionize: move protocols top-level vs backend (#179 ) * Productionize: move protocols top-level vs backend * fix path * import fix * the last one * benches as well	2022-02-21 01:04:53 +01:00
Mamy Ratsimbazafy	81acfb1626	Nim 1.6 in CI (#170 ) * try 1.6 CI * Try CI with 1.6 and windows. * Bend the knee * have fun debugging CI * have fun debugging CI * more CI spam * branch -> nim_version * fight or flight * properly detect windows * Fix galore * 🐍 🐍 snake: * meh give up on parallelizing windows and dealing with windows PATH issues * ¯\_ (ツ)_/¯	2022-02-20 23:44:00 +01:00
Mamy Ratsimbazafy	dc73c71801	Pairings optimizations (#178 ) * bench for cyclotomic square, exp and rename cyclotomic exp + multipairings for BLS12-377 * refactor/unify lines and cyclotomic functions * Add Karabina's compressed squaring * Use compressed squarings in final exponentiation * Weighted addchain for bn254_snarks * Add new towering options and cost functions * Rearrange bench summaries * fix BW6-761	2022-02-20 20:15:20 +01:00
Mamy Ratsimbazafy	14af7e8724	Low-level refactoring (#175 ) * Add specific fromMont conversion routine. Rename montyResidue to getMont * missed test file * Add x86_64 ASM for fromMont * Add x86_64 MULX/ADCX/ADOX for fromMont * rework Montgomery Multiplication with prefetch/latency hiding techniques * Fix ADX autodetection, closes #174. Rollback faster mul_mont attempt, no improvement and debug pain. * finalSub in fromMont & adx_bmi -> adx * Some {.noInit.} to avoid Nim zeroMem (which should be optimized away but who knows) * Uniformize name 'op+domain': mulmod - mulmont * Fix asm codegen bug "0x0000555555565930 <+896>: sbb 0x20(%r8),%r8" with Clang in final substraction * Prepare for skipping final substraction * Don't forget to copy the result when we skip the final substraction * Seems like we need to stash the idea of skipping the final substraction for now, needs bounds analysis https://eprint.iacr.org/2017/1057.pdf * fix condition for ASM 32-bit * optim modular addition when sparebit is available	2022-02-14 00:16:55 +01:00
Mamy Ratsimbazafy	53c4db7ead	Fast modular inversion (#172 ) * split modular inversion in its own file * Stash fast GCD inversion https://eprint.iacr.org/2020/972.pdf * Stash Pornin's bingcd -> issue with inner modular reduction * Implement Bernstein-Yang inversion * Avoid Nim checks on signed integers (32-bit runtime issue) * cleanup: remove old inversion impls * cleanup: static moduli, move div2 * small comments (skip ci) * comment cleanup (skip ci) * fix total iterations on 32-bit * Add batch conversion to affine coordinates using simultaneous inversion trick * fix conditional setZero and batchAffine conversion * cleanup unneeded branches following affine conversion unification * Fix batchAffine with zero inputs and add fuzz failure to test suite	2022-02-10 14:05:07 +01:00
Mamy Ratsimbazafy	404a966601	^k to ᵏ (skip ci)	2022-02-06 15:38:26 +01:00
Mamy Ratsimbazafy	50717d8de6	Test GT-subgroup for BW6-761 (#171 )	2022-01-08 17:30:26 +01:00
Mamy Ratsimbazafy	f6c02fe075	Optimized subgroup checks and cofactor clearing (#169 ) * Move cofactor clearing to dedicated per-curve subgroups file * Add BLS12-381 fast subgroup checks * Implement fast cofactor clearing for BN254_snarks * Add fast subgroup check to BN254Snarks * add BLS12_377 optimized cofactor and subgroup functions * Add BN254_Nogami * Add GT-subgroup tests * Use the new subgroup checks for Eth1 EVM precompiles	2022-01-03 14:12:58 +01:00
Mamy Ratsimbazafy	c42e2a0251	Rename NotOnTwist/OnTwist => subgroup G1 and G2	2022-01-01 19:17:04 +01:00
Mamy Ratsimbazafy	86a67013dd	glv filename -> endomorphisms	2022-01-01 17:49:26 +01:00
Mamy Ratsimbazafy	bea798e27c	Field sqrt optimization (#168 ) * add more Fp tests for Twisted Edwards curves * add fused sqrt+division bench * Significant fused sqrt+division improvement for any prime field over algorithm described in "High-Speed High-Security Signature", Bernstein et al, p15 "Fast decompression", https://ed25519.cr.yp.to/ed25519-20110705.pdf * Activate secp256k1 field benches + spring renaming of field multiplication * addition chains for inversion and sqrt of Curve25519 * Make isSquare use addition chains * add double-prec mul/square bench for <256-bit prime fields.	2022-01-01 16:19:35 +01:00
Mamy Ratsimbazafy	53f9708c2b	Initial support for Twisted Edwards curves (#167 ) * Point decoding: optimized sqrt for p ≡ 5 (mod 8) (Curve25519) * Implement fused sqrt(u/v) for twisted edwards point deserialization * Introduce twisted edwards affine * Allow declaration of curve field elements (and fight against recursive dependencies * Twisted edwards group law + tests * Add support for jubjub and bandersnatch #162 * test twisted edwards scalar mul	2021-12-29 01:54:17 +01:00
Mamy Ratsimbazafy	1195e5e980	Eth1 evm precompiles (#166 ) * Prepare support for Eth1 EVM * Implement EIP 196 (Ethereum BN254 add/mul) * Implement ETH1 pairing precompile * Accelerate isOnCurve for G2 with precomputation	2021-12-15 00:02:11 +01:00
Mamy Ratsimbazafy	f5c0b6245d	Multipairing (#165 ) * Productionize multipairings for BLS12-381 * typo * arg order + benchmark * Introduce mul_3way_sparse_sparse * cleanup MultiMiller loop * fix init sparse optimization in multimiller loop [skip ci]	2021-08-16 22:22:51 +02:00
Mamy Ratsimbazafy	979d183657	Tests for the eth2 BLS signature protocol (BLS12-381, pubkeys G1, signatures G2) using low-level primitives (#164 )	2021-08-15 11:41:46 +02:00
Mamy Ratsimbazafy	0bc228126a	hash-to-curve BLS12-381 perf (#163 ) * fp square noasm split from non-4 non-6 limbs fallback (40% speedup) * optimized cofactor clearing for BLS12-381 G2 * Support jacobian isogenies and point_add on isogenies * fuse addition and isogeny map * {.noInit.} and sparseMul * poly_eval_horner init * dedicated invsqrt + cleanup square root file * hash to field: reduce copy overhead and don't return arrays * h2c isogeny jacobian reuse pow 3 precomputed value * Fix sqrt bench	2021-08-14 21:01:50 +02:00
Mamy Ratsimbazafy	499f9605b2	Hash to curve - BLS12-381 (#110 ) * Hash to Curve: impl expand_message_xmd * Try to precompute part of hash to curve at compile-time * sha256 bench - use the new hashes module * [WIP] smoke test hash to field * Implement hash_to_field with expected output * unoptimized hash-to-curve G2 for BLS12-381 * Don't run sanitizer on hash to field as it uses GC-ed strings	2021-08-13 22:07:26 +02:00
Mamy Ratsimbazafy	aefd40f455	Square ADX (#160 ) * Add MULX/ADOX/ADCX assembly for squaring 4 limbs * Add squarings for 6 limbs * Use the new square assembly where relevant * Fix 32-bit register name and calling convention * typo * Disable MontRed ASM for 2 limbs or less	2021-02-20 13:18:49 +01:00
Mamy Ratsimbazafy	9ac9862401	Optimize Miller Loop and prepare Multi-pairing (#159 ) * Pairing with affine: align API to BLST and Gurvy and common use-case. * Implement multi-pairing / aggregate verif for BLS12-381 (+2% pairing perf) * Generalize the optimized miller loop for single pairing * Immplement the miller loop addchain for BLS12-377 * Miller addition chain for BN254-Nogami * no Miller adchain for BN254-Snarks * Update the line test with new tower https://github.com/mratsim/constantine/pull/153 * Somewhat sparse for Fp2 M-Twist * Implement line by line multiplication for Fp12 D-Twist * Somewhat sparse Mul for Fp12 D-Twist * Finish the sparse and somewhat sparse multiplications	2021-02-14 13:06:57 +01:00
Mamy Ratsimbazafy	e7296a78a8	Double-precision cubic towering + pairing (#158 ) * Double-precision cubic towering 5% perf+ * Lazy Cubic squaring, yet another 3% boost. * Implement lazy reduced inverse (but inclusive perf boost) * Double precision sparse multiplication for D-Twist ~ 2% for BN254 Nogami and Snarks curves * Implement lazy sparse mul for M-twist * Try to introduce more laziness but need bound proofs	2021-02-12 21:27:58 +01:00
Mamy Ratsimbazafy	5806cc4638	Double-Precision towering (#155 ) * consistent naming for dbl-width * Isolate double-width Fp2 mul * Implement double-width complex multiplication * Lay out Fp4 double-width mul * Off by p in square Fp4 as well :/ * less copies and stack space in addition chains * Address https://github.com/mratsim/constantine/issues/154 partly * Fix #154, faster Fp4 square: less non-residue, no Mul, only square (bit more ops total) * Fix typo * better assembly scheduling for add/sub * Double-width -> Double-precision * Unred -> Unr * double-precision modular addition * Replace canUseNoCarryMontyMul and canUseNoCarryMontySquare by getSpareBits * Complete the double-precision implementation * Use double-precision path for Fp4 squaring and mul * remove mixin annotations * Lazy reduction in Fp4 prod * Fix assembly for sum2xMod * Assembly for double-precision negation * reduce white spaces in pairing benchmarks * ADX implies BMI2	2021-02-09 22:57:45 +01:00
Mamy Ratsimbazafy	491b4d4d21	Drop nim-json-serialization for testing (#156 )	2021-02-09 22:10:16 +01:00
Mamy Ratsimbazafy	e23f990280	Tower drop concepts (#153 ) * Fix affine instantiation * drop concept from the codebase * Remove alignment requirement, this cases problem in sequences on 32-bit for t_fp12_anti_regression * slight sparse optim	2021-02-07 14:03:56 +01:00
Mamy Ratsimbazafy	258e7e516f	[WIP] Pairings for bw6 761 (#108 ) * Prepare BW6-761 pairing constants * Extract the basic miller loop from pairings * template and method call syntax issue * Layout pairing for BW6-761 * Fix rebasing woes * Try to match the paper (still buggy) * Stash BW6-761	2021-02-07 09:46:41 +01:00

1 2 3 4

200 Commits