constantine

Commit Graph

Author	SHA1	Message	Date
Mamy André-Ratsimbazafy	d22d981e9e	Implement fused sqrt invsqrt on Fp: Accelerate sqrt on Fp2 by 20% (hashToG2 and property-based testing bottleneck, 4 times slower than inversion and 87 times slower than Fp2 multiplication)	2020-06-17 22:44:52 +02:00
Mamy Ratsimbazafy	d376f08d1b	G2 / Operations on the twisted curve E'(Fp2) (#51 ) * Split elliptic curve tests to better use parallel testing * Add support for printing points on G2 * Implement multiplication and division by optimal sextic non-residue (BLS12-381) * Implement modular square root in 𝔽p2 * Support EC add and EC double on G2 (for BLS12-381) * Support G2 divisive twists with non-unit sextic-non-residue like BN254 snarks * Add EC G2 bench * cleanup some unused warnings * Reorg the tests for parallelization and to avoid instantiating huge files	2020-06-15 22:58:56 +02:00
Mamy Ratsimbazafy	2613356281	Endomorphism acceleration for Scalar Multiplication (#44 ) * Add MultiScalar recoding from "Efficient and Secure Algorithms for GLV-Based Scalar Multiplication" by Faz et al * precompute cube root of unity - Add VM precomputation of Fp - workaround upstream bug https://github.com/nim-lang/Nim/issues/14585 * Add the φ-accelerated lookup table builder * Add a dedicated bithacks file * cosmetic import consistency * Build the φ precompute table with n-1 EC additions instead of 2^(n-1) additions * remove binary * Add the GLV precomputations to the sage scripts * You can't avoid it, bigint multiplication is needed at one point * Add bigint multiplication discarding some low words * Implement the lattice decomposition in sage * Proper decomposition for BN254 * Prepare the code for a new scalar mul * We compile, and now debugging hunt * More helpers to debug GLV scalar Mul * Fix conditional negation * Endomorphism accelerated scalar mul working for BN254 curve * Implement endomorphism acceleration for BLS12-381 (needed cofactor clearing of the point) * fix nimble test script after bench rename	2020-06-14 15:39:06 +02:00
Mamy Ratsimbazafy	3d1b1fab98	Fix benchmark on ARM (#31 )	2020-06-04 22:09:30 +02:00
Mamy Ratsimbazafy	82ceca6e3b	Scalar mul tests (#28 ) * Add sage script for BN254 * Implement (failing) scalar multiplication tests * Add a first test against sagemath * Finish the tests against SAGE for BN254 * Add significant test coverage of scalar multiplication with reference checks for BN254_Snarks and BLS12_381	2020-06-04 20:37:29 +02:00
Mamy André-Ratsimbazafy	44350d08af	Add elliptic doubling in projective coordinates	2020-04-15 22:23:46 +02:00
Mamy André-Ratsimbazafy	7ae0f51000	benchmarking skips cycle counting for ARM	2020-04-15 21:24:18 +02:00
Mamy André-Ratsimbazafy	e0c1e0b1c8	Add EC bench on G1 + Add throughput to benches	2020-04-15 19:38:02 +02:00
Mamy André-Ratsimbazafy	aff44f4d8e	Implement constant-time `div2` on finite and extension fields	2020-04-15 02:12:45 +02:00
Mamy Ratsimbazafy	c04721a04e	Refactor: Higher-Kinded Tower of Extension Fields (#25 ) * Mention that the inverse of 0 is 0 (TODO tests) * Introduce "Higher-Kinded tower extensions" * rename isCOmplexExtension -> fromComplexExtension * update benchmarks with the new tower scheme * Try to recover some speed on mul/squaring for an optimal tower (but this was not it)	2020-04-14 02:05:42 +02:00
Mamy André-Ratsimbazafy	33314fe725	Properly distinguish between Nogami and Snark/Ethereum BN254 closes #19	2020-04-12 03:01:50 +02:00
Mamy André-Ratsimbazafy	a6e4517be2	Implement 𝔽p12 inversion, enable 𝔽p12 tests and bench	2020-04-09 14:28:01 +02:00
Mamy André-Ratsimbazafy	8b7374f405	Cleanup in Montgomery Mul, Square, Pow	2020-03-22 13:24:37 +01:00
Mamy André-Ratsimbazafy	c40bc1977d	Inverse in cubic extension field 𝔽p6 = 𝔽p2[∛(1 + 𝑖)]	2020-03-21 23:47:43 +01:00
Mamy André-Ratsimbazafy	ff4a54daba	Add multiplication in 𝔽p6 = 𝔽p2[∛(1+𝑖)]	2020-03-21 19:03:57 +01:00
Mamy André-Ratsimbazafy	1855d14497	Add more curves for testing: Curve25519, BLS12-377, BN446, FKM-447, BLS12-461, BN462	2020-03-21 13:05:58 +01:00
Mamy André-Ratsimbazafy	9e78cd5d6d	Benchmark template for 𝔽p, 𝔽p2, 𝔽p6	2020-03-21 02:31:31 +01:00
Mamy André-Ratsimbazafy	bde619155b	30% faster constant-time inversion	2020-03-20 23:03:52 +01:00
Mamy Ratsimbazafy	4ff0e3d90b	Internals refactor + renewed focus on perf (#17 ) * Lay out the refactoring objectives and tradeoffs * Refactor the 32 and 64-bit primitives [skip ci] * BigInts and Modular BigInts compile * Make the bigints test compile * Fix modular reduction * Fix reduction tests vs GMP * Implement montegomery mul, pow, inverse, WIP finite field compilation * Make FiniteField compile * Fix exponentiation compilation * Fix Montgomery magic constant computation for 2^64 words * Fix typo in non-optimized CIOS - passing finite fields IO tests * Add limbs comparisons [skip ci] * Fix on precomputation of the Montgomery magic constant * Passing all tests including 𝔽p2 * modular addition, the test for mersenne prime was wrong * update benches * Fix "nimble test" + typo on out-of-place field addition * bigint division, normalization is needed: https://travis-ci.com/github/mratsim/constantine/jobs/298359743 * missing conversion in subborrow non-x86 fallback - https://travis-ci.com/github/mratsim/constantine/jobs/298359744 * Fix little-endian serialization * Constantine32 flag to run 32-bit constantine on 64-bit machines * IO Field test, ensure that BaseType is used instead of uint64 when the prime can field in uint32 * Implement proper addcarry and subborrow fallback for the compile-time VM * Fix export issue when the logical wordbitwidth == physical wordbitwidth - passes all tests (32-bit and 64-bit) * Fix uint128 on ARM * Fix C++ conditional copy and ARM addcarry/subborrow * Add investigation for SIGFPE in Travis * Fix debug display for unsafeDiv2n1n * multiplexer typo * moveMem bug in glibc of Ubuntu 16.04? * Was probably missing an early clobbered register annotation on conditional mov * Note on Montgomery-friendly moduli * Strongly suspect a GCC before GCC 7 codegen bug (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87139) * hex conversion was (for debugging) not taking requested order into account + inlining comment * Use 32-bit limbs on ARM64, uint128 builtin __udivti4 bug? * Revert "Use 32-bit limbs on ARM64, uint128 builtin __udivti4 bug?" This reverts commit 087f9aa7fb40bbd058d05cbd8eec7fc082911f49. * Fix subborrow fallback for non-x86 (need to maks the borrow)	2020-03-16 16:33:51 +01:00
Mamy André-Ratsimbazafy	191bb7710c	Add a warmup to the Fp bench to deal with CPU scaling	2020-03-15 21:02:17 +01:00
Mamy André-Ratsimbazafy	b810422486	Add benchmark for Ethereum 1 and Ethereum 2 curves	2020-03-15 20:54:14 +01:00
Mamy André-Ratsimbazafy	dc0c1c181c	enable substraction benchmarks	2020-03-07 12:23:46 +01:00
Mamy André-Ratsimbazafy	472823b749	more comprehensive benchmark of Fp	2020-03-06 17:44:30 +01:00
Mamy André-Ratsimbazafy	1fdb1df80a	Add benchmark clock timers	2020-02-29 19:36:35 +01:00
Mamy André-Ratsimbazafy	ca817fcb69	Use Assembly cmov on x86	2020-02-29 18:27:20 +01:00
Mamy André-Ratsimbazafy	05bce529b4	1st experiment at accelerating montgomery multiplication (665 lines of specialized duplicated ASM code for some reason, monomorphization is probably better than that)	2020-02-28 22:46:20 +01:00
Mamy André-Ratsimbazafy	ddce056bb4	make bench compile	2020-02-25 03:07:42 +01:00
Mamy André-Ratsimbazafy	8cbbd40a0c	Add benchmark of constant-time vs unsafe powmod	2020-02-22 18:39:29 +01:00
Mamy André-Ratsimbazafy	10346d83a4	Benchmark: BigInt -> Montgomery conversion: - shlAddMod (with assembly division) is already 4x slower than Montgomery Multiplication based. - constant-time division will be even slower - use montgomery-multiplication based conversion	2020-02-16 01:43:17 +01:00

29 Commits