nim-libp2p

Commit Graph

Author	SHA1	Message	Date
Tanguy Cizain	55a3606ecb	add ping protocol (#584 ) * add ping protocol * add ping protocol handler * ping styling * more ping tests * switch ping to bearssl rng * update ping style * new cancellation test	2021-06-08 18:53:45 +02:00
Tanguy Cizain	caac8191d2	Change newXXXX procs to XXXX.new (#585 ) * newBufferStream -> BufferStream.new * newMultistream -> MultistreamSelect.new * newSecio -> Secio.new * newNoise -> Noise.new * newPlainText -> PlainText.new * newPubSubPeer -> PubSubPeer.new * newIdentify -> Identify.new * newMuxerProvider -> MuxerProvider.new	2021-06-07 09:32:08 +02:00
Dmitriy Ryajov	ac47964377	use pkg namespace	2021-06-02 12:26:02 -06:00
Dmitriy Ryajov	b56ca11b74	use raises defect	2021-06-02 12:26:02 -06:00
Dmitriy Ryajov	ce42674d80	avoid memory safety errors with nim 1.4.x	2021-06-02 12:26:01 -06:00
Dmitriy Ryajov	a6eea0c275	import chronicles otherwise compile breaks	2021-06-02 12:25:37 -06:00
Dmitriy Ryajov	1c3616e3a5	merge latest master	2021-06-02 12:25:36 -06:00
Dmitriy Ryajov	c949f14a99	Merge master to unstable (#570 ) * Revisit Floodsub (#543) Fixes #525 add coverage to unsubscribeAll and testing * add mounted protos to identify message (#546) * add stable/unstable auto bumps * fix auto-bump CI * merge nbc auto bump with CI in order to bump only on CI success * put conditional locks on nbc bump (#549) * Fix minor exception issues (#550) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it. * fix nimbus ref for auto-bump stable's PR * use a builder pattern to build the switch (#551) * use a builder pattern to build the switch * with with * more refs * builders (#559) * More builders (#560) * address some issues pointed out in review * re-add to prevent breaking other projects * mem usage cleanups for pubsub (#564) In `async` functions, a closure environment is created for variables that cross an await boundary - this closure environment is kept in memory for the lifetime of the associated future - this means that although _some_ variables are no longer used, they still take up memory for a long time. In Nimbus, message validation is processed in batches meaning the future of an incoming gossip message stays around for quite a while - this leads to memory consumption peaks of 100-200 mb when there are many attestations in the pipeline. To avoid excessive memory usage, it's generally better to move non-async code into proc's such that the variables therein can be released earlier - this includes the many hidden variables introduced by macro and template expansion (ie chronicles that does expensive exception handling) * move seen table salt to floodsub, use there as well * shorten seen table salt to size of hash * avoid unnecessary memory allocations and copies in a few places * factor out message scoring * avoid reencoding outgoing message for every peer * keep checking validators until reject (in case there's both reject and ignore) * `readOnce` avoids `readExactly` overhead for single-byte read * genericAssign -> assign2 * More gossip coverage (#553) * add floodPublish test * test delivery via control Iwant/have mechanics * fix issues in control, and add testing * fix possible backoff issue with pruned routine overriding it * fix control messages (#566) * remove unused control graft check in handleControl * avoid sending empty Iwant messages * Split dialer (#542) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments * add base exception class and fix hierarchy * fix imports * Merge master (#555) * Revisit Floodsub (#543) Fixes #525 add coverage to unsubscribeAll and testing * add mounted protos to identify message (#546) * add stable/unstable auto bumps * fix auto-bump CI * merge nbc auto bump with CI in order to bump only on CI success * put conditional locks on nbc bump (#549) * Fix minor exception issues (#550) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it. * fix nimbus ref for auto-bump stable's PR * Split dialer (#542) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments * add base exception class and fix hierarchy * fix imports * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im> * Builders (#558) * use a builder pattern to build the switch (#551) * use a builder pattern to build the switch * with with * more refs * Merge master (#555) * Revisit Floodsub (#543) Fixes #525 add coverage to unsubscribeAll and testing * add mounted protos to identify message (#546) * add stable/unstable auto bumps * fix auto-bump CI * merge nbc auto bump with CI in order to bump only on CI success * put conditional locks on nbc bump (#549) * Fix minor exception issues (#550) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it. * fix nimbus ref for auto-bump stable's PR * Split dialer (#542) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments * add base exception class and fix hierarchy * fix imports * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im> * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` * fix builders * more builder stuff * more builders Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im>	2021-06-02 12:24:46 -06:00
Dmitriy Ryajov	5ee67f31bf	Merge master (#562 ) * builders (#559) * More builders (#560) * address some issues pointed out in review * re-add to prevent breaking other projects * Merge master (#555) * Revisit Floodsub (#543) Fixes #525 add coverage to unsubscribeAll and testing * add mounted protos to identify message (#546) * add stable/unstable auto bumps * fix auto-bump CI * merge nbc auto bump with CI in order to bump only on CI success * put conditional locks on nbc bump (#549) * Fix minor exception issues (#550) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it. * fix nimbus ref for auto-bump stable's PR * Split dialer (#542) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments * add base exception class and fix hierarchy * fix imports * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im>	2021-06-02 12:24:46 -06:00
Dmitriy Ryajov	530e589e14	Builders (#558 ) * use a builder pattern to build the switch (#551) * use a builder pattern to build the switch * with with * more refs * Merge master (#555) * Revisit Floodsub (#543) Fixes #525 add coverage to unsubscribeAll and testing * add mounted protos to identify message (#546) * add stable/unstable auto bumps * fix auto-bump CI * merge nbc auto bump with CI in order to bump only on CI success * put conditional locks on nbc bump (#549) * Fix minor exception issues (#550) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it. * fix nimbus ref for auto-bump stable's PR * Split dialer (#542) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments * add base exception class and fix hierarchy * fix imports * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im> * `doAssert` is `ValueError` not `AssertionError`? * revert back to `AssertionError` * fix builders * more builder stuff * more builders Co-authored-by: Giovanni Petrantoni <7008900+sinkingsugar@users.noreply.github.com> Co-authored-by: Jacek Sieka <jacek@status.im>	2021-06-02 12:24:44 -06:00
Dmitriy Ryajov	eef5dd0042	fix imports	2021-06-02 12:24:06 -06:00
Dmitriy Ryajov	8e3ef540ea	add base exception class and fix hierarchy	2021-06-02 12:24:04 -06:00
Dmitriy Ryajov	a3c00af945	Split dialer (#542 ) * extracting dialing logic to dialer * exposing upgrade methods on transport * cleanup * fixing tests to use new interfaces * add comments	2021-06-02 12:23:44 -06:00
Dmitriy Ryajov	3da656687b	use LPError more consistently (#582 ) * use LPError more consistently * don't use Exceptino * annotate with raises * don't panic on concatenation * further rework error handling	2021-06-02 15:39:10 +02:00
Dmitriy Ryajov	1b94c3feda	fix #581 (#583 )	2021-06-01 18:06:08 -06:00
Dmitriy Ryajov	34787728a3	don't use MaResult as default in newStandardSwitch (#578 )	2021-05-24 16:48:18 -06:00
Dmitriy Ryajov	24132d7129	More raises cleanup (#575 ) * use toException to map errors * don't initialize address twice * better error messages * allow LPError to escape * allow LPError to escape	2021-05-22 12:27:30 -06:00
Dmitriy Ryajov	ac4e060e1a	adding raises defect across the codebase (#572 ) * adding raises defect across the codebase * use unittest2 * add windows deps caching * update mingw link * die on failed peerinfo initialization * use result.expect instead of get * use expect more consistently and rework inits * use expect more consistently * throw on missing public key * remove unused closure annotation * merge master	2021-05-21 10:27:01 -06:00
Jacek Sieka	9674a6a6f6	simplify connmanager (#573 ) * no need to init orderedset * array more simple than table	2021-05-19 08:49:55 +02:00
Jacek Sieka	83a20a992a	gossipsub: unsubscribe fixes (#569 ) * gossipsub: unsubscribe fixes * fix KeyError when updating metric of unsubscribed topic * fix unsubscribe message not being sent to all peers causing them to keep thinking we're still subscribed * release memory earlier in a few places * floodsub fix	2021-05-07 00:43:45 +02:00
Giovanni Petrantoni	9f301964ed	fix control messages (#566 ) * remove unused control graft check in handleControl * avoid sending empty Iwant messages	2021-04-28 10:03:03 +09:00
Giovanni Petrantoni	f81a085d0b	More gossip coverage (#553 ) * add floodPublish test * test delivery via control Iwant/have mechanics * fix issues in control, and add testing * fix possible backoff issue with pruned routine overriding it	2021-04-22 18:51:22 +09:00
Jacek Sieka	e285d8bbf4	mem usage cleanups for pubsub (#564 ) In `async` functions, a closure environment is created for variables that cross an await boundary - this closure environment is kept in memory for the lifetime of the associated future - this means that although _some_ variables are no longer used, they still take up memory for a long time. In Nimbus, message validation is processed in batches meaning the future of an incoming gossip message stays around for quite a while - this leads to memory consumption peaks of 100-200 mb when there are many attestations in the pipeline. To avoid excessive memory usage, it's generally better to move non-async code into proc's such that the variables therein can be released earlier - this includes the many hidden variables introduced by macro and template expansion (ie chronicles that does expensive exception handling) * move seen table salt to floodsub, use there as well * shorten seen table salt to size of hash * avoid unnecessary memory allocations and copies in a few places * factor out message scoring * avoid reencoding outgoing message for every peer * keep checking validators until reject (in case there's both reject and ignore) * `readOnce` avoids `readExactly` overhead for single-byte read * genericAssign -> assign2	2021-04-18 10:08:33 +02:00
Dmitriy Ryajov	6b930ae7e6	More builders (#560 ) * address some issues pointed out in review * re-add to prevent breaking other projects	2021-04-06 14:16:23 -06:00
Dmitriy Ryajov	290866dd62	builders (#559 )	2021-04-05 16:06:45 -06:00
Giovanni Petrantoni	795a651839	use a builder pattern to build the switch (#551 ) * use a builder pattern to build the switch * with with * more refs	2021-04-02 10:20:51 +09:00
Jacek Sieka	54031c9e9b	Fix minor exception issues (#550 ) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it.	2021-03-23 07:45:25 +01:00
Dmitriy Ryajov	f7a9d83545	add mounted protos to identify message (#546 )	2021-03-15 15:29:05 -06:00
Giovanni Petrantoni	aeb18c4e41	now wild `except:`	2021-03-15 16:48:11 +09:00
Giovanni Petrantoni	4760df1e31	fix build with libp2p_agents_metrics switch	2021-03-15 01:42:47 +00:00
Jacek Sieka	70deac9e0d	fix peer score accumulation (#541 ) * fix accumulating peer score * fix missing exception handling * remove unnecessary initHashSet/initTable calls * simplify peer stats management * clean up tests a little * fix some missing raises annotations	2021-03-09 13:22:52 +01:00
Giovanni Petrantoni	269d3df351	Consolidate metrics collection for mesh (#540 ) * Consolidate metrics collection for mesh * more fixes * wrapping up * Update libp2p/protocols/pubsub/gossipsub/behavior.nim Co-authored-by: Jacek Sieka <jacek@status.im> * Update libp2p/protocols/pubsub/gossipsub/behavior.nim Co-authored-by: Jacek Sieka <jacek@status.im> * Update libp2p/protocols/pubsub/gossipsub/behavior.nim Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Jacek Sieka <jacek@status.im>	2021-03-03 22:11:21 +01:00
Giovanni Petrantoni	34c2fbeb16	small gossipsub metrics change	2021-03-03 08:41:21 +00:00
Giovanni Petrantoni	02ad017107	Gossipsub fixes and Initiator flagging fixes (#539 ) * properly propagate initiator information for gossipsub * Fix pubsubpeer lifetime management * restore old behavior * tests fixing * clamp backoff time value received * fix member name collisions * internal test fixes * better names and explaining of the importance of transport direction * fixes	2021-03-03 08:23:40 +09:00
Giovanni Petrantoni	c1334c6d89	pubsubpeer better address management	2021-02-28 04:53:17 +00:00
Giovanni Petrantoni	7b2727d930	avoid leaking in peersInIP, don't depend on sendConn	2021-02-27 23:49:56 +09:00
Giovanni Petrantoni	67d0926e89	use in any case PeerID for peersInIP to avoid keeping references	2021-02-27 21:31:59 +09:00
Giovanni Petrantoni	fae38e0146	fix PubSubPeer hashing issues	2021-02-26 19:19:15 +09:00
Giovanni Petrantoni	45300c28a9	[SEC] gossipsub - handleIHAVE/handleIWANT recommendations & notes (#535 ) Fixes #400	2021-02-26 14:27:42 +09:00
Giovanni Petrantoni	c1d8317e3c	fix badly merged code in gossipsub.colocationFactor	2021-02-26 12:39:57 +09:00
Giovanni Petrantoni	eac6cd3dbf	Debt: cleanup warnings #426 (#536 ) * testswitch cleanups * Debt: cleanup warnings Fixes #426	2021-02-25 09:24:49 -06:00
Giovanni Petrantoni	922cd92f94	don't check if peers have `sendConn` when disconnecting for bad scoring	2021-02-22 10:04:02 +09:00
Giovanni Petrantoni	51d8cd4ade	[SEC] gossipsub - rebalanceMesh may prune up to D_lo on oversubscription (#531 ) Fixes #403	2021-02-13 13:39:32 +09:00
Giovanni Petrantoni	e124e342b0	n subscription limits (#528 ) * subscription high water, cleanups * subscription limits test * newline	2021-02-12 12:27:26 +09:00
Dmitriy Ryajov	12adefb4de	add multi types to exports (#527 ) * add multitypes to exports * export standard setup	2021-02-10 11:42:46 -06:00
Dmitriy Ryajov	f4145ebbfa	More exports cleanup (#522 ) * annotate `SecureProtocol.Secio` as deprecated * dont export varint * add `errors` to exports - convenient error utils	2021-02-09 15:41:49 -06:00
Giovanni Petrantoni	fff54fa23c	add more diagnostics when gossip publish fails	2021-02-09 18:42:59 +09:00
Ștefan Talpalaru	d9563d65ae	support compilation with Nim-1.4 HEAD (#521 )	2021-02-08 15:21:43 -06:00
Dmitriy Ryajov	2658181df9	Merge unstable (#518 ) * Address Book POC implementation (#499) * Address Book POC implementation * Feat/peerstore impl (#505) Co-authored-by: Hanno Cornelius <68783915+jm-clius@users.noreply.github.com>	2021-02-08 15:16:23 -06:00
Dmitriy Ryajov	4dea23c394	Remove secio usage and cleanup exports (#519 ) * cleaned up exports * remove secio use * added more useful exports * proper import	2021-02-08 14:33:34 -06:00
Giovanni Petrantoni	646557597d	lower some gossipsub logging to debug level	2021-02-08 10:11:41 +09:00
Giovanni Petrantoni	fd73cf9f9d	Refactor gossipsub into multiple modules (#515 ) * Refactor gossipsub into multiple modules * splitup further gossipsub * move more mesh related stuff to behavior * fix internal tests * fix PubSubPeer.outbound flag, make it more reliable * use discard rather then _	2021-02-06 09:13:04 +09:00
Dmitriy Ryajov	5c234ddd37	add hash proc to support using with containers (#516 )	2021-02-05 10:12:44 -06:00
Giovanni Petrantoni	5aebf0990e	peer stats fixes (#511 ) Gossipsub fix, required by nimbus, merging into master as low impact	2021-01-29 12:41:51 +09:00
Dmitriy Ryajov	fb493d1a4a	Connection limits tests (#509 ) * connection limit tests * remove use of secio * check that upgraded fut is not nil * rebuild	2021-01-27 21:27:33 -06:00
Giovanni Petrantoni	1d77d37f17	Gossipsub scoring fixes (#508 ) * Fix some problematics when running with full scoring * more fixes	2021-01-25 21:13:42 +09:00
Dmitriy Ryajov	0959877b29	Connection limits (#384 ) * master merge * wip * avoid deadlocks * tcp limits * expose client field in chronosstream * limit incoming connections * update with new listen api * fix release * don't override peerinfo in connection * rework transport with accept * use semaphore to track resource ussage * rework with new transport accept api * move events to conn manager (#373) * use semaphore to track resource ussage * merge master * expose api to acquire conn slots * don't fail expensive metrics * allow tracking and updating connections * set global connection limits to 80 * add per peer connection limits * make sure conn is closed if tracking failed * more descriptive naming for handle * rework with new transport accept api * add `getStream` hide `selectConn` * add TransportClosedError * make nil explicit * don't make unnecessary copies of message * logging * error handling * cleanup semaphore * track connections properly * throw `TooManyConnections` when tracking outgoing * use proper exception and handle conventions * check onCloseHandle for nil * revert internalConnect changes * adding upgraded flag * await stream before closing * simplify tracking * wip * logging * split connection limits into incoming and outgoing * further streamline connection limits split counts * don't use closeWithEOF * move peer and conn event triggers from switch * wip * wip * wip * merge master * handle nil connections properly * add clarifying comment * don't raise exc on nil * no finally * add proper min/max connections logic * rebase master * merge master * master merge * remove request timeout should be addressed in separate PR * merge master * share semaphore when in/out limits arent enforced * merge master * use import * pass semaphore to trackConn * don't close last conn * use storeConn * merge master * use storeConn	2021-01-20 22:00:24 -06:00
Dmitriy Ryajov	96c01e5e69	Split upgrade flow (#507 ) * splitting upgrade flow * bring back master changes * re-export `Upgrade` * export public methods/procs in derived class * style fixes	2021-01-20 11:28:32 -06:00
Dmitriy Ryajov	34e330353f	better `upgraded` lifetime handling (avoid NPE) (#506 ) * avoid npe on connection upgrade * add `onUpgraded` event	2021-01-18 16:27:29 -06:00
Dmitriy Ryajov	64b822e8f0	remove blank spaces	2021-01-18 15:32:42 -06:00
Giovanni Petrantoni	b57101f265	add an invalid topic subscriptions metric	2021-01-15 18:55:54 +09:00
Giovanni Petrantoni	1fb783eb7f	let apps decide if they want to penalize peers on invalid topic	2021-01-15 18:50:42 +09:00
Giovanni Petrantoni	6542b913df	set "ignoring invalid topic subscription" to trace level	2021-01-15 18:48:58 +09:00
Giovanni Petrantoni	240ec84ffb	Gossipsub wip (#502 ) * Remove unused connections in pubsubpeer, also removed wrong usages, add a disconnect bad peers parameter * handle exceptions in disconnectPeer * small fix * use the proper disconnection procedure for gossip peers * fixes, more metrics add test about disconnection * hot fix possible null pointers in switch * silly isnil sugar * Fix and test gossip directPeer connections	2021-01-15 13:48:03 +09:00
Dmitriy Ryajov	3878a95b23	Semaphore cancellations (#503 ) * add proper cancelation handling * remove cancelled futures explicitly * use fifo to keep proper order * add out of order cancelations test * make count public * use `new` instead of `init` * remove private `queue` from tests * expose count as a readonly prop * use `delete()` to preserve seq order	2021-01-14 10:11:12 +01:00
Giovanni Petrantoni	dc48170b0d	Gossip subscription improvements (#497 ) * salt ids in seen table * add subscription validation callback and avoid processing topics we don't care of * apply penalty on bad subscription * fix IHave handling IDs * reduce indenting, add some comments * fix gossip randombytes generation * do not descore unwanted topics (might happen, due to timing, needs improvements) * cleaning up and added tests * validate subscriptions only when subscribing * set notice level for failed publish * fix floodsub behavior	2021-01-13 23:49:44 +09:00
Giovanni Petrantoni	b902c030a0	add metrics into chronosstream to identify peers agents (#458 ) * add metrics into chronosstream to identify peers agents * avoid too many agent strings * use gauge instead of counter for stream metrics * filter identity on / * also track bytes traffic * fix identity tracking closeimpl call * add gossip rpc metrics * fix missing metrics inclusions * metrics fixes and additions * add a KnownLibP2PAgents strdefine * enforse toLowerAscii to agent names (metrics) * incoming rpc metrics * fix silly mistake in rpc metrics * fix agent metrics logic * libp2p_gossipsub_failed_publish metric * message ids metrics * libp2p_pubsub_broadcast_ihave metric improvement * refactor expensive gossip metrics * more detailed metrics * metrics improvements * remove generic metrics for `set` users * small fixes, add debug counters * fix counter and add missing subs metrics! * agent metrics behind -d:libp2p_agents_metrics * remove testing related code from this PR * small rebroadcast metric fix * fix small mistake * add some guide to the readme in order to use new metrics * add libp2p_gossipsub_peers_scores metric * add protobuf metrics to understand bytes traffic precisely * refactor gossipsub metrics * remove unused variable * add more metrics, refactor rebalance metrics * avoid bad metric concurrent states * use a stack structure for gossip mesh metrics * refine sub metrics * add received subs metrics fixes * measure handlers of known topics * sub/unsub counter * unsubscribeAll log unknown topics * expose a way to specify known topics at runtime	2021-01-08 14:21:24 +09:00
Dmitriy Ryajov	8e57746f3a	improving connection estblishing metrics (#500 )	2021-01-07 17:06:41 -06:00
Dmitriy Ryajov	b2ea5a3c77	Concurrent upgrades (#489 ) * adding an upgraded event to conn * set stopped flag asap * trigger upgradded event on conn * set concurrency limit for accepts * backporting semaphore from tcp-limits2 * export unittests module * make params explicit * tone down debug logs * adding semaphore tests * use semaphore to throttle concurent upgrades * add libp2p scope * trigger upgraded event before any other events * add event handler for connection upgrade * cleanup upgraded event on conn close * make upgrades slot release rebust * dont forget to release slot on nil connection * misc * make sure semaphore is always released * minor improvements and a nil check * removing unneeded comment * make upgradeMonitor a non-closure proc * make sure the `upgraded` event is initialized * handle exceptions in accepts when stopping * don't leak exceptions when stopping accept loops	2021-01-04 12:59:05 -06:00
Giovanni Petrantoni	5e79d3ab9c	avoid triggering unsubscribeAll from unsubscribe (gossip)	2020-12-20 17:08:03 +09:00
Giovanni Petrantoni	4858e0ab15	Gossipsub refactor pt2 (#495 ) * add sub/unsub test * remove unused variable from gossip	2020-12-20 00:45:34 +09:00
Giovanni Petrantoni	05e789a34f	Gossipsub refactor (#490 ) * refactor peerStats, re-enable scores for testing * remove gossip 1.0 * cleanup * codecov matrix fixes * restore previous score on onNewPeer * fix coverage n checks * unsubscribeAll gossipsub fixes * refactor unsub/sub * refactor onNewPeer and fix score flow * disable scores by default (change in tests later) * fix tests, enable scores in tests * fix wrongly merged test * ensure topic removal from topics table * small typo fix * testinterop fixes	2020-12-19 15:43:32 +01:00
Giovanni Petrantoni	6c2e743ff3	Race condition in pubsub #469 (#471 ) * Race condition in pubsub #469 * use allFinished * improve cancellation handling	2020-12-19 00:56:46 +09:00
Jacek Sieka	a1a5f9abac	nice msgid log formatting (#491 )	2020-12-18 16:14:07 +01:00
Giovanni Petrantoni	52628d1a2e	avoid using debug logging in gossip, just because	2020-12-17 17:21:09 +09:00
Jacek Sieka	0c331d5200	simplify noise frame construction (#478 )	2020-12-16 13:10:06 +01:00
Jacek Sieka	9e5ba64c48	avoid creating prune message unless we're pruning (#487 )	2020-12-15 22:46:03 +01:00
Giovanni Petrantoni	18d69a5c41	Workaround the gossipsub table race condition (#486 )	2020-12-15 12:32:11 -06:00
Jacek Sieka	b52dab9fd7	use stew/leb128 (#481 ) * avoids multiple reallocations in readLp * simplifies varint implementation * remove vbuffer.length (unused)	2020-12-15 12:15:22 -06:00
Giovanni Petrantoni	5543f6681f	first pass, use results for Cid module (#480 ) * first pass, use results for Cid module * improvements to decode	2020-12-15 14:19:18 +01:00
Giovanni Petrantoni	f8f0bc1bd8	Gossip improvements (#476 ) * add more traces, remove async from rebalance * more traces * avoid computng scores when weight is 0.0 * debug colocation, fix an indent in unsubpeer (minor) * add full ValidationResult coverage * store in cache only after validation * gossip 1.0 fixes * fix typo * gossip 10 internal test fixes * test fixing * refactor peerstats usages * populate tables if missing when scoring	2020-12-15 10:25:22 +09:00
Mamy Ratsimbazafy	42d264d8b0	Rm bearssl + Deactivate Travis completely (#477 ) * Rm bearssl added in #2167 * Travis ARM doesn't work	2020-12-10 14:19:27 +01:00
Mamy André-Ratsimbazafy	8805e5f061	Use Travis only for ARM64 - https://github.com/status-im/nimbus-eth2/pull/2167	2020-12-09 16:05:41 -06:00
Jacek Sieka	6f1ecc8df7	streamline socket read/write hot path (#473 ) * streamline socket read/write hot path This avoids some unnecessary memory copying on the hot path of noise / mplex, as well as getting rid of a few futures - profiling shows that this is one of the main culprits of small memory allocations, which makes sense - this is where gossip fan-out happens. * fewer futures (and corresponding closures) when sending lpchannel messages * avoid data copies when encrypting and framing noise messages * avoid copying tuple when reading noise data (poor c codegen) * fix setting eof flag in secure read * write noise frames in one go ...and closing secure socket once is enough	2020-12-09 08:56:40 -06:00
Jacek Sieka	1befeb8c2e	clean up peerid (#470 ) * fix dangling cstring on error return * remove some useless inlines * less mallocs in shortlog * proc -> func * rename test	2020-12-03 13:53:16 -06:00
Dmitriy Ryajov	e9d4679059	Race in connection setup (#464 ) * check that connection is not closed or eof * don't release connection lock prematurely * test that only valid connections can be added * correct exception type on closed connection * add clarifying comment * use closeWithEOF for more stable test * misc comments * log stream id in buffestream asserts * use closeWithEOF to prevent races in tests * give some time to the remote handler to trigger * adding more tests to make codecov happy	2020-12-02 19:24:48 -06:00
Dmitriy Ryajov	d1c689e5ab	adding libp2p tag to logScope (#465 )	2020-12-01 11:34:27 -06:00
Giovanni Petrantoni	e1648d4404	fix mcache logic check in gossipsub	2020-12-01 23:55:51 +09:00
Giovanni Petrantoni	b4738d723c	Some gossip fixes (#467 ) * fix some missing rpc in rebalanceMesh * clarify some variable names and lifetime * further improvements	2020-12-01 11:44:09 +01:00
Dmitriy Ryajov	94e672ead0	allow concurrent closeWithEOF (#466 ) * allow concurrent closeWithEOF * add dedicated closedWithEOF flag	2020-12-01 09:44:21 +01:00
Jacek Sieka	5c2a54bdd9	fix timeoutmonitor loop (#463 ) * fix timeoutmonitor loop * Clarify that cancellation can happen while in timeoutMonitor	2020-11-29 13:34:19 +01:00
Dmitriy Ryajov	18443dafc1	rework peer event to take an initiator flag (#456 ) * rework peer event to take an initiator flag * use correct direction for initiator	2020-11-28 10:59:47 -06:00
Dmitriy Ryajov	3d44fcb8b3	use cancelAndAwait to mitigate further hangs (#459 )	2020-11-28 09:48:06 -06:00
Dmitriy Ryajov	a8f5f7a8bb	move dialing logic to it's own proc to avoid try/finally bugs (#461 ) * move dialing logic to it's own proc to avoid try/finally bugs * re-export transport * lint * add cancelation test * test remote conn close on dial	2020-11-28 09:05:12 +01:00
Giovanni Petrantoni	02b20440f2	Limit ihave emission (#462 ) * add some limits to ihave emission, go has them, rust does not actually * restore shuffling of IDs * add some context	2020-11-28 16:27:39 +09:00
Giovanni Petrantoni	12db9a4cf2	TopicParams validation tuning	2020-11-28 00:23:09 +09:00
Giovanni Petrantoni	809df8d04d	add some extra gossip metrics	2020-11-26 16:20:34 +09:00
Giovanni Petrantoni	6c7f2766fe	expose seenTTL parameters (#457 )	2020-11-26 14:45:10 +09:00
Dmitriy Ryajov	ca9c5c85e4	dont break chronicles logging streamline connsetup (#455 )	2020-11-25 13:34:48 -06:00
Dmitriy Ryajov	7b1e652224	Allow custom identify agent string (#451 ) * allow custom agent version string * rework tests and add test for custom agent version	2020-11-25 07:42:02 -06:00
Dmitriy Ryajov	164892776b	get rid of hangs cleanup (#453 )	2020-11-25 07:35:25 -06:00
Dmitriy Ryajov	21110636cb	fixing log level to avoid sacring users (#452 )	2020-11-24 12:07:27 -06:00
Dmitriy Ryajov	351489bfa9	getMuxedStream to more appropriate getStream (#448 )	2020-11-24 00:37:45 -06:00
Dmitriy Ryajov	69ae24dc8d	less leak prone cleanup (#447 ) * less leak prone cleanup * fix double allFinished	2020-11-23 18:22:15 -06:00
Dmitriy Ryajov	6cc3f4283a	update conn peerinfo instead of replacing (#445 ) * update conn peerinfo instead of replacing * remove unnecesary peerid var	2020-11-23 15:15:55 -06:00
Dmitriy Ryajov	034a1e8b1b	small cleanups from tcp-limits2 (#446 )	2020-11-23 15:02:23 -06:00
Dmitriy Ryajov	1d16d22f5f	Don't allow concurrent pushdata (#444 ) * handle resets properly with/without pushes/reads * add clarifying comments * pushEof should also not be concurrent * move channel reset to bufferstream this is where the action happens - lpchannel merely redefines how close is done Co-authored-by: Jacek Sieka <jacek@status.im>	2020-11-23 09:07:11 -06:00
Dmitriy Ryajov	c42009d56e	don't quit accept prematurelly (#443 )	2020-11-19 09:10:25 -06:00
Giovanni Petrantoni	93b6c4dc52	Gossip runtime params (#437 ) * move gossip parameters to runtime * internal test fixes * add missing params * restore const parameters are soldi base and use them in init * more constants tuning	2020-11-19 16:48:17 +09:00
Dmitriy Ryajov	92fa4110c1	Rework transport to use chronos accept (#420 ) * rework transport to use the new accept api * use the new chronos primits * fixup tests to use the new transport api * handle all exceptions in upgradeIncoming * master merge * add multiaddress exception type * raise appropriate exception on invalida address * allow retrying on TransportTooManyError * adding TODO * wip * merge master * add sleep if nil is returned * accept loop handles all exceptions * avoid issues with tray/except/finally * make consistent with master * cleanup accept loop * logging * Update libp2p/transports/tcptransport.nim Co-authored-by: Jacek Sieka <jacek@status.im> * use Direction enum instead of initiator flag * use consistent import style * remove experimental `closeWithEOF()` Co-authored-by: Jacek Sieka <jacek@status.im>	2020-11-18 20:06:42 -06:00
Dmitriy Ryajov	8c8d73380f	Re-add connection manager tests (#441 ) * use table.getOrDefault() * re-add missing connection manager tests	2020-11-17 18:48:26 -06:00
Jacek Sieka	74acd0a33a	fix channels not being reset (#439 ) * fix channels not being reset silly for loop.. * allow only one concurrent read * fix mplex test race condition * add some bufferstream eof tests * deadlock, lost data and hung channel fixes * prevent concurrent `reset` calls * reset LPChannel when read is cancelled (since data is lost) * ensure there's one, and one only, 0-byte readOnce on EOF * ensure that all data is returned before EOF is returned * keep running activity monitor for half-closed channels (or they never get closed)	2020-11-17 08:59:25 -06:00
Dmitriy Ryajov	da37eee285	Test disconnect from conn event (#432 ) * logs * adding disconnect test in connection events * adding immediate disconnect from connection event	2020-11-11 13:20:14 -06:00
Giovanni Petrantoni	a9948b0b05	clarify validation messages (#431 ) * clarify validation messages * add codecov threshold	2020-11-12 01:42:12 +09:00
Dmitriy Ryajov	90921bff09	move some importance trace logs to debug (#428 )	2020-11-09 22:14:46 -06:00
Dmitriy Ryajov	4fb3f50d2c	Reset channels on close (#425 ) * reset when failed to read/write muxed conn * add more comprehensive resource cleanup tests * style * cleanup tests	2020-11-06 09:24:24 -06:00
Dmitriy Ryajov	3956f3fd69	make sure all streams are tracked (#422 ) * make sure all streams are tracked * revert unnecesary change	2020-11-04 21:52:54 -06:00
Dmitriy Ryajov	6040cb4ef1	fix debugutils (#423 )	2020-11-04 19:56:28 -06:00
Giovanni Petrantoni	7cc42ce219	start adding more tests + minor fixes (#419 ) * start adding more tests + minor fixes * add wrong secure negotiation test * add noise failed handshake test	2020-11-04 23:24:41 +09:00
Giovanni Petrantoni	e496802943	Least expensive metrics (#421 ) * add more general and useful metrics * fix gossipsub peers metrics in heartbeat	2020-11-04 15:18:00 +01:00
Dmitriy Ryajov	7b5259dbc7	Move triggers (#416 ) * move event triggers to connmanager * use base error type * avoid deadlocks * handle eof and closed when identifying incoming * use `closeWait`	2020-11-02 14:35:26 -06:00
Dmitriy Ryajov	43a77e60a1	split stream counts by direction (#418 )	2020-11-01 16:23:26 -06:00
Jacek Sieka	03639f1446	Revert "Channel leaks (#413 )" (#417 ) This reverts commit `1de1d49223`.	2020-11-01 14:49:25 -06:00
Giovanni Petrantoni	9c1633bf87	fix ValidIpAddress multiaddress init return type	2020-10-31 13:20:29 +09:00
cheatfate	04c95cb7b0	Fix write should be writeArray.	2020-10-31 03:23:34 +02:00
cheatfate	ff48d0b1a2	Proper fix for init(ValidIpAddress).	2020-10-30 17:52:38 +02:00
Giovanni Petrantoni	3d9948a65e	ensure all multiaddress routines use Result	2020-10-30 23:50:04 +09:00
Giovanni Petrantoni	75b023c9e5	gossipsub audit fixes (#412 ) * [SEC] gossipsub - rebalanceMesh grafts peers giving preference to low scores #405 * comment score choices * compiler warning fixes/bug fixes (unsubscribe) * rebalanceMesh does not enforce D_out quota * fix outbound grafting * fight the nim compiler * fix closure capture bs... * another closure fix * #403 rebalance prune fixes * more test fixing * #403 fixes * #402 avoid removing scores on unsub * #401 handleGraft improvements * [SEC] handleIHAVE/handleIWANT recommendations * add a note about peer exchange handling	2020-10-30 21:49:54 +09:00
Dmitriy Ryajov	1de1d49223	Channel leaks (#413 ) * break stream tracking by type * use closeWithEOF to await wrapped stream * fix cancelation leaks * fix channel leaks * logging * use close monitor and always call closeUnderlying * don't use closeWithEOF * removing close monitor * logging	2020-10-27 11:21:03 -06:00
Giovanni Petrantoni	eeaa62feec	add more debug details to multiaddress assertions	2020-10-21 18:29:59 +09:00
Giovanni Petrantoni	462da1f7a8	gossip MessageID as seq[byte] (#391 ) * gossip MessageID as seq[byte] * combina hashes in defaultMsgIdProvider * wip * fix defaultMsgIdProvider	2020-10-21 12:26:04 +09:00
Giovanni Petrantoni	27b9bf436e	fix validation according to specification (#410 )	2020-10-21 12:25:42 +09:00
Giovanni Petrantoni	5c19668b2d	avoid verbose EOF messages in readOnce(secure) (#411 ) * avoid verbose EOF messages in readOnce(secure) * shorten azure tests further	2020-10-21 10:08:24 +09:00
Giovanni Petrantoni	32623b930e	handle secure errors in readOnce (secure) (#397 ) * handle secure errors in readOnce(secure) * small synthax fix * fix mistake in readOnce's isNil	2020-10-19 14:13:14 +09:00
Giovanni Petrantoni	556213abf4	Extended validators (#395 ) * gossip extended validation * fix flood tests * fix gossip 1.0 tests * synthax consistency	2020-10-12 16:56:00 +09:00
Giovanni Petrantoni	e3bdb9eb13	decode properly ControlPrune (#392 )	2020-10-09 09:12:38 +09:00
Giovanni Petrantoni	0f2435f551	better opportunistic grafting score (when score is disabled) (#389 )	2020-10-03 09:26:45 +09:00
Dean Eigenmann	853238a215	feature/expose-matcher (#387 ) * exposes matcher * might work * fix * fix	2020-10-02 08:59:15 -06:00
Giovanni Petrantoni	4a98a8af5a	gossip pruning fixes related to #371 (#385 ) * gossip pruning fixes related to #371 * better trace for grafted/pruned * shorted azure testing again	2020-10-02 13:09:31 +09:00
Mamy Ratsimbazafy	03f5bbba6d	saner logging (#381 )	2020-09-29 09:40:06 -06:00
Giovanni Petrantoni	98d0cc3a16	defaultMsgIdProvider alternative/test anonymize (#379 ) * defaultMsgIdProvider alternative/test anonymize * avoid freeze during flood tests * avoid `empty message, skipping` situation * test observers * avoid double initPubSub * fix gossip testing (specially when anonymize is on) * make azure tests shorter	2020-09-28 09:11:18 +02:00
Jacek Sieka	8ecef46738	reencode gossipsub messages with anonymization (#378 ) This helps protect against clients sending more data than they should and thus getting penalized on topics that require anonymity	2020-09-25 18:39:34 +02:00
Jacek Sieka	17e00e642a	limit write queue length (#376 ) To break a potential read/write deadlock, gossipsub uses an unbounded queue for writes - when peers are too slow to process this queue, it may end up growing without bounds causing high memory usage. Here, we introduce a maximum write queue length after which the peer is disconnected - the queue is generous enough that any "normal" usage should be fine - writes that are `await`:ed are not affected, only writes that are launched in an `asyncSpawn` task or similar. * avoid unnecessary copy of message when there are no send observers * release message memory earlier in gossipsub * simplify pubsubpeer logging	2020-09-24 18:43:20 +02:00
Jacek Sieka	25bd0a18f4	small fixes (#374 ) * add helper to read EOF marker after closing stream (else stream stay alive until timeout/reset) * don't assert on empty channel message * don't loop when writing to chronos (no need)	2020-09-24 07:30:19 +02:00
Giovanni Petrantoni	ec322124ac	allow to omit peerId and seqno (#372 ) * allow to omit peerId and seqno * small refactor * wip * fix message encoding * improve rpc signature logic * remove peerid from verify * trace fixes * fix message test * fix gossip 1.0 tests	2020-09-23 17:56:33 +02:00
Dmitriy Ryajov	abd234601b	move events to conn manager (#373 )	2020-09-23 08:07:16 -06:00
Jacek Sieka	471e5906f6	fix gossipsub memory leak on disconnected peer (#371 ) When messages can't be sent to peer, we try to establish a send connection - this causes messages to stack up as more and more unsent messages are blocked on the dial lock. * remove dial lock * run reconnection loop in background task	2020-09-22 09:05:53 +02:00
Jacek Sieka	49a12e619d	channel close race and deadlock fixes (#368 ) * channel close race and deadlock fixes * remove send lock, write chunks in one go * push some of half-closed implementation to BufferStream * fix some hangs where LPChannel readers and writers would not always wake up * simplify lazy channels * fix close happening more than once in some orderings * reenable connection tracking tests * close channels first on mplex close such that consumers can read bytes A notable difference is that BufferedStream is no longer considered EOF until someone has actually read the EOF marker. * docs, simplification	2020-09-21 19:48:19 +02:00
Giovanni Petrantoni	b99d2039a8	Gossip one one (#240 ) * allow multiple codecs per protocol (without breaking things) * add 1.1 protocol to gossip * explicit peering part 1 * explicit peering part 2 * explicit peering part 3 * PeerInfo and ControlPrune protocols * fix encodePrune * validated always, even explicit peers * prune by score (score is stub still) * add a way to pass parameters to gossip * standard setup fixes * take into account explicit direct peers in publish * add floodPublish logic * small fixes, publish still half broken * make sure to waitsub in sparse test * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * restore utility module import * restore trace vs debug in gossip * improve fanout replenish behavior further * triage publish nil peers (issue is on master too but just hidden behind a if/in) * getGossipPeers fixes * remove topics from pubsubpeer (was unused) * simplify rebalanceMesh (following spec) and make it finally reach D_high * better diagnostics * merge new pubsubpeer, copy 1.1 to new module * fix up merge * conditional enable gossip11 module * add back topics in peers, re-enable flood publish * add more heartbeat locking to prevent races * actually lock the heartbeat * minor fixes * with sugar * merge 1.0 * remove assertion in publish * fix multistream 1.1 multi proto * Fix merge oops * wip * fix gossip 11 upstream * gossipsub11 -> gossipsub * support interop testing * tests fixing * fix directchat build * control prune updates (pb) * wip parameters * gossip internal tests fixes * parameters wip * finishup with params * cleanups/wip * small sugar * grafted and pruned procs * wip updateScores * wip * fix logging issue * pubsubpeer, chronicles explicit override * fix internal gossip tests * wip * tables troubleshooting * score wip * score wip * fixes * fix test utils generateNodes * don't delete while iterating in score update * fix grafted defect * add a handleConnect in subscribeTopic * pruning improvements * wip * score fixes * post merge - builds gossip tests * further merge fixes * rebalance improvements and opportunistic grafting * fix test for now * restore explicit peering * implement peer exchange graft message * add an hard cap to PX * backoff time management * IWANT cap/budget * Adaptive gossip dissemination * outbound mesh quota, internal tests fixing * oversub prune score based, finish outbound quota * finishup with score and ihave budget * use go daemon 0.3.0 * import fixes * byScore cleanup score sorting * remove pointless scaling in `/` Duration operator * revert using libp2p org for daemon * interop fixes * fixes and cleanup * remove heartbeat assertion, minor debug fixes * logging improvements and cleaning up * (to revert) add some traces * add explicit topic to gossip rpcs * pubsub merge fixes and type fix in switch * Revert "(to revert) add some traces" This reverts commit `4663eaab6c`. * cleanup some now irrelevant todo * shuffle peers anyway as score might be disabled * add missing shuffle * old merge fix * more merge fixes * debug improvements * re-enable gossip internal tests * add gossip10 fallback (dormant but tested) * split gossipsub internal tests into 1.0 and 1.1 Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-09-21 11:16:29 +02:00
Jacek Sieka	b7e5d1122c	cleanups (#366 ) * reuse connection timeout for noise handshake (avoid extra timer) * enforce nbytes > 0 for readOnce * avoid some unnecessary memory zeroing * simplify noise * fix dumping when noise splits message	2020-09-16 11:55:25 +02:00
Dmitriy Ryajov	b0d86b95dd	add peer lifecycle events (#357 ) * add peer lifecycle events * rework peer events to not use connection events * don't use result in pubsub and switch init * wip * use ordered hashes and remove logscope * logging * add missing test * small fixes	2020-09-15 14:19:22 -06:00
Giovanni Petrantoni	a6007be428	avoid sending empty seqno and/or fromPeer (gossip rpc) (#364 )	2020-09-15 12:33:18 +02:00
Eugene Kabanov	e2d46c6b53	Add `libp2p_dump` and `libp2p_dump_dir` compiler time options. (#359 ) * Add `libp2p_dump` and `libp2p_dump_dir` compiler time options. * Add tools/pbcap_parser. * Add mplex control messages decoding.	2020-09-15 11:16:43 +02:00
Oskar Thorén	5e66f6fbd8	Add logScope to connmanager and pubsubprotobuf (#363 )	2020-09-15 08:03:53 +02:00
Jacek Sieka	0db45462cd	mplex fixes (#362 ) * remove almost-empty types module * lock when writing message (that's the only place the lock matters, and only when the message is > max msg size) * logging updates (log in consistent order, makes reading logs easier) * raise EOF from readExactly only if no bytes have been read (to signal that _no_ bytes were lost)	2020-09-14 10:19:54 +02:00
Jacek Sieka	96d4c44fec	refactor bufferstream to use a queue (#346 ) This change modifies how the backpressure algorithm in bufferstream works - in particular, instead of working byte-by-byte, it will now work seq-by-seq. When data arrives, it usually does so in packets - in the current bufferstream, the packet is read then split into bytes which are fed one by one to the bufferstream. On the reading side, the bytes are popped of the bufferstream, again byte by byte, to satisfy `readOnce` requests - this introduces a lot of synchronization traffic because the checks for full buffer and for async event handling must be done for every byte. In this PR, a queue of length 1 is used instead - this means there will at most exist one "packet" in `pushTo`, one in the queue and one in the slush buffer that is used to store incomplete reads. * avoid byte-by-byte copy to buffer, with synchronization in-between * reuse AsyncQueue synchronization logic instead of rolling own * avoid writeHandler callback - implement `write` method instead * simplify EOF signalling by only setting EOF flag in queue reader (and reset) * remove BufferStream pipes (unused) * fixes drainBuffer deadlock when drain is called from within read loop and thus blocks draining * fix lpchannel init order	2020-09-10 08:19:13 +02:00
Jacek Sieka	5b347adf58	logging fixes and small cleanups (#361 ) In particular, allow longer multistream select reads	2020-09-09 19:12:08 +02:00
Jacek Sieka	63b38734bd	fix poor performance in LRU cache (#360 ) it turns out (in NBC) a heap is sufficiently slow becuase of all the deletes that it makes more sense to go with a linked list	2020-09-09 18:28:46 +02:00
Jacek Sieka	82c179db9e	mplex fixes (#356 ) * close the right connection when channel send fails * don't crash on channel id that is not unique	2020-09-08 08:24:28 +02:00
Jacek Sieka	2b72d485a3	a few more log fixes (#355 )	2020-09-07 14:15:11 +02:00
Jacek Sieka	c1856fda53	simplify and unify logging (#353 ) * use short format for logging peerid * log peerid:oid for connections	2020-09-06 10:31:47 +02:00
Jacek Sieka	9b815efe8f	gossipsub: don't subscribe to floodsub also (#352 )	2020-09-04 22:53:03 +02:00
Jacek Sieka	16a008db75	fix connection event order when connection dies early (#351 ) if the connection is already closed (because the remote closes during identfiy for example), an exception would be raised which would leave the connection in limbo, beacuse it would not go through the rest of internalConnect. Also, if the connection is already closed, the disconnect event would be scheduled before the connect event :/	2020-09-04 20:30:26 +02:00
Jacek Sieka	6d91d61844	small cleanups & docs (#347 ) * simplify gossipsub heartbeat start / stop * avoid alloc in peerid check * stop iterating over seq after unsubscribing item (could crash) * don't crash on missing private key with enabled sigs (shouldn't happen but...)	2020-09-04 18:31:43 +02:00
Eugene Kabanov	0b85192119	Remove asyncCheck from codebase. (#345 ) * Remove asyncCheck from codebase. * Replace all `discard` statements with new `asyncSpawn`. * Bump `nim-chronos` requirement.	2020-09-04 18:30:45 +02:00
Jacek Sieka	5819c6a9a7	gossipsub / floodsub fixes (#348 ) * mcache fixes * remove timed cache - the window shifting already removes old messages * ref -> object * avoid unnecessary allocations with `[]` operator * simplify init * fix several gossipsub/floodsub issues * floodsub, gossipsub: don't rebroadcast messages that fail validation (!) * floodsub, gossipsub: don't crash when unsubscribing from unknown topics (!) * gossipsub: don't send message to peers that are not interested in the topic, when messages don't share topic list * floodsub: don't repeat all messages for each message when rebroadcasting * floodsub: allow sending empty data * floodsub: fix inefficient unsubscribe * sync floodsub/gossipsub logging * gossipsub: include incoming messages in mcache (!) * gossipsub: don't rebroadcast already-seen messages (!) * pubsubpeer: remove incoming/outgoing seen caches - these are already handled in gossipsub, floodsub and will cause trouble when peers try to resubscribe / regraft topics (because control messages will have same digest) * timedcache: reimplement without timers (fixes timer leaks and extreme inefficiency due to per-message closures, futures etc) * timedcache: ref -> obj	2020-09-04 08:10:32 +02:00
Jacek Sieka	cd1c68dbc5	avoid send deadlock by not allowing send to block (#342 ) * avoid send deadlock by not allowing send to block * handle message issues more consistently	2020-09-01 09:33:03 +02:00
Dmitriy Ryajov	d3182c4dba	No raise send (#339 ) * dont raise in send * check that the lock is acquire on release	2020-08-20 20:50:33 -06:00
Giovanni Petrantoni	840a76915e	warn -> debug log levels in errors.nim	2020-08-20 16:53:28 +09:00
Jacek Sieka	eb13845f65	work around send that may raise `send` can raise exceptions that together with asyncCheck will crash NBC	2020-08-19 14:25:30 +03:00
Zahary Karadjov	af0955c58b	Add comments explaning a possible deadlock	2020-08-18 13:51:41 +03:00
Zahary Karadjov	60122a044c	Restore interop with Lighthouse by preventing concurrent meshsub dials	2020-08-17 22:40:58 +03:00
Jacek Sieka	833a5b8e57	add muxer nil check	2020-08-17 13:32:02 +02:00
Jacek Sieka	cfcda3c3ef	work around race conditions between identify and other protocols when identify is run on incoming connections, the connmanager tables are updated too late for incoming connections to properly be handled this is a quickfix that will eventually need cleaning up	2020-08-17 13:29:45 +02:00
Jacek Sieka	790b67c923	work around bufferstream deadlock (#332 ) mplex backpressure handling deadlocks with something	2020-08-17 12:45:54 +02:00
Jacek Sieka	53877e97bd	trace logs	2020-08-17 12:39:25 +02:00
Jacek Sieka	f46bf0faa4	remove send lock (#334 ) * remove send lock When mplex receives data it will block until a reader has processed the data. Thus, when a large message is received, such as a gossipsub subscription table, all of mplex will be blocked until all reading is finished. However, if at the same time a `dial` to establish a gossipsub send connection is ongoing, that `dial` will be blocked because mplex is no longer reading data - specifically, it might indeed be the connection that's processing the previous data that is waiting for a send connection. There are other problems with the current code: * If an exception is raised, it is not necessarily raised for the same connection as `p.sendConn`, so resetting `p.sendConn` in the exception handling is wrong * `p.isConnected` is checked before taking the lock - thus, if it returns false, a new dial will be started. If a new task enters `send` before dial is finished, it will also determine `p.isConnected` is false, then get stuck on the lock - when the previous task finishes and releases the lock, the new task will _also_ dial and thus reset `p.sendConn` causing a leak. * prefer existing connection simplifies flow	2020-08-17 12:38:27 +02:00
Jacek Sieka	b12145dff7	avoid crash when subscribe is received (#333 ) ...by making subscribeTopic synchronous, avoiding a peer table lookup completely. rebalanceMesh will be called a second later - it's fine	2020-08-17 12:10:22 +02:00
Jacek Sieka	ab864fc747	logging cleanups and small fixes (#331 )	2020-08-15 21:50:31 +02:00
Jacek Sieka	397f9edfd4	simplify mplex (#327 ) * less async * less copying of data * less redundant cleanup	2020-08-15 07:58:30 +02:00
Jacek Sieka	9c7e055310	set activity flag on noise / secio (#330 )	2020-08-15 07:36:15 +02:00
Dmitriy Ryajov	d1f1e1b31e	add missing mplex half closed test (#326 )	2020-08-12 07:23:49 +02:00
Dmitriy Ryajov	b76b3e0e9b	Rework pubsub (#322 ) * move pubsub of off switch, pass switch into pubsub * use join on lpstreams * properly cleanup up failed peers * fix tests * fix peertable hasPeerId * fix tests * rework sending, remove helpers from pubsubpeer, unify in broadcast * further split broadcast into send * use send where appropriate * use formatIt * improve trace Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-08-11 18:05:49 -06:00
Eugene Kabanov	59b290fcc7	Refactor minasn1 and fix security issues. (#323 ) * Refactor minasn1 and fix security issues. * Fix for RSA test vectors.	2020-08-11 16:58:51 -06:00
Eugene Kabanov	d47b2d805f	Use constant-time hex encoding/decoding procedures explicitly. (#305 ) * Use constant-time hex encoding/decoding procedures explicitly. * Add comments.	2020-08-11 08:48:21 -06:00
Dmitriy Ryajov	2325692f55	Fix half closed (#324 ) * don't call `close` in `remoteClose` * make sure timeout are properly propagted * fix tests * adding remote close write test	2020-08-10 16:17:11 -06:00
Giovanni Petrantoni	6ffd5be059	fix crash at TRACE log level	2020-08-08 16:31:37 +09:00
Eugene Kabanov	7c1aac5dc1	Use constant-time comparison for keys and signatures. (#299 )	2020-08-08 08:53:33 +02:00
Jacek Sieka	f303954989	peer hooks -> events (#320 ) * peer hooks -> events * peerinfo -> peerid * include connection direction in event * check connection status after event * lock connmanager lookup also when dialling peer * clean up un-upgraded connection when upgrade fails * await peer eventing * remove join/lifetime future from peerinfo Peerinfo instances are not unique per peer so the lifetime future is misleading - it fires when a random connection is closed, not the "last" one * document switch values * naming * peerevent->conneevent	2020-08-08 08:52:20 +02:00
zah	fbb59c3638	`msg` is a reserved property name in Chronicles (#321 ) Every Chronicles log record has an existing `msg` property matching the static string supplied in the log statement. Thus, it's currently not possible to use `msg` as the name of a user property: https://github.com/status-im/nim-chronicles/issues/86	2020-08-07 16:46:00 -06:00
Jacek Sieka	7c2ab38da1	cleanups (#319 )	2020-08-06 20:14:40 +02:00
Jacek Sieka	c6c0c152c0	Dial peerid (#308 ) * prefer PeerID in switch api This avoids ref issues like ref identity and nil * use existing peerinfo instance if possible * remove secureCodec there may be multiple connections per peerinfo with different codecs * avoid some extra async::	2020-08-06 09:29:27 +02:00
Giovanni Petrantoni	9bbe5e4841	Fix subclass calls to handleDisconnect (#314 ) * Fix subclass calls to handleDisconnect * add peer ID to nil peer debug message	2020-08-06 11:12:52 +09:00
Giovanni Petrantoni	5c986cf657	Fix build, add some raises (#315 ) * Fix build, add some raises * wip * wip more raises * missing exc object in mplex * proper lifetime for subscribePeer Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-08-05 19:30:57 -06:00
Ștefan Talpalaru	bd5d43874a	more expensive metrics (#312 )	2020-08-05 14:02:26 +02:00
Dmitriy Ryajov	74a6dccd80	adding channel limits to mplex (#309 )	2020-08-04 23:16:04 -06:00
Ștefan Talpalaru	843d32f8db	put expensive metrics under a Nim define (#310 )	2020-08-04 17:27:59 -06:00
Dmitriy Ryajov	cf2b42b914	Moving idle timeout to Connection to enable across all connection streams (#307 ) * move idle timeout logic to connection * more informative logs * more informative logs	2020-08-04 07:22:05 -06:00
Giovanni Petrantoni	5f0637c49a	Audit curve fixes part2 (#298 ) * refactor and fix mulgen (curve25519) * crypto tests fixing * fix some confusion in curve25519 mul * removing ForbiddenCurveValues table and checks * fix remaining merge issues	2020-08-04 18:19:26 +09:00
Giovanni Petrantoni	5cd93540fa	add a timeout during noise handshake (#294 ) * add a timeout during noise handshake * noise hs timeout into const	2020-08-04 17:04:16 +09:00
Giovanni Petrantoni	504e0444d3	refactor and fix mulgen (curve25519) (#293 ) * refactor and fix mulgen (curve25519) * crypto tests fixing	2020-08-04 14:07:53 +09:00
Dmitriy Ryajov	b6877b8aac	increase send timeout for prune and graft msgs (#306 ) * increase send timeout for prune and graft msgs * use trace logs for subscribe monitor	2020-08-03 17:55:42 -06:00
Dmitriy Ryajov	980764774e	pubsub timeouts tuning (#295 ) * add finegrained timeouts to pubsub * use 10 millis timeout in tests * finalization * revert timeouts * use `atEof` for reads * adjust timeouts and use atEof for reads * use atEof for reads * set isEof flag * no backoff for pubsub streams * temp timer increase, make macos finalize * don't call `subscribePeer` in libp2p anymore * more traces * leak tests * lower timeouts * handle exceptions in control message * don't use `cancelAndWait` * handle exceptions in helpers * wip * don't send empty messages * check for leaks properly * don't use cancelAndWait * don't await subscribption sends * remove subscrivePeer calls from switch * trying without the hooks again	2020-08-02 23:20:11 -06:00
Jacek Sieka	e655a510cd	misc cleanups (#303 )	2020-08-02 12:22:49 +02:00
Jacek Sieka	d544b64010	resolve several races in connmanager (#302 ) * resolve several races in connmanager collections may change while doing await * close conn * simplify connmanager API PeerID avoids nil and ref issues * remove silly condition	2020-08-01 22:50:40 +02:00
Giovanni Petrantoni	afcfd27aa0	add some verbosity to multistream handshake for debugging pruposes	2020-07-31 14:02:03 +09:00
Giovanni Petrantoni	0f06ae5a1d	Audit multistream fixes (#291 ) * Don't ignore missing \n in multistream requests Also make sure to except and quit an existing connection if multistream handshake fails * solve handshake tracking in ms handler	2020-07-28 23:03:22 +09:00
Dmitriy Ryajov	f7fdf31365	Pubsub lifetime (#284 ) * lifecycle hooks * tests * move trace after closed check * restore 1 second heartbeat * await close event * fix tests * print direction string * more trace logging * add pubsub monitor * add log scope * adjust idle timeout * add exc.msg to trace	2020-07-27 13:33:51 -06:00
Dmitriy Ryajov	ed0df74bbd	Connection lifecycle hooks (#288 ) * lifecycle hooks * trigger hooks as tasks * handle exceptions in trigger hooks * trigger hooks after storing the connection * add disconnected hook * tests	2020-07-24 13:24:31 -06:00
Eugene Kabanov	6af3cb6406	Public key infrastructure filters. (#272 ) * Initial commit. * Workaround nim's bug and add some other compilation error fixes. * Rename to libp2p_pki_schemes. Fix secio. Add tests. * Attempt to fix command line. * Fix command line. Show status in tests.	2020-07-21 14:10:21 -06:00
Giovanni Petrantoni	c3404f6eea	Handle cancellation in timeoutMonitor (#283 ) * Handle cancellation in timeoutMonitor * refactor lpchannel timeout as suggested by cheatfate	2020-07-21 09:03:41 -06:00
Giovanni Petrantoni	3b088f8980	Fix some unsubscribe issues and add unsubscribeAll helper (#282 ) * Fix some unsub issues and add unsuball helper * batch sendprune in unsubscribe methods * add unsubscribeAll for floodsub	2020-07-20 10:16:13 -06:00
Dmitriy Ryajov	38eb36efae	don't use close event to stop timer (#280 )	2020-07-18 11:00:44 -06:00
Dmitriy Ryajov	94196fee71	Connections and pubsub peers cleanup (#279 ) * better peer tracking and cleanup * check if peer and conn is nil * test name * make timeout more agressive * rename method for better clarity	2020-07-17 13:46:24 -06:00
Dmitriy Ryajov	ba071cafa6	Channel timeout (#278 ) * add support for channel timeouts * tests for channel timeout * add timeouts to standard switch * fix mplex init * cleanup timer on stream close * add comment for `isConnected` * move cleanup event	2020-07-17 12:44:41 -06:00
Dmitriy Ryajov	0348773ec9	Connection manager (#277 ) * splitting out connection management * wip * wip conn mngr tests * set peerinfo in contructor * comments and documentation * tests * wip * add `None` to detect untagged connections * use `PeerID` to index connections * fix tests * remove useless equality	2020-07-17 09:36:48 -06:00
Jacek Sieka	170685f9c6	gossipsub fixes (#276 ) * graft up to D peers * fix logging so it's clear who is grafting/pruning who * clear fanout when grafting	2020-07-16 21:26:57 +02:00
Jacek Sieka	c76152f2c1	Simplify send (#271 ) * PubSubPeer.send single message * gossipsub: simplify send further	2020-07-16 12:06:57 +02:00
Dmitriy Ryajov	f35b8999b3	some light cleanup for pub/gossip sub (#273 ) * move peer table out to its own file * move peer table * cleanup `==` and add one to peerinfo * add peertable * missed equality check	2020-07-15 13:18:55 -06:00
Eugene Kabanov	b832668768	Minprotobuf refactoring 2 (#269 ) * Protobuf refactoring stage II. * Remove NoError. * Change trace level for invalid message.	2020-07-15 10:25:39 +02:00
Eugene Kabanov	9eb5828a42	Fix #266 . (#270 ) * Fix security issue #266. * Add more tests. * Fix PeerID tests should not use RSA-512 keys. * Fix crypto tests to use vectors with 2048+ bits. * Disable 4096bit RSA key generation for CI debug runs.	2020-07-15 10:24:04 +02:00
Giovanni Petrantoni	d7bab37119	Fix gossip messages seqno according to spec (#253 ) * Fix gossip messages seqno according to spec * Add peers back to gossipsub table, slow down heartbeat * Revert "Add peers back to gossipsub table, slow down heartbeat" This reverts commit `01e2e62172`. * make seqno a threadvar, remove from peerinfo * seqno refactor, into pubsub	2020-07-14 21:51:33 -06:00
Ștefan Talpalaru	b8b0a2b4bc	CI: build binaries with TRACE & JSON logs (#268 ) Also: remove unused imports.	2020-07-14 02:02:16 +02:00
Jacek Sieka	c6c2d99907	one more log fix	2020-07-13 20:19:20 +02:00
Jacek Sieka	76853f064a	json logging again	2020-07-13 19:59:49 +02:00
Jacek Sieka	6620b7a00b	more comment fixes	2020-07-13 19:30:18 +02:00
Jacek Sieka	0d4c74b33a	comment log that can't be json-serialized	2020-07-13 18:36:49 +02:00
Jacek Sieka	061c54d3c6	logging fixes	2020-07-13 17:26:05 +02:00
Jacek Sieka	87e58c1c8d	metrics: one more pubsub peers fix	2020-07-13 16:16:46 +02:00
Jacek Sieka	c7895ccc52	metrics: fix pubsub_peers add metric	2020-07-13 16:15:27 +02:00
Giovanni Petrantoni	fcda0f6ce1	PubSubPeer tables refactor (#263 ) * refactor peer tables * tests fixing * override PubSubPeer equality * fix pubsubpeer comparison	2020-07-13 15:32:38 +02:00
Eugene Kabanov	efb952f18b	[WIP] Minprotobuf refactoring (#259 ) * Minprotobuf initial commit * Fix noise. * Add signed integers support. Add checks for field number value. Remove some casts. * Fix compile errors. * Fix comments and constants.	2020-07-13 14:43:07 +02:00
Dmitriy Ryajov	181cf73ca7	Drain buffer (#264 ) * drain lpchannel on reset * move drainBuffer to bufferstream	2020-07-12 18:37:10 +02:00
Dmitriy Ryajov	bec9a0658f	Cleanup rpc handler (#261 ) * more cleanup * fix tests * merging master * remove `withLock` as it conflicts with stdlib * wip * more fanout ttl Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 17:54:16 -06:00
Dmitriy Ryajov	4c815d75e7	More gossip cleanup (#257 ) * more cleanup * correct pubsub peer count * close the stream first * handle cancelation * fix tests * fix fanout ttl * merging master * remove `withLock` as it conflicts with stdlib * fix trace build Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 14:21:47 -06:00
Jacek Sieka	c720e042fc	clean up mesh handling logic (#260 ) * gossipsub is a function of subscription messages only * graft/prune work with mesh, get filled up from gossipsub * fix race conditions with await * fix exception unsafety when grafting/pruning * fix allowing up to DHi peers in mesh on incoming graft * fix metrics in several places	2020-07-09 11:16:46 -06:00
Jacek Sieka	9a3684c221	init from concrete key type (#252 )	2020-07-09 02:59:09 -06:00
Jacek Sieka	45c089ff0d	noise updates (#255 ) * clear secrets explicitly * simplify keygen * avoid some trivial memory allocations * fix little endian encoding of nonce	2020-07-09 02:53:19 -06:00
Giovanni Petrantoni	4e12d0d97a	nil check peer before disconnect	2020-07-09 17:20:45 +09:00
Giovanni Petrantoni	f9e0a1f069	CI fix handleDisconnect (pubsub)	2020-07-09 13:56:59 +09:00
Giovanni Petrantoni	9b8b159abb	Remove other spurious getStacktrace in pubsub traces	2020-07-09 13:19:34 +09:00
Giovanni Petrantoni	4bcb567d47	fix gossip tests	2020-07-09 12:34:36 +09:00
Giovanni Petrantoni	4698f41a91	Remove stacktrace logging from pubsub connect	2020-07-09 12:23:03 +09:00
Giovanni Petrantoni	fec507e755	Add peers back to gossipsub table, slow down heartbeat (#256 ) * Add peers back to gossipsub table, slow down heartbeat * exclude on unsub from mesh and fanout	2020-07-08 11:06:26 -06:00
Dmitriy Ryajov	a52763cc6d	fix publishing (#250 ) * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * Cleanup resources (#246) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises * restore trace vs debug in gossip * improve fanout replenish behavior further * cleanup stale peers more eaguerly * synchronize connection cleanup and small refactor * close client first and call parent second * disconnect failed peers on publish * check for publish result * fix tests * fix tests * always call close Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-07 18:33:05 -06:00
Eugene Kabanov	775cab414a	Remove SHA1 from crypto and crypto tests. (#251 ) * Remove SHA1 from crypto and crypto tests. * Simplify RSA comparison procedure. Refactor some procedures in crypto.nim.	2020-07-07 15:48:15 +02:00
Jacek Sieka	d522537b19	reuse single RNG instance for all crypto key generation (#249 ) * reuse single RNG instance for all crypto key generation * use foolproof rng * initRng -> newRng (because it's ref) * fix test * imports/exports, chat fix * fix rsa * imports and exports * work around threadvar issue * fixup * mac workaround test	2020-07-07 13:14:11 +02:00
Jacek Sieka	b49c619ca8	export public field types (#248 ) * export public field types * one more	2020-07-01 09:22:29 +02:00
Giovanni Petrantoni	ec00c7fc50	Peer resultification and defect only (#245 ) * Peer resultification and defect only * Fixing some tests * test fixes * Rename peer into peerid * better result error message in identify * further merge fixes	2020-07-01 08:25:09 +02:00
Dmitriy Ryajov	c788a6a3c0	Cleanup resources (#246 ) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises	2020-06-29 09:15:31 -06:00

... 3 4 5 6 7 ...

990 Commits