nim-libp2p-experimental

Commit Graph

Author	SHA1	Message	Date
Giovanni Petrantoni	f81a085d0b	More gossip coverage (#553 ) * add floodPublish test * test delivery via control Iwant/have mechanics * fix issues in control, and add testing * fix possible backoff issue with pruned routine overriding it	2021-04-22 18:51:22 +09:00
Jacek Sieka	e285d8bbf4	mem usage cleanups for pubsub (#564 ) In `async` functions, a closure environment is created for variables that cross an await boundary - this closure environment is kept in memory for the lifetime of the associated future - this means that although _some_ variables are no longer used, they still take up memory for a long time. In Nimbus, message validation is processed in batches meaning the future of an incoming gossip message stays around for quite a while - this leads to memory consumption peaks of 100-200 mb when there are many attestations in the pipeline. To avoid excessive memory usage, it's generally better to move non-async code into proc's such that the variables therein can be released earlier - this includes the many hidden variables introduced by macro and template expansion (ie chronicles that does expensive exception handling) * move seen table salt to floodsub, use there as well * shorten seen table salt to size of hash * avoid unnecessary memory allocations and copies in a few places * factor out message scoring * avoid reencoding outgoing message for every peer * keep checking validators until reject (in case there's both reject and ignore) * `readOnce` avoids `readExactly` overhead for single-byte read * genericAssign -> assign2	2021-04-18 10:08:33 +02:00
Giovanni Petrantoni	795a651839	use a builder pattern to build the switch (#551 ) * use a builder pattern to build the switch * with with * more refs	2021-04-02 10:20:51 +09:00
Giovanni Petrantoni	03bbdd2261	Revisit Floodsub (#543 ) Fixes #525 add coverage to unsubscribeAll and testing	2021-03-15 13:16:03 -06:00
Jacek Sieka	70deac9e0d	fix peer score accumulation (#541 ) * fix accumulating peer score * fix missing exception handling * remove unnecessary initHashSet/initTable calls * simplify peer stats management * clean up tests a little * fix some missing raises annotations	2021-03-09 13:22:52 +01:00
Giovanni Petrantoni	02ad017107	Gossipsub fixes and Initiator flagging fixes (#539 ) * properly propagate initiator information for gossipsub * Fix pubsubpeer lifetime management * restore old behavior * tests fixing * clamp backoff time value received * fix member name collisions * internal test fixes * better names and explaining of the importance of transport direction * fixes	2021-03-03 08:23:40 +09:00
Giovanni Petrantoni	45300c28a9	[SEC] gossipsub - handleIHAVE/handleIWANT recommendations & notes (#535 ) Fixes #400	2021-02-26 14:27:42 +09:00
Giovanni Petrantoni	d7469b2286	[SEC] gossipsub - a peer score is not retained up until expiry if abusive peer unsubscribes (#534 ) * [SEC] gossipsub - a peer score is not retained up until expiry if abusive peer unsubscribes Fixes #402 * remove debug logging	2021-02-26 14:15:58 +09:00
Giovanni Petrantoni	8236319a91	[SEC] gossipsub - handleGraft/handlePrune recommendations & notes (#530 ) Fixes #401	2021-02-22 12:04:20 +09:00
Giovanni Petrantoni	1368bf7ecb	test rebalanceMesh with low score peers (#529 )	2021-02-22 10:05:25 +09:00
Giovanni Petrantoni	51d8cd4ade	[SEC] gossipsub - rebalanceMesh may prune up to D_lo on oversubscription (#531 ) Fixes #403	2021-02-13 13:39:32 +09:00
Giovanni Petrantoni	e124e342b0	n subscription limits (#528 ) * subscription high water, cleanups * subscription limits test * newline	2021-02-12 12:27:26 +09:00
Dmitriy Ryajov	4dea23c394	Remove secio usage and cleanup exports (#519 ) * cleaned up exports * remove secio use * added more useful exports * proper import	2021-02-08 14:33:34 -06:00
Dmitriy Ryajov	0959877b29	Connection limits (#384 ) * master merge * wip * avoid deadlocks * tcp limits * expose client field in chronosstream * limit incoming connections * update with new listen api * fix release * don't override peerinfo in connection * rework transport with accept * use semaphore to track resource ussage * rework with new transport accept api * move events to conn manager (#373) * use semaphore to track resource ussage * merge master * expose api to acquire conn slots * don't fail expensive metrics * allow tracking and updating connections * set global connection limits to 80 * add per peer connection limits * make sure conn is closed if tracking failed * more descriptive naming for handle * rework with new transport accept api * add `getStream` hide `selectConn` * add TransportClosedError * make nil explicit * don't make unnecessary copies of message * logging * error handling * cleanup semaphore * track connections properly * throw `TooManyConnections` when tracking outgoing * use proper exception and handle conventions * check onCloseHandle for nil * revert internalConnect changes * adding upgraded flag * await stream before closing * simplify tracking * wip * logging * split connection limits into incoming and outgoing * further streamline connection limits split counts * don't use closeWithEOF * move peer and conn event triggers from switch * wip * wip * wip * merge master * handle nil connections properly * add clarifying comment * don't raise exc on nil * no finally * add proper min/max connections logic * rebase master * merge master * master merge * remove request timeout should be addressed in separate PR * merge master * share semaphore when in/out limits arent enforced * merge master * use import * pass semaphore to trackConn * don't close last conn * use storeConn * merge master * use storeConn	2021-01-20 22:00:24 -06:00
Giovanni Petrantoni	240ec84ffb	Gossipsub wip (#502 ) * Remove unused connections in pubsubpeer, also removed wrong usages, add a disconnect bad peers parameter * handle exceptions in disconnectPeer * small fix * use the proper disconnection procedure for gossip peers * fixes, more metrics add test about disconnection * hot fix possible null pointers in switch * silly isnil sugar * Fix and test gossip directPeer connections	2021-01-15 13:48:03 +09:00
Giovanni Petrantoni	dc48170b0d	Gossip subscription improvements (#497 ) * salt ids in seen table * add subscription validation callback and avoid processing topics we don't care of * apply penalty on bad subscription * fix IHave handling IDs * reduce indenting, add some comments * fix gossip randombytes generation * do not descore unwanted topics (might happen, due to timing, needs improvements) * cleaning up and added tests * validate subscriptions only when subscribing * set notice level for failed publish * fix floodsub behavior	2021-01-13 23:49:44 +09:00
Giovanni Petrantoni	4858e0ab15	Gossipsub refactor pt2 (#495 ) * add sub/unsub test * remove unused variable from gossip	2020-12-20 00:45:34 +09:00
Giovanni Petrantoni	05e789a34f	Gossipsub refactor (#490 ) * refactor peerStats, re-enable scores for testing * remove gossip 1.0 * cleanup * codecov matrix fixes * restore previous score on onNewPeer * fix coverage n checks * unsubscribeAll gossipsub fixes * refactor unsub/sub * refactor onNewPeer and fix score flow * disable scores by default (change in tests later) * fix tests, enable scores in tests * fix wrongly merged test * ensure topic removal from topics table * small typo fix * testinterop fixes	2020-12-19 15:43:32 +01:00
Giovanni Petrantoni	f8f0bc1bd8	Gossip improvements (#476 ) * add more traces, remove async from rebalance * more traces * avoid computng scores when weight is 0.0 * debug colocation, fix an indent in unsubpeer (minor) * add full ValidationResult coverage * store in cache only after validation * gossip 1.0 fixes * fix typo * gossip 10 internal test fixes * test fixing * refactor peerstats usages * populate tables if missing when scoring	2020-12-15 10:25:22 +09:00
Dmitriy Ryajov	4224f12503	fix memory safety issue in tests (#484 )	2020-12-14 15:22:53 -06:00
Giovanni Petrantoni	93b6c4dc52	Gossip runtime params (#437 ) * move gossip parameters to runtime * internal test fixes * add missing params * restore const parameters are soldi base and use them in init * more constants tuning	2020-11-19 16:48:17 +09:00
Dmitriy Ryajov	55b763264e	Cleanup tests (#435 ) * add async testing methods * refactor with async testing methods * use iffy in async tests	2020-11-12 21:44:02 -06:00
Giovanni Petrantoni	75b023c9e5	gossipsub audit fixes (#412 ) * [SEC] gossipsub - rebalanceMesh grafts peers giving preference to low scores #405 * comment score choices * compiler warning fixes/bug fixes (unsubscribe) * rebalanceMesh does not enforce D_out quota * fix outbound grafting * fight the nim compiler * fix closure capture bs... * another closure fix * #403 rebalance prune fixes * more test fixing * #403 fixes * #402 avoid removing scores on unsub * #401 handleGraft improvements * [SEC] handleIHAVE/handleIWANT recommendations * add a note about peer exchange handling	2020-10-30 21:49:54 +09:00
Giovanni Petrantoni	556213abf4	Extended validators (#395 ) * gossip extended validation * fix flood tests * fix gossip 1.0 tests * synthax consistency	2020-10-12 16:56:00 +09:00
Giovanni Petrantoni	98d82fce5c	fix opportunistic graft in internal 11 testing (#390 )	2020-10-05 11:35:03 +09:00
Giovanni Petrantoni	98d0cc3a16	defaultMsgIdProvider alternative/test anonymize (#379 ) * defaultMsgIdProvider alternative/test anonymize * avoid freeze during flood tests * avoid `empty message, skipping` situation * test observers * avoid double initPubSub * fix gossip testing (specially when anonymize is on) * make azure tests shorter	2020-09-28 09:11:18 +02:00
Giovanni Petrantoni	ec322124ac	allow to omit peerId and seqno (#372 ) * allow to omit peerId and seqno * small refactor * wip * fix message encoding * improve rpc signature logic * remove peerid from verify * trace fixes * fix message test * fix gossip 1.0 tests	2020-09-23 17:56:33 +02:00
Jacek Sieka	471e5906f6	fix gossipsub memory leak on disconnected peer (#371 ) When messages can't be sent to peer, we try to establish a send connection - this causes messages to stack up as more and more unsent messages are blocked on the dial lock. * remove dial lock * run reconnection loop in background task	2020-09-22 09:05:53 +02:00
Jacek Sieka	49a12e619d	channel close race and deadlock fixes (#368 ) * channel close race and deadlock fixes * remove send lock, write chunks in one go * push some of half-closed implementation to BufferStream * fix some hangs where LPChannel readers and writers would not always wake up * simplify lazy channels * fix close happening more than once in some orderings * reenable connection tracking tests * close channels first on mplex close such that consumers can read bytes A notable difference is that BufferedStream is no longer considered EOF until someone has actually read the EOF marker. * docs, simplification	2020-09-21 19:48:19 +02:00
Giovanni Petrantoni	b99d2039a8	Gossip one one (#240 ) * allow multiple codecs per protocol (without breaking things) * add 1.1 protocol to gossip * explicit peering part 1 * explicit peering part 2 * explicit peering part 3 * PeerInfo and ControlPrune protocols * fix encodePrune * validated always, even explicit peers * prune by score (score is stub still) * add a way to pass parameters to gossip * standard setup fixes * take into account explicit direct peers in publish * add floodPublish logic * small fixes, publish still half broken * make sure to waitsub in sparse test * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * restore utility module import * restore trace vs debug in gossip * improve fanout replenish behavior further * triage publish nil peers (issue is on master too but just hidden behind a if/in) * getGossipPeers fixes * remove topics from pubsubpeer (was unused) * simplify rebalanceMesh (following spec) and make it finally reach D_high * better diagnostics * merge new pubsubpeer, copy 1.1 to new module * fix up merge * conditional enable gossip11 module * add back topics in peers, re-enable flood publish * add more heartbeat locking to prevent races * actually lock the heartbeat * minor fixes * with sugar * merge 1.0 * remove assertion in publish * fix multistream 1.1 multi proto * Fix merge oops * wip * fix gossip 11 upstream * gossipsub11 -> gossipsub * support interop testing * tests fixing * fix directchat build * control prune updates (pb) * wip parameters * gossip internal tests fixes * parameters wip * finishup with params * cleanups/wip * small sugar * grafted and pruned procs * wip updateScores * wip * fix logging issue * pubsubpeer, chronicles explicit override * fix internal gossip tests * wip * tables troubleshooting * score wip * score wip * fixes * fix test utils generateNodes * don't delete while iterating in score update * fix grafted defect * add a handleConnect in subscribeTopic * pruning improvements * wip * score fixes * post merge - builds gossip tests * further merge fixes * rebalance improvements and opportunistic grafting * fix test for now * restore explicit peering * implement peer exchange graft message * add an hard cap to PX * backoff time management * IWANT cap/budget * Adaptive gossip dissemination * outbound mesh quota, internal tests fixing * oversub prune score based, finish outbound quota * finishup with score and ihave budget * use go daemon 0.3.0 * import fixes * byScore cleanup score sorting * remove pointless scaling in `/` Duration operator * revert using libp2p org for daemon * interop fixes * fixes and cleanup * remove heartbeat assertion, minor debug fixes * logging improvements and cleaning up * (to revert) add some traces * add explicit topic to gossip rpcs * pubsub merge fixes and type fix in switch * Revert "(to revert) add some traces" This reverts commit 4663eaab6cc336c81cee50bc54025cf0b7bcbd99. * cleanup some now irrelevant todo * shuffle peers anyway as score might be disabled * add missing shuffle * old merge fix * more merge fixes * debug improvements * re-enable gossip internal tests * add gossip10 fallback (dormant but tested) * split gossipsub internal tests into 1.0 and 1.1 Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-09-21 11:16:29 +02:00
Dmitriy Ryajov	b0d86b95dd	add peer lifecycle events (#357 ) * add peer lifecycle events * rework peer events to not use connection events * don't use result in pubsub and switch init * wip * use ordered hashes and remove logscope * logging * add missing test * small fixes	2020-09-15 14:19:22 -06:00
Jacek Sieka	63b38734bd	fix poor performance in LRU cache (#360 ) it turns out (in NBC) a heap is sufficiently slow becuase of all the deletes that it makes more sense to go with a linked list	2020-09-09 18:28:46 +02:00
Jacek Sieka	c1856fda53	simplify and unify logging (#353 ) * use short format for logging peerid * log peerid:oid for connections	2020-09-06 10:31:47 +02:00
Jacek Sieka	5819c6a9a7	gossipsub / floodsub fixes (#348 ) * mcache fixes * remove timed cache - the window shifting already removes old messages * ref -> object * avoid unnecessary allocations with `[]` operator * simplify init * fix several gossipsub/floodsub issues * floodsub, gossipsub: don't rebroadcast messages that fail validation (!) * floodsub, gossipsub: don't crash when unsubscribing from unknown topics (!) * gossipsub: don't send message to peers that are not interested in the topic, when messages don't share topic list * floodsub: don't repeat all messages for each message when rebroadcasting * floodsub: allow sending empty data * floodsub: fix inefficient unsubscribe * sync floodsub/gossipsub logging * gossipsub: include incoming messages in mcache (!) * gossipsub: don't rebroadcast already-seen messages (!) * pubsubpeer: remove incoming/outgoing seen caches - these are already handled in gossipsub, floodsub and will cause trouble when peers try to resubscribe / regraft topics (because control messages will have same digest) * timedcache: reimplement without timers (fixes timer leaks and extreme inefficiency due to per-message closures, futures etc) * timedcache: ref -> obj	2020-09-04 08:10:32 +02:00
Jacek Sieka	cd1c68dbc5	avoid send deadlock by not allowing send to block (#342 ) * avoid send deadlock by not allowing send to block * handle message issues more consistently	2020-09-01 09:33:03 +02:00
Jacek Sieka	f46bf0faa4	remove send lock (#334 ) * remove send lock When mplex receives data it will block until a reader has processed the data. Thus, when a large message is received, such as a gossipsub subscription table, all of mplex will be blocked until all reading is finished. However, if at the same time a `dial` to establish a gossipsub send connection is ongoing, that `dial` will be blocked because mplex is no longer reading data - specifically, it might indeed be the connection that's processing the previous data that is waiting for a send connection. There are other problems with the current code: * If an exception is raised, it is not necessarily raised for the same connection as `p.sendConn`, so resetting `p.sendConn` in the exception handling is wrong * `p.isConnected` is checked before taking the lock - thus, if it returns false, a new dial will be started. If a new task enters `send` before dial is finished, it will also determine `p.isConnected` is false, then get stuck on the lock - when the previous task finishes and releases the lock, the new task will _also_ dial and thus reset `p.sendConn` causing a leak. * prefer existing connection simplifies flow	2020-08-17 12:38:27 +02:00
Dmitriy Ryajov	b76b3e0e9b	Rework pubsub (#322 ) * move pubsub of off switch, pass switch into pubsub * use join on lpstreams * properly cleanup up failed peers * fix tests * fix peertable hasPeerId * fix tests * rework sending, remove helpers from pubsubpeer, unify in broadcast * further split broadcast into send * use send where appropriate * use formatIt * improve trace Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-08-11 18:05:49 -06:00
Dmitriy Ryajov	980764774e	pubsub timeouts tuning (#295 ) * add finegrained timeouts to pubsub * use 10 millis timeout in tests * finalization * revert timeouts * use `atEof` for reads * adjust timeouts and use atEof for reads * use atEof for reads * set isEof flag * no backoff for pubsub streams * temp timer increase, make macos finalize * don't call `subscribePeer` in libp2p anymore * more traces * leak tests * lower timeouts * handle exceptions in control message * don't use `cancelAndWait` * handle exceptions in helpers * wip * don't send empty messages * check for leaks properly * don't use cancelAndWait * don't await subscribption sends * remove subscrivePeer calls from switch * trying without the hooks again	2020-08-02 23:20:11 -06:00
Dmitriy Ryajov	f7fdf31365	Pubsub lifetime (#284 ) * lifecycle hooks * tests * move trace after closed check * restore 1 second heartbeat * await close event * fix tests * print direction string * more trace logging * add pubsub monitor * add log scope * adjust idle timeout * add exc.msg to trace	2020-07-27 13:33:51 -06:00
Giovanni Petrantoni	c3af7659b0	Add more checks and fix some issues in gossip tests (#281 )	2020-07-20 15:55:00 +09:00
Dmitriy Ryajov	f35b8999b3	some light cleanup for pub/gossip sub (#273 ) * move peer table out to its own file * move peer table * cleanup `==` and add one to peerinfo * add peertable * missed equality check	2020-07-15 13:18:55 -06:00
Giovanni Petrantoni	d7bab37119	Fix gossip messages seqno according to spec (#253 ) * Fix gossip messages seqno according to spec * Add peers back to gossipsub table, slow down heartbeat * Revert "Add peers back to gossipsub table, slow down heartbeat" This reverts commit 01e2e62172a7793bb17f0eb8314e2faeb2682173. * make seqno a threadvar, remove from peerinfo * seqno refactor, into pubsub	2020-07-14 21:51:33 -06:00
Ștefan Talpalaru	b8b0a2b4bc	CI: build binaries with TRACE & JSON logs (#268 ) Also: remove unused imports.	2020-07-14 02:02:16 +02:00
Giovanni Petrantoni	fcda0f6ce1	PubSubPeer tables refactor (#263 ) * refactor peer tables * tests fixing * override PubSubPeer equality * fix pubsubpeer comparison	2020-07-13 15:32:38 +02:00
Dmitriy Ryajov	4c815d75e7	More gossip cleanup (#257 ) * more cleanup * correct pubsub peer count * close the stream first * handle cancelation * fix tests * fix fanout ttl * merging master * remove `withLock` as it conflicts with stdlib * fix trace build Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 14:21:47 -06:00
Jacek Sieka	c720e042fc	clean up mesh handling logic (#260 ) * gossipsub is a function of subscription messages only * graft/prune work with mesh, get filled up from gossipsub * fix race conditions with await * fix exception unsafety when grafting/pruning * fix allowing up to DHi peers in mesh on incoming graft * fix metrics in several places	2020-07-09 11:16:46 -06:00
Dmitriy Ryajov	a52763cc6d	fix publishing (#250 ) * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * Cleanup resources (#246) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises * restore trace vs debug in gossip * improve fanout replenish behavior further * cleanup stale peers more eaguerly * synchronize connection cleanup and small refactor * close client first and call parent second * disconnect failed peers on publish * check for publish result * fix tests * fix tests * always call close Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-07 18:33:05 -06:00
Jacek Sieka	d522537b19	reuse single RNG instance for all crypto key generation (#249 ) * reuse single RNG instance for all crypto key generation * use foolproof rng * initRng -> newRng (because it's ref) * fix test * imports/exports, chat fix * fix rsa * imports and exports * work around threadvar issue * fixup * mac workaround test	2020-07-07 13:14:11 +02:00
Giovanni Petrantoni	ec00c7fc50	Peer resultification and defect only (#245 ) * Peer resultification and defect only * Fixing some tests * test fixes * Rename peer into peerid * better result error message in identify * further merge fixes	2020-07-01 08:25:09 +02:00
Jacek Sieka	aa6756dfe0	allow message id provider to be specified (#243 ) * don't send public key in message when not signing (information leak) * don't run rebalance if there are peers in gossip (see #242) * don't crash randomly on bad peer id from remote	2020-06-28 09:56:38 -06:00

1 2

90 Commits