nim-libp2p

Commit Graph

Author	SHA1	Message	Date
Giovanni Petrantoni	4a98a8af5a	gossip pruning fixes related to #371 (#385 ) * gossip pruning fixes related to #371 * better trace for grafted/pruned * shorted azure testing again	2020-10-02 13:09:31 +09:00
Jacek Sieka	17e00e642a	limit write queue length (#376 ) To break a potential read/write deadlock, gossipsub uses an unbounded queue for writes - when peers are too slow to process this queue, it may end up growing without bounds causing high memory usage. Here, we introduce a maximum write queue length after which the peer is disconnected - the queue is generous enough that any "normal" usage should be fine - writes that are `await`:ed are not affected, only writes that are launched in an `asyncSpawn` task or similar. * avoid unnecessary copy of message when there are no send observers * release message memory earlier in gossipsub * simplify pubsubpeer logging	2020-09-24 18:43:20 +02:00
Giovanni Petrantoni	ec322124ac	allow to omit peerId and seqno (#372 ) * allow to omit peerId and seqno * small refactor * wip * fix message encoding * improve rpc signature logic * remove peerid from verify * trace fixes * fix message test * fix gossip 1.0 tests	2020-09-23 17:56:33 +02:00
Jacek Sieka	471e5906f6	fix gossipsub memory leak on disconnected peer (#371 ) When messages can't be sent to peer, we try to establish a send connection - this causes messages to stack up as more and more unsent messages are blocked on the dial lock. * remove dial lock * run reconnection loop in background task	2020-09-22 09:05:53 +02:00
Giovanni Petrantoni	b99d2039a8	Gossip one one (#240 ) * allow multiple codecs per protocol (without breaking things) * add 1.1 protocol to gossip * explicit peering part 1 * explicit peering part 2 * explicit peering part 3 * PeerInfo and ControlPrune protocols * fix encodePrune * validated always, even explicit peers * prune by score (score is stub still) * add a way to pass parameters to gossip * standard setup fixes * take into account explicit direct peers in publish * add floodPublish logic * small fixes, publish still half broken * make sure to waitsub in sparse test * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * restore utility module import * restore trace vs debug in gossip * improve fanout replenish behavior further * triage publish nil peers (issue is on master too but just hidden behind a if/in) * getGossipPeers fixes * remove topics from pubsubpeer (was unused) * simplify rebalanceMesh (following spec) and make it finally reach D_high * better diagnostics * merge new pubsubpeer, copy 1.1 to new module * fix up merge * conditional enable gossip11 module * add back topics in peers, re-enable flood publish * add more heartbeat locking to prevent races * actually lock the heartbeat * minor fixes * with sugar * merge 1.0 * remove assertion in publish * fix multistream 1.1 multi proto * Fix merge oops * wip * fix gossip 11 upstream * gossipsub11 -> gossipsub * support interop testing * tests fixing * fix directchat build * control prune updates (pb) * wip parameters * gossip internal tests fixes * parameters wip * finishup with params * cleanups/wip * small sugar * grafted and pruned procs * wip updateScores * wip * fix logging issue * pubsubpeer, chronicles explicit override * fix internal gossip tests * wip * tables troubleshooting * score wip * score wip * fixes * fix test utils generateNodes * don't delete while iterating in score update * fix grafted defect * add a handleConnect in subscribeTopic * pruning improvements * wip * score fixes * post merge - builds gossip tests * further merge fixes * rebalance improvements and opportunistic grafting * fix test for now * restore explicit peering * implement peer exchange graft message * add an hard cap to PX * backoff time management * IWANT cap/budget * Adaptive gossip dissemination * outbound mesh quota, internal tests fixing * oversub prune score based, finish outbound quota * finishup with score and ihave budget * use go daemon 0.3.0 * import fixes * byScore cleanup score sorting * remove pointless scaling in `/` Duration operator * revert using libp2p org for daemon * interop fixes * fixes and cleanup * remove heartbeat assertion, minor debug fixes * logging improvements and cleaning up * (to revert) add some traces * add explicit topic to gossip rpcs * pubsub merge fixes and type fix in switch * Revert "(to revert) add some traces" This reverts commit 4663eaab6cc336c81cee50bc54025cf0b7bcbd99. * cleanup some now irrelevant todo * shuffle peers anyway as score might be disabled * add missing shuffle * old merge fix * more merge fixes * debug improvements * re-enable gossip internal tests * add gossip10 fallback (dormant but tested) * split gossipsub internal tests into 1.0 and 1.1 Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-09-21 11:16:29 +02:00
Jacek Sieka	c1856fda53	simplify and unify logging (#353 ) * use short format for logging peerid * log peerid:oid for connections	2020-09-06 10:31:47 +02:00
Jacek Sieka	9b815efe8f	gossipsub: don't subscribe to floodsub also (#352 )	2020-09-04 22:53:03 +02:00
Jacek Sieka	6d91d61844	small cleanups & docs (#347 ) * simplify gossipsub heartbeat start / stop * avoid alloc in peerid check * stop iterating over seq after unsubscribing item (could crash) * don't crash on missing private key with enabled sigs (shouldn't happen but...)	2020-09-04 18:31:43 +02:00
Eugene Kabanov	0b85192119	Remove asyncCheck from codebase. (#345 ) * Remove asyncCheck from codebase. * Replace all `discard` statements with new `asyncSpawn`. * Bump `nim-chronos` requirement.	2020-09-04 18:30:45 +02:00
Jacek Sieka	5819c6a9a7	gossipsub / floodsub fixes (#348 ) * mcache fixes * remove timed cache - the window shifting already removes old messages * ref -> object * avoid unnecessary allocations with `[]` operator * simplify init * fix several gossipsub/floodsub issues * floodsub, gossipsub: don't rebroadcast messages that fail validation (!) * floodsub, gossipsub: don't crash when unsubscribing from unknown topics (!) * gossipsub: don't send message to peers that are not interested in the topic, when messages don't share topic list * floodsub: don't repeat all messages for each message when rebroadcasting * floodsub: allow sending empty data * floodsub: fix inefficient unsubscribe * sync floodsub/gossipsub logging * gossipsub: include incoming messages in mcache (!) * gossipsub: don't rebroadcast already-seen messages (!) * pubsubpeer: remove incoming/outgoing seen caches - these are already handled in gossipsub, floodsub and will cause trouble when peers try to resubscribe / regraft topics (because control messages will have same digest) * timedcache: reimplement without timers (fixes timer leaks and extreme inefficiency due to per-message closures, futures etc) * timedcache: ref -> obj	2020-09-04 08:10:32 +02:00
Jacek Sieka	cd1c68dbc5	avoid send deadlock by not allowing send to block (#342 ) * avoid send deadlock by not allowing send to block * handle message issues more consistently	2020-09-01 09:33:03 +02:00
Dmitriy Ryajov	d3182c4dba	No raise send (#339 ) * dont raise in send * check that the lock is acquire on release	2020-08-20 20:50:33 -06:00
Jacek Sieka	b12145dff7	avoid crash when subscribe is received (#333 ) ...by making subscribeTopic synchronous, avoiding a peer table lookup completely. rebalanceMesh will be called a second later - it's fine	2020-08-17 12:10:22 +02:00
Jacek Sieka	ab864fc747	logging cleanups and small fixes (#331 )	2020-08-15 21:50:31 +02:00
Dmitriy Ryajov	b76b3e0e9b	Rework pubsub (#322 ) * move pubsub of off switch, pass switch into pubsub * use join on lpstreams * properly cleanup up failed peers * fix tests * fix peertable hasPeerId * fix tests * rework sending, remove helpers from pubsubpeer, unify in broadcast * further split broadcast into send * use send where appropriate * use formatIt * improve trace Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-08-11 18:05:49 -06:00
Giovanni Petrantoni	9bbe5e4841	Fix subclass calls to handleDisconnect (#314 ) * Fix subclass calls to handleDisconnect * add peer ID to nil peer debug message	2020-08-06 11:12:52 +09:00
Giovanni Petrantoni	5c986cf657	Fix build, add some raises (#315 ) * Fix build, add some raises * wip * wip more raises * missing exc object in mplex * proper lifetime for subscribePeer Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-08-05 19:30:57 -06:00
Ștefan Talpalaru	bd5d43874a	more expensive metrics (#312 )	2020-08-05 14:02:26 +02:00
Ștefan Talpalaru	843d32f8db	put expensive metrics under a Nim define (#310 )	2020-08-04 17:27:59 -06:00
Dmitriy Ryajov	980764774e	pubsub timeouts tuning (#295 ) * add finegrained timeouts to pubsub * use 10 millis timeout in tests * finalization * revert timeouts * use `atEof` for reads * adjust timeouts and use atEof for reads * use atEof for reads * set isEof flag * no backoff for pubsub streams * temp timer increase, make macos finalize * don't call `subscribePeer` in libp2p anymore * more traces * leak tests * lower timeouts * handle exceptions in control message * don't use `cancelAndWait` * handle exceptions in helpers * wip * don't send empty messages * check for leaks properly * don't use cancelAndWait * don't await subscribption sends * remove subscrivePeer calls from switch * trying without the hooks again	2020-08-02 23:20:11 -06:00
Jacek Sieka	e655a510cd	misc cleanups (#303 )	2020-08-02 12:22:49 +02:00
Dmitriy Ryajov	f7fdf31365	Pubsub lifetime (#284 ) * lifecycle hooks * tests * move trace after closed check * restore 1 second heartbeat * await close event * fix tests * print direction string * more trace logging * add pubsub monitor * add log scope * adjust idle timeout * add exc.msg to trace	2020-07-27 13:33:51 -06:00
Giovanni Petrantoni	3b088f8980	Fix some unsubscribe issues and add unsubscribeAll helper (#282 ) * Fix some unsub issues and add unsuball helper * batch sendprune in unsubscribe methods * add unsubscribeAll for floodsub	2020-07-20 10:16:13 -06:00
Dmitriy Ryajov	94196fee71	Connections and pubsub peers cleanup (#279 ) * better peer tracking and cleanup * check if peer and conn is nil * test name * make timeout more agressive * rename method for better clarity	2020-07-17 13:46:24 -06:00
Jacek Sieka	170685f9c6	gossipsub fixes (#276 ) * graft up to D peers * fix logging so it's clear who is grafting/pruning who * clear fanout when grafting	2020-07-16 21:26:57 +02:00
Jacek Sieka	c76152f2c1	Simplify send (#271 ) * PubSubPeer.send single message * gossipsub: simplify send further	2020-07-16 12:06:57 +02:00
Dmitriy Ryajov	f35b8999b3	some light cleanup for pub/gossip sub (#273 ) * move peer table out to its own file * move peer table * cleanup `==` and add one to peerinfo * add peertable * missed equality check	2020-07-15 13:18:55 -06:00
Giovanni Petrantoni	d7bab37119	Fix gossip messages seqno according to spec (#253 ) * Fix gossip messages seqno according to spec * Add peers back to gossipsub table, slow down heartbeat * Revert "Add peers back to gossipsub table, slow down heartbeat" This reverts commit 01e2e62172a7793bb17f0eb8314e2faeb2682173. * make seqno a threadvar, remove from peerinfo * seqno refactor, into pubsub	2020-07-14 21:51:33 -06:00
Jacek Sieka	c6c2d99907	one more log fix	2020-07-13 20:19:20 +02:00
Jacek Sieka	76853f064a	json logging again	2020-07-13 19:59:49 +02:00
Jacek Sieka	6620b7a00b	more comment fixes	2020-07-13 19:30:18 +02:00
Jacek Sieka	0d4c74b33a	comment log that can't be json-serialized	2020-07-13 18:36:49 +02:00
Jacek Sieka	061c54d3c6	logging fixes	2020-07-13 17:26:05 +02:00
Giovanni Petrantoni	fcda0f6ce1	PubSubPeer tables refactor (#263 ) * refactor peer tables * tests fixing * override PubSubPeer equality * fix pubsubpeer comparison	2020-07-13 15:32:38 +02:00
Dmitriy Ryajov	bec9a0658f	Cleanup rpc handler (#261 ) * more cleanup * fix tests * merging master * remove `withLock` as it conflicts with stdlib * wip * more fanout ttl Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 17:54:16 -06:00
Dmitriy Ryajov	4c815d75e7	More gossip cleanup (#257 ) * more cleanup * correct pubsub peer count * close the stream first * handle cancelation * fix tests * fix fanout ttl * merging master * remove `withLock` as it conflicts with stdlib * fix trace build Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 14:21:47 -06:00
Jacek Sieka	c720e042fc	clean up mesh handling logic (#260 ) * gossipsub is a function of subscription messages only * graft/prune work with mesh, get filled up from gossipsub * fix race conditions with await * fix exception unsafety when grafting/pruning * fix allowing up to DHi peers in mesh on incoming graft * fix metrics in several places	2020-07-09 11:16:46 -06:00
Giovanni Petrantoni	4e12d0d97a	nil check peer before disconnect	2020-07-09 17:20:45 +09:00
Giovanni Petrantoni	4bcb567d47	fix gossip tests	2020-07-09 12:34:36 +09:00
Giovanni Petrantoni	fec507e755	Add peers back to gossipsub table, slow down heartbeat (#256 ) * Add peers back to gossipsub table, slow down heartbeat * exclude on unsub from mesh and fanout	2020-07-08 11:06:26 -06:00
Dmitriy Ryajov	a52763cc6d	fix publishing (#250 ) * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * Cleanup resources (#246) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises * restore trace vs debug in gossip * improve fanout replenish behavior further * cleanup stale peers more eaguerly * synchronize connection cleanup and small refactor * close client first and call parent second * disconnect failed peers on publish * check for publish result * fix tests * fix tests * always call close Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-07 18:33:05 -06:00
Giovanni Petrantoni	ec00c7fc50	Peer resultification and defect only (#245 ) * Peer resultification and defect only * Fixing some tests * test fixes * Rename peer into peerid * better result error message in identify * further merge fixes	2020-07-01 08:25:09 +02:00
Dmitriy Ryajov	c788a6a3c0	Cleanup resources (#246 ) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises	2020-06-29 09:15:31 -06:00
Jacek Sieka	aa6756dfe0	allow message id provider to be specified (#243 ) * don't send public key in message when not signing (information leak) * don't run rebalance if there are peers in gossip (see #242) * don't crash randomly on bad peer id from remote	2020-06-28 09:56:38 -06:00
Giovanni Petrantoni	7852c6dd0f	Noise and eth2/nbc fixes (#226 ) * Remove noise padding payload (spec removed it) * add log scope in secure * avoid defect array out of range in switch secure when "na" * improve identify traces * wip noise fixes * noise protobuf adjustments (trying) * add more debugging messages/traces, improve their actual contents * re-enable ID check in noise * bump go daemon tag version * bump go daemon tag version * enable noise in daemonapi * interop testing, (both secio and noise will be tested) * azure cache bump (p2pd) * CI changes - Travis: use Go 1.14 - azure-pipelines.yml: big cleanup - Azure: bump cache keys - build 64-bit p2pd on 32-bit Windows - install both Mingw-w64 architectures * noise logging fixes * alternate testing between noise and secio * increase timeout to avoid VM errors in CI (multistream tests) * refactor heartbeat management in gossipsub * remove locking within heartbeat * refactor heartbeat management in gossipsub * remove locking within heartbeat Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2020-06-20 19:56:55 +09:00
Dmitriy Ryajov	7a1c1c2ea6	fixing some key not found exceptions (#231 )	2020-06-19 15:19:07 -06:00
Dmitriy Ryajov	5b28e8c488	Cleanup lpstream, Connection and BufferStream (#228 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * don't use delegation in connection * move connection out to own file * don't breakout on reset * make sure to call close on secured conn * add lpstream tracing * don't breackdown by conn id * fix import * remove unused lable * reset connection on exception * add additional metrics for skipped messages * check for nil in secure.close	2020-06-19 11:29:43 -06:00
Dmitriy Ryajov	719744f46a	Small fixes (#230 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * make sure sending doesn't fail * add `contains` * review comment from prev pr	2020-06-19 11:29:25 -06:00
Dmitriy Ryajov	fe828d87d8	count published messages (#224 )	2020-06-16 22:14:02 -06:00
Dmitriy Ryajov	7cb6c81159	Don't modify iterables while iterating them (#219 ) * don't modify iterables while iterating * assert handlers to properly close connections	2020-06-15 12:30:09 -06:00

1 2

75 Commits