nim-libp2p

Commit Graph

Author	SHA1	Message	Date
Dmitriy Ryajov	034a1e8b1b	small cleanups from tcp-limits2 (#446 )	2020-11-23 15:02:23 -06:00
Giovanni Petrantoni	93b6c4dc52	Gossip runtime params (#437 ) * move gossip parameters to runtime * internal test fixes * add missing params * restore const parameters are soldi base and use them in init * more constants tuning	2020-11-19 16:48:17 +09:00
Giovanni Petrantoni	a9948b0b05	clarify validation messages (#431 ) * clarify validation messages * add codecov threshold	2020-11-12 01:42:12 +09:00
Dmitriy Ryajov	90921bff09	move some importance trace logs to debug (#428 )	2020-11-09 22:14:46 -06:00
Giovanni Petrantoni	e496802943	Least expensive metrics (#421 ) * add more general and useful metrics * fix gossipsub peers metrics in heartbeat	2020-11-04 15:18:00 +01:00
Jacek Sieka	03639f1446	Revert "Channel leaks (#413 )" (#417 ) This reverts commit `1de1d49223`.	2020-11-01 14:49:25 -06:00
Giovanni Petrantoni	75b023c9e5	gossipsub audit fixes (#412 ) * [SEC] gossipsub - rebalanceMesh grafts peers giving preference to low scores #405 * comment score choices * compiler warning fixes/bug fixes (unsubscribe) * rebalanceMesh does not enforce D_out quota * fix outbound grafting * fight the nim compiler * fix closure capture bs... * another closure fix * #403 rebalance prune fixes * more test fixing * #403 fixes * #402 avoid removing scores on unsub * #401 handleGraft improvements * [SEC] handleIHAVE/handleIWANT recommendations * add a note about peer exchange handling	2020-10-30 21:49:54 +09:00
Dmitriy Ryajov	1de1d49223	Channel leaks (#413 ) * break stream tracking by type * use closeWithEOF to await wrapped stream * fix cancelation leaks * fix channel leaks * logging * use close monitor and always call closeUnderlying * don't use closeWithEOF * removing close monitor * logging	2020-10-27 11:21:03 -06:00
Giovanni Petrantoni	462da1f7a8	gossip MessageID as seq[byte] (#391 ) * gossip MessageID as seq[byte] * combina hashes in defaultMsgIdProvider * wip * fix defaultMsgIdProvider	2020-10-21 12:26:04 +09:00
Giovanni Petrantoni	27b9bf436e	fix validation according to specification (#410 )	2020-10-21 12:25:42 +09:00
Giovanni Petrantoni	556213abf4	Extended validators (#395 ) * gossip extended validation * fix flood tests * fix gossip 1.0 tests * synthax consistency	2020-10-12 16:56:00 +09:00
Giovanni Petrantoni	e3bdb9eb13	decode properly ControlPrune (#392 )	2020-10-09 09:12:38 +09:00
Giovanni Petrantoni	0f2435f551	better opportunistic grafting score (when score is disabled) (#389 )	2020-10-03 09:26:45 +09:00
Giovanni Petrantoni	4a98a8af5a	gossip pruning fixes related to #371 (#385 ) * gossip pruning fixes related to #371 * better trace for grafted/pruned * shorted azure testing again	2020-10-02 13:09:31 +09:00
Mamy Ratsimbazafy	03f5bbba6d	saner logging (#381 )	2020-09-29 09:40:06 -06:00
Giovanni Petrantoni	98d0cc3a16	defaultMsgIdProvider alternative/test anonymize (#379 ) * defaultMsgIdProvider alternative/test anonymize * avoid freeze during flood tests * avoid `empty message, skipping` situation * test observers * avoid double initPubSub * fix gossip testing (specially when anonymize is on) * make azure tests shorter	2020-09-28 09:11:18 +02:00
Jacek Sieka	8ecef46738	reencode gossipsub messages with anonymization (#378 ) This helps protect against clients sending more data than they should and thus getting penalized on topics that require anonymity	2020-09-25 18:39:34 +02:00
Jacek Sieka	17e00e642a	limit write queue length (#376 ) To break a potential read/write deadlock, gossipsub uses an unbounded queue for writes - when peers are too slow to process this queue, it may end up growing without bounds causing high memory usage. Here, we introduce a maximum write queue length after which the peer is disconnected - the queue is generous enough that any "normal" usage should be fine - writes that are `await`:ed are not affected, only writes that are launched in an `asyncSpawn` task or similar. * avoid unnecessary copy of message when there are no send observers * release message memory earlier in gossipsub * simplify pubsubpeer logging	2020-09-24 18:43:20 +02:00
Jacek Sieka	25bd0a18f4	small fixes (#374 ) * add helper to read EOF marker after closing stream (else stream stay alive until timeout/reset) * don't assert on empty channel message * don't loop when writing to chronos (no need)	2020-09-24 07:30:19 +02:00
Giovanni Petrantoni	ec322124ac	allow to omit peerId and seqno (#372 ) * allow to omit peerId and seqno * small refactor * wip * fix message encoding * improve rpc signature logic * remove peerid from verify * trace fixes * fix message test * fix gossip 1.0 tests	2020-09-23 17:56:33 +02:00
Jacek Sieka	471e5906f6	fix gossipsub memory leak on disconnected peer (#371 ) When messages can't be sent to peer, we try to establish a send connection - this causes messages to stack up as more and more unsent messages are blocked on the dial lock. * remove dial lock * run reconnection loop in background task	2020-09-22 09:05:53 +02:00
Giovanni Petrantoni	b99d2039a8	Gossip one one (#240 ) * allow multiple codecs per protocol (without breaking things) * add 1.1 protocol to gossip * explicit peering part 1 * explicit peering part 2 * explicit peering part 3 * PeerInfo and ControlPrune protocols * fix encodePrune * validated always, even explicit peers * prune by score (score is stub still) * add a way to pass parameters to gossip * standard setup fixes * take into account explicit direct peers in publish * add floodPublish logic * small fixes, publish still half broken * make sure to waitsub in sparse test * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * restore utility module import * restore trace vs debug in gossip * improve fanout replenish behavior further * triage publish nil peers (issue is on master too but just hidden behind a if/in) * getGossipPeers fixes * remove topics from pubsubpeer (was unused) * simplify rebalanceMesh (following spec) and make it finally reach D_high * better diagnostics * merge new pubsubpeer, copy 1.1 to new module * fix up merge * conditional enable gossip11 module * add back topics in peers, re-enable flood publish * add more heartbeat locking to prevent races * actually lock the heartbeat * minor fixes * with sugar * merge 1.0 * remove assertion in publish * fix multistream 1.1 multi proto * Fix merge oops * wip * fix gossip 11 upstream * gossipsub11 -> gossipsub * support interop testing * tests fixing * fix directchat build * control prune updates (pb) * wip parameters * gossip internal tests fixes * parameters wip * finishup with params * cleanups/wip * small sugar * grafted and pruned procs * wip updateScores * wip * fix logging issue * pubsubpeer, chronicles explicit override * fix internal gossip tests * wip * tables troubleshooting * score wip * score wip * fixes * fix test utils generateNodes * don't delete while iterating in score update * fix grafted defect * add a handleConnect in subscribeTopic * pruning improvements * wip * score fixes * post merge - builds gossip tests * further merge fixes * rebalance improvements and opportunistic grafting * fix test for now * restore explicit peering * implement peer exchange graft message * add an hard cap to PX * backoff time management * IWANT cap/budget * Adaptive gossip dissemination * outbound mesh quota, internal tests fixing * oversub prune score based, finish outbound quota * finishup with score and ihave budget * use go daemon 0.3.0 * import fixes * byScore cleanup score sorting * remove pointless scaling in `/` Duration operator * revert using libp2p org for daemon * interop fixes * fixes and cleanup * remove heartbeat assertion, minor debug fixes * logging improvements and cleaning up * (to revert) add some traces * add explicit topic to gossip rpcs * pubsub merge fixes and type fix in switch * Revert "(to revert) add some traces" This reverts commit `4663eaab6c`. * cleanup some now irrelevant todo * shuffle peers anyway as score might be disabled * add missing shuffle * old merge fix * more merge fixes * debug improvements * re-enable gossip internal tests * add gossip10 fallback (dormant but tested) * split gossipsub internal tests into 1.0 and 1.1 Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-09-21 11:16:29 +02:00
Dmitriy Ryajov	b0d86b95dd	add peer lifecycle events (#357 ) * add peer lifecycle events * rework peer events to not use connection events * don't use result in pubsub and switch init * wip * use ordered hashes and remove logscope * logging * add missing test * small fixes	2020-09-15 14:19:22 -06:00
Giovanni Petrantoni	a6007be428	avoid sending empty seqno and/or fromPeer (gossip rpc) (#364 )	2020-09-15 12:33:18 +02:00
Oskar Thorén	5e66f6fbd8	Add logScope to connmanager and pubsubprotobuf (#363 )	2020-09-15 08:03:53 +02:00
Jacek Sieka	96d4c44fec	refactor bufferstream to use a queue (#346 ) This change modifies how the backpressure algorithm in bufferstream works - in particular, instead of working byte-by-byte, it will now work seq-by-seq. When data arrives, it usually does so in packets - in the current bufferstream, the packet is read then split into bytes which are fed one by one to the bufferstream. On the reading side, the bytes are popped of the bufferstream, again byte by byte, to satisfy `readOnce` requests - this introduces a lot of synchronization traffic because the checks for full buffer and for async event handling must be done for every byte. In this PR, a queue of length 1 is used instead - this means there will at most exist one "packet" in `pushTo`, one in the queue and one in the slush buffer that is used to store incomplete reads. * avoid byte-by-byte copy to buffer, with synchronization in-between * reuse AsyncQueue synchronization logic instead of rolling own * avoid writeHandler callback - implement `write` method instead * simplify EOF signalling by only setting EOF flag in queue reader (and reset) * remove BufferStream pipes (unused) * fixes drainBuffer deadlock when drain is called from within read loop and thus blocks draining * fix lpchannel init order	2020-09-10 08:19:13 +02:00
Jacek Sieka	5b347adf58	logging fixes and small cleanups (#361 ) In particular, allow longer multistream select reads	2020-09-09 19:12:08 +02:00
Jacek Sieka	63b38734bd	fix poor performance in LRU cache (#360 ) it turns out (in NBC) a heap is sufficiently slow becuase of all the deletes that it makes more sense to go with a linked list	2020-09-09 18:28:46 +02:00
Jacek Sieka	c1856fda53	simplify and unify logging (#353 ) * use short format for logging peerid * log peerid:oid for connections	2020-09-06 10:31:47 +02:00
Jacek Sieka	9b815efe8f	gossipsub: don't subscribe to floodsub also (#352 )	2020-09-04 22:53:03 +02:00
Jacek Sieka	16a008db75	fix connection event order when connection dies early (#351 ) if the connection is already closed (because the remote closes during identfiy for example), an exception would be raised which would leave the connection in limbo, beacuse it would not go through the rest of internalConnect. Also, if the connection is already closed, the disconnect event would be scheduled before the connect event :/	2020-09-04 20:30:26 +02:00
Jacek Sieka	6d91d61844	small cleanups & docs (#347 ) * simplify gossipsub heartbeat start / stop * avoid alloc in peerid check * stop iterating over seq after unsubscribing item (could crash) * don't crash on missing private key with enabled sigs (shouldn't happen but...)	2020-09-04 18:31:43 +02:00
Eugene Kabanov	0b85192119	Remove asyncCheck from codebase. (#345 ) * Remove asyncCheck from codebase. * Replace all `discard` statements with new `asyncSpawn`. * Bump `nim-chronos` requirement.	2020-09-04 18:30:45 +02:00
Jacek Sieka	5819c6a9a7	gossipsub / floodsub fixes (#348 ) * mcache fixes * remove timed cache - the window shifting already removes old messages * ref -> object * avoid unnecessary allocations with `[]` operator * simplify init * fix several gossipsub/floodsub issues * floodsub, gossipsub: don't rebroadcast messages that fail validation (!) * floodsub, gossipsub: don't crash when unsubscribing from unknown topics (!) * gossipsub: don't send message to peers that are not interested in the topic, when messages don't share topic list * floodsub: don't repeat all messages for each message when rebroadcasting * floodsub: allow sending empty data * floodsub: fix inefficient unsubscribe * sync floodsub/gossipsub logging * gossipsub: include incoming messages in mcache (!) * gossipsub: don't rebroadcast already-seen messages (!) * pubsubpeer: remove incoming/outgoing seen caches - these are already handled in gossipsub, floodsub and will cause trouble when peers try to resubscribe / regraft topics (because control messages will have same digest) * timedcache: reimplement without timers (fixes timer leaks and extreme inefficiency due to per-message closures, futures etc) * timedcache: ref -> obj	2020-09-04 08:10:32 +02:00
Jacek Sieka	cd1c68dbc5	avoid send deadlock by not allowing send to block (#342 ) * avoid send deadlock by not allowing send to block * handle message issues more consistently	2020-09-01 09:33:03 +02:00
Dmitriy Ryajov	d3182c4dba	No raise send (#339 ) * dont raise in send * check that the lock is acquire on release	2020-08-20 20:50:33 -06:00
Jacek Sieka	eb13845f65	work around send that may raise `send` can raise exceptions that together with asyncCheck will crash NBC	2020-08-19 14:25:30 +03:00
Zahary Karadjov	af0955c58b	Add comments explaning a possible deadlock	2020-08-18 13:51:41 +03:00
Zahary Karadjov	60122a044c	Restore interop with Lighthouse by preventing concurrent meshsub dials	2020-08-17 22:40:58 +03:00
Jacek Sieka	53877e97bd	trace logs	2020-08-17 12:39:25 +02:00
Jacek Sieka	f46bf0faa4	remove send lock (#334 ) * remove send lock When mplex receives data it will block until a reader has processed the data. Thus, when a large message is received, such as a gossipsub subscription table, all of mplex will be blocked until all reading is finished. However, if at the same time a `dial` to establish a gossipsub send connection is ongoing, that `dial` will be blocked because mplex is no longer reading data - specifically, it might indeed be the connection that's processing the previous data that is waiting for a send connection. There are other problems with the current code: * If an exception is raised, it is not necessarily raised for the same connection as `p.sendConn`, so resetting `p.sendConn` in the exception handling is wrong * `p.isConnected` is checked before taking the lock - thus, if it returns false, a new dial will be started. If a new task enters `send` before dial is finished, it will also determine `p.isConnected` is false, then get stuck on the lock - when the previous task finishes and releases the lock, the new task will _also_ dial and thus reset `p.sendConn` causing a leak. * prefer existing connection simplifies flow	2020-08-17 12:38:27 +02:00
Jacek Sieka	b12145dff7	avoid crash when subscribe is received (#333 ) ...by making subscribeTopic synchronous, avoiding a peer table lookup completely. rebalanceMesh will be called a second later - it's fine	2020-08-17 12:10:22 +02:00
Jacek Sieka	ab864fc747	logging cleanups and small fixes (#331 )	2020-08-15 21:50:31 +02:00
Dmitriy Ryajov	b76b3e0e9b	Rework pubsub (#322 ) * move pubsub of off switch, pass switch into pubsub * use join on lpstreams * properly cleanup up failed peers * fix tests * fix peertable hasPeerId * fix tests * rework sending, remove helpers from pubsubpeer, unify in broadcast * further split broadcast into send * use send where appropriate * use formatIt * improve trace Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-08-11 18:05:49 -06:00
zah	fbb59c3638	`msg` is a reserved property name in Chronicles (#321 ) Every Chronicles log record has an existing `msg` property matching the static string supplied in the log statement. Thus, it's currently not possible to use `msg` as the name of a user property: https://github.com/status-im/nim-chronicles/issues/86	2020-08-07 16:46:00 -06:00
Jacek Sieka	c6c0c152c0	Dial peerid (#308 ) * prefer PeerID in switch api This avoids ref issues like ref identity and nil * use existing peerinfo instance if possible * remove secureCodec there may be multiple connections per peerinfo with different codecs * avoid some extra async::	2020-08-06 09:29:27 +02:00
Giovanni Petrantoni	9bbe5e4841	Fix subclass calls to handleDisconnect (#314 ) * Fix subclass calls to handleDisconnect * add peer ID to nil peer debug message	2020-08-06 11:12:52 +09:00
Giovanni Petrantoni	5c986cf657	Fix build, add some raises (#315 ) * Fix build, add some raises * wip * wip more raises * missing exc object in mplex * proper lifetime for subscribePeer Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-08-05 19:30:57 -06:00
Ștefan Talpalaru	bd5d43874a	more expensive metrics (#312 )	2020-08-05 14:02:26 +02:00
Ștefan Talpalaru	843d32f8db	put expensive metrics under a Nim define (#310 )	2020-08-04 17:27:59 -06:00
Dmitriy Ryajov	b6877b8aac	increase send timeout for prune and graft msgs (#306 ) * increase send timeout for prune and graft msgs * use trace logs for subscribe monitor	2020-08-03 17:55:42 -06:00
Dmitriy Ryajov	980764774e	pubsub timeouts tuning (#295 ) * add finegrained timeouts to pubsub * use 10 millis timeout in tests * finalization * revert timeouts * use `atEof` for reads * adjust timeouts and use atEof for reads * use atEof for reads * set isEof flag * no backoff for pubsub streams * temp timer increase, make macos finalize * don't call `subscribePeer` in libp2p anymore * more traces * leak tests * lower timeouts * handle exceptions in control message * don't use `cancelAndWait` * handle exceptions in helpers * wip * don't send empty messages * check for leaks properly * don't use cancelAndWait * don't await subscribption sends * remove subscrivePeer calls from switch * trying without the hooks again	2020-08-02 23:20:11 -06:00
Jacek Sieka	e655a510cd	misc cleanups (#303 )	2020-08-02 12:22:49 +02:00
Dmitriy Ryajov	f7fdf31365	Pubsub lifetime (#284 ) * lifecycle hooks * tests * move trace after closed check * restore 1 second heartbeat * await close event * fix tests * print direction string * more trace logging * add pubsub monitor * add log scope * adjust idle timeout * add exc.msg to trace	2020-07-27 13:33:51 -06:00
Giovanni Petrantoni	3b088f8980	Fix some unsubscribe issues and add unsubscribeAll helper (#282 ) * Fix some unsub issues and add unsuball helper * batch sendprune in unsubscribe methods * add unsubscribeAll for floodsub	2020-07-20 10:16:13 -06:00
Dmitriy Ryajov	94196fee71	Connections and pubsub peers cleanup (#279 ) * better peer tracking and cleanup * check if peer and conn is nil * test name * make timeout more agressive * rename method for better clarity	2020-07-17 13:46:24 -06:00
Dmitriy Ryajov	0348773ec9	Connection manager (#277 ) * splitting out connection management * wip * wip conn mngr tests * set peerinfo in contructor * comments and documentation * tests * wip * add `None` to detect untagged connections * use `PeerID` to index connections * fix tests * remove useless equality	2020-07-17 09:36:48 -06:00
Jacek Sieka	170685f9c6	gossipsub fixes (#276 ) * graft up to D peers * fix logging so it's clear who is grafting/pruning who * clear fanout when grafting	2020-07-16 21:26:57 +02:00
Jacek Sieka	c76152f2c1	Simplify send (#271 ) * PubSubPeer.send single message * gossipsub: simplify send further	2020-07-16 12:06:57 +02:00
Dmitriy Ryajov	f35b8999b3	some light cleanup for pub/gossip sub (#273 ) * move peer table out to its own file * move peer table * cleanup `==` and add one to peerinfo * add peertable * missed equality check	2020-07-15 13:18:55 -06:00
Eugene Kabanov	b832668768	Minprotobuf refactoring 2 (#269 ) * Protobuf refactoring stage II. * Remove NoError. * Change trace level for invalid message.	2020-07-15 10:25:39 +02:00
Giovanni Petrantoni	d7bab37119	Fix gossip messages seqno according to spec (#253 ) * Fix gossip messages seqno according to spec * Add peers back to gossipsub table, slow down heartbeat * Revert "Add peers back to gossipsub table, slow down heartbeat" This reverts commit `01e2e62172`. * make seqno a threadvar, remove from peerinfo * seqno refactor, into pubsub	2020-07-14 21:51:33 -06:00
Ștefan Talpalaru	b8b0a2b4bc	CI: build binaries with TRACE & JSON logs (#268 ) Also: remove unused imports.	2020-07-14 02:02:16 +02:00
Jacek Sieka	c6c2d99907	one more log fix	2020-07-13 20:19:20 +02:00
Jacek Sieka	76853f064a	json logging again	2020-07-13 19:59:49 +02:00
Jacek Sieka	6620b7a00b	more comment fixes	2020-07-13 19:30:18 +02:00
Jacek Sieka	0d4c74b33a	comment log that can't be json-serialized	2020-07-13 18:36:49 +02:00
Jacek Sieka	061c54d3c6	logging fixes	2020-07-13 17:26:05 +02:00
Jacek Sieka	87e58c1c8d	metrics: one more pubsub peers fix	2020-07-13 16:16:46 +02:00
Jacek Sieka	c7895ccc52	metrics: fix pubsub_peers add metric	2020-07-13 16:15:27 +02:00
Giovanni Petrantoni	fcda0f6ce1	PubSubPeer tables refactor (#263 ) * refactor peer tables * tests fixing * override PubSubPeer equality * fix pubsubpeer comparison	2020-07-13 15:32:38 +02:00
Eugene Kabanov	efb952f18b	[WIP] Minprotobuf refactoring (#259 ) * Minprotobuf initial commit * Fix noise. * Add signed integers support. Add checks for field number value. Remove some casts. * Fix compile errors. * Fix comments and constants.	2020-07-13 14:43:07 +02:00
Dmitriy Ryajov	bec9a0658f	Cleanup rpc handler (#261 ) * more cleanup * fix tests * merging master * remove `withLock` as it conflicts with stdlib * wip * more fanout ttl Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 17:54:16 -06:00
Dmitriy Ryajov	4c815d75e7	More gossip cleanup (#257 ) * more cleanup * correct pubsub peer count * close the stream first * handle cancelation * fix tests * fix fanout ttl * merging master * remove `withLock` as it conflicts with stdlib * fix trace build Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-09 14:21:47 -06:00
Jacek Sieka	c720e042fc	clean up mesh handling logic (#260 ) * gossipsub is a function of subscription messages only * graft/prune work with mesh, get filled up from gossipsub * fix race conditions with await * fix exception unsafety when grafting/pruning * fix allowing up to DHi peers in mesh on incoming graft * fix metrics in several places	2020-07-09 11:16:46 -06:00
Giovanni Petrantoni	4e12d0d97a	nil check peer before disconnect	2020-07-09 17:20:45 +09:00
Giovanni Petrantoni	f9e0a1f069	CI fix handleDisconnect (pubsub)	2020-07-09 13:56:59 +09:00
Giovanni Petrantoni	9b8b159abb	Remove other spurious getStacktrace in pubsub traces	2020-07-09 13:19:34 +09:00
Giovanni Petrantoni	4bcb567d47	fix gossip tests	2020-07-09 12:34:36 +09:00
Giovanni Petrantoni	4698f41a91	Remove stacktrace logging from pubsub connect	2020-07-09 12:23:03 +09:00
Giovanni Petrantoni	fec507e755	Add peers back to gossipsub table, slow down heartbeat (#256 ) * Add peers back to gossipsub table, slow down heartbeat * exclude on unsub from mesh and fanout	2020-07-08 11:06:26 -06:00
Dmitriy Ryajov	a52763cc6d	fix publishing (#250 ) * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * Cleanup resources (#246) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises * restore trace vs debug in gossip * improve fanout replenish behavior further * cleanup stale peers more eaguerly * synchronize connection cleanup and small refactor * close client first and call parent second * disconnect failed peers on publish * check for publish result * fix tests * fix tests * always call close Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-07-07 18:33:05 -06:00
Giovanni Petrantoni	ec00c7fc50	Peer resultification and defect only (#245 ) * Peer resultification and defect only * Fixing some tests * test fixes * Rename peer into peerid * better result error message in identify * further merge fixes	2020-07-01 08:25:09 +02:00
Dmitriy Ryajov	c788a6a3c0	Cleanup resources (#246 ) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises	2020-06-29 09:15:31 -06:00
Jacek Sieka	aa6756dfe0	allow message id provider to be specified (#243 ) * don't send public key in message when not signing (information leak) * don't run rebalance if there are peers in gossip (see #242) * don't crash randomly on bad peer id from remote	2020-06-28 09:56:38 -06:00
Dmitriy Ryajov	7a95f1844b	Concurrent dials (#238 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * add secureconn constructor * close in the correct order * concurent dial lock and track in/out conns better * make tests pass * add todo comment * disconect peers that open too many connections * wip * do connection and muxer tracking in one place * prevent nil pointer in observers * drop connections when peers is over max * prevent channel leaks * don't use closure to handle channel	2020-06-24 09:08:44 -06:00
Giovanni Petrantoni	7852c6dd0f	Noise and eth2/nbc fixes (#226 ) * Remove noise padding payload (spec removed it) * add log scope in secure * avoid defect array out of range in switch secure when "na" * improve identify traces * wip noise fixes * noise protobuf adjustments (trying) * add more debugging messages/traces, improve their actual contents * re-enable ID check in noise * bump go daemon tag version * bump go daemon tag version * enable noise in daemonapi * interop testing, (both secio and noise will be tested) * azure cache bump (p2pd) * CI changes - Travis: use Go 1.14 - azure-pipelines.yml: big cleanup - Azure: bump cache keys - build 64-bit p2pd on 32-bit Windows - install both Mingw-w64 architectures * noise logging fixes * alternate testing between noise and secio * increase timeout to avoid VM errors in CI (multistream tests) * refactor heartbeat management in gossipsub * remove locking within heartbeat * refactor heartbeat management in gossipsub * remove locking within heartbeat Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2020-06-20 19:56:55 +09:00
Dmitriy Ryajov	7a1c1c2ea6	fixing some key not found exceptions (#231 )	2020-06-19 15:19:07 -06:00
Dmitriy Ryajov	5b28e8c488	Cleanup lpstream, Connection and BufferStream (#228 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * don't use delegation in connection * move connection out to own file * don't breakout on reset * make sure to call close on secured conn * add lpstream tracing * don't breackdown by conn id * fix import * remove unused lable * reset connection on exception * add additional metrics for skipped messages * check for nil in secure.close	2020-06-19 11:29:43 -06:00
Dmitriy Ryajov	719744f46a	Small fixes (#230 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * make sure sending doesn't fail * add `contains` * review comment from prev pr	2020-06-19 11:29:25 -06:00
Dmitriy Ryajov	fe828d87d8	count published messages (#224 )	2020-06-16 22:14:02 -06:00
Dmitriy Ryajov	9d9f793b4f	add metrics for sent messages by topic and peer (#220 )	2020-06-15 17:39:03 -06:00
Dmitriy Ryajov	7cb6c81159	Don't modify iterables while iterating them (#219 ) * don't modify iterables while iterating * assert handlers to properly close connections	2020-06-15 12:30:09 -06:00
Oskar Thorén	f97e2deec7	Make methods in FloodSub, GossipSub public (#216 ) Similar to https://github.com/status-im/nim-libp2p/pull/193 but doing it for all methods to avoid this issue in PubSub in the future. Got hit by this behavior again in rpcHandler and took me a while to figure out. See https://github.com/oskarth/nim-private-method-skipping-case/ for minimal repro	2020-06-12 17:54:12 -06:00
Dmitriy Ryajov	ac04ca6e31	make sure keys exist and more metrics (#215 )	2020-06-11 20:20:58 -06:00
Dmitriy Ryajov	55a294a5c9	better pubsub metrics (#214 )	2020-06-11 12:09:34 -06:00
Dmitriy Ryajov	6b196ad7b4	remove pubsub peer on disconnect (#212 ) * remove pubsub peer on disconnect * make sure lock is aquired * add $ * count upgrades/dials/disconnects	2020-06-11 08:45:59 -06:00
Viktor Kirilov	1afec627c2	proper name for topics so that we can filter dynamically using chronicles (#210 ) * proper name for topics so that we can filter dynamically using chronicles * lowercase	2020-06-10 10:48:01 +02:00
Dmitriy Ryajov	ee281310c0	move trace log	2020-06-08 10:40:08 -06:00
Giovanni Petrantoni	82b4ed8f44	use declareCounter rather then gauge for certain metrics	2020-06-07 16:41:23 +09:00
Giovanni Petrantoni	a6a2a81711	Start adding some metrics to pubsub (#192 ) * Start adding some metrics to pubsub In order to visualize it's functionality Still WIP * more metrics * add per topic metrics * finishup with requested metrics * add a metrisServer define to start local server * PR fixes and cleanup	2020-06-07 09:15:21 +02:00
Dmitriy Ryajov	130c64f33a	don't return nil in dial (#205 ) * dont return nil in dial * don't crash on pubsub send	2020-06-05 18:17:05 -06:00
Dmitriy Ryajov	5960d42c50	remove casts from (#203 )	2020-06-02 20:21:11 -06:00
Dmitriy Ryajov	bb8bff2195	add sparse message propagation tests to gossipsub (#202 ) * add sparce tests to gossipsub * add send hooks * remove `all`	2020-06-02 17:53:38 -06:00
Dmitriy Ryajov	1b4876d26d	emulate `defered`	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	86e1c8169c	decorate observers hooks with {.raises: [Defect].} move hooks logic out into standalone procs License: MIT Signed-off-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	293b7da295	typo	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	daef00fc7b	don't crash schlesi-dev	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	93e5805c01	better exception handling	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	d4bdb42046	gossipsub fixes	2020-06-02 09:10:27 -06:00
Dmitriy Ryajov	802299e69a	breakout from publish loop	2020-06-02 09:10:27 -06:00
Oskar Thorén	b88bfc05f8	Make GossipSub initPubSub method public (#193 ) This means we can use it from other protocols that inherit GossipSub. Otherwise, a lot of internal state (heartbeat lock etc) doesn't get initialized properly.	2020-05-29 09:35:03 -06:00
Dmitriy Ryajov	7b6e1c0688	Gossipsub interop (#189 ) * interop fixes * add custom messageid provider and fix seqno * use ECDSA for speed * adding messageid tests * breakout from publish loop * addressing review comments * remove unneded var * dont stop broadcasting on failed peers	2020-05-27 12:33:49 -06:00
Dmitriy Ryajov	9132f16927	gossipsub fixes (#186 )	2020-05-21 14:24:20 -06:00
Dmitriy Ryajov	ba53c08b3c	Track incoming connections (#181 ) * call write until all is written out * wip: rework with proper half-closed * add eof and closed handling * wip * close connection on chronos close * don't use read * make noise work again * don't reraise just yet * fixes after backporting * remove on transport close cleanup * revert back allread * rust interop fixes * read from stream * inc count before closing * rebasing master * store incomming connections * fix merge * remove unneeded changes * use internal close flag to indicate disposal	2020-05-21 11:33:48 -06:00
Dmitriy Ryajov	7900fd9f61	Half closed (#174 ) * call write until all is written out * add comments to lpchannel fields * add an eof flag to signal which end closed * wip: rework with proper half-closed * add eof and closed handling * propagate closes to piped * call parent close * moving bufferstream trackers out * move writeLock to bufferstream * move writeLock out * remove unused call * wip * rebasing master * fix mplex tests * wip * fix bufferstream after backport * wip * rename to differentiate from chronos tracker * close connection on chronos close * make reset request asyncCheck * fix channel cleanup * misc * don't use read * fix backports * make noise work again * proper exception handling * don't reraise just yet * add convenience templates * dont double wrap * use async pragma * fixes after backporting * muxer owns connection * remove on transport close cleanup * revert back allread * adding some todos * read from stream * inc count before closing * rebasing master * rebase master * use correct exception type * use try/finally insted of defer * fix compile in trace mode * reset channels on mplex close	2020-05-19 18:14:15 -06:00
Dmitriy Ryajov	f8029e7359	use sha256 digest as cache keys (#135 ) * use sha256 digest as cache keys * rebasing master	2020-05-18 14:49:49 -06:00
Giovanni Petrantoni	7dcb807f64	Crypto utilities resultification (#150 )	2020-05-18 07:25:55 +02:00
Jacek Sieka	69abf5097d	handle a few exceptions (#170 ) * handle a few exceptions Some of these are maybe too aggressive, but in return, they'll log their exception - more refactoring needed to sort this out - right now we get crashes on unhandled exceptions of unknown origin * during connection setup * while closing channels * while processing pubsubs * catch exceptions that are raised and don't try to catch exceptions that are not raised * propagate cancellederror * one more * more * more * make interop tests less fragile * Raise expiration time in gossipsub fanout test for slow CI Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com> Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-05-14 21:56:56 -06:00
Jacek Sieka	3053f03814	fix varint issues * fixes #111	2020-05-11 09:12:23 -06:00
Dmitriy Ryajov	6196d56fc2	check for nil observers	2020-05-08 15:44:18 -06:00
Giovanni Petrantoni	c889224012	Add PubSub observer+ hooks (they can modify as well)	2020-05-08 13:31:52 -06:00
Jacek Sieka	330da51819	removals (#159 ) * remove unused stream methods * reimplement some of them with proc's * remove broken tests * Error->Defect for defect * warning fixes	2020-05-06 18:31:47 +02:00
Dmitriy Ryajov	6da4d2af48	Pubsub signatures flags (#161 ) * add verify signature flag * add sign flag to enable/disable msg signing * moving internal tests out to their own file * cleanup nimble file * remove unneeded tests * move pubsub tests out * fix tests	2020-05-06 11:26:08 +02:00
Giovanni Petrantoni	4c6a123d31	Add chronos trackers and used them to sanitize resource disposal (#131 ) * Add chronos trackers and used them to sanitize resource disposal * Chronos trackers for transport tests wip * No more chronos leaks in testtransport * Make tcp transport and test more robust when closing * Test async leaking tracking wip * Fix a regression in wire connect * Add chronos trackers to more tests and sanitize resource closure * Wip fixing floodsub tests * Floodsub wip * Made floodsub basically deterministic, hit a nim bug with captures tho * Wrap up floodsub tests refactor * Wrapping up * Add allFuturesThrowing utility * Fix missing allFuturesThrowing in noise tests! * Make tests green * attempt fixing gossipsub failing cases * Make sure to check also fanout in waitSub * More verbose traces * Gossipsub test improvments * Refactor TcpTransport remove asyncCheck * Add Connection trackers * Add stricter connection tracking, wip mplex fix * More asynccheck removal, in order to avoid connection leaks * bump chronicles requirement * Enable tracker dump to check CI output * Wait for more futures in testmplex * Remove tracker dump messages * add tryAndWarn utility, fix mplex issue with go interop * All allFuturesThrowing to directchat too * make sure to cleanup on transport close	2020-04-21 10:24:42 +09:00
Giovanni Petrantoni	303ec297da	Start removing allFutures (#125 ) * Start removing allFutures * More allfutures removal * Complete allFutures removal except legacy and tests * Introduce table values copies to prevent error * Switch to allFinished * Resolve TODOs in flood/gossip * muxer handler, log and re-raise * Add a common and flexible way to check multiple futures	2020-04-11 13:08:25 +09:00
Dmitriy Ryajov	6cbcc7859e	reduse usssage of asyncCheck	2020-04-07 12:16:59 -06:00
Giovanni Petrantoni	35a48fa560	Re-enable gossipsub internal tests when running CI minor bonus: add a link in the comments about bearssl issue with callbacks	2020-04-07 22:07:00 +09:00
Giovanni Petrantoni	3514733060	Fix table assertion, edited while iterating (the fix is not so nice.. adds plenty of allocations, but for now should be ok)	2020-04-05 01:19:10 +09:00
Dmitriy Ryajov	3effb95f10	close underlying bufferstream in lpchannel	2020-03-28 09:29:43 -06:00
cheatfate	1f5d994700	Fix compilation errors introduced by latest chronos.	2020-03-24 09:48:05 +02:00
Giovanni Petrantoni	0a3e4a764b	Less verbose traces (#112 ) * Make traces less verbose with shortHexDump utility * Rename shortHexDump into shortLog * Improve shortLog, add shortLog for crypto keys * Add proper shortLog implementations in messages	2020-03-23 15:03:36 +09:00
Eugene Kabanov	5701d937c8	Signed variable integers fixes. (#96 ) * Fix signed varints. Add tests for signed varints. Remove some casts to allow usage at compile time. * Fix vsizeof() on 32bit platforms. * Add `hint` and `zint` types for proper signed integer encoding. * Fix varint related bugs. * Update requirements. * Fix interop tests because of fixed readLine. * Add putVarint, getVarint and tests.	2020-03-06 20:19:43 +01:00
Eugene Kabanov	381630f185	Fix and refactoring of some procedures which are able to return nil as result (#97 ) * Fix do not return nil as result. * Fix mplex test to properly raise.	2020-03-04 21:45:14 +02:00
Dmitriy Ryajov	fbcef69891	implicitelly dial pubsub if enabled	2020-02-21 09:21:06 -06:00
Dmitriy Ryajov	9023bf786d	remove sleeps	2020-02-16 11:31:35 -06:00
Dmitriy Ryajov	7f8eb0272e	cleanup and fix tests	2020-02-16 11:31:35 -06:00
Dmitriy Ryajov	88a030d8fb	fix: removing timeouts from conn	2020-02-05 20:38:43 +01:00
Dmitriy Ryajov	2232ca598e	don't timeout in pubsub	2020-02-04 17:59:57 +01:00
Zahary Karadjov	1bd933cd5a	More precise tracing	2020-02-04 17:27:32 +01:00
Zahary Karadjov	7bd305471c	Make sure the library can compile with json logging in trace mode	2020-02-04 15:17:39 +01:00
Dmitriy Ryajov	667691f784	send messages in batches	2020-01-09 12:55:21 -06:00
Dmitriy Ryajov	0fb1f1c5b8	strenghten pubsub interop testing	2019-12-24 10:35:35 -06:00
Dmitriy Ryajov	68cc57669e	Feat/pubsub validators (#58 ) * feat: adding validator hooks to pubsub * expose add/remove validators on switch * do less unnecessary copyng	2019-12-16 23:24:03 -06:00
Dmitriy Ryajov	293a219dbe	Cleanup (#55 ) * fix: don't allow replacing pubkey * fix: several small improvements * removing pubkey setter * improove error handling * remove the use of Option[T] if not needed * don't use optional * fix-ci: temporarily pin p2pd to a working tag * fix example to comply with latest changes * bumping p2pd again to a higher version	2019-12-10 14:50:35 -06:00
Zahary Karadjov	454f658ba8	Fixes and tweaks related to the beacon node integration * Bugfix: Dialing an already connected peer may lead to crash * Introduced a standard_setup module allowing to instantiate the `Switch` object in an easier manner. * Added `Switch.disconnect(peer)` * Trailing space removed (sorry about polluting the diff)	2019-12-08 23:58:43 +02:00
Dmitriy Ryajov	5f6fcc3d90	extract public and private keys fields from peerid (#44 ) * extract public and private keys fields from peerid * allow assigning a public key * cleaned up TODOs * make pubsub prefix a const * public key should be an `Option`	2019-12-07 10:36:39 -06:00
Dmitriy Ryajov	e623e70e7b	PubSub (Gossip & Flood) Implementation (#36 ) This adds gossipsub and floodsub, as well as basic interop testing with the go libp2p daemon. * add close event * wip: gossipsub * splitting rpc message * making message handling more consistent * initial gossipsub implementation * feat: nim 1.0 cleanup * wip: gossipsub protobuf * adding encoding/decoding of gossipsub messages * add disconnect handler * add proper gossipsub msg handling * misc: cleanup for nim 1.0 * splitting floodsub and gossipsub tests * feat: add mesh rebalansing * test pubsub * add mesh rebalansing tests * testing mesh maintenance * finishing mcache implementatin * wip: commenting out broken tests * wip: don't run heartbeat for now * switchout debug for trace logging * testing gossip peer selection algorithm * test stream piping * more work around message amplification * get the peerid from message * use timed cache as backing store * allow setting timeout in constructor * several changes to improve performance * more through testing of msg amplification * prevent gc issues * allow piping to self and prevent deadlocks * improove floodsub * allow running hook on cache eviction * prevent race conditions * prevent race conditions and improove tests * use hashes as cache keys * removing useless file * don't create a new seq * re-enable pubsub tests * fix imports * reduce number of runs to speed up tests * break out control message processing * normalize sleeps between steps * implement proper transport filtering * initial interop testing * clean up floodsub publish logic * allow dialing without a protocol * adding multiple reads/writes * use protobuf varint in mplex * don't loose conn's peerInfo * initial interop pubsub tests * don't duplicate connections/peers * bring back interop tests * wip: interop * re-enable interop and daemon tests * add multiple read write tests from handlers * don't cleanup channel prematurely * use correct channel to send/receive msgs * adjust tests with latest changes * include interop tests * remove temp logging output * fix ci * use correct public key serialization * additional tests for pubsub interop	2019-12-05 20:16:18 -06:00
Dmitriy Ryajov	903e79ede1	Feat/conn cleanup (#41 ) Backporting proper connection cleanup from #36 to align with latest chronos changes. * add close event * use proper varint encoding * add proper channel cleanup in mplex * add connection cleanup in secio * tidy up * add dollar operator * fix tests * don't close connections prematurely * handle closing streams properly * misc * implement address filtering logic * adding pipe tests * don't use gcsafe if not needed * misc * proper connection cleanup and stream muxing * re-enable pubsub tests	2019-12-03 22:44:54 -06:00
Dmitriy Ryajov	1df16bdbce	set log level to trace - not enabled by default	2019-12-02 18:43:21 -06:00
Dmitriy Ryajov	2066e81658	set default timeout to 10 secs	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	1d4b51413e	option to allow triggering own handlers on publish	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	37d7a03fba	use a timed cache in floodsub	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	f190c155d3	don't throw on missing peer	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	3b9d34116d	decrease floodsub traffic	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	6cfbf2c124	don't send messages to self	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	34d1a641de	cleanup/test pubsub	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	9862064234	changed copyright year	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	5b3f93ba1c	feat: allow multiple handlers per topic in pubsub	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	b270515bb3	feat: make private/public keys Option[T]	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	663ce6c589	misc: nimpretty	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	054085620c	logging: switch debug for trace in most cases	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	3eb0cdd5f7	misc	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	68d50a97f8	properly initialize hashsets	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	9f3b80b60c	got pubsub working without signing	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	8920cd7d60	misc: pubsub/floodsub and logging	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	177eb71ffa	wip: floodsub initial implementation	2019-10-11 08:15:24 +09:00
Dmitriy Ryajov	827a8caba6	wip: modeling floodsub	2019-10-11 08:15:24 +09:00

... 2 3 4 5 6 ...

318 Commits