nim-libp2p

Commit Graph

Author	SHA1	Message	Date
Giovanni Petrantoni	02ad017107	Gossipsub fixes and Initiator flagging fixes (#539 ) * properly propagate initiator information for gossipsub * Fix pubsubpeer lifetime management * restore old behavior * tests fixing * clamp backoff time value received * fix member name collisions * internal test fixes * better names and explaining of the importance of transport direction * fixes	2021-03-03 08:23:40 +09:00
Jacek Sieka	6f1ecc8df7	streamline socket read/write hot path (#473 ) * streamline socket read/write hot path This avoids some unnecessary memory copying on the hot path of noise / mplex, as well as getting rid of a few futures - profiling shows that this is one of the main culprits of small memory allocations, which makes sense - this is where gossip fan-out happens. * fewer futures (and corresponding closures) when sending lpchannel messages * avoid data copies when encrypting and framing noise messages * avoid copying tuple when reading noise data (poor c codegen) * fix setting eof flag in secure read * write noise frames in one go ...and closing secure socket once is enough	2020-12-09 08:56:40 -06:00
Dmitriy Ryajov	d1c689e5ab	adding libp2p tag to logScope (#465 )	2020-12-01 11:34:27 -06:00
Dmitriy Ryajov	21110636cb	fixing log level to avoid sacring users (#452 )	2020-11-24 12:07:27 -06:00
Dmitriy Ryajov	034a1e8b1b	small cleanups from tcp-limits2 (#446 )	2020-11-23 15:02:23 -06:00
Dmitriy Ryajov	1d16d22f5f	Don't allow concurrent pushdata (#444 ) * handle resets properly with/without pushes/reads * add clarifying comments * pushEof should also not be concurrent * move channel reset to bufferstream this is where the action happens - lpchannel merely redefines how close is done Co-authored-by: Jacek Sieka <jacek@status.im>	2020-11-23 09:07:11 -06:00
Jacek Sieka	74acd0a33a	fix channels not being reset (#439 ) * fix channels not being reset silly for loop.. * allow only one concurrent read * fix mplex test race condition * add some bufferstream eof tests * deadlock, lost data and hung channel fixes * prevent concurrent `reset` calls * reset LPChannel when read is cancelled (since data is lost) * ensure there's one, and one only, 0-byte readOnce on EOF * ensure that all data is returned before EOF is returned * keep running activity monitor for half-closed channels (or they never get closed)	2020-11-17 08:59:25 -06:00
Dmitriy Ryajov	4fb3f50d2c	Reset channels on close (#425 ) * reset when failed to read/write muxed conn * add more comprehensive resource cleanup tests * style * cleanup tests	2020-11-06 09:24:24 -06:00
Dmitriy Ryajov	3956f3fd69	make sure all streams are tracked (#422 ) * make sure all streams are tracked * revert unnecesary change	2020-11-04 21:52:54 -06:00
Jacek Sieka	03639f1446	Revert "Channel leaks (#413 )" (#417 ) This reverts commit `1de1d49223`.	2020-11-01 14:49:25 -06:00
Dmitriy Ryajov	1de1d49223	Channel leaks (#413 ) * break stream tracking by type * use closeWithEOF to await wrapped stream * fix cancelation leaks * fix channel leaks * logging * use close monitor and always call closeUnderlying * don't use closeWithEOF * removing close monitor * logging	2020-10-27 11:21:03 -06:00
Jacek Sieka	17e00e642a	limit write queue length (#376 ) To break a potential read/write deadlock, gossipsub uses an unbounded queue for writes - when peers are too slow to process this queue, it may end up growing without bounds causing high memory usage. Here, we introduce a maximum write queue length after which the peer is disconnected - the queue is generous enough that any "normal" usage should be fine - writes that are `await`:ed are not affected, only writes that are launched in an `asyncSpawn` task or similar. * avoid unnecessary copy of message when there are no send observers * release message memory earlier in gossipsub * simplify pubsubpeer logging	2020-09-24 18:43:20 +02:00
Jacek Sieka	25bd0a18f4	small fixes (#374 ) * add helper to read EOF marker after closing stream (else stream stay alive until timeout/reset) * don't assert on empty channel message * don't loop when writing to chronos (no need)	2020-09-24 07:30:19 +02:00
Jacek Sieka	49a12e619d	channel close race and deadlock fixes (#368 ) * channel close race and deadlock fixes * remove send lock, write chunks in one go * push some of half-closed implementation to BufferStream * fix some hangs where LPChannel readers and writers would not always wake up * simplify lazy channels * fix close happening more than once in some orderings * reenable connection tracking tests * close channels first on mplex close such that consumers can read bytes A notable difference is that BufferedStream is no longer considered EOF until someone has actually read the EOF marker. * docs, simplification	2020-09-21 19:48:19 +02:00
Giovanni Petrantoni	b99d2039a8	Gossip one one (#240 ) * allow multiple codecs per protocol (without breaking things) * add 1.1 protocol to gossip * explicit peering part 1 * explicit peering part 2 * explicit peering part 3 * PeerInfo and ControlPrune protocols * fix encodePrune * validated always, even explicit peers * prune by score (score is stub still) * add a way to pass parameters to gossip * standard setup fixes * take into account explicit direct peers in publish * add floodPublish logic * small fixes, publish still half broken * make sure to waitsub in sparse test * use var semantics to optimize table access * wip... lvalues don't work properly sadly... * big publish refactor, replenish and balance * fix internal tests * use g.peers for fanout (todo: don't include flood peers) * exclude non gossip from fanout * internal test fixes * fix flood tests * fix test's trypublish * test interop fixes * make sure to not remove peers from gossip table * restore old replenishFanout * cleanups * restore utility module import * restore trace vs debug in gossip * improve fanout replenish behavior further * triage publish nil peers (issue is on master too but just hidden behind a if/in) * getGossipPeers fixes * remove topics from pubsubpeer (was unused) * simplify rebalanceMesh (following spec) and make it finally reach D_high * better diagnostics * merge new pubsubpeer, copy 1.1 to new module * fix up merge * conditional enable gossip11 module * add back topics in peers, re-enable flood publish * add more heartbeat locking to prevent races * actually lock the heartbeat * minor fixes * with sugar * merge 1.0 * remove assertion in publish * fix multistream 1.1 multi proto * Fix merge oops * wip * fix gossip 11 upstream * gossipsub11 -> gossipsub * support interop testing * tests fixing * fix directchat build * control prune updates (pb) * wip parameters * gossip internal tests fixes * parameters wip * finishup with params * cleanups/wip * small sugar * grafted and pruned procs * wip updateScores * wip * fix logging issue * pubsubpeer, chronicles explicit override * fix internal gossip tests * wip * tables troubleshooting * score wip * score wip * fixes * fix test utils generateNodes * don't delete while iterating in score update * fix grafted defect * add a handleConnect in subscribeTopic * pruning improvements * wip * score fixes * post merge - builds gossip tests * further merge fixes * rebalance improvements and opportunistic grafting * fix test for now * restore explicit peering * implement peer exchange graft message * add an hard cap to PX * backoff time management * IWANT cap/budget * Adaptive gossip dissemination * outbound mesh quota, internal tests fixing * oversub prune score based, finish outbound quota * finishup with score and ihave budget * use go daemon 0.3.0 * import fixes * byScore cleanup score sorting * remove pointless scaling in `/` Duration operator * revert using libp2p org for daemon * interop fixes * fixes and cleanup * remove heartbeat assertion, minor debug fixes * logging improvements and cleaning up * (to revert) add some traces * add explicit topic to gossip rpcs * pubsub merge fixes and type fix in switch * Revert "(to revert) add some traces" This reverts commit `4663eaab6c`. * cleanup some now irrelevant todo * shuffle peers anyway as score might be disabled * add missing shuffle * old merge fix * more merge fixes * debug improvements * re-enable gossip internal tests * add gossip10 fallback (dormant but tested) * split gossipsub internal tests into 1.0 and 1.1 Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-09-21 11:16:29 +02:00
Jacek Sieka	0db45462cd	mplex fixes (#362 ) * remove almost-empty types module * lock when writing message (that's the only place the lock matters, and only when the message is > max msg size) * logging updates (log in consistent order, makes reading logs easier) * raise EOF from readExactly only if no bytes have been read (to signal that _no_ bytes were lost)	2020-09-14 10:19:54 +02:00
Jacek Sieka	96d4c44fec	refactor bufferstream to use a queue (#346 ) This change modifies how the backpressure algorithm in bufferstream works - in particular, instead of working byte-by-byte, it will now work seq-by-seq. When data arrives, it usually does so in packets - in the current bufferstream, the packet is read then split into bytes which are fed one by one to the bufferstream. On the reading side, the bytes are popped of the bufferstream, again byte by byte, to satisfy `readOnce` requests - this introduces a lot of synchronization traffic because the checks for full buffer and for async event handling must be done for every byte. In this PR, a queue of length 1 is used instead - this means there will at most exist one "packet" in `pushTo`, one in the queue and one in the slush buffer that is used to store incomplete reads. * avoid byte-by-byte copy to buffer, with synchronization in-between * reuse AsyncQueue synchronization logic instead of rolling own * avoid writeHandler callback - implement `write` method instead * simplify EOF signalling by only setting EOF flag in queue reader (and reset) * remove BufferStream pipes (unused) * fixes drainBuffer deadlock when drain is called from within read loop and thus blocks draining * fix lpchannel init order	2020-09-10 08:19:13 +02:00
Jacek Sieka	5b347adf58	logging fixes and small cleanups (#361 ) In particular, allow longer multistream select reads	2020-09-09 19:12:08 +02:00
Jacek Sieka	82c179db9e	mplex fixes (#356 ) * close the right connection when channel send fails * don't crash on channel id that is not unique	2020-09-08 08:24:28 +02:00
Jacek Sieka	c1856fda53	simplify and unify logging (#353 ) * use short format for logging peerid * log peerid:oid for connections	2020-09-06 10:31:47 +02:00
Eugene Kabanov	0b85192119	Remove asyncCheck from codebase. (#345 ) * Remove asyncCheck from codebase. * Replace all `discard` statements with new `asyncSpawn`. * Bump `nim-chronos` requirement.	2020-09-04 18:30:45 +02:00
Jacek Sieka	397f9edfd4	simplify mplex (#327 ) * less async * less copying of data * less redundant cleanup	2020-08-15 07:58:30 +02:00
Dmitriy Ryajov	b76b3e0e9b	Rework pubsub (#322 ) * move pubsub of off switch, pass switch into pubsub * use join on lpstreams * properly cleanup up failed peers * fix tests * fix peertable hasPeerId * fix tests * rework sending, remove helpers from pubsubpeer, unify in broadcast * further split broadcast into send * use send where appropriate * use formatIt * improve trace Co-authored-by: Giovanni Petrantoni <giovanni@fragcolor.xyz>	2020-08-11 18:05:49 -06:00
Dmitriy Ryajov	2325692f55	Fix half closed (#324 ) * don't call `close` in `remoteClose` * make sure timeout are properly propagted * fix tests * adding remote close write test	2020-08-10 16:17:11 -06:00
Giovanni Petrantoni	5c986cf657	Fix build, add some raises (#315 ) * Fix build, add some raises * wip * wip more raises * missing exc object in mplex * proper lifetime for subscribePeer Co-authored-by: Dmitriy Ryajov <dryajov@gmail.com>	2020-08-05 19:30:57 -06:00
Dmitriy Ryajov	74a6dccd80	adding channel limits to mplex (#309 )	2020-08-04 23:16:04 -06:00
Ștefan Talpalaru	843d32f8db	put expensive metrics under a Nim define (#310 )	2020-08-04 17:27:59 -06:00
Dmitriy Ryajov	cf2b42b914	Moving idle timeout to Connection to enable across all connection streams (#307 ) * move idle timeout logic to connection * more informative logs * more informative logs	2020-08-04 07:22:05 -06:00
Dmitriy Ryajov	980764774e	pubsub timeouts tuning (#295 ) * add finegrained timeouts to pubsub * use 10 millis timeout in tests * finalization * revert timeouts * use `atEof` for reads * adjust timeouts and use atEof for reads * use atEof for reads * set isEof flag * no backoff for pubsub streams * temp timer increase, make macos finalize * don't call `subscribePeer` in libp2p anymore * more traces * leak tests * lower timeouts * handle exceptions in control message * don't use `cancelAndWait` * handle exceptions in helpers * wip * don't send empty messages * check for leaks properly * don't use cancelAndWait * don't await subscribption sends * remove subscrivePeer calls from switch * trying without the hooks again	2020-08-02 23:20:11 -06:00
Jacek Sieka	e655a510cd	misc cleanups (#303 )	2020-08-02 12:22:49 +02:00
Dmitriy Ryajov	f7fdf31365	Pubsub lifetime (#284 ) * lifecycle hooks * tests * move trace after closed check * restore 1 second heartbeat * await close event * fix tests * print direction string * more trace logging * add pubsub monitor * add log scope * adjust idle timeout * add exc.msg to trace	2020-07-27 13:33:51 -06:00
Giovanni Petrantoni	c3404f6eea	Handle cancellation in timeoutMonitor (#283 ) * Handle cancellation in timeoutMonitor * refactor lpchannel timeout as suggested by cheatfate	2020-07-21 09:03:41 -06:00
Dmitriy Ryajov	38eb36efae	don't use close event to stop timer (#280 )	2020-07-18 11:00:44 -06:00
Dmitriy Ryajov	ba071cafa6	Channel timeout (#278 ) * add support for channel timeouts * tests for channel timeout * add timeouts to standard switch * fix mplex init * cleanup timer on stream close * add comment for `isConnected` * move cleanup event	2020-07-17 12:44:41 -06:00
Ștefan Talpalaru	b8b0a2b4bc	CI: build binaries with TRACE & JSON logs (#268 ) Also: remove unused imports.	2020-07-14 02:02:16 +02:00
Dmitriy Ryajov	181cf73ca7	Drain buffer (#264 ) * drain lpchannel on reset * move drainBuffer to bufferstream	2020-07-12 18:37:10 +02:00
Dmitriy Ryajov	c788a6a3c0	Cleanup resources (#246 ) * consolidate reading in lpstream * remove debug echo * tune log level * add channel cleanup and cancelation handling * cancelation handling * cancelation handling * cancelation handling * cancelation handling * cleanup and cancelation handling * cancelation handling * cancelation * tests * rename isConnected to connected * remove testing trace * comment out debug stacktraces * explicit raises	2020-06-29 09:15:31 -06:00
Dmitriy Ryajov	902880ef1f	consolidate reading in lpstream (#241 ) * consolidate reading in lpstream * remove debug echo * throw if not enough bytes where read * tune log level * set eof flag * test readExactly to fail on not enough bytes	2020-06-27 11:33:34 -06:00
Dmitriy Ryajov	7a95f1844b	Concurrent dials (#238 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * add secureconn constructor * close in the correct order * concurent dial lock and track in/out conns better * make tests pass * add todo comment * disconect peers that open too many connections * wip * do connection and muxer tracking in one place * prevent nil pointer in observers * drop connections when peers is over max * prevent channel leaks * don't use closure to handle channel	2020-06-24 09:08:44 -06:00
Jacek Sieka	b99fd88deb	logging fixes	2020-06-21 11:14:19 +02:00
Giovanni Petrantoni	7852c6dd0f	Noise and eth2/nbc fixes (#226 ) * Remove noise padding payload (spec removed it) * add log scope in secure * avoid defect array out of range in switch secure when "na" * improve identify traces * wip noise fixes * noise protobuf adjustments (trying) * add more debugging messages/traces, improve their actual contents * re-enable ID check in noise * bump go daemon tag version * bump go daemon tag version * enable noise in daemonapi * interop testing, (both secio and noise will be tested) * azure cache bump (p2pd) * CI changes - Travis: use Go 1.14 - azure-pipelines.yml: big cleanup - Azure: bump cache keys - build 64-bit p2pd on 32-bit Windows - install both Mingw-w64 architectures * noise logging fixes * alternate testing between noise and secio * increase timeout to avoid VM errors in CI (multistream tests) * refactor heartbeat management in gossipsub * remove locking within heartbeat * refactor heartbeat management in gossipsub * remove locking within heartbeat Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2020-06-20 19:56:55 +09:00
Dmitriy Ryajov	5b28e8c488	Cleanup lpstream, Connection and BufferStream (#228 ) * count published messages * don't call `switch.dial` in `subscribeToPeer` * don't use delegation in connection * move connection out to own file * don't breakout on reset * make sure to call close on secured conn * add lpstream tracing * don't breackdown by conn id * fix import * remove unused lable * reset connection on exception * add additional metrics for skipped messages * check for nil in secure.close	2020-06-19 11:29:43 -06:00
Dmitriy Ryajov	3c4c10d871	con't crash on nil ptr	2020-06-15 13:37:25 -06:00
Dmitriy Ryajov	7cb6c81159	Don't modify iterables while iterating them (#219 ) * don't modify iterables while iterating * assert handlers to properly close connections	2020-06-15 12:30:09 -06:00
Dmitriy Ryajov	9cd47fe816	add a logScope to make tracing less ugly (#218 ) * add a logScope to make tracing less ugly * don't crash on nil pointer	2020-06-15 12:10:41 -06:00
Dmitriy Ryajov	85b56d0b3a	Observedaddr (#217 ) * send correct observerAddr * run tests with info log level * set observedaddr for channels	2020-06-12 19:56:17 -06:00
Dmitriy Ryajov	6b196ad7b4	remove pubsub peer on disconnect (#212 ) * remove pubsub peer on disconnect * make sure lock is aquired * add $ * count upgrades/dials/disconnects	2020-06-11 08:45:59 -06:00
Viktor Kirilov	1afec627c2	proper name for topics so that we can filter dynamically using chronicles (#210 ) * proper name for topics so that we can filter dynamically using chronicles * lowercase	2020-06-10 10:48:01 +02:00
Dmitriy Ryajov	5960d42c50	remove casts from (#203 )	2020-06-02 20:21:11 -06:00
Dmitriy Ryajov	285884c20c	Close peers (#201 ) * wip * exceptions and resource cleanup * correct peerlifetime on disconnect * emulate defered * remove comment	2020-06-02 11:32:42 -06:00

1 2 3 4

152 Commits