reviews updated

2025-02-26 05:35:22 +00:00 · 2024-01-23 23:48:39 +05:30 · 2024-01-23 23:48:39 +05:30 · 9eca986676
commit 9eca986676
parent 5c04d86198
1 changed files with 78 additions and 44 deletions
--- a/content/docs/rfcs/75/README.md
+++ b/content/docs/rfcs/75/README.md
@ -1,38 +1,45 @@
 ---
 slug: "75"
-title: 75/WAKU2-MESSAGE-RELIABILITY
-name: Message reliability for Waku Store
+title: 75/WAKU2-SYNC
+name: Synchronization protocol for Waku messages
 status: raw
 category: Standards Track
 tags:
-  - message-reliability
+  - sync-protocol
 editor: Abhimanyu Rawat <abhi@status.im>
 contributors:
 ---

 # Abstract

-Sync protocol enables a node to synchronize itself with messages that it may have missed due to any reason, from one of its peers. This is an efficient method for the nodes present in the network to stay synchronized while sharing minimal information with each other.
+The Sync protocol allows operators within peer-to-peer Waku network to efficiently synchronize Waku messages by exchanging their [message hashes](https://rfc.vac.dev/spec/14/#deterministic-message-hashing) using their respective Prolly trees (spec of Prolly tree to be published).
+This process ensures that all operators stay updated with minimal information exchange, accommodating situations where operators may have missed some messages due to any reason.

-This document describes the Sync store protocol and how it is used to synchronize the nodes in the network. It also explains the various components of the protocol and how they interact with each other. The detailed explanation of the protocol is given in the [research doc](https://github.com/waku-org/research/issues/78). In a Sync request-response protocol, a client receives the messages through Sync mechanism using their peer nodes only. Store is planned to work in conjunction with Sync to provide a reliable message delivery mechanism.
+This document describes the Sync store protocol and how it is used to synchronize the operators in the network.
+It also explains the various components of the protocol and how they interact with each other.
+In a Sync request-response protocol, a client receives the list of missing Waku message hashes through Sync mechanism using their peer nodes only.

 # Background / Rationale / Motivation

-Nodes who want to synchronize with each other should be able to do so with minimal amount of communication and bandwidth usage. Existing methods present in Waku Store protocol are based on full message delivery using time range query where there is no way to know apart from a given timestamp range if other messages are also missing, making them unreliable and bandwidth consuming. Node that wants to synchronize sends a request to the peer node, and in return peer node send the response to the requesting node. After a brief low-bandwidth consuming back and flow exchange, resulting will be a list values (message_hashes) that the requesting node needs to sync with the peer node. These hashes can be used by other parts of the Store protocol to make sure the related full messages are wired to the requesting node and stored in the Store, making the requesting node fully synchronized with the peer node.
-
-A detailed view in the process can be tracked using the research [issue](https://github.com/waku-org/research/issues/62).
+Operators who want to synchronize with each other should be able to do so with minimal amount of communication and bandwidth usage.
+Existing methods present in Waku Store protocol are based on full Waku message delivery using time range query where there is no way to know apart from a given timestamp range if other Waku messages are also missing, making Store only method unreliable and bandwidth consuming.
+Operator that wants to synchronize sends a request to its peer, and in return peer operator send the response.
+After a brief low-bandwidth consuming back and flow exchange, resulting will be a list message hashes that the requesting operator needs to sync with the peer operator.
+These messages hashes can be used by other parts of the Store protocol to make sure the related full messages are wired to the requesting operator and stored in its accessible (local for now) Store/DB.
+Making the requesting operator fully synchronized with its peer operator.

 # Theory / Semantics

 The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “NOT RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in [RFC 2119](https://www.ietf.org/rfc/rfc2119.txt).

 Consider a request-response protocol with two roles: a client and a server.
-Client and Server both MUST be peers supporting the Waku Sync protocol. There is no other eligibility for the node at this point in time to be a client or a server who can synchronize with each other.
+Client and Server both MUST be peers supporting the Waku Sync protocol.
+Each operator MUST contain empty or populated Prolly tree of Waku message hashes.
+There is no other eligibility for the node at this point in time to be a client or a server who can synchronize with each other.

 # Wire Format Specification / Syntax

 ```
-
    syntax = "proto3";

    package prollytree;
@ -42,32 +49,34 @@ Client and Server both MUST be peers supporting the Waku Sync protocol. There is
        int64 timestamp = 2;          // Timestamp or other unique identifier
        bytes node_hash = 3;         // Hash of the node
        int32 level = 4;              // Level of the node in the tree
-        bytes merkel_hash = 5;       // Merkel hash of the node
+        bytes merkle_hash = 5;       // Merkle hash of the node
        bool is_tail = 6;             // Flag indicating if it's a tail node

        // Relationships (IDs or indices of related nodes)
        // Note: Relationships like left, right, up, and down are typically represented using references
        // (like indices or IDs) rather than directly embedding the objects to avoid deep nesting.
        // In our case we can use the key of the node itself and figure out the level based on the current node level.
-        int32 left = 7;               // Reference to the left node
-        int32 right = 8;              // Reference to the right node
-        int32 up = 9;                 // Reference to the parent node
-        int32 down = 10;              // Reference to the child node
+        int64 left = 7;               // Reference to the left node
+        int64 right = 8;              // Reference to the right node
+        int64 up = 9;                 // Reference to the parent node
+        int64 down = 10;              // Reference to the child node
    }

-    // Request message for GetRoot RPC. No fields needed for this example.
-    message GetRootRequest {
+    // Request message for GetRootAtHeight
+    message ExchangeRootAtHeightRequest {
+        Node root = 1;    // The root node of the Prolly tree
+        int32 height = 1;  // The desired height
    }

    // Response message for GetRoot RPC
-    message GetRootResponse {
-        Node root = 1;  // The root node of the ProllyTree
+    message ExchangeRootAtHeightResponse {
+        Node root = 1;  // The root node of the Prolly tree
        // Add additional fields if necessary, e.g., a status message or an error code.
    }

    // Request message for GetIntermediateNode to resolve the reference indices as mentioned above
    message GetIntermediateNodeRequest {
-        int32 node_id = 1;  // ID of the node to fetch
+        int64 node_id = 1;  // ID/timestamp of the node to fetch
        int32 level = 2;    // Level of the node, if necessary for retrieval
    }

@ -76,12 +85,6 @@ Client and Server both MUST be peers supporting the Waku Sync protocol. There is
        Node node = 1;  // The requested node
    }

-
-    // Request message for GetRootAtHeight RPC
-    message GetRootAtHeightRequest {
-        int32 height = 1;  // The desired height
-    }
-
    // Request message for non boundary nodes
    message GetNonBoundaryNodesRequest {
        repeated Node start_nodes = 1;  // Nodes from which to start searching
@ -95,68 +98,99 @@ Client and Server both MUST be peers supporting the Waku Sync protocol. There is

 A brief description of each message request-response is described as follows:

-### GetRootRequest
+### ExchangeRootRequest

-This message is sent by the node that wants to sync with the peer node. It contains no fields. The peer node if using the Sync store protocol will respond with the GetRootResponse message. It sends the root node of the tree to the requesting node.
+This message is sent by the node that wants to sync with the peer node.
+It contains local Prolly Tree root along with its height in the tree.
+The peer node if using the Sync store protocol will respond with the GetRootResponse message.
+It sends the root node of the tree to the requesting node.

-### GetRootResponse
+### ExchangeRootResponse

-This message is sent by the peer node in response to the GetRootRequest message. It contains the root node of the tree. The requesting node can then use this node to traverse the tree and find the nodes that it needs to sync with the peer node.
+This message is sent by the peer node in response to the GetRootRequest message.
+It contains the root node of the tree.
+The requesting node can then use this node to traverse the tree and find the nodes that it needs to sync with the peer node.

 ### GetRootAtHeightRequest

-This message is sent by the node that wants to sync with the peer node. It contains the height of the tree that the requesting node wants to sync with the peer node. The peer node if using the Sync store protocol will respond with the GetRootResponse message. It sends the root node of the tree to the requesting node.
+This message is sent by the node that wants to sync with the peer node.
+It contains the height of the tree that the requesting node wants to sync with the peer node.
+The peer node if using the Sync store protocol will respond with the GetRootResponse message.
+It sends the root node of the tree to the requesting node.

 ### GetIntermediateNodeRequest

-This message is sent by the node that wants to sync with the peer node. It contains the node id/key and the level of the Prolly tree, uisng this information peer node searches and in response send back the subjected node. The peer node if using the Sync store protocol will respond with the GetIntermediateNodeResponse message. It sends the node of the tree to the requesting node.
+This message is sent by the node that wants to sync with the peer node.
+It contains the node id/key and the level of the Prolly tree, uisng this information peer node searches and in response send back the subjected node.
+The peer node if using the Sync store protocol will respond with the GetIntermediateNodeResponse message.
+It sends the node of the tree to the requesting node.

 ### GetIntermediateNodeResponse

-This message is sent by the peer node in response to the GetIntermediateNodeRequest message. It contains an intermediate node of the tree. The requesting node can then use this node to traverse the tree and find the missing nodes.
+This message is sent by the peer node in response to the GetIntermediateNodeRequest message.
+It contains an intermediate node of the tree.
+The requesting node can then use this node to traverse the tree and find the missing nodes.

 ### GetNonBoundaryNodesRequest

-This message is sent by the node that wants to sync with the peer node. It contains the start nodes from which the peer node will start searching for the non boundary nodes. The peer node if using the Sync store protocol will respond with the GetNonBoundaryNodesResponse message. It sends the non boundary nodes of the tree to the requesting node.
+This message is sent by the node that wants to sync with the peer node.
+It contains the start nodes from which the peer node will start searching for the non boundary nodes.
+The peer node if using the Sync store protocol will respond with the GetNonBoundaryNodesResponse message.
+It sends the non boundary nodes of the tree to the requesting node.

 ### GetNonBoundaryNodesResponse

-This message is sent by the peer node in response to the GetNonBoundaryNodesRequest message. It contains the non boundary nodes of the tree. The requesting node can then use this nodes to traverse the tree and find the missing nodes.
+This message is sent by the peer node in response to the GetNonBoundaryNodesRequest message.
+It contains the non boundary nodes of the tree.
+The requesting node can then use this nodes to traverse the tree and find the missing nodes.

 # Implementation in of Sync using Prolly tree (PoC version)

 This Section describes a proof-of-concept (PoC) implementation of the Sync protocol using Prolly tree.

-The PoC code is written in Python and can be found [here](https://github.com/ABresting/Prolly-Tree-Waku-Message). One can also run the code using the README.md guide.
+The PoC code is written in Python and can be found [here](https://github.com/ABresting/Prolly-Tree-Waku-Message).
+One can also run the code using the README.md guide.

 The Sync Process works in following steps:

 Step 1: A key-value store is created using the Store protocol where the key is the timestamp of a message, and the value is the `message_hash` attribute which uniquely identifies a message in Waku network.

-Step 2: A Prolly tree is populated using the key-value store. This construction is similar at both client and server side.
+Step 2: A Prolly tree is populated using the key-value store.
+This construction is similar at both client and server side.

-Step 3: A client node sends a request to the server node to get the root node of the Prolly tree. The server node responds with the root node of the Prolly tree.
+Step 3: A client node sends a request to the server node to get the root node of the Prolly tree at a particular height.
+It also send its own local root node of the Prolly tree.

-Step 4: As Prolly tree is designed to compare diff based on equal hieght so if the height of the root of the Prolly tree is different then the client node sends a request to the server node to get the root node of the Prolly tree at the height of the root of the client's Prolly tree. The server node responds with the root node of the Prolly tree at the height of the root of the client's Prolly tree.
+Step 4: The server node responds with the root node of the Prolly tree present at that height.
+If the height at server is less than the height requested by the client, then the server responds with the root node of the Prolly tree at the maximum height present at the server.

-Step 5: The client node then traverses the Prolly tree from the root node to the leaf nodes and finds the missing nodes at every level of the tree starting from the top. The missing intermediate Prolly tree Node is requested by client to the server. Server responds with the requested nodes.
+Step 5: The client node starts traversing the Prolly tree from the root node to the leaf nodes and finds the missing nodes at every level of the tree starting from the top.
+If the client detects the difference of merkle hash at a key of a certain level, then it requests the server node to get the intermediate child node belonging to that key.
+And the keys with no difference in merkle hash are skipped.
+This process continues until the client reaches the leaf nodes of the tree.

-Step 6: The client eventually computes the missing `message_hash`es and sends a request to server to get the missing messages. Server responds with the requested messages.
+Step 6: The client eventually computes the missing message hashes and sends a request to server to get the missing Waku messages corresponding to the computed message hashes.
+Server responds with the requested Waku messages.
+
+When a server receives the `ExchangeRootAtHeightRequest` message from the client, it can also detect if server also needs to Sync message hashes with the client.
+This step helps reducing the bandwidth usage and the number of messages exchanged between the client and the server nodes in longer run.

 Upon receiving the missing messages that are present in the Prolly tree, the client node request the onward messages using the existing Store [method](https://github.com/waku-org/nwaku/blob/master/waku/waku_archive/driver.nim#L36) since these nodes are for sure missing from the client Prolly tree.

 # Security/Privacy Considerations

-Sync protocol on an abstract level provides an interface to synchronize Store of Waku messages with peer nodes in the network. The security and privacy considerations are limited to the storage of messages on the server node, not disclosing any other information about the origin of the messages on server.
+Sync protocol on an abstract level provides an interface to synchronize Store of Waku messages with peer nodes in the network.
+The security and privacy considerations are limited to the storage of messages on the server node, not disclosing any other information about the origin of the messages on server.

 # Limitations and Future Work

 This document is intentionally simplified in its initial version.
-It is to showcase the working pieces of the Sync protocol and how it can be used to synchronize the nodes in the network. The PoC of the Sync store is initself feature complete but not sufficient for production code without sanitizing it to work with Waku Store.
+It is to showcase the working pieces of the Sync protocol and how it can be used to synchronize the nodes in the network.
+The PoC of the Sync store is initself feature complete but not sufficient for production code without sanitizing it to work with Waku Store.

 The following ideas may be explored in future:

- Batch sequenced insertions: The design and the advantage of the Waku Prolly tree over others is such that it can be used to insert multiple sequenced messages at once without affection the Merkel hashes of the messages already present in the tree i.e. left side of the tree.
+- Batch sequenced insertions/deletions: The design and the advantage of the Waku Prolly tree over others is such that it can be used to insert/delete multiple sequenced messages at once without affection the Merkle hashes of the messages already present in the tree i.e. left side of the tree.

 - It can itself be served as a Store of messages in light weight nodes with limited storage capacity.