mirror of https://github.com/logos-messaging/specs.git synced 2026-01-02 14:13:06 +00:00

Jazz Turner-Baggs 8ff0caae76

2025-10-27 12:32:13 -07:00

12 KiB

Raw Blame History

title	name	category	tags	editor	contributors
PRIVATE1	Private conversation	Standards Track		Jazz Alyxzander (@Jazzz)

Abstract

Background

Pairwise encrypted messaging channels are a foundational component in building chat systems. They allow for confidential, authenticated payloads to be delivered between two clients. Groupchats and channel based communication often rely on pairwise channels (at least partially) to deliver state updates and coordination messages.

Having robust pairwise communication channels allow for 1:1 communication while also providing the infrastructure for more complicated communication.

Private V1

PrivateV1 is conversation type which establishes a full-duplex secure channel between two participants.

Private Conversations have the following properties:

Payload Confidentiality: Only the participants can read the contents of any message sent.
Content Integrity: Recipients can detect if the contents were modified by a third party.
Sender Privacy: Only the recipient can determine who the sender was.
Forward Secrecy: A compromise in the future does not allow previous messages to be decrypted by a third party.
Post Compromise Security: Conversations eventually recover from a compromise which occurs today.
Dropped Message Observability: Messages which were lost in transit are eventually visible to both sender and recipient.

Definitions

This document makes use of the shared terminology defined in the CHAT-DEFINITIONS specification.

The terms include:

Application
Content
Participant
Payload
Recipient
Sender

Architecture

This conversation type assumes there is some service or application which wishes to generate and receive end-to-end encrypted content. It also assumes that some other component will be responsible for delivering the generated payloads. At its core this protocol takes the content provided and creates a series of payloads to be sent to the recipient.

flowchart LR
    Content:::plain--> Privatev1 --> Payload:::plain
    classDef plain fill:none,stroke:transparent;

Content

Content is provided to the protocol as encoded bytes. Due to segmentation limitations there is a restriction on the maximum size of content. This value is variable and is dependent upon which delivery service is used. In practice content size MUST be less that 255 * max_seg_size see: initialization

Other than its size, the protocol is agnostic of content.

Payload Delivery

How payloads are sent and received by clients is not described in this protocol. The choice of delivery method has no impact on the security of this conversation type, though the choice may affect sender privacy and censorship resistance. In practice, any best-effort method of transmitting payloads will suffice, as no assumptions are made.

Initialization

The channel is initialized by both sender and recipient agreeing on the following values for each conversation:

sk - initial secret key [32 bytes]
ssk - sender DH seed key
rsk - recipient DH seed key

To maintain the security properties:

sk MUST be known only by the participants.
sk MUST be derived in a way that ensures mutual authentication of the participants
sk SHOULD have forward secrecy by incorporating ephemeral key material
rsk and ssk SHOULD incorporate ephemeral key material

Additionally implementations MUST determine the following constants:

max_seg_size - maximum segmentation size to be used.
max_skip - number of keys which can be skipped per session. Values are determined by

Frame Encoding

There are 3 phases to operation.

flowchart TD
    C("Content"):::plain
    S(Segmentation)
    R(Reliability)
    E(Encryption) 
     D(Delivery):::plain
    C --> S --> R --> E --> D

    classDef plain fill:none,stroke:transparent;

Segmentation: Divides content into smaller fragments for transportation.
Reliability: Adds tracking information to detect dropped messages.
Encryption: Provides confidentiality and tamper resistance.

The output of each phase of the operational pipeline is the input of the next.

Segmentation

Thought the protocol has no limitation, it is assumed that a delivery mechanism MAY have restrictions on the max message size. While this is a transport level issue, it's included here because deferring segmentation has negative impacts on bandwidth efficiency and privacy. Forcing the transport layer to handle segmentation would require either reassembling unauthenticated segments (which are open to malicious interference) or implementing encryption at the transport layer. In the event of a dropped payload, segmentation after reliability would require clients to re-broadcast entire frames, rather than only the missing segments. This unnecessarily increases load on the network/clients and increases a DOS attack surface. To optimize the entire pipeline, segmentation is handled first so that segments can benefit from the reliability and robust encryption already in place.

The segmentation strategy used is defined by !TODO: Flatten link once completed

Implementation specifics:

Error correction is not used, as reliable delivery is already provided by lower layers.
segmentSize = max_seg_size !TODO: ^Spec currently has a limit of

Message Reliability

Scalable Data Sync is used to detect missing messages and provide delivery receipts to the sender after successful reception of a payload. SDS is implemented according to the specification.

!TODO: define: sender_id mapping !TODO: define: message_id mapping !TODO: update to latest version and include SDS-R

!NOTE: The defaultConfig in nim-SDS creates a bloom filter with the parameters n=10000, p=0.001 which has a size of ~18KiB. The bloom filter is included in every message which results in a best-case overhead rate of 13.3% (assuming waku's MSS of 150KB). Given a target content size of 4KB, that puts the utilization factor at 80+% (Without considering other layers). This needs to be looked at, lowering to n=2000 would lower overhead to ~3.5 KiB.

Encryption

Payloads are encrypted using doubleratchet.

With the following choices for external functions:

DH: X25519
KDF_RF: HKDF with SHA256, info = logoschat_privatev1
KDF_CK: HKDF with SHA256, input = "0x01 for message_key, and "0x02" for chain_key
KDF_MK: HKDF with SHA256, hdkf.info = "PrivateV1MessageKey"
ENCRYPT: Implemented with AEAD_CHACHA20_POLY1305

!TODO: Define AssociatedData

AEAD_CHACHA20_POLY1305 is implemented using randomly generated nonces. The nonce and tag are combined with the ciphertext for transport where ciphertext = nonce || encrypted_bytes || tag.

Frame Handling

This protocol uses explicit tagging of content, to remove ambiguity when parsing/handling frames. This creates a clear distinction between frames generated by the protocol, and content which was passed in. Even if new frames are added in the future, Clients can be certain whether the payload is intended for itself or applications. This is achieved through an invariant - All non-content frames are intended to be consumed by the client. When a new unknown frame arrives it can be certain that a version compatibility issue has occurred.

All application level content MUST use the content frameType.
Clients SHALL only pass content tagged frames to Applications
Clients MAY drop unrecognized frames

Wire Format Specification / Syntax

Payload Parse Tree

A deterministic parse tree is used to avoid ambiguity when receiving payloads.

flowchart TD

    D[DoubleRatchet]
    S[SDS Message]
    Segment1[ Segment]
    Segment2[ Segment]
    Segment3[ Segment]
    P[PrivateV1Frame]

    start@{ shape: start }
    start --> D
    D -->|Payload| S
    S -->|Payload| Segment1

    Segment1 --> P
    Segment2:::plain --> P
    Segment3:::plain --> P
    
    P --> T{frame_type}
    T --content--> Bytes
    T --> Placeholder

    classDef plain fill:none,stroke:transparent;

!TODO: Replace placeholder

Payloads

!TODO: Don't duplicate payload definitions from other specs. Though its helpful for now.

Encrypted Payload

message Doubleratchet {
    bytes dh = 1;               // 32 byte publickey
    uint32 msgNum = 2;          
    uint32 prevChainLen = 3;     
    bytes ciphertext = 4;       // arbitrary length bytes
}

dh: the x component of the dh_pair.publickey encoded as raw bytes. ciphertext: A protobuf encoded SDS Message

SDS Message

This payload is used without modification from the SDS Spec.

message HistoryEntry {
    string message_id = 1;        
    bytes retrieval_hint = 2;                      
  }
  
message ReliablePayload {
    string message_id = 2;      
    string channel_id = 3;  
    int32 lamport_timestamp = 10;    
    repeated HistoryEntry causal_history = 11;   
    bytes bloom_filter = 12; 
    bytes content = 20;                           
  }

content: This field is an protobuf encoded Segment

!TODO: Why is SDS using signed int for timestamps?

Segmentation

This payload is used without modification from the Segmentation specification


message SegmentMessageProto {
  bytes  entire_message_hash    = 1; // 32 Bytes
  uint32 index                  = 2; 
  uint32 segments_count         = 3;
  bytes  payload                = 4; 
  uint32 parity_segment_index   = 5;
  uint32 parity_segments_count  = 6; 
}

payload: This field is an protobuf encoded PrivateV1Frame

!TODO: This should be encoded as a FrameType so it can be optional.

Frame

message PrivateV1Frame {                 
    uint64 timestamp = 1;             // Sender reported timestamp
	oneof frame_type {
		bytes content = 10;
        Placeholder placeholder = 11;
        // ....
	}
}

content: is encoded as bytes in order to allow implementations to define the type at runtime.

Implementation Suggestions

Content Types

Implementors need to be mindful of maintaining interoperability between clients, when deciding how content is encoded prior to transmission. In a decentralized context, clients cannot be assumed to be using the same version let alone application. It is recommended that implementors use a self-describing content payload such as CONTENTFRAME specification. This provides the ability for clients to determine support for incoming frames, regardless of the software used to receive them.

Initialization

Mutual authentication is provided by the sk, so there is no requirement of using authenticated keys for ssk and rsk. Implementations SHOULD use the most ephemeral key available in order incorporate as much key material as possible. This means that senders SHOULD generate a new ephemeral key for ssk for every conversation assuming channels are asynchronously initialized.

Excessive Skipped Message

Handling of skipped message keys is not strictly defined in double ratchet. Implementations need to choose an strategy which works best for their environment, and delivery mechanism. Halting operation of the channel is the safest, as it bounds resource utilization in the event of a DOS attack but is not always possible.

If eventual delivery of messages is not guaranteed, implementors should regularly delete keys that are older than a given time window. Unreliable delivery mechanisms will result in increased key storage over time, as more messages are lost with no hope of delivery.

12 KiB

Raw Blame History

Abstract

Background

Private V1

Definitions

Architecture

Content

Payload Delivery

Initialization

Frame Encoding

Segmentation

Message Reliability

Encryption

Frame Handling

Wire Format Specification / Syntax

Payload Parse Tree

Payloads

Encrypted Payload

SDS Message

Segmentation

Frame

Implementation Suggestions

Content Types

Initialization

Excessive Skipped Message

Security/Privacy Considerations

Sender Derivation

Segmentation Session Binding

Privacy - ContentSize

Copyright

References

12 KiB Raw Blame History

Abstract

Background

Private V1

Definitions

Architecture

Content

Payload Delivery

Initialization

Frame Encoding

Segmentation

Message Reliability

Encryption

Frame Handling

Wire Format Specification / Syntax

Payload Parse Tree

Payloads

Encrypted Payload

SDS Message

Segmentation

Frame

Implementation Suggestions

Content Types

Initialization

Excessive Skipped Message

Security/Privacy Considerations

Sender Derivation

Segmentation Session Binding

Privacy - ContentSize

Copyright

References

12 KiB

Raw Blame History