logos-storage-docs-obsidian/10 Notes/Codex Encryption Design.md

---
related-to:
  - "[[Codex Encryption Basis]]"
  - "[[How to get a pointer to a seq to pass it to a C library]]"
---
As summarized in [[Codex Encryption Basis#^b2e265|the proposal]], we:

- use a freshly generated random master key (at least 256 bits) per dataset (generated and kept on the user's machine)
- derive a new encryption key and also an IV for each block from the master key and the block index
- use for example AES192-CBC

For example, we could have

    key = SHA256( MASTER_KEY || block_index ), truncated to 192 bits
    IV  = SHA256( MASTER_IV  || block_index ), truncated to 128 bits
    
where both `MASTER_KEY` and `MASTER_IV` are 256 bit random numbers, and `||` denotes concatenation.

If storing 512 bits (as opposed to a 256 bit minimum) of key material is a problem, we could derive both by the same key, for example as

    blockKEY = SHA256( MASTER_KEY || 0x01 || block_index ), truncated to 192 bits
    blockIV  = SHA256( MASTER_KEY || 0x02 || block_index ), truncated to 128 bits

In our implementation, we will be using the second scheme - starting with a random master key, and then for each block deriving a block level key (`blockKEY`) and block level initialization vector (`blockIV`).

For some introduction and examples on BearSSL, please consult:

- [[How to generate a random number using BearSSL]]
- [[How to create a hash using BearSSL]]
- [[How to encrypt and decrypt content using symmetric encryption in BearSSL]]

In Codex, we use a slightly modified AES-CBC mode of operation:

1. we first generate a random `MASTER_KEY` - this key will be returned to the user,
2. from the `MASTER_KEY`, for each block, we derive the corresponding block level encryption key `blockKEY` and block level initialization vector `blockIV` as shown in the proposal above,
3. using the derived `blockKEY` and `blockIV`, we then encrypt the block using the BearSSL encryption primitives as demonstrated above.

As we see above, the block index is used in the process of the key and initialization vector derivation. For this reason we also need to remember to convert the block index to a byte representation - we use big-endian ordering. For this conversion a very simple function can be used:

```nim
func toBytes[T: SomeInteger](value: T): seq[byte] =
  let v =
    if system.cpuEndian == bigEndian: value
    else: swapBytes(value)
  
  result = newSeq[byte](sizeof(T))
  copyMem(addr result[0], unsafeAddr v, sizeof(T))
```

or, we can just use the `endians2.toBytes` function from `nim-stew`: `toBytes(blockIndex.uint32, bigEndian)`

Below we show the code for the derivation of the block level key and initialization vector, followed by the encryption and decryption of a block:

```nim
import std/sequtils
import bearssl/[blockx, hash]
import stew/[byteutils, endians2]

import ./rng
import ./hash

let masterKey = newSeqWith(32, Rng.instance.rand(uint8.high).byte)

let blockIndex = 1.uint32  
let blockIndexArray = toBytes(blockIndex, bigEndian)

const 
  KEY_SIZE = 24 # 192 bits for AES-192
  IV_SIZE = 16  # 128 bits
  KeyDerivationIdentifier = "aes192_block_key".toBytes
  IvDerivationIdentifier = "aes192_block_iv".toBytes
  DefaultBlockSize* = uint 1024 * 64 # as used in Codex

let keyForBlock = hash(addr sha256Vtable, masterKey & KeyDerivationIdentifier & blockIndexArray.toSeq)[0 ..< KEY_SIZE]
let ivForBlock = hash(addr sha256Vtable, masterKey & IvDerivationIdentifier & blockIndexArray.toSeq)[0 ..< IV_SIZE]

var plaintext = newSeqWith(DefaultBlockSize.int, Rng.instance.rand(uint8.high).byte)

let key = keyForBlock
let ive = ivForBlock
let encBuffer = plaintext

# encryption
var encCtx: AesBigCbcencKeys
aesBigCbcencInit(encCtx, addr key[0], key.len.uint)
aesBigCbcencRun(encCtx, addr ive[0], addr encBuffer[0], encBuffer.len.uint)

assert encBuffer != plaintext, "Encryption failed, output should differ from input!"

# decryption
let ivd = ivForBlock

var decCtx: AesBigCbcdecKeys
aesBigCbcdecInit(decCtx, addr key[0], key.len.uint)
aesBigCbcdecRun(decCtx, addr ivd[0], addr encBuffer[0], encBuffer.len.uint)

assert encBuffer == plaintext, "Decryption failed, output should match input!"
```

where `rng` and `hash` are defined as shown in [[How to generate a random number using BearSSL]] and [[How to create a hash using BearSSL]].

In a similar way we will proceed with the following, successive blocks.

### API

All data uploaded to Codex will, be default, be encrypted.

The standard upload API - `/api/codex/v1/data`  - after successful upload, returns the content identifier of the uploaded content, followed by the hex encoded master encryption key, separated by a single comma character `:`, eg:

```bash
curl -X POST \
  http://localhost:8001/api/codex/v1/data \
  -H 'Content-Type: application/octet-stream' \
  -H 'Content-Disposition: filename="enc.txt"' \
  -w '\n' \
  -T enc.txt
zDvZRwzm22eSYNdLBuNHVi7jSTR2a4n48yy4Ur9qws4vHV6madiz:541cc266b2ef426486cd981f2d7429347c68113bda2a80d21c9f18e250bfdaff
```

When downloading, the user is allowed to either retrieve the encrypted content, and then use an external utility (to be provided) to decrypt it, or the user can provide the encryption master key, which then will be used by the Codex client to decrypt the content, which then will be returned to the user:

```bash
export CID="zDvZRwzm22eSYNdLBuNHVi7jSTR2a4n48yy4Ur9qws4vHV6madiz"
export KEY="541cc266b2ef426486cd981f2d7429347c68113bda2a80d21c9f18e250bfdaff"
curl "http://localhost:8001/api/codex/v1/data/${CID}/network/stream?key=${KEY}" -o "dec.txt"
```

Notice that when no encryption key is provided, the returned content size will always be multiply of the block size.
### Links

- [bearssl](https://bearssl.org/)
- [bearssl Nim bindings](https://github.com/status-im/nim-bearssl)
- A nice example of using BearSSL encryption API (Arduino) https://github.com/kakopappa/esp8266-aes-cbc-encryption-decryption/blob/main/esp8266-aes-cbc-encryption-decryption.ino
Renames "Code Encryption Basis" to "Codex Encryption Basis" 2025-06-17 13:34:32 +02:00			`---`
			`related-to:`
			`- "[[Codex Encryption Basis]]"`
updates encryption docs 2025-06-30 14:19:35 +02:00			`- "[[How to get a pointer to a seq to pass it to a C library]]"`
Renames "Code Encryption Basis" to "Codex Encryption Basis" 2025-06-17 13:34:32 +02:00			`---`
updates encryption docs 2025-06-30 14:19:35 +02:00			`As summarized in [[Codex Encryption Basis#^b2e265\|the proposal]], we:`

			`- use a freshly generated random master key (at least 256 bits) per dataset (generated and kept on the user's machine)`
			`- derive a new encryption key and also an IV for each block from the master key and the block index`
			`- use for example AES192-CBC`

			`For example, we could have`

			`key = SHA256( MASTER_KEY \|\| block_index ), truncated to 192 bits`
			`IV = SHA256( MASTER_IV \|\| block_index ), truncated to 128 bits`

			where both `MASTER_KEY` and `MASTER_IV` are 256 bit random numbers, and `\|\|` denotes concatenation.

			`If storing 512 bits (as opposed to a 256 bit minimum) of key material is a problem, we could derive both by the same key, for example as`

updates encryption docs 2025-07-02 19:55:28 +02:00			`blockKEY = SHA256( MASTER_KEY \|\| 0x01 \|\| block_index ), truncated to 192 bits`
			`blockIV = SHA256( MASTER_KEY \|\| 0x02 \|\| block_index ), truncated to 128 bits`
updates encryption docs 2025-06-30 14:19:35 +02:00
updates encryption docs 2025-07-02 19:55:28 +02:00			In our implementation, we will be using the second scheme - starting with a random master key, and then for each block deriving a block level key (`blockKEY`) and block level initialization vector (`blockIV`).
updates encryption docs 2025-06-30 14:19:35 +02:00
updates encryption docs 2025-07-02 19:55:28 +02:00			`For some introduction and examples on BearSSL, please consult:`

			`- [[How to generate a random number using BearSSL]]`
			`- [[How to create a hash using BearSSL]]`
			`- [[How to encrypt and decrypt content using symmetric encryption in BearSSL]]`

Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`In Codex, we use a slightly modified AES-CBC mode of operation:`
updates encryption docs 2025-07-02 19:55:28 +02:00
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			1. we first generate a random `MASTER_KEY` - this key will be returned to the user,
			2. from the `MASTER_KEY`, for each block, we derive the corresponding block level encryption key `blockKEY` and block level initialization vector `blockIV` as shown in the proposal above,
updates encryption docs 2025-07-02 19:55:28 +02:00			3. using the derived `blockKEY` and `blockIV`, we then encrypt the block using the BearSSL encryption primitives as demonstrated above.

			`As we see above, the block index is used in the process of the key and initialization vector derivation. For this reason we also need to remember to convert the block index to a byte representation - we use big-endian ordering. For this conversion a very simple function can be used:`

			```nim
			`func toBytes[T: SomeInteger](value: T): seq[byte] =`
			`let v =`
			`if system.cpuEndian == bigEndian: value`
			`else: swapBytes(value)`

			`result = newSeq[byte](sizeof(T))`
			`copyMem(addr result[0], unsafeAddr v, sizeof(T))`
			```

			or, we can just use the `endians2.toBytes` function from `nim-stew`: `toBytes(blockIndex.uint32, bigEndian)`

			`Below we show the code for the derivation of the block level key and initialization vector, followed by the encryption and decryption of a block:`

			```nim
			`import std/sequtils`
			`import bearssl/[blockx, hash]`
			`import stew/[byteutils, endians2]`

			`import ./rng`
			`import ./hash`

			`let masterKey = newSeqWith(32, Rng.instance.rand(uint8.high).byte)`

			`let blockIndex = 1.uint32`
			`let blockIndexArray = toBytes(blockIndex, bigEndian)`

Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`const`
			`KEY_SIZE = 24 # 192 bits for AES-192`
			`IV_SIZE = 16 # 128 bits`
			`KeyDerivationIdentifier = "aes192_block_key".toBytes`
			`IvDerivationIdentifier = "aes192_block_iv".toBytes`
			`DefaultBlockSize* = uint 1024 * 64 # as used in Codex`
updates encryption docs 2025-07-02 19:55:28 +02:00
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`let keyForBlock = hash(addr sha256Vtable, masterKey & KeyDerivationIdentifier & blockIndexArray.toSeq)[0 ..< KEY_SIZE]`
			`let ivForBlock = hash(addr sha256Vtable, masterKey & IvDerivationIdentifier & blockIndexArray.toSeq)[0 ..< IV_SIZE]`
updates encryption docs 2025-07-02 19:55:28 +02:00
updates the encryption design doc to use the data size different from initialization vector size 2025-07-03 14:58:51 +02:00			`var plaintext = newSeqWith(DefaultBlockSize.int, Rng.instance.rand(uint8.high).byte)`
updates encryption docs 2025-07-02 19:55:28 +02:00
			`let key = keyForBlock`
			`let ive = ivForBlock`
			`let encBuffer = plaintext`

			`# encryption`
			`var encCtx: AesBigCbcencKeys`
			`aesBigCbcencInit(encCtx, addr key[0], key.len.uint)`
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`aesBigCbcencRun(encCtx, addr ive[0], addr encBuffer[0], encBuffer.len.uint)`
updates encryption docs 2025-07-02 19:55:28 +02:00
			`assert encBuffer != plaintext, "Encryption failed, output should differ from input!"`

			`# decryption`
			`let ivd = ivForBlock`

			`var decCtx: AesBigCbcdecKeys`
			`aesBigCbcdecInit(decCtx, addr key[0], key.len.uint)`
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`aesBigCbcdecRun(decCtx, addr ivd[0], addr encBuffer[0], encBuffer.len.uint)`
updates encryption docs 2025-07-02 19:55:28 +02:00
			`assert encBuffer == plaintext, "Decryption failed, output should match input!"`
			```

			where `rng` and `hash` are defined as shown in [[How to generate a random number using BearSSL]] and [[How to create a hash using BearSSL]].

Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`In a similar way we will proceed with the following, successive blocks.`

			`### API`

			`All data uploaded to Codex will, be default, be encrypted.`
updates encryption docs 2025-07-02 19:55:28 +02:00
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			The standard upload API - `/api/codex/v1/data` - after successful upload, returns the content identifier of the uploaded content, followed by the hex encoded master encryption key, separated by a single comma character `:`, eg:

			```bash
			`curl -X POST \`
			`http://localhost:8001/api/codex/v1/data \`
			`-H 'Content-Type: application/octet-stream' \`
			`-H 'Content-Disposition: filename="enc.txt"' \`
			`-w '\n' \`
			`-T enc.txt`
			`zDvZRwzm22eSYNdLBuNHVi7jSTR2a4n48yy4Ur9qws4vHV6madiz:541cc266b2ef426486cd981f2d7429347c68113bda2a80d21c9f18e250bfdaff`
			```

			`When downloading, the user is allowed to either retrieve the encrypted content, and then use an external utility (to be provided) to decrypt it, or the user can provide the encryption master key, which then will be used by the Codex client to decrypt the content, which then will be returned to the user:`

			```bash
			`export CID="zDvZRwzm22eSYNdLBuNHVi7jSTR2a4n48yy4Ur9qws4vHV6madiz"`
			`export KEY="541cc266b2ef426486cd981f2d7429347c68113bda2a80d21c9f18e250bfdaff"`
			`curl "http://localhost:8001/api/codex/v1/data/${CID}/network/stream?key=${KEY}" -o "dec.txt"`
			```
updates encryption docs 2025-07-02 19:55:28 +02:00
Updates Codex Encryption docs 2025-07-08 03:14:42 +02:00			`Notice that when no encryption key is provided, the returned content size will always be multiply of the block size.`
updates encryption docs 2025-07-02 19:55:28 +02:00			`### Links`
Renames "Code Encryption Basis" to "Codex Encryption Basis" 2025-06-17 13:34:32 +02:00
			`- [bearssl](https://bearssl.org/)`
adds one more link in Codex Encryption Design 2025-06-30 16:25:33 +02:00			`- [bearssl Nim bindings](https://github.com/status-im/nim-bearssl)`
			`- A nice example of using BearSSL encryption API (Arduino) https://github.com/kakopappa/esp8266-aes-cbc-encryption-decryption/blob/main/esp8266-aes-cbc-encryption-decryption.ino`