multicodec/README.md

# multicodec

[![](https://img.shields.io/badge/made%20by-Protocol%20Labs-blue.svg?style=flat-square)](http://ipn.io)
[![](https://img.shields.io/badge/project-multiformats-blue.svg?style=flat-square)](https://github.com/multiformats/multiformats)
[![](https://img.shields.io/badge/freenode-%23ipfs-blue.svg?style=flat-square)](https://webchat.freenode.net/?channels=%23ipfs)
[![](https://img.shields.io/badge/readme%20style-standard-brightgreen.svg?style=flat-square)](https://github.com/RichardLitt/standard-readme)

> Canonical table of of codecs used by various multiformats

## Table of Contents

- [Motivation](#motivation)
- [Description](#description)
- [Examples](#examples)
- [Multicodec table](#multicodec-table)
  - [Adding new multicodecs to the table](#adding-new-multicodecs-to-the-table)
- [Implementations](#implementations)
- [Reserved Code Ranges](#reserved-code-ranges)
- [FAQ](#faq)
- [Contribute](#contribute)
- [License](#license)

## Motivation

Multicodec is an agreed-upon codec table. It is designed for use in binary representations, such as keys or identifiers (i.e [CID](https://github.com/ipld/cid)).

## Description

The code of a multicodec is usually encoded as unsigned varint as defined by [multiformats/unsigned-varint](https://github.com/multiformats/unsigned-varint). It is then used as a prefix to identify the data that follows.

## Examples

Multicodec is used in various [Multiformats](https://github.com/multiformats/multiformats). In [Multihash](https://github.com/multiformats/multihash) it is used to identify the hashes, in the machine-readable [Multiaddr](https://github.com/multiformats/multiaddr) to identify components such as IP addresses, domain names, identities, etc.

## Multicodec table

Find the canonical table of multicodecs at [table.csv](/table.csv). There's also a sortable [viewer](https://ipfs.io/ipfs/QmXec1jjwzxWJoNbxQF5KffL8q6hFXm9QwUGaa3wKGk6dT/#title=Multicodecs&src=https://raw.githubusercontent.com/multiformats/multicodec/master/table.csv).

### Status

Each multicodec is marked with a status:

* draft - this codec has been reserved but may be reassigned if it doesn't gain wide adoption.
* permanent - this codec has been widely adopted and may not reassigned.

NOTE: Just because a codec is marked draft, don't assume that it can be re-assigned. Check to see if it ever gained wide adoption and, if so, mark it as permanent.

### Adding new multicodecs to the table

The process to add a new multicodec to the table is the following:

1. Fork this repo
2. Add your codecs to the table. Each newly proposed codec must have:
  1. A unique codec.
  2. A unique name.
  3. A category.
  4. A status of "draft".
3. Submit a Pull Request

This ["first come, first assign"](https://github.com/multiformats/multicodec/pull/16#issuecomment-260146609) policy is a way to assign codes as they are most needed, without increasing the size of the table (and therefore the size of the multicodecs) too rapidly.

The first 127 bits are encoded as a single-byte varint, hence they are reserved for the most widely used multicodecs. So if you are adding your own codec to the table, you most likely would want to ask for a codec bigger than `0x80`.

Codec names should be easily convertible to constants in common programming languages using basic transformation rules (e.g. upper-case, conversion of `-` to `_`, etc.). Therefore they should contain alphanumeric characters, with the first character being alphabetic. The primary delimiter for multi-part names should be `-`, with `_` reserved for cases where a secondary delimiter is required. For example: `bls12_381-g1-pub` contains 3 parts: `bls_381`, `g1` and `pub`, where `bls_381` is "BLS 381" which is not commonly written as "BLS381" and therefore requires a secondary separator.

The `validate.py` script can be used to validate the table once it's edited.

## Implementations

- [go](https://github.com/multiformats/go-multicodec/)
- [JavaScript](https://github.com/multiformats/js-multicodec)
- [Python](https://github.com/multiformats/py-multicodec)
- [Haskell](https://github.com/multiformats/haskell-multicodec)
- [Elixir](https://github.com/nocursor/ex-multicodec)
- [Scala](https://github.com/fluency03/scala-multicodec)
- [Ruby](https://github.com/sleeplessbyte/ruby-multicodec)
- [Add yours today!](https://github.com/multiformats/multicodec/edit/master/table.csv)

## Reserved Code Ranges

The following code ranges have special meaning and may only have meanings assigned to as specified in their description:

### Private Use Area

*Range*: `0x300000 – 0x3FFFFF`

Codes in this range are reserved for internal use by applications and will never be assigned any meaning as part of the Multicodec specification.

## FAQ

> Why varints?

So that we have no limitation on protocols.

> What kind of varints?

An Most Significant Bit unsigned varint, as defined by the [multiformats/unsigned-varint](https://github.com/multiformats/unsigned-varint).

> Don't we have to agree on a table of protocols?

Yes, but we already have to agree on what protocols themselves are, so this is not so hard. The table even leaves some room for custom protocol paths, or you can use your own tables. The standard table is only for common things.

> Where did multibase go?

For a period of time, the [multibase](https://github.com/multiformats/multibase) prefixes lived in this table. However, multibase prefixes are *symbols* that may map to *multiple* underlying byte representations (that may overlap with byte sequences used for other multicodecs). Including them in a table for binary/byte identifiers lead to more confusion than it solved.

You can still find the table in [multibase.csv](https://github.com/multiformats/multibase/blob/master/multibase.csv).

> Can I use multicodec for my own purpose?

Sure, you can use multicodec whenever you have the need for self-identifiable data. Just prefix your own data with the corresponding varint encodec multicodec.

## Contribute

Contributions welcome. Please check out [the issues](https://github.com/multiformats/multicodec/issues).

Check out our [contributing document](https://github.com/multiformats/multiformats/blob/master/contributing.md) for more information on how we work, and about contributing in general. Please be aware that all interactions related to multiformats are subject to the IPFS [Code of Conduct](https://github.com/ipfs/community/blob/master/code-of-conduct.md).

Small note: If editing the README, please conform to the [standard-readme](https://github.com/RichardLitt/standard-readme) specification.

## License

This repository is only for documents. All of these are licensed under the [CC-BY-SA 3.0](https://ipfs.io/ipfs/QmVreNvKsQmQZ83T86cWSjPu2vR3yZHGPm5jnxFuunEB9u) license © 2016 Protocol Labs Inc. Any code is under a [MIT](LICENSE) © 2016 Protocol Labs Inc.
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								# multicodec
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								[![](https://img.shields.io/badge/made%20by-Protocol%20Labs-blue.svg?style=flat-square)](http://ipn.io)
-												Edited README

- Fixed https issue in links
- Added standard-readme badge
- Changed descrption by capitalizing Compact and changing on GitHub
- Fixed edit link to point to the table
- Added note about readme to contribute
- Added CC license to license section
- Added year and Protocol Labs to MIT code license

											
										
										
											2016-12-27 19:27:39 +00:00
+								[![](https://img.shields.io/badge/project-multiformats-blue.svg?style=flat-square)](https://github.com/multiformats/multiformats)
 								[![](https://img.shields.io/badge/freenode-%23ipfs-blue.svg?style=flat-square)](https://webchat.freenode.net/?channels=%23ipfs)
 								[![](https://img.shields.io/badge/readme%20style-standard-brightgreen.svg?style=flat-square)](https://github.com/RichardLitt/standard-readme)
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> Canonical table of of codecs used by various multiformats
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								## Table of Contents
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								- [Motivation](#motivation)
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								- [Description](#description)
 								- [Examples](#examples)
-												docs: remove references to multicodec-packed (#72)

multicodec-packed isn't a thing anymore, it was what multicodec is now.
Hence remove all references to it.
											
										
										
											2018-02-01 21:20:11 +00:00
+								- [Multicodec table](#multicodec-table)
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								  - [Adding new multicodecs to the table](#adding-new-multicodecs-to-the-table)
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								- [Implementations](#implementations)
-												Add reserved code range for private use by applications (#191)

See GH/multiformats/multicodec#158. Changes are analogous to those proposed in GH/multiformats/multicodec#159.
											
										
										
											2020-08-28 04:14:39 +00:00
+								- [Reserved Code Ranges](#reserved-code-ranges)
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								- [FAQ](#faq)
 								- [Contribute](#contribute)
 								- [License](#license)
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
 								## Motivation
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								Multicodec is an agreed-upon codec table. It is designed for use in binary representations, such as keys or identifiers (i.e [CID](https://github.com/ipld/cid)).
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								## Description
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								The code of a multicodec is usually encoded as unsigned varint as defined by [multiformats/unsigned-varint](https://github.com/multiformats/unsigned-varint). It is then used as a prefix to identify the data that follows.
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								## Examples
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								Multicodec is used in various [Multiformats](https://github.com/multiformats/multiformats). In [Multihash](https://github.com/multiformats/multihash) it is used to identify the hashes, in the machine-readable [Multiaddr](https://github.com/multiformats/multiaddr) to identify components such as IP addresses, domain names, identities, etc.
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												fix: fix some mistakes in the spec

											
										
										
											2016-09-25 07:46:07 +00:00
+								## Multicodec table
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								Find the canonical table of multicodecs at [table.csv](/table.csv). There's also a sortable [viewer](https://ipfs.io/ipfs/QmXec1jjwzxWJoNbxQF5KffL8q6hFXm9QwUGaa3wKGk6dT/#title=Multicodecs&src=https://raw.githubusercontent.com/multiformats/multicodec/master/table.csv).
-												break multicodec table into its own file, add a note of how to add new codes

											
										
										
											2016-11-13 05:23:37 +00:00
-												add draft/standard statuses to the readme

											
										
										
											2020-03-16 19:35:29 +00:00
+								### Status
 								Each multicodec is marked with a status:
 								* draft - this codec has been reserved but may be reassigned if it doesn't gain wide adoption.
-												Rename "standard" to "permanent"

This makes it clearer that there doesn't necessarily be an official standard,
but having wide adoption and being a de-facto standard could still be enough.

											
										
										
											2020-04-15 16:40:15 +00:00
+								* permanent - this codec has been widely adopted and may not reassigned.
-												add draft/standard statuses to the readme

											
										
										
											2020-03-16 19:35:29 +00:00
-												README: add a note about draft

											
										
										
											2021-04-13 17:46:44 +00:00
+								NOTE: Just because a codec is marked draft, don't assume that it can be re-assigned. Check to see if it ever gained wide adoption and, if so, mark it as permanent.
-												break multicodec table into its own file, add a note of how to add new codes

											
										
										
											2016-11-13 05:23:37 +00:00
+								### Adding new multicodecs to the table
 								The process to add a new multicodec to the table is the following:
-												add draft/standard statuses to the readme

											
										
										
											2020-03-16 19:35:29 +00:00
+. Fork this repo
 . Add your codecs to the table. Each newly proposed codec must have:
 . A unique codec.
 . A unique name.
 . A category.
 . A status of "draft".
 . Submit a Pull Request
-												break multicodec table into its own file, add a note of how to add new codes

											
										
										
											2016-11-13 05:23:37 +00:00
 								This ["first come, first assign"](https://github.com/multiformats/multicodec/pull/16#issuecomment-260146609) policy is a way to assign codes as they are most needed, without increasing the size of the table (and therefore the size of the multicodecs) too rapidly.
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								The first 127 bits are encoded as a single-byte varint, hence they are reserved for the most widely used multicodecs. So if you are adding your own codec to the table, you most likely would want to ask for a codec bigger than `0x80`.
-												Naming restrictions (doc & validate), rename `0xcert` to be code-friendly

Fixes: https://github.com/multiformats/multicodec/issues/181
Ref: https://github.com/multiformats/multicodec/pull/177

											
										
										
											2020-07-20 03:59:24 +00:00
+								Codec names should be easily convertible to constants in common programming languages using basic transformation rules (e.g. upper-case, conversion of `-` to `_`, etc.). Therefore they should contain alphanumeric characters, with the first character being alphabetic. The primary delimiter for multi-part names should be `-`, with `_` reserved for cases where a secondary delimiter is required. For example: `bls12_381-g1-pub` contains 3 parts: `bls_381`, `g1` and `pub`, where `bls_381` is "BLS 381" which is not commonly written as "BLS381" and therefore requires a secondary separator.
 								The `validate.py` script can be used to validate the table once it's edited.
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
+								## Implementations
-												update with regards to https://github.com/multiformats/multicodec/pull/16\#issuecomment-249497577

											
										
										
											2016-09-26 08:06:15 +00:00
+								- [go](https://github.com/multiformats/go-multicodec/)
 								- [JavaScript](https://github.com/multiformats/js-multicodec)
-												Add py-multicodec to the list of implementations (#60)


											
										
										
											2017-09-16 12:17:15 +00:00
+								- [Python](https://github.com/multiformats/py-multicodec)
-												Update README.md (#67)


											
										
										
											2017-11-04 16:42:07 +00:00
+								- [Haskell](https://github.com/multiformats/haskell-multicodec)
-												Added Elixir implementation link
											
										
										
											2018-11-16 19:21:45 +00:00
+								- [Elixir](https://github.com/nocursor/ex-multicodec)
-												Add link to Scala implementation

add fluency03/scala-multicodec

											
										
										
											2018-11-15 21:53:51 +00:00
+								- [Scala](https://github.com/fluency03/scala-multicodec)
-												Add Ruby implementation
											
										
										
											2019-06-20 15:58:26 +00:00
+								- [Ruby](https://github.com/sleeplessbyte/ruby-multicodec)
-												Edited README

- Fixed https issue in links
- Added standard-readme badge
- Changed descrption by capitalizing Compact and changing on GitHub
- Fixed edit link to point to the table
- Added note about readme to contribute
- Added CC license to license section
- Added year and Protocol Labs to MIT code license

											
										
										
											2016-12-27 19:27:39 +00:00
+								- [Add yours today!](https://github.com/multiformats/multicodec/edit/master/table.csv)
-												bring updates on updated multistream

											
										
										
											2015-08-24 10:16:30 +00:00
-												Add reserved code range for private use by applications (#191)

See GH/multiformats/multicodec#158. Changes are analogous to those proposed in GH/multiformats/multicodec#159.
											
										
										
											2020-08-28 04:14:39 +00:00
+								## Reserved Code Ranges
 								The following code ranges have special meaning and may only have meanings assigned to as specified in their description:
 								### Private Use Area
 								*Range*: `0x300000 – 0x3FFFFF`
 								Codes in this range are reserved for internal use by applications and will never be assigned any meaning as part of the Multicodec specification.
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								## FAQ
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> Why varints?
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Remove implementation note

There are already multicodec codes in the table which are > 127
(Blake and Stein hashes). Hence implementations need to implement
varint.

											
										
										
											2018-12-03 12:34:31 +00:00
+								So that we have no limitation on protocols.
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> What kind of varints?
-												wip

											
										
										
											2015-08-23 22:34:57 +00:00
-												update with regards to https://github.com/multiformats/multicodec/pull/16\#issuecomment-249497577

											
										
										
											2016-09-26 08:06:15 +00:00
+								An Most Significant Bit unsigned varint, as defined by the [multiformats/unsigned-varint](https://github.com/multiformats/unsigned-varint).
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> Don't we have to agree on a table of protocols?
-												added multicodec-packed beginning

The table is still missing.

											
										
										
											2016-08-25 01:18:07 +00:00
-												update with regards to https://github.com/multiformats/multicodec/pull/16\#issuecomment-249497577

											
										
										
											2016-09-26 08:06:15 +00:00
+								Yes, but we already have to agree on what protocols themselves are, so this is not so hard. The table even leaves some room for custom protocol paths, or you can use your own tables. The standard table is only for common things.
-												added multicodec-packed beginning

The table is still missing.

											
										
										
											2016-08-25 01:18:07 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> Where did multibase go?
-												fill out multibase table and treat multibases as symbols

This extends the concept of multicodecs to general symbolic (text) strings, not
just byte strings.

											
										
										
											2017-09-07 00:25:51 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								For a period of time, the [multibase](https://github.com/multiformats/multibase) prefixes lived in this table. However, multibase prefixes are *symbols* that may map to *multiple* underlying byte representations (that may overlap with byte sequences used for other multicodecs). Including them in a table for binary/byte identifiers lead to more confusion than it solved.
-												move multibase prefixes out of this table

Resolution from a discussion with Juan and the discussion on the following
issues:

fixes #89
fixes #76

											
										
										
											2018-11-15 19:23:21 +00:00
 								You can still find the table in [multibase.csv](https://github.com/multiformats/multibase/blob/master/multibase.csv).
-												fill out multibase table and treat multibases as symbols

This extends the concept of multicodecs to general symbolic (text) strings, not
just byte strings.

											
										
										
											2017-09-07 00:25:51 +00:00
-												Make README reflect what multicodec currently is used for

Multicodec changed over time what it actually is. The README should reflect
the current state on how it is used.

Closes #133.

											
										
										
											2019-05-29 11:48:11 +00:00
+								> Can I use multicodec for my own purpose?
 								Sure, you can use multicodec whenever you have the need for self-identifiable data. Just prefix your own data with the corresponding varint encodec multicodec.
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								## Contribute
 								Contributions welcome. Please check out [the issues](https://github.com/multiformats/multicodec/issues).
 								Check out our [contributing document](https://github.com/multiformats/multiformats/blob/master/contributing.md) for more information on how we work, and about contributing in general. Please be aware that all interactions related to multiformats are subject to the IPFS [Code of Conduct](https://github.com/ipfs/community/blob/master/code-of-conduct.md).
-												Edited README

- Fixed https issue in links
- Added standard-readme badge
- Changed descrption by capitalizing Compact and changing on GitHub
- Fixed edit link to point to the table
- Added note about readme to contribute
- Added CC license to license section
- Added year and Protocol Labs to MIT code license

											
										
										
											2016-12-27 19:27:39 +00:00
+								Small note: If editing the README, please conform to the [standard-readme](https://github.com/RichardLitt/standard-readme) specification.
-												Standardized Readme

See multiformats/multiformats#13

											
										
										
											2016-08-15 20:53:49 +00:00
+								## License
-												Edited README

- Fixed https issue in links
- Added standard-readme badge
- Changed descrption by capitalizing Compact and changing on GitHub
- Fixed edit link to point to the table
- Added note about readme to contribute
- Added CC license to license section
- Added year and Protocol Labs to MIT code license

											
										
										
											2016-12-27 19:27:39 +00:00
+								This repository is only for documents. All of these are licensed under the [CC-BY-SA 3.0](https://ipfs.io/ipfs/QmVreNvKsQmQZ83T86cWSjPu2vR3yZHGPm5jnxFuunEB9u) license © 2016 Protocol Labs Inc. Any code is under a [MIT](LICENSE) © 2016 Protocol Labs Inc.