Serialization Overview

Introduction

NimYAML tries hard to make transforming YAML characters streams to native Nim types and vice versa as easy as possible. In simple scenarios, you might not need anything else than the two procs dump and load. On the other side, the process should be as customizable as possible to allow the user to tightly control how the generated YAML character stream will look and how a YAML character stream is interpreted.

An important thing to remember in NimYAML is that unlike in interpreted languages like Ruby, Nim cannot load a YAML character stream without knowing the resulting type beforehand. For example, if you want to load this piece of YAML:

%YAML 1.2
--- !nim:system:seq(nim:system:int8)
- 1
- 2
- 3

You would need to know that it will load a seq[int8] at compile time. This is not really a problem because without knowing which type you will load, you cannot do anything useful with the result afterwards in the code. But it may be unfamiliar for programmers who are used to the YAML libraries of Python or Ruby.

Supported Types

NimYAML supports a growing number of types of Nim's system module and standard library, and it also supports user-defined object, tuple and enum types out of the box.

Important: NimYAML currently does not support polymorphism or variant object types. This may be added in the future.

This also means that NimYAML is generally able to work with object, tuple and enum types defined in the standard library or a third-party library without further configuration.

Scalar Types

The following integer types are supported by NimYAML: int8, int16, int32, int64, uint8, uint16, uint32, uint64. Note that the int type is not supported, since its sice varies based on the target operation system. This makes it unfit for usage within NimYAML, because a value might not round-trip between a binary for a 32-bit OS and another binary for a 64-bit OS. The same goes for uint.

The floating point types float32 and float64 are also supported, but float is not, for the same reason.

string is supported and one of the few Nim types which directly map to a standard YAML type. char is also supported.

To support new scalar types, you must implement the constructObject() and representObject() procs on that type (see below).

Collection Types

NimYAML supports Nim's seq and Table types out of the box. Unlike the native YAML types !!seq and !!map, seq and Table define the type of all their items (or keys and values). So YAML objects with heterogeneous types in them cannot be loaded to Nim collection types. For example, this sequence:

%YAML 1.2
--- !!seq
- !!int 1
- !!string foo

Cannot be loaded to a Nim seq. For this reason, you cannot load YAML's native !!map and !!seq types directly into Nim types.

Reference Types

A reference to any supported non-reference type (including user defined types, see below) is supported by NimYAML. A reference type will be treated like its base type, but NimYAML is able to detect multiple references to the same object and dump the structure properly with anchors and aliases in place. It is possible to dump and load cyclic data structures without further configuration. It is possible for reference types to hold a nil value, which will be mapped to the !!null YAML scalar type.

Pointer types are not supported because it seems dangerous to automatically allocate memory which the user must then manually deallocate.

User Defined Types

For an object or tuple type to be directly usable with NimYAML, the following conditions must be met:

Every type contained in the object/tuple must be supported
All fields of an object type must be accessible from the code position where you call NimYAML. If an object has non-public member fields, it can only be processed in the module where it is defined.
The object may not contain a case clause.
The object may not have a generic parameter

NimYAML will present enum types as YAML scalars, and tuple and object types as YAML maps. Some of the conditions above may be loosened in future releases.

Customize Serialization

It is possible to customize the serialization of a type. For this, you need to implement two procs, constructObject̀ and representObject. If you only need to process the type in one direction (loading or dumping), you can omit the other proc.

constructObject

proc constructObject*(s: var YamlStream, c: ConstructionContext,
                      result: var MyObject)
        {.raises: [YamlConstructionError, YamlStreamError.}

This proc should construct the type from a YamlStream. Follow the following guidelines when implementing a custom constructObject proc:

For constructing a value from a YAML scalar, consider using the constructScalarItem template, which will automatically catch exceptions and wrap them with a YamlConstructionError, and also will assure that the item you use for construction is a yamlScalar. See below for an example.
For constructing a value from a YAML sequence or map, you must use the constructChild proc for child values if you want to use their constructObject implementation. This will check their tag and anchor. Always try to construct child values that way.
For non-scalars, make sure that the last value you remove from the stream is the object's ending event (yamlEndMap or yamlEndSequence)
Use peek for inspecting the next event in the YamlStream without removing it.
Never write a constructObject proc for a ref type. ref types are always handled by NimYAML itself. You can only customize the construction of the underlying object.

The following example for constructing from a YAML scalar value is the actual implementation of constructing int types:

proc constructObject*[T: int8|int16|int32|int64](
        s: var YamlStream, c: ConstructionContext, result: var T)
        {.raises: [YamlConstructionError, YamlStreamError].} =
    var item: YamlStreamEvent
    constructScalarItem(s, item, name(T)):
        result = T(parseBiggestInt(item.scalarContent))

The following example for constructiong from a YAML non-scalar is the actual implementation of constructing seq types:

proc constructObject*[T](s: var YamlStream, c: ConstructionContext,
                         result: var seq[T])
        {.raises: [YamlConstructionError, YamlStreamError].} =
    let event = s.next()
    if event.kind != yamlStartSequence:
        raise newException(YamlConstructionError, "Expected sequence start")
    result = newSeq[T]()
    while s.peek().kind != yamlEndSequence:
        var item: T
        constructChild(s, c, item)
        result.add(item)
    discard s.next()

representObject

proc representObject*(value: MyObject, ts: TagStyle = tsNone,
                      c: SerializationContext): RawYamlStream {.raises: [].}

This proc should return an iterator over YamlStreamEvent which represents the type. Follow the following guidelines when implementing a custom representObject proc:

You can use the helper template presentTag for outputting the tag.
Always output the first tag with a yAnchorNone. Anchors will be set automatically by ref type handling.
When outputting non-scalar types, you should use the representObject implementation of the child types, if possible.
Check if the given TagStyle equals tsRootOnly and if yes, change it to tsNone for the child values.
Never write a representObject proc for ref types.

The following example for representing to a YAML scalar is the actual implementation of representing int types:

proc representObject*[T: uint8|uint16|uint32|uint64](
        value: T, ts: TagStyle, c: SerializationContext):
        RawYamlStream {.raises: [].} =
    result = iterator(): YamlStreamEvent =
        yield scalarEvent($value, presentTag(T, ts), yAnchorNone)

The following example for representing to a YAML non-scalar is the actual implementation of representing seq types:

proc representObject*[T](value: seq[T], ts: TagStyle,
        c: SerializationContext): RawYamlStream {.raises: [].} =
    result = iterator(): YamlStreamEvent =
        let childTagStyle = if ts == tsRootOnly: tsNone else: ts
        yield YamlStreamEvent(kind: yamlStartSequence,
                              seqTag: presentTag(seq[T], ts),
                              seqAnchor: yAnchorNone)
        for item in value:
            var events = representObject(item, childTagStyle, c)
            while true:
                let event = events()
                if finished(events): break
                yield event
        yield YamlStreamEvent(kind: yamlEndSequence)