125 lines
4.4 KiB
Markdown
125 lines
4.4 KiB
Markdown
# stm
|
|
|
|
[![Go Reference](https://pkg.go.dev/badge/github.com/anacrolix/stm.svg)](https://pkg.go.dev/github.com/anacrolix/stm)
|
|
[![Go Report Card](https://goreportcard.com/badge/github.com/anacrolix/stm)](https://goreportcard.com/report/github.com/anacrolix/stm)
|
|
|
|
Package `stm` provides [Software Transactional Memory](https://en.wikipedia.org/wiki/Software_transactional_memory) operations for Go. This is
|
|
an alternative to the standard way of writing concurrent code (channels and
|
|
mutexes). STM makes it easy to perform arbitrarily complex operations in an
|
|
atomic fashion. One of its primary advantages over traditional locking is that
|
|
STM transactions are composable, whereas locking functions are not -- the
|
|
composition will either deadlock or release the lock between functions (making
|
|
it non-atomic).
|
|
|
|
The `stm` API tries to mimic that of Haskell's [`Control.Concurrent.STM`](https://hackage.haskell.org/package/stm-2.4.4.1/docs/Control-Concurrent-STM.html), but
|
|
this is not entirely possible due to Go's type system; we are forced to use
|
|
`interface{}` and type assertions. Furthermore, Haskell can enforce at compile
|
|
time that STM variables are not modified outside the STM monad. This is not
|
|
possible in Go, so be especially careful when using pointers in your STM code.
|
|
|
|
Unlike Haskell, data in Go is not immutable by default, which means you have
|
|
to be careful when using STM to manage pointers. If two goroutines have access
|
|
to the same pointer, it doesn't matter whether they retrieved the pointer
|
|
atomically; modifying the pointer can still cause a data race. To resolve
|
|
this, either use immutable data structures, or replace pointers with STM
|
|
variables. A more concrete example is given below.
|
|
|
|
It remains to be seen whether this style of concurrency has practical
|
|
applications in Go. If you find this package useful, please tell us about it!
|
|
|
|
## Examples
|
|
|
|
See the package examples in the Go package docs for examples of common operations.
|
|
|
|
See [example_santa_test.go](example_santa_test.go) for a more complex example.
|
|
|
|
## Pointers
|
|
|
|
Note that `Operation` now returns a value of type `interface{}`, which isn't included in the
|
|
examples throughout the documentation yet. See the type signatures for `Atomically` and `Operation`.
|
|
|
|
Be very careful when managing pointers inside transactions! (This includes
|
|
slices, maps, channels, and captured variables.) Here's why:
|
|
|
|
```go
|
|
p := stm.NewVar([]byte{1,2,3})
|
|
stm.Atomically(func(tx *stm.Tx) {
|
|
b := tx.Get(p).([]byte)
|
|
b[0] = 7
|
|
tx.Set(p, b)
|
|
})
|
|
```
|
|
|
|
This transaction looks innocent enough, but it has a hidden side effect: the
|
|
modification of b is visible outside the transaction. Instead of modifying
|
|
pointers directly, prefer to operate on immutable values as much as possible.
|
|
Following this advice, we can rewrite the transaction to perform a copy:
|
|
|
|
```go
|
|
stm.Atomically(func(tx *stm.Tx) {
|
|
b := tx.Get(p).([]byte)
|
|
c := make([]byte, len(b))
|
|
copy(c, b)
|
|
c[0] = 7
|
|
tx.Set(p, c)
|
|
})
|
|
```
|
|
|
|
This is less efficient, but it preserves atomicity.
|
|
|
|
In the same vein, it would be a mistake to do this:
|
|
|
|
```go
|
|
type foo struct {
|
|
i int
|
|
}
|
|
p := stm.NewVar(&foo{i: 2})
|
|
stm.Atomically(func(tx *stm.Tx) {
|
|
f := tx.Get(p).(*foo)
|
|
f.i = 7
|
|
tx.Set(p, f)
|
|
})
|
|
```
|
|
|
|
...because setting `f.i` is a side-effect that escapes the transaction. Here,
|
|
the correct approach is to move the `Var` inside the struct:
|
|
|
|
```go
|
|
type foo struct {
|
|
i *stm.Var
|
|
}
|
|
f := foo{i: stm.NewVar(2)}
|
|
stm.Atomically(func(tx *stm.Tx) {
|
|
i := tx.Get(f.i).(int)
|
|
i = 7
|
|
tx.Set(f.i, i)
|
|
})
|
|
```
|
|
|
|
## Benchmarks
|
|
|
|
In synthetic benchmarks, STM seems to have a 1-5x performance penalty compared
|
|
to traditional mutex- or channel-based concurrency. However, note that these
|
|
benchmarks exhibit a lot of data contention, which is where STM is weakest.
|
|
For example, in `BenchmarkIncrementSTM`, each increment transaction retries an
|
|
average of 2.5 times. Less contentious benchmarks are forthcoming.
|
|
|
|
```
|
|
BenchmarkAtomicGet-4 50000000 26.7 ns/op
|
|
BenchmarkAtomicSet-4 20000000 65.7 ns/op
|
|
|
|
BenchmarkIncrementSTM-4 500 2852492 ns/op
|
|
BenchmarkIncrementMutex-4 2000 645122 ns/op
|
|
BenchmarkIncrementChannel-4 2000 986317 ns/op
|
|
|
|
BenchmarkReadVarSTM-4 5000 268726 ns/op
|
|
BenchmarkReadVarMutex-4 10000 248479 ns/op
|
|
BenchmarkReadVarChannel-4 10000 240086 ns/op
|
|
|
|
```
|
|
|
|
## Credits
|
|
|
|
Package stm was [originally](https://github.com/lukechampine/stm/issues/3#issuecomment-549087541)
|
|
created by lukechampine.
|