* Bit-order in-place reversal optimizations * optimization/simplification * Done modulo documentation and testing on x86 * Minor type fixes on non-ARM * Minor x86 * Transpose docs * Docs * Make rustfmt happy * Bug fixes + tests * Minor docs + lints