Trying to debug these tests in CI is incredibly difficult and since they depend on older versions of previously published modules, if something breaks due to a bug in released code it's very hard to fix it.
Instead follow the pattern in the perf tests and include the full test implementations here.