912 Commits

Author SHA1 Message Date
Eric
139aa6eef8
readd prevention of hanging when closing process streams on windows 2025-06-05 15:58:06 +10:00
Eric
e7a9adab21
Add debugging and assertions after test run
Mostly to debug in windows
2025-06-05 15:58:06 +10:00
Eric
f4b78f048f
CI yml fix for gcc-14 2025-06-05 15:58:05 +10:00
Eric
f3901d13b1
Update NBS action.yml in CI 2025-06-05 15:58:05 +10:00
Eric
67719ee395
formatting 2025-06-05 15:58:05 +10:00
Eric
e648954eea
formatting 2025-06-05 15:58:05 +10:00
Eric
357840db4d
Fix process hanging after restart 2025-06-05 15:58:05 +10:00
Eric
eb77643405
refactor: debug and logging
- Move test manager values to config object
- Increase codex port separation (between api and disc ports) in an attempt to prevent overlap across tests (ie Test1: api=8000(tcp), disc=9000(udp), and Test2: api=9000(tcp), disc=10000(udp))
- print stderr when exitcode == 1 if there's < 3 lines of stdout
- Logging:
  - Always write test manager harness chronicles logs to file, ie testmanager.chronicles.log in the root of the `integration/logs/<run name>` dir
  - Always write individual test stdout to file, ie `<test file name>.stdout.log` in the root of the `integration/logs/<run name>/<test file name>` dir
  - On error, print stderr to screen and write stderr to file. Or on failure, if stdout is sufficiently short, write stderr to screen and file in `integration/logs/<run name>/<test file name>/<test file name>.stderr.log`
  - When debugging, ie DebugCodexNodes == true
- Removes DebugTestHarness from controlling anything other than printing chronicles output from the testmanager to the terminal
- Now, if DebugCodexNodes is set to true:
  - Codex node (chronicles) output for multinodesuite tests is logged to file, eg `integration/logs/<run name>/<test file name>/<test name>/<role>_<idx>.log`
  - Codex chronicles output is logged to stdout, which also written to file (see above)
2025-06-05 15:58:05 +10:00
Eric
0a4369d0cd
enable more logging to file
- test harness chronicles logs to log file
- test process stdout and stderr to log file for each integration test file

Note: does not work when DEBUG=1 is not present in make command
2025-06-05 15:58:04 +10:00
Eric
595e1d3fdb
Ensure started tests are torn down properly on timeout 2025-06-05 15:58:04 +10:00
Eric
b9e3e07dc1
filter out log topics that are making too much noise
- set disabled topics during codex compilation due to lack of support for disabling chronicles topics at runtime in codex
- ensure that TRACE logs are set for nodes running in multinodesuite when DebugCodexNodes is set
- add TRACE loglevel to testcli
2025-06-05 15:58:04 +10:00
Slava
494a3b9244
ci: check runner for parallel integration tests and use value from vars 2025-06-05 15:58:04 +10:00
Eric
40c243a080
Colorise stdout from test runs
Adds terminal colour to [OK], [FAILED], and [Suite]

# Conflicts:
#	tests/integration/testmanager.nim
2025-06-05 15:58:04 +10:00
Eric
72d2931183
Refactor process launch
Refactor to keep a process handle so that the processes can be stopped on timeout.
2025-06-05 15:58:04 +10:00
Eric
bc5fdd49be
run tests async from start so that stop can cancel all tracked futures 2025-06-05 15:58:03 +10:00
Eric
fb2e3436f6
Add proper async exception tracking to multinodesuite 2025-06-05 15:58:03 +10:00
Eric
592e7c1e27
bump chronos to 4.0.4
potentially fixes an issue on macos where process exit codes were reported -1
2025-06-05 15:58:03 +10:00
Eric
c3558e1e04
formatting 2025-06-05 15:58:03 +10:00
Eric
8257ec29b8
CI: enable artefact upload for parallel tests 2025-06-05 15:58:03 +10:00
Eric
23875b875d
run npm install for parallel tests in ci 2025-06-05 15:58:03 +10:00
Eric
22cdacb0fe
Allow debug runs from the github UI 2025-06-05 15:58:03 +10:00
Eric
cddb407ddc
clean up 2025-06-05 15:58:02 +10:00
Eric
c59fa9f1e5
increase port gap by 1000 to avoid clashes in windows 2025-06-05 15:58:02 +10:00
Eric
aa6678b8ff
lock codex port selection in multinodesuites
Codex api/disc ports were clashing in windows because `lastUsedCodexApiPort` and `lastUsedCodeexDiscPort` were not locked when read and written, while client and provider nodes were started concurrently (async).
2025-06-05 15:58:02 +10:00
Eric
5333856083
add test name to datadir path
avoids reusing datadirs between tests
2025-06-05 15:58:02 +10:00
Eric
25ce97e81e
update Make flag to simply debug
DEBUG -> enables DebugCodexNodes, DebugTestHarness, and ShowContinuousStatusUpdates
DEBUG_HARDHAT -> enables DebugHardhat
PARALLEL -> enables EnableParallelTests

Additionally, when DEBUG is enabled, all integration tests debug configs are enabled for Codex nodes, the Codex node output is printed with the test output (not interleaved), and the Codex node output is logged to file in `tests/integrations/logs/<starttime>__IntegrationTests/<integration_test_name>/<suite_name>/<testname>/<role>_<idx>.log`.
When DEBUG_HARDHAT is enabled, all hardhat output is printed with the test output (not interleaved), and the output is also written to a log file in `tests/integrations/logs/<starttime>__IntegrationTests/<integration_test_name>/hardhat.log
2025-06-05 15:58:02 +10:00
Eric
29df100e93
cancel processes soon
Windows really does not like to terminate processes, so if we try to cancel the process, we will hang in Windows.
2025-06-05 15:58:01 +10:00
Eric
92ca6de95b
fix: hardhot not being stopped when test times out 2025-06-05 15:58:01 +10:00
Eric
7ea21795c9
Allow parallel integration tests to be disabled
Disable with `make ENABLE_PARALLEL_TESTS=0 testIntegration`
2025-06-05 15:58:01 +10:00
Eric
44f99f5d39
fix: use the last used port instead of the globally defined port 2025-06-05 15:58:01 +10:00
Eric
42701ee54e
Increase hardhat, codex api, and codex discovery ports for each subsequent test
Previously, each test would start with a port specified by a global const and overwritable using `define` cli flag. This seems to be causing some port clashes that `nextFreePort` is not picking up. Instead of relying on `nextFreePort`, do not reuse port numbers for any of the tests.
2025-06-05 15:58:01 +10:00
Eric
fb39f12fe4
enable debug logs for testpurchasing since the failure in ci 2025-06-05 15:58:01 +10:00
Eric
680430ef81
swap linux large runner 2025-06-05 15:58:00 +10:00
Eric
7695610307
fix waiting for hardhat node process exit on windows 2025-06-05 15:58:00 +10:00
Eric
3554c772cc
handle cancellations for race and allFutures 2025-06-05 15:58:00 +10:00
Eric
dca40e0682
fix hardhat running on windows
Solution was two-fold:
1. Run the hardhat.cmd script instead of hardhat
2. Use the absolute path to the script executable instead of relative to the working directory
2025-06-05 15:58:00 +10:00
Eric
7c248b45ab
Bubble errors from hardhat start
- bubble errors from hardhatprocess.start (instead of just logging)
- push raises: [] in all nodeprocess procs/methods to avoid leaking `Exception` exception types in method overrides
2025-06-05 15:58:00 +10:00
Eric
3e4d677b82
re-add windows to CI 2025-06-05 15:58:00 +10:00
Eric
52479ef0e8
nil check hardhat process before stopping 2025-06-05 15:58:00 +10:00
Eric
d260e6350c
fix exit status 2025-06-05 15:57:59 +10:00
Eric
5659e442e1
use macos-14-large
windows is failing with SIGSEGV so remove it for now
2025-06-05 15:57:59 +10:00
Eric
3040d27a58
add windows large runner 32vcpu 2025-06-05 15:57:59 +10:00
Eric
0dcf8ded69
indicate with exit code 1 if the tests failed 2025-06-05 15:57:59 +10:00
Eric
7d3013f3b3
fix: pass show continuous updates into test manager instance 2025-06-05 15:57:59 +10:00
Eric
702706d4ee
trying a self-hosted runner to see if it has more resources 2025-06-05 15:57:59 +10:00
Eric
14cc8fdc75
bump to larger runner, linux only 2025-06-05 15:57:58 +10:00
Eric
3fd43f38e8
temp remove coverage from ci workflow 2025-06-05 15:57:58 +10:00
Eric
28dea5bfd2
try large runners for linux/windows 2025-06-05 15:57:58 +10:00
Eric
373d4b5a1a
use large linux runner and disable other tests 2025-06-05 15:57:58 +10:00
Eric
e902df5365
undo changes done for CI debugging 2025-06-05 15:57:58 +10:00