Chrysostomos Nanakos
0a68259dc7
fix(argo): ensure vector scales up even on workflow failure
...
Signed-off-by: Chrysostomos Nanakos <chris@include.gr>
2025-10-31 13:16:58 +02:00
Chrysostomos Nanakos
200c749cb5
feat(workflows): add Vector log parsing workflow template and synchronization
...
Add workflow template for parsing logs collected by Vector from Kubernetes
pods, with semaphore synchronization to prevent concurrent access conflicts.
- log-parsing-workflow-template-vector: New workflow template that scales
down Vector aggregator to access RWO PVC, parses JSONL logs, then scales
aggregator back up
- vector-log-parsing-semaphore: ConfigMap semaphore limiting to one log
parsing workflow at a time (prevents RWO PVC mount conflicts)
- codex-workflows-rbac: Added deployment get/patch/update permissions to
executor role (required for scaling Vector aggregator)
Signed-off-by: Chrysostomos Nanakos <chris@include.gr>
2025-10-21 13:25:00 +03:00
gmega
04828514e4
feat: port bug fixes and features from swarm branch
2025-06-09 20:06:19 -03:00
gmega
b2491c26f9
fix: fix workflow expressions
2025-02-27 18:49:48 -03:00
gmega
a366f04e7c
feat: allow re-running failed experiments from previous workflow runs
2025-02-25 12:14:15 -03:00
gmega
5a9543259b
feat: add support for region k8s annotations
2025-02-24 14:16:59 -03:00
gmega
8dbc3faed8
feat: add tunable parallelism
2025-02-23 11:33:49 -03:00
gmega
48e71a315a
feat: add support for setting the node tag in benchmark workflow
2025-02-20 12:14:49 -03:00
gmega
a8c19364b7
fix: minikube env param in workflow
2025-02-20 10:21:45 -03:00
gmega
0d08814929
feat: generalize benchmark workflow to run Codex in addition to Deluge
2025-02-17 10:44:00 -03:00
gmega
e8441b7bea
fix: respect logger increments even when stream returns less data than expected
2025-02-14 15:59:28 -03:00
gmega
99992d2e7e
fix: enable cleanup on failure by default
2025-02-03 15:46:26 -03:00
gmega
61f2172304
feat: add workflow for the final experiment
2025-01-30 11:48:09 -03:00
gmega
94893c0f93
fix: conditional expression for cleanup
2025-01-29 20:35:26 -03:00
gmega
a29c010e7a
feat: allow keeping pods around on failure, add optional log parsing at end of experiment run
2025-01-29 08:47:01 -03:00
gmega
1b83f8047c
feat: update RBAC for codex workflows
2025-01-28 18:20:47 -03:00
gmega
ee67a92726
feat: grant codex runner permissions to launch subworkflows
2025-01-27 18:07:56 -03:00
gmega
90dda4f932
fix: add -C so tars do not include parent folders
2025-01-24 19:19:54 -03:00
gmega
4d4d06e7a9
feat: add log parsing workflow with upload to hetzner storage bucket
2025-01-24 18:28:28 -03:00
gmega
a9b9fd8332
fix: quotation so argo does not screw up the value array
2025-01-23 08:06:43 -03:00
gmega
8096c9f4e0
feat: add ordering to parameter matrix expander
2025-01-22 17:12:46 -03:00
gmega
d70b87d2bb
fix: production values for Argo workflows and RBAC
2025-01-22 10:31:08 -03:00
gmega
882392bef2
fix: add missing parameters to cleanup hook
2025-01-20 18:41:11 -03:00
gmega
7e07eda3c2
feat: allow running workflows from locally loaded images under Minikube
2025-01-20 17:57:21 -03:00
gmega
5a203fad18
chore: eliminate 5GB experiment for now
2025-01-20 15:29:27 -03:00
gmega
bfabd1c4c8
feat: label components with /component label, use /name to refer to benchmark pods; add README
2025-01-09 09:27:21 -03:00
gmega
a4fe12e620
feat: add new Helm chart parameters to workflow
2025-01-08 16:43:01 -03:00
gmega
59f3a9a584
fix: remove useless sync point which was causing issues
2024-12-20 18:00:32 -03:00
gmega
f3a66d9637
fix: workaround for broken Argo exit hooks
2024-12-20 07:51:58 -03:00
gmega
5447762f45
feat: add parameter expansion to deluge workflow
2024-12-19 20:48:40 -03:00
gmega
469ae82009
feat: add workflow matrix expansion sketch
2024-12-19 11:18:15 -03:00
gmega
9fb91179bd
docs: minor rewording for RBAC manifest
2024-12-18 19:09:47 -03:00
gmega
b4b625ad34
feat: add working parametererized Argo workflow for deluge benchmarks
2024-12-18 18:47:21 -03:00
gmega
bea51a5adf
feat: add argo workflow sketch
2024-12-18 14:34:22 -03:00