Commit Graph

503 Commits

Author SHA1 Message Date
Jakub Sokołowski d1d762a686
nimbus.fluffy: lower sensitivity for consul checks
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-04-23 12:49:37 +02:00
Jakub Sokołowski 29e7870a8b
nimbus.prater: decomission last two remaining hosts
The testnet is dead, and keeping it around is not necessary.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-04-19 10:13:23 +02:00
Jakub Sokołowski ce13aac5ce
nimbus.mainnet: download BN when branch is stable
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-04-01 13:50:34 +02:00
Jakub Sokołowski f26dfe57c4
nimbus.holesky: reduce critical alerts sensitivity
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-30 09:59:34 +01:00
Alexis Pentori 29bce4b7da
sepolia: update port open for waku network monitor
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-03-26 15:33:11 +01:00
Jakub Sokołowski 6aeb204614
nimbus.holesky: deploy windows-01 host with nodes
Currently we have issues building and running Beacon node:
https://github.com/status-im/nimbus-eth2/issues/6139

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-26 12:42:26 +01:00
Etan Kissling eccb234f17
nimbus.prater: use custom research branch
As prater is approaching end of life, it is suitable for testing edge
cases with partitioned network. To avoid interfering with other networks
a separate branch should be tracked instead.

Signed-off-by: Etan Kissling <etan@status.im>
2024-03-25 16:26:18 +01:00
Jakub Sokołowski 10ff179299
nimbus.holesky: add Geth hosts for MacOS and Windows
No Windows host with beacon nodes currently.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-21 16:53:36 +01:00
Jakub Sokołowski 2ad97c7be4
nimbus.prater: lower alert sensitivity even further
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-18 19:59:55 +01:00
Etan Kissling 140e1835ce
Revert "nimbus.prater: use stable instead of unstable branch"
This reverts commit c48ac78797.

The underlying issue that prompted switching to `stable` has been fixed.

- https://github.com/status-im/nimbus-eth2/issues/5753

Signed-off-by: Etan Kissling <etan@status.im>
2024-03-16 17:21:20 +01:00
Jakub Sokołowski 2aed8560e6
nimbus.prater: bump consul alerts thresholds
We really don't care about this fleet.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-10 13:08:53 +01:00
Jakub Sokołowski 4a7355d91d
nimbus.mainnet: disable subscribe-all for bootstrap
Generates high CPU load and deteriorates bootstrap node performance.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-08 15:19:17 +01:00
kdeme bcbc19ec16
all: remove SSH access for daniil@status.im
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-05 12:45:15 +01:00
Jakub Sokołowski de1ae7b873
nimbus.mainnet: rename nodes to include bootstrap word
This is additional precaution to avoid accidental destruction of these hosts
as they are used by Ethereum clients to connect to the network.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-05 10:41:52 +01:00
Jakub Sokołowski 5f903426a5
nimbus.prater: reduce layout to just 2 hosts
This testnet will be dead within ~3 weeks.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-03-04 17:49:56 +01:00
Jakub Sokołowski aaa3c437a9
nimbus.holesky: lower limit for detailed val metrics
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-29 02:31:01 +01:00
Anton Iakimov 23ba0d0d18
nimbus.sepolia: open ports for waku.sandbox 2024-02-27 16:47:23 +01:00
Anton Iakimov 0395e9a253
nimbus.mainnet: remove beacon-node-mainnet-testing-02 2024-02-27 16:06:23 +01:00
Alexis Pentori f4ffeb4d6e
mainnet: remove BN stable-02 from holesky.mainnet
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-02-23 12:29:48 +01:00
Diego 26d6d34bfa
all: grant SSH access to diego@status.im
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-22 12:53:56 +01:00
Jakub Sokołowski 907c2714e2
nimbus.holesky: use Nim 2.0 on a selection of nodes
This is part of prepartion to upgrade to Nim 2.0.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-22 12:24:22 +01:00
Jakub Sokołowski 674cb764c7
nimbus.sepolia: open WS ports for waku-nmon.misc
https://github.com/status-im/infra-misc/pull/242

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-13 11:12:16 +01:00
Jakub Sokołowski 24173cb374
nimbus.sepolia: open RPC ports for waku test fleets
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-12 16:13:31 +01:00
Anton Iakimov 5f4bf2b90f
sepolia&eth1: bump nimbus-eth1 requirements and update flags 2024-02-09 12:03:43 +01:00
Jakub Sokołowski e2a119a673
nimbus.holesky: reuse prater macos host
Currently without any execution clients as we don't have any roles to
run Geth or other clients on a MacOS host.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-06 19:56:05 +01:00
Alexis Pentori f446bd309f
holesky: use node branch in validator client name
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-02-02 16:32:25 +01:00
Alexis Pentori 9e97832b48
Revert "sepolia, holesky: temporary mapping to use correct version for Dencon"
This reverts commit cd1f42c171.
2024-02-02 16:31:58 +01:00
Jakub Sokołowski cb5176502f
nimbus.master: disable wildcard REST API origin
Suspected to be related to REST API timeout issues:
https://github.com/status-im/nimbus-eth2/issues/5838

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-02-02 15:25:45 +01:00
Alexis Pentori c03f82b225
sepolia: update payload_builder_url to boost-relay-sepolia.flashbots.net
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-01-30 10:19:46 +01:00
Alexis Pentori cd1f42c171
sepolia, holesky: temporary mapping to use correct version for Dencon
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-01-29 12:02:57 +01:00
Alexis Pentori d950ebfd5f
holesky: adding MEV payloard builder and updatign node layout
Signed-off-by: Alexis Pentori <alexis@status.im>
2024-01-29 11:58:53 +01:00
Jakub Sokołowski 548f0f48fb
nimbus.sepolia: change layout to use all main branches
This also resoles the issue with Waku nodes depending on an Geth node
being driven by an unstable beacon node, causing RLN issues:
https://github.com/status-im/nimbus-eth2/issues/5753

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-16 16:51:58 +03:00
Jakub Sokołowski c48ac78797
nimbus.prater: use stable instead of unstable branch
Avoid an RPC bug found in `unstable` for Dencun upgrade:
https://github.com/status-im/nimbus-eth2/issues/5753

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-16 09:27:44 +03:00
Jakub Sokołowski 9fbf61806a
nimbus.mainnet: use all geth nodes in the fleet
To allow individual geth nodes to be re-synced gradually.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-09 09:40:14 +01:00
Jakub Sokołowski 2c766a7991
dash.nimbus: reduce indices kept to 21 from 30
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-08 23:15:07 +01:00
Jakub Sokołowski 781b9e7524
nimbus.mainnet: switch exec URLs on linux-03
Necessary to allow the geth node to resync using new schema in order to
save disk space.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-08 22:09:07 +01:00
Alexis Pentori c7f77f5e32
nimbus.prater: switching MEV boots url.
Removing invalid url. List of valid URL available at https://ethstaker.cc/mev-relay-list#mev-relay-list-for-goerli-testnet

Signed-off-by: Alexis Pentori <alexis@status.im>
2024-01-03 16:28:08 +01:00
Jakub Sokołowski 7a239812ec
nimbus.prater: remove Erigon node to save disk space
It was using 1.1 TB on `linux-05`, which is unreasonable.
And the network is deprecated anyway.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2024-01-02 13:22:54 +01:00
Jakub Sokołowski 08db65a52e
nimbus.sepolia: remove obsolete vacdev.misc ipset
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-12-08 17:44:55 +01:00
Jakub Sokołowski 9d99b031ac
nimbus.prater: reduce Consul alerts sensitivity
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-12-06 11:13:54 +01:00
Jakub Sokołowski 9be87e32ce
nimbus.mainnet: reduce Consul alerts sensitivity
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-12-06 11:13:26 +01:00
Jakub Sokołowski 39e12fdc6f
nimbus.holesky: add public beacon node API endpoints
https://github.com/status-im/infra-nimbus/issues/157

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-12-01 11:50:23 +01:00
Jakub Sokołowski 58cbfee30f
nimbus.prater: disable resyncing on all hosts
It just causes unnecessary alerts for an obsolete network.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-27 11:45:16 +01:00
Jakub Sokołowski 24020d0962
all: reduce MTR report cycle from 10 to 1
We have received a complaint from InnovaHosting about them being hit by
about 150 ICMP `ttl1` packets/s on their routers, causing excess CPU usage.
https://client.innovahosting.net/viewticket.php?tid=532874&c=8gALx9vm

By using `tcpdump` I have identified that `mtr` by default pings the
target 10 times, which means that the default value of `-c`/`--report-cycles`
is 10, although this is not documented in the manual.

We can see this when calling `mtr github.com` and watching with `tcpdump`:
```
 > sudo tcpdump -v -i eno1 icmp and src 185.181.230.78 and dst github.com | grep 'ttl 1,'
tcpdump: listening on eno1, link-type EN10MB (Ethernet), snapshot length 262144 bytes
19:54:53.981243 IP (tos 0x0, ttl 1, id 37119, offset 0, flags [none], proto ICMP (1), length 64)
...(8 packets omitted)...
19:55:03.025460 IP (tos 0x0, ttl 1, id 38226, offset 0, flags [none], proto ICMP (1), length 64)
```
We don't need to run the test 10 times to get a result for our metric.

Related to:
https://github.com/status-im/infra-role-bootstrap-linux/commit/ea22bdfe

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-20 21:07:17 +01:00
Jakub Sokołowski 6b800a5342
nimbus.fluffy: re-enable Consul healthchecks
It appears the RPC issues was resolved in:
https://github.com/status-im/nimbus-eth1/issues/1880

Most probably caused by DB size.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-20 09:57:46 +01:00
Jakub Sokołowski 2cac3081a0
layouts: add script and generate TSVs of validators
Helps developers identify which host holds which validator.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-17 15:01:57 +01:00
Jakub Sokołowski c29b23c6dc
nimbus.sepolia: open ports for waku.test fleet
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-14 11:56:16 +01:00
kdeme ce37186651
all: update SSH key for kim
This one is from a YubiKey.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-07 18:48:05 +01:00
Jakub Sokołowski 10dd722e29
all: grant admin rights to kim
Necessary to run 'perf'.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-07 16:19:24 +01:00
Jakub Sokołowski c1be589960
all: add debug tools like gdb and perf
Also allow use of 'perf' without root.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-11-07 12:48:05 +01:00