Commit Graph

397 Commits

Author SHA1 Message Date
Etan Kissling b33a35c5e1
nimbus: set --rest-allow-origin='*' for public_api
To allow browser based clients to access our API servers.

Depends on:

- https://github.com/status-im/infra-role-beacon-node-linux/pull/14
- https://github.com/status-im/infra-role-beacon-node-linux/pull/15
- https://github.com/status-im/infra-role-beacon-node-macos/pull/7
- https://github.com/status-im/infra-role-beacon-node-macos/pull/8
- https://github.com/status-im/infra-role-beacon-node-windows/pull/6
- https://github.com/status-im/infra-role-beacon-node-windows/pull/7

Signed-off-by: Etan Kissling <etan@status.im>
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-05 13:45:55 +02:00
Jakub Sokołowski e08586be04
nimbus.sepolia: move validators to new Innova host
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-05 12:08:01 +02:00
Jakub Sokołowski 6ae7dc00f0
nimbus.prater: add Innova host config
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-02 10:35:18 +02:00
Jakub Sokołowski da40f535ba
nimbus.sepolia: add Innova host config
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-02 10:34:20 +02:00
Jakub Sokołowski 948b5885fe
nimbus.prater: move disabling discovery to Hetzner group
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 21:20:03 +02:00
Jakub Sokołowski e9a05d73ab
nimbus.prater: add DNS discovery for Erigon
Necessary because we disable normal discovery.

Source:
https://github.com/ethereum/discv4-dns-lists

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 21:03:53 +02:00
Jakub Sokołowski a8741b7019
nimbus.prater: , disable discovery for Erigon
It appears peer discovery can be detected by hetzner as an attempt at
network scanning:
https://github.com/ledgerwatch/erigon/issues/7234

Which can lead to the server being shut down.
For this reason I'm lowering max peers and disabling discovery.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 20:06:58 +02:00
Jakub Sokołowski 3f3d763990
nimbus.prater: double nofile limit to 32768
Which needs to be more than 4 x 7899 validators.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 18:36:09 +02:00
Jakub Sokołowski e5324beaaa
nimbus-geth-goerli: remove one of three Geth nodes
Not nough storage space to run 3 nodes on the hosts.

Also made AWS, Windows and MacOS nodes discover available
Geth nodes using Consul catalog.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-24 18:34:03 +02:00
Jakub Sokołowski 597b85a0c8
nimbus.sepolia: open EL WebSocket to wakuv2.test
https://github.com/waku-org/nwaku/issues/1624
https://github.com/status-im/infra-nim-waku/pull/69

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-17 18:15:47 +02:00
Jakub Sokołowski 2e14ecc79e
nimbus.sepolia: enable SMART metrics service
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-17 18:11:27 +02:00
Jakub Sokołowski c15e1ea358
nimbus.mainnet: 10k max peers for stable-02@linux-03
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:38:24 +02:00
Jakub Sokołowski a270f4044c
all: get Consul certificates from BitWarden
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:14:38 +02:00
Jakub Sokołowski 0cd6de1ea6
nimbus.mainnet: use ncli_db from stable node
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:14:27 +02:00
Jakub Sokołowski d04a98ea11
remove metal-01.he-eu-hel1.nimbus.mainnet
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 11:05:30 +02:00
Jakub Sokołowski a145963a2a
ih-eu-mda1: disable Consul check for SMART metrics
The service fails on InnovaHosting servers due to HBA controller:
https://github.com/prometheus-community/smartctl_exporter/issues/26

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-15 19:51:41 +02:00
Jakub Sokołowski 0256a21dd0
nimbus.mainnet: add two more hosts from Innova
The `linux-06` is a new host, while `linux-07` is re-purposed `linux-01`
from the prater fleet. This is done because the Innova company is out of
Xeon E5-2690 CPUs, and we want consistent performance across the fleet.

This way all seven hosts in `nimbus.mainnet` will use the E5-2690 CPU.

https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-09 15:07:22 +02:00
Jakub Sokołowski 406ac58f16
nimbus.mainnet: use --no-el for even number nodes
Attempted fix issues with syncing where latency on the Geth endpoint is
too high for beacon nodes to finish syncing properly.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-05 20:58:53 +02:00
Jakub Sokołowski 01cf22c9f8
get-geth-api-urls: rename variables for clarity
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-05 19:21:49 +02:00
Jakub Sokołowski 5635b39e50
mainnet: decommission 6 Hetzner hosts
Being replaced by InnovaHosting servers
https://github.com/status-im/infra-nimbus/issues/132

I'm leaving old `metal-07` as `metal-01` since we don't have a server
with `libp2p` nodes on mainnet yet.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-28 12:22:46 +02:00
Jakub Sokołowski cd0841eba2
ih-eu-mda1: fix destintion DC for sending logs
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-25 18:06:00 +02:00
Jakub Sokołowski e43ede93a5
nimbus.mainnet: fix exec layer URL logic
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-20 22:28:26 +02:00
Jakub Sokołowski 940b2577da
nimbus.mainnet: add node layout for new Innova hosts
Once they are synced and working we'll remove the Hetzner ones.

https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-18 18:14:53 +02:00
Jakub Sokołowski 8ddbd86139
nimbus.mainnet: revert removing of stable-02 node
This reverts commit ab266d132d.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-17 20:06:57 +02:00
Jakub Sokołowski 53ccd34927
nimbus.prater: bootstrap new Innova servers
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-17 17:11:27 +02:00
Jakub Sokołowski 1f5ad05893
nimbus.fluffy: 24h to make healthcheck critical
This is not a critical service and there's a LOT of sevices that cause a
lot of alerts when developers are stopping and starting them.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-12 19:28:37 +02:00
Jakub Sokołowski 9ec35f149b
nimbus.fluff: deploy smart-metrics exporter
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-12 13:24:29 +02:00
Jakub Sokołowski ab266d132d
nimbus.mainnet: remove stable-02 node to save space
We can always re-add it when we deploy to our new hosts.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-07 12:00:20 +02:00
Jakub Sokołowski d41b3c89f8
dash.nimbus: increase logs retention to 30 days
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-05 11:04:09 +02:00
Jakub Sokołowski d6de0faf1a
nimbus.prater: open Erigon metrics port on VPN
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-23 15:14:38 +01:00
Jakub Sokołowski 0956c88d9a
nimbus.prater: enable metrics for Erigon nodes
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-22 13:47:15 +01:00
Jakub Sokołowski 9aa83f8d47
nimbus.prater: use Erigon EL node if available
Beginning of work on testing multi-EL node setup:
https://github.com/status-im/infra-nimbus/issues/144

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-22 12:10:34 +01:00
Jakub Sokołowski cdf362f2a9
nimbus.prater: provide private key for mining
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-21 13:07:23 +01:00
Jakub Sokołowski f92bcb2802
nimbus.prater: fix erigon metrics port conflict
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-21 13:07:13 +01:00
Jakub Sokołowski 4a5b7d7107
nimbus.prater: deploy two Erigon nodes
Part of work to support multiple EL nodes driven by one CL node:
https://github.com/status-im/infra-nimbus/issues/144

Disabled Geth websocket as unused and conflicting with Erigon.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-20 22:27:50 +01:00
Jakub Sokołowski 4ea58aeda7
deploy SMART metrics to all metal hosts
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-18 18:02:58 +01:00
Jakub Sokołowski e1e21119da
use new exec layer variables for beacon nodes
Role changes:
- https://github.com/status-im/infra-role-beacon-node-linux/commit/99a445d1
- https://github.com/status-im/infra-role-beacon-node-macos/commit/eecf9fea
- https://github.com/status-im/infra-role-beacon-node-windows/commit/dba21b36

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-16 00:59:48 +01:00
Jakub Sokołowski 5783656224
nimbus.mainnet: make AWS nodes have archive history
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:51 +01:00
Jakub Sokołowski 768ecef371
nimbus.prater: use AWS hosts to sync first node
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:45 +01:00
Jakub Sokołowski 090c4661a9
nimbus.mainnet: use AWS hosts to sync first node
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:44 +01:00
Jakub Sokołowski 10f3132bcd
logs.nimbus: use G1 garbage collector
https://github.com/status-im/infra-nimbus/issues/138

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-01 18:44:02 +01:00
Jakub Sokołowski 4e20fd2c14
nimbus.prater: use a new set of MEV boost URLs
The one from `flashbots.net` seems to have connectivity issues:
```
REST request was interrupted
```

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-24 13:16:06 +01:00
Jakub Sokołowski 15d96ca3be
nimbus.mainnet/prater: use more threads than 1
This setting is quite old and it's time to use more than one thread.
The `libp2p` branch is the exception that will keep using 1 thread.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-24 11:37:17 +01:00
Jakub Sokołowski 3e3cff6840
nimbus.mainnet/prater: enable pruned history
For most nodes this is fine, except for nodes with publicly exposed API,
which we should keep as synced from genesis and in archive mode.

Every other node is fine being prunned. I'm keeping AWS nodes not pruned
as fallback since increasing their storage is quick and easy.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 22:52:41 +01:00
Jakub Sokołowski 6e05ef6eec
nimbus.prater: add linux-01.cs-eu-nld1.nimbus.prater
Part of research into alternative hosting for Nimbus:
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 21:25:42 +01:00
Jakub Sokołowski d6da1ce336
nimbus.mainnet: drop db_purge and db_sync vars
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 12:38:17 +01:00
Jakub Sokołowski 297f347934
drop linux-01.cs-eu-nld1.nimbus.prater host
Decided against using it due to maintenance costs:
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 12:37:15 +01:00
Jakub Sokołowski 77278c2b65
logs.nimbus: lower ES JVM heap size to 40 GB
Never ending issues with `Data too big` errors:
https://github.com/status-im/infra-nimbus/issues/138

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 00:11:55 +01:00
Jakub Sokołowski fc236d0132
remove SSH access from mamy, miran, s1fr0
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-17 16:35:09 +01:00
Jakub Sokołowski c05fad0e01
add linux-01.cs-eu-nld1.nimbus.prater host
Test for replacement of Hetzner hosts:
https://github.com/status-im/infra-nimbus/issues/132

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-01-30 21:34:21 +01:00