e08586be04
nimbus.sepolia: move validators to new Innova host
...
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-05 12:08:01 +02:00
6ae7dc00f0
nimbus.prater: add Innova host config
...
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-02 10:35:18 +02:00
da40f535ba
nimbus.sepolia: add Innova host config
...
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-06-02 10:34:20 +02:00
948b5885fe
nimbus.prater: move disabling discovery to Hetzner group
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 21:20:03 +02:00
e9a05d73ab
nimbus.prater: add DNS discovery for Erigon
...
Necessary because we disable normal discovery.
Source:
https://github.com/ethereum/discv4-dns-lists
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 21:03:53 +02:00
a8741b7019
nimbus.prater: , disable discovery for Erigon
...
It appears peer discovery can be detected by hetzner as an attempt at
network scanning:
https://github.com/ledgerwatch/erigon/issues/7234
Which can lead to the server being shut down.
For this reason I'm lowering max peers and disabling discovery.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 20:06:58 +02:00
3f3d763990
nimbus.prater: double nofile limit to 32768
...
Which needs to be more than 4 x 7899 validators.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-29 18:36:09 +02:00
e5324beaaa
nimbus-geth-goerli: remove one of three Geth nodes
...
Not nough storage space to run 3 nodes on the hosts.
Also made AWS, Windows and MacOS nodes discover available
Geth nodes using Consul catalog.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-24 18:34:03 +02:00
597b85a0c8
nimbus.sepolia: open EL WebSocket to wakuv2.test
...
https://github.com/waku-org/nwaku/issues/1624
https://github.com/status-im/infra-nim-waku/pull/69
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-17 18:15:47 +02:00
2e14ecc79e
nimbus.sepolia: enable SMART metrics service
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-17 18:11:27 +02:00
c15e1ea358
nimbus.mainnet: 10k max peers for stable-02@linux-03
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:38:24 +02:00
a270f4044c
all: get Consul certificates from BitWarden
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:14:38 +02:00
0cd6de1ea6
nimbus.mainnet: use ncli_db from stable node
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 12:14:27 +02:00
d04a98ea11
remove metal-01.he-eu-hel1.nimbus.mainnet
...
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-16 11:05:30 +02:00
a145963a2a
ih-eu-mda1: disable Consul check for SMART metrics
...
The service fails on InnovaHosting servers due to HBA controller:
https://github.com/prometheus-community/smartctl_exporter/issues/26
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-15 19:51:41 +02:00
0256a21dd0
nimbus.mainnet: add two more hosts from Innova
...
The `linux-06` is a new host, while `linux-07` is re-purposed `linux-01`
from the prater fleet. This is done because the Innova company is out of
Xeon E5-2690 CPUs, and we want consistent performance across the fleet.
This way all seven hosts in `nimbus.mainnet` will use the E5-2690 CPU.
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-09 15:07:22 +02:00
406ac58f16
nimbus.mainnet: use --no-el for even number nodes
...
Attempted fix issues with syncing where latency on the Geth endpoint is
too high for beacon nodes to finish syncing properly.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-05 20:58:53 +02:00
01cf22c9f8
get-geth-api-urls: rename variables for clarity
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-05-05 19:21:49 +02:00
5635b39e50
mainnet: decommission 6 Hetzner hosts
...
Being replaced by InnovaHosting servers
https://github.com/status-im/infra-nimbus/issues/132
I'm leaving old `metal-07` as `metal-01` since we don't have a server
with `libp2p` nodes on mainnet yet.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-28 12:22:46 +02:00
cd0841eba2
ih-eu-mda1: fix destintion DC for sending logs
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-25 18:06:00 +02:00
e43ede93a5
nimbus.mainnet: fix exec layer URL logic
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-20 22:28:26 +02:00
940b2577da
nimbus.mainnet: add node layout for new Innova hosts
...
Once they are synced and working we'll remove the Hetzner ones.
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-18 18:14:53 +02:00
8ddbd86139
nimbus.mainnet: revert removing of stable-02 node
...
This reverts commit ab266d132d042418673861e489ee6227338851d1.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-17 20:06:57 +02:00
53ccd34927
nimbus.prater: bootstrap new Innova servers
...
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-17 17:11:27 +02:00
1f5ad05893
nimbus.fluffy: 24h to make healthcheck critical
...
This is not a critical service and there's a LOT of sevices that cause a
lot of alerts when developers are stopping and starting them.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-12 19:28:37 +02:00
9ec35f149b
nimbus.fluff: deploy smart-metrics exporter
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-12 13:24:29 +02:00
ab266d132d
nimbus.mainnet: remove stable-02 node to save space
...
We can always re-add it when we deploy to our new hosts.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-07 12:00:20 +02:00
d41b3c89f8
dash.nimbus: increase logs retention to 30 days
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-04-05 11:04:09 +02:00
d6de0faf1a
nimbus.prater: open Erigon metrics port on VPN
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-23 15:14:38 +01:00
0956c88d9a
nimbus.prater: enable metrics for Erigon nodes
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-22 13:47:15 +01:00
9aa83f8d47
nimbus.prater: use Erigon EL node if available
...
Beginning of work on testing multi-EL node setup:
https://github.com/status-im/infra-nimbus/issues/144
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-22 12:10:34 +01:00
cdf362f2a9
nimbus.prater: provide private key for mining
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-21 13:07:23 +01:00
f92bcb2802
nimbus.prater: fix erigon metrics port conflict
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-21 13:07:13 +01:00
4a5b7d7107
nimbus.prater: deploy two Erigon nodes
...
Part of work to support multiple EL nodes driven by one CL node:
https://github.com/status-im/infra-nimbus/issues/144
Disabled Geth websocket as unused and conflicting with Erigon.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-20 22:27:50 +01:00
4ea58aeda7
deploy SMART metrics to all metal hosts
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-18 18:02:58 +01:00
e1e21119da
use new exec layer variables for beacon nodes
...
Role changes:
- https://github.com/status-im/infra-role-beacon-node-linux/commit/99a445d1
- https://github.com/status-im/infra-role-beacon-node-macos/commit/eecf9fea
- https://github.com/status-im/infra-role-beacon-node-windows/commit/dba21b36
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-16 00:59:48 +01:00
5783656224
nimbus.mainnet: make AWS nodes have archive history
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:51 +01:00
768ecef371
nimbus.prater: use AWS hosts to sync first node
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:45 +01:00
090c4661a9
nimbus.mainnet: use AWS hosts to sync first node
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-02 12:22:44 +01:00
10f3132bcd
logs.nimbus: use G1 garbage collector
...
https://github.com/status-im/infra-nimbus/issues/138
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-03-01 18:44:02 +01:00
4e20fd2c14
nimbus.prater: use a new set of MEV boost URLs
...
The one from `flashbots.net` seems to have connectivity issues:
```
REST request was interrupted
```
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-24 13:16:06 +01:00
15d96ca3be
nimbus.mainnet/prater: use more threads than 1
...
This setting is quite old and it's time to use more than one thread.
The `libp2p` branch is the exception that will keep using 1 thread.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-24 11:37:17 +01:00
3e3cff6840
nimbus.mainnet/prater: enable pruned history
...
For most nodes this is fine, except for nodes with publicly exposed API,
which we should keep as synced from genesis and in archive mode.
Every other node is fine being prunned. I'm keeping AWS nodes not pruned
as fallback since increasing their storage is quick and easy.
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 22:52:41 +01:00
6e05ef6eec
nimbus.prater: add linux-01.cs-eu-nld1.nimbus.prater
...
Part of research into alternative hosting for Nimbus:
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 21:25:42 +01:00
d6da1ce336
nimbus.mainnet: drop db_purge and db_sync vars
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 12:38:17 +01:00
297f347934
drop linux-01.cs-eu-nld1.nimbus.prater host
...
Decided against using it due to maintenance costs:
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 12:37:15 +01:00
77278c2b65
logs.nimbus: lower ES JVM heap size to 40 GB
...
Never ending issues with `Data too big` errors:
https://github.com/status-im/infra-nimbus/issues/138
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-21 00:11:55 +01:00
fc236d0132
remove SSH access from mamy, miran, s1fr0
...
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-02-17 16:35:09 +01:00
c05fad0e01
add linux-01.cs-eu-nld1.nimbus.prater host
...
Test for replacement of Hetzner hosts:
https://github.com/status-im/infra-nimbus/issues/132
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-01-30 21:34:21 +01:00
61d586fa8c
logs.nimbus: set static max heap size to 48 GB
...
Trying to fix the `Data too large` circuit breaker errors:
https://github.com/status-im/infra-nimbus/issues/138
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2023-01-28 14:17:04 +01:00