Commit Graph

1013 Commits

Author SHA1 Message Date
Zahary Karadjov 4668266eb6
Specify the double-checking slashing DB mode on all large servers
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-22 15:16:47 +01:00
Zahary Karadjov f887391f04 More varied validator assignments 2021-02-22 15:14:16 +01:00
Jakub Sokołowski 8dfe06a9bd
change testing branch build to 6am
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-22 15:09:28 +01:00
Jakub Sokołowski 86ce49aae4
upgrade Terraform to 0.14.4
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-17 11:40:17 +01:00
Jakub Sokołowski a40c0ab49c
geth-goerli: increase cache size from default 1GB to 2GB
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-12 23:06:04 +01:00
Jakub Sokołowski bb1e3ff3ba
bump size of Goerli Geth node to t3a.large
It keeps hitting memory limits and failing to sync.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-12 22:58:42 +01:00
Jakub Sokołowski a5a950bb91
set become=false in ansible.cfg, set it to true in playbooks
See: https://github.com/status-im/nimbus-private/pull/9

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-09 20:27:13 +01:00
Jakub Sokołowski 0e82fa7c03
beacon-node-builds: don't update git branch when deploying
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-09 18:35:32 +01:00
Jakub Sokołowski 4cdd5f01ce
increase Goerli container memory limit to 80%
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-09 18:32:11 +01:00
Jakub Sokołowski d8b643fd74
add SSH users to systemd-journal group
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-08 12:36:14 +01:00
Jakub Sokołowski f96fe517e7
beacon-node-builds: allow staff group membets to start builds
I was thinking of defining the jobs as nimbus user systemd units, but
that would not work unless nimbus user logged in at least once.

And Systemd services have no functionality for allowing non-root users
to start/stop/restart system services, so sudo is the simplest way.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-08 12:36:02 +01:00
Jakub Sokołowski e5037a6751
geth: bump instance data volume to 1200 GB
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-06 20:38:33 +01:00
Jakub Sokołowski 8e0966fb0e
beacon-node-builds: don't fetch if we're on a detached head
Causes failures when a tag is checked out, nothing to fetch.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-04 23:48:26 +01:00
zah 33ba3b1e8b Upgrade the production fleet to 1.0.7 2021-02-04 21:41:22 +01:00
Jakub Sokołowski b7226818d3
disable kibana-logtrail, it's not been updated to 7.10.2
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-01 16:05:50 +01:00
Jakub Sokołowski dbac764185
upgrade ElasticSearch to 7.10.2
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-01 15:57:10 +01:00
Jakub Sokołowski 921b1c3ea7
ansible.cfg: enable ansible ssh multiplexing
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-02-01 13:21:44 +01:00
Jakub Sokołowski 157d215230
specify beacon_node_netkey for nimbus-mainnet-small bootnodes
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-29 13:26:55 +01:00
Jakub Sokołowski 87a0503c22
increase volume size for Mainnet Geth to 1TB
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-27 16:47:25 +01:00
Ștefan Talpalaru 37b4e558b2 fix the Makefile and a shell script
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-26 15:31:26 +01:00
Jakub Sokołowski 26a31c5d63
use 8000+ UIDs for extra SSH users to not clash with core
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-26 12:11:33 +01:00
Jakub Sokołowski d12ccc626b
ansible.cfg: shell instead of command as default module
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-25 15:10:25 +01:00
Jakub Sokołowski 8f76c6810a
log-store: fix mounting of data volumes
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-25 15:08:34 +01:00
Jakub Sokołowski 62f44b8a05
drop old fleet from group and host vars and pyrmont playbook
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 15:59:47 +01:00
Jakub Sokołowski 49bd1fa267
remove old pyrmont fleet, move new one back to pyrmont.tf
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 15:56:26 +01:00
Jakub Sokołowski 8aa381ca2e
update names in inventory for nimbus.mainnet hosts
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 15:46:04 +01:00
Jakub Sokołowski 84be761116
fix hostname in nimbus-pyrmont-unstable-libp2p validators layout
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 15:19:58 +01:00
Jakub Sokołowski 8920b57c34 rename mainnet hosts to match stable branch
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 14:27:39 +01:00
Jakub Sokołowski 37c1c62a5d add validators distributio for the new fleet
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-22 14:27:39 +01:00
Jakub Sokołowski 8ad0027936
log-store: add 2GB swap files to hosts
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 20:35:33 +01:00
Jakub Sokołowski 1218ab1772
lower volume size for logs to 200 GB for now
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 20:35:16 +01:00
Jakub Sokołowski 9a9619138b
add dashboard link to README
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 19:27:04 +01:00
Jakub Sokołowski 6e138997e2
re-enable sending logs to logstash
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 19:27:04 +01:00
Jakub Sokołowski 45a0315bb0
deploy elasticsearch and a kibana dashboard
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 16:00:29 +01:00
Jakub Sokołowski ef8e77a1ac
re-add the ELK stack hosts
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-20 15:59:11 +01:00
Jakub Sokołowski 8d939e1cd9
bump data volume IOPS to 2500
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-19 16:29:53 +01:00
Jakub Sokołowski 2b4c2320ce
move pyrmont variables to pyrmont.tf
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-19 10:33:27 +01:00
Jakub Sokołowski 67ebb93be3
fix wrong build nodes listed in playbook
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 19:53:52 +01:00
Jakub Sokołowski 0bc722ca21
dist-validators: don't run any tasks if no layout is defined
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 19:47:51 +01:00
Jakub Sokołowski 89dfe7ca5c
add new fleet hosts to inventory
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 19:35:03 +01:00
Jakub Sokołowski 2b44636cd6
create new pyrmont fleet using new branch naming
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 19:04:10 +01:00
Jakub Sokołowski 05684f23b1
drop unused toledor and maste references
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 16:52:40 +01:00
Jakub Sokołowski e2a2070f43
bump data volume IOPS to 2000
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 16:14:02 +01:00
Jakub Sokołowski 2784affaa9
pyrmont: make data EBS volumes to io1 type, 1500 IOPS
The max IOPS for gp2/3 is 16,000 for 16 TB.
The max ratio of IOPS to size is 50.
This makes the max of 7500 IOPS for 150 GB.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-18 15:51:12 +01:00
Zahary Karadjov e2244166ca
Simple Makefile target for generating a SSH config file with the fleet hosts
`make ssh-config` will generate a file ~/.ssh/config.d/infra-numbus with all
hostnames from the Ansible inventory. You can `Include` this file from your
main config file and re-generate it at will at any time. Having such a config
file makes it easier to ssh into any host on the fleet by taking advantage of
the built-in ssh hostname autocompletion in your shell.

The names of the entries were chosen in a way such that typing

ssh nimbus<tab>goerli<tab>

... produces the expected result.
2021-01-18 15:09:08 +01:00
Jakub Sokołowski ac608401ae
pyrmont: use gp3 EBS volumes for data on devel-small fleet
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-14 17:14:21 +01:00
Jakub Sokołowski 2132851b16
use bootstrap__active_extra_users variable
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-14 14:01:57 +01:00
Jakub Sokołowski 701ca85795
geth: lower size of goerli host to t3a.medium
More memory is only necessary when syncing from scratch.

Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-14 12:08:42 +01:00
Jakub Sokołowski ab017c29bb
update inventory
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-13 17:07:31 +01:00
Jakub Sokołowski 7660970894
bump size of geth mainnet node to t3a.xlarge temporarily
Signed-off-by: Jakub Sokołowski <jakub@status.im>
2021-01-13 17:07:20 +01:00