Ansible role for ElasticSearch cluster
Go to file
markoburcul aedab4dae9
config: Increase url max length for http queries
Resolves issue with Elastic Comrade which was getting 400 too_long_frame_exception
error when querying ES nodes.

Signed-off-by: markoburcul <marko@status.im>
2024-09-18 20:29:50 +02:00
defaults docker: upgrade to Docker Compose V2 2024-08-08 12:46:29 +02:00
meta meta: use full names of Ansible roles 2024-03-10 13:34:12 +01:00
tasks docker: upgrade to Docker Compose V2 2024-08-08 12:46:29 +02:00
templates config: Increase url max length for http queries 2024-09-18 20:29:50 +02:00
BACKUPS.md add /var/backup to container for snapshots 2022-02-14 14:42:36 +01:00
README.md add /var/backup to container for snapshots 2022-02-14 14:42:36 +01:00

README.md

Description

This role configures an ElasticSearch cluster as part of the ELK Stack for the purpose of storing logs for future querying. This data is aggregated by Logstash for use with the Kibana dashboard.

Usage

The recommended number of hosts in an ES cluster is at least 3. This way with number_of_replicas set to 2 means that if one host goes down we lose none of the data.

For more details read: https://www.elastic.co/guide/en/elasticsearch/guide/current/replica-shards.html

Configuration

The only mandatory settings in defaults/main.yml are:

es_cluster_name: 'my-awesome-cluster'
es_master_nodes:
  - { name: node-01.es.example.vpn, addr: 1.2.3.4, port: 9300 }
  - { name: node-02.es.example.vpn, addr: 2.3.4.5, port: 9300 }
  - { name: node-03.es.example.vpn, addr: 3.4.5.6, port: 9300 }

The only other configuration that makes any difference are the JVM options like the ones related to heap size in:

es_jvm_min_heap: 2g
es_jvm_max_heap: 2g

As the hosts are scaled up to deal with more and more logs we should adjust those in turn.

Backups

For information on how to create backups see the BACKUPS.md document.

Known Issues

Because we need to know the VPN IPs of all the nodes in the ES cluster we need to run the setup modules(gather_facts: true) on them in order to get that. So if this role is not ran for the whole cluster it will fail due to lack of value ansible_local.wireguard.vpn_ip variable.

We could use Consul for this but it would not work the first time setting up a new cluster.