consul

Commit Graph

Author	SHA1	Message	Date
Matt Keeler	9f37a218c5	Merge pull request #8035 from hashicorp/feature/auto-config/server-rpc	2020-06-17 20:08:17 +00:00
Daniel Nephin	058114e82e	Merge pull request #7762 from hashicorp/dnephin/warn-on-unknown-service-file config: warn if a config file is being skipped because of its file extension	2020-06-17 15:21:34 -04:00
Pierre Souchay	318495d1f8	gossip: Ensure that metadata of Consul Service is updated (#7903 ) While upgrading servers to a new version, I saw that metadata of existing servers are not upgraded, so the version and raft meta is not up to date in catalog. The only way to do it was to: * update Consul server * make it leave the cluster, then metadata is accurate That's because the optimization to avoid updating catalog does not take into account metadata, so no update on catalog is performed.	2020-06-17 10:17:33 +00:00
Matt Keeler	c3b348bebb	Agent Auto Configuration: Configuration Syntax Updates (#8003 )	2020-06-16 19:03:59 +00:00
Matt Keeler	3c4413cbed	ACL Node Identities (#7970 ) A Node Identity is very similar to a service identity. Its main targeted use is to allow creating tokens for use by Consul agents that will grant the necessary permissions for all the typical agent operations (node registration, coordinate updates, anti-entropy). Half of this commit is for golden file based tests of the acl token and role cli output. Another big updates was to refactor many of the tests in agent/consul/acl_endpoint_test.go to use the same style of tests and the same helpers. Besides being less boiler plate in the tests it also uses a common way of starting a test server with ACLs that should operate without any warnings regarding deprecated non-uuid master tokens etc.	2020-06-16 16:55:01 +00:00
Matt Keeler	64262d22d6	Make the Agent Cache more Context aware (#8092 ) Blocking queries issues will still be uncancellable (that cannot be helped until we get rid of net/rpc). However this makes it so that if calling getWithIndex (like during a cache Notify go routine) we can cancell the outer routine. Previously it would keep issuing more blocking queries until the result state actually changed.	2020-06-15 15:43:32 +00:00
Freddy	2af14433be	Merge pull request #8099 from hashicorp/gateway-services-endpoint	2020-06-12 21:15:25 +00:00
Freddy	c9dbb6c51a	Only pass one hostname via EDS and prefer healthy ones (#8084 ) Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com> Currently when passing hostname clusters to Envoy, we set each service instance registered with Consul as an LbEndpoint for the cluster. However, Envoy can only handle one per cluster: [2020-06-04 18:32:34.094][1][warning][config] [source/common/config/grpc_subscription_impl.cc:87] gRPC config for type.googleapis.com/envoy.api.v2.Cluster rejected: Error adding/updating cluster(s) dc2.internal.ddd90499-9b47-91c5-4616-c0cbf0fc358a.consul: LOGICAL_DNS clusters must have a single locality_lb_endpoint and a single lb_endpoint, server.dc2.consul: LOGICAL_DNS clusters must have a single locality_lb_endpoint and a single lb_endpoint Envoy is currently handling this gracefully by only picking one of the endpoints. However, we should avoid passing multiple to avoid these warning logs. This PR: * Ensures we only pass one endpoint, which is tied to one service instance. * We prefer sending an endpoint which is marked as Healthy by Consul. * If no endpoints are healthy we emit a warning and skip the cluster. * If multiple unique hostnames are spread across service instances we emit a warning and let the user know which will be resolved.	2020-06-12 19:46:51 +00:00
Chris Piraino	7f89ab990e	Allow users to set hosts to the wildcard specifier when TLS is disabled (#8083 ) This allows easier demoing/testing of ingress gateways, while still preserving the validation we have for DNSSANs	2020-06-11 15:03:46 +00:00
Chris Piraino	42c8f34788	Move ingress param to a new endpoint (#8081 ) In discussion with team, it was pointed out that query parameters tend to be filter mechanism, and that semantically the "/v1/health/connect" endpoint should return "all healthy connect-enabled endpoints (e.g. could be side car proxies or native instances) for this service so I can connect with mTLS". That does not fit an ingress gateway, so we remove the query parameter and add a new endpoint "/v1/health/ingress" that semantically means "all the healthy ingress gateway instances that I can connect to to access this connect-enabled service without mTLS"	2020-06-10 18:07:41 +00:00
Chris Piraino	ea1b54a826	Merge pull request #8064 from hashicorp/ingress/health-query-param Add API query parameter ?ingress to allow users to find ingress gateways associated to a service	2020-06-09 21:09:09 +00:00
Hans Hasselberg	a678b47c73	acl: do not resolve local tokens from remote dcs (#8068 )	2020-06-09 19:14:19 +00:00
Kyle Havlovitz	9e6718ad0f	Merge pull request #8040 from hashicorp/ingress/expose-cli Ingress expose CLI command	2020-06-09 19:11:51 +00:00
Daniel Nephin	1bfb7f3b07	Merge pull request #7964 from hashicorp/dnephin/remove-patch-slice-of-maps-forward-compat config: Use HookWeakDecodeFromSlice in place of PatchSliceOfMaps	2020-06-08 23:53:31 +00:00
Hans Hasselberg	cfc95732f3	Tokens converted from legacy ACLs get their Hash computed (#8047 ) (#8054 ) This allows new style token replication to work for legacy tokens as well when they change. Fixes #5606	2020-06-08 23:36:55 +02:00
Hans Hasselberg	b4f33b52a2	agent: add option to disable agent cache for HTTP endpoints (#8023 ) This allows the operator to disable agent caching for the http endpoint. It is on by default for backwards compatibility and if disabled will ignore the url parameter `cached`.	2020-06-08 22:49:33 +02:00
Chris Piraino	165a9af053	Always require Host header values for http services (#7990 ) Previously, we did not require the 'service-name.' host header value when on a single http service was exposed. However, this allows a user to get into a situation where, if they add another service to the listener, suddenly the previous service's traffic might not be routed correctly. Thus, we always require the Host header, even if there is only 1 service. Also, we add the make the default domain matching more restrictive by matching "service-name.ingress." by default. This lines up better with the namespace case and more accurately matches the Consul DNS value we expect people to use in this case.	2020-06-08 18:16:48 +00:00
Hans Hasselberg	c675166e1b	Setup intermediate_pki_path on secondary when using vault (#8001 ) Make sure to mount vault backend for intermediate_pki_path on secondary dc.	2020-06-05 19:37:21 +00:00
Hans Hasselberg	de3e68c577	Merge pull request #7966 from hashicorp/pool_improvements Agent connection pool cleanup	2020-06-05 19:03:24 +00:00
R.B. Boyer	89fc98322e	tests: ensure that the ServiceExists helper function normalizes entmeta (#8025 ) This fixes a unit test failure over in enterprise due to https://github.com/hashicorp/consul/pull/7384	2020-06-05 08:42:35 +00:00
Hans Hasselberg	0491a9301b	tests: use constructor instead init (#8024 )	2020-06-04 23:12:44 +02:00
R.B. Boyer	ebc5fc039f	server: don't activate federation state replication or anti-entropy until all servers are running 1.8.0+ (#8014 )	2020-06-04 21:05:49 +00:00
Pierre Souchay	621862606e	checks: when a service does not exists in an alias, consider it failing (#7384 ) In current implementation of Consul, check alias cannot determine if a service exists or not. Because a service without any check is semantically considered as passing, so when no healthchecks are found for an agent, the check was considered as passing. But this make little sense as the current implementation does not make any difference between: * a non-existing service (passing) * a service without any check (passing as well) In order to make it work, we have to ensure that when a check did not find any healthcheck, the service does indeed exists. If it does not, lets consider the check as failing.	2020-06-04 12:51:23 +00:00
Freddy	5d2475232a	Enable gateways to resolve hostnames to IPv4 addresses (#7999 ) The DNS resolution will be handled by Envoy and defaults to LOGICAL_DNS. This discovery type can be overridden on a per-gateway basis with the envoy_dns_discovery_type Gateway Option. If a service contains an instance with a hostname as an address we set the Envoy cluster to use DNS as the discovery type rather than EDS. Since both mesh gateways and terminating gateways route to clusters using SNI, whenever there is a mix of hostnames and IP addresses associated with a service we use the hostname + CDS rather than the IPs + EDS. Note that we detect hostnames by attempting to parse the service instance's address as an IP. If it is not a valid IP we assume it is a hostname.	2020-06-03 18:51:33 -06:00
Matt Keeler	1e2754d59c	Fix legacy management tokens in unupgraded secondary dcs (#7908 ) The ACL.GetPolicy RPC endpoint was supposed to return the “parent” policy and not always the default policy. In the case of legacy management tokens the parent policy was supposed to be “manage”. The result of us not sending this properly was that operations that required specifically a management token such as saving a snapshot would not work in secondary DCs until they were upgraded.	2020-06-03 15:42:57 +00:00
Matt Keeler	a539c5de88	Fix segfault due to race condition for checking server versions (#7957 ) The ACL monitoring routine uses c.routers to check for server version updates. Therefore it needs to be started after initializing the routers.	2020-06-03 14:37:10 +00:00
R.B. Boyer	5404155d36	acl: allow auth methods created in the primary datacenter to optionally create global tokens (#7899 )	2020-06-01 16:45:22 +00:00
R.B. Boyer	c4b875cae4	acl: remove the deprecated `acl_enforce_version_8` option (#7991 ) Fixes #7292	2020-06-01 10:40:22 -05:00
Jono Sosulska	cedcbf3299	Replace whitelist/blacklist terminology with allowlist/denylist (#7971 ) * Replace whitelist/blacklist terminology with allowlist/denylist	2020-06-01 10:40:14 -05:00
Daniel Nephin	1664067943	ci: Add staticcheck and fix most errors Three of the checks are temporarily disabled to limit the size of the diff, and allow us to enable all the other checks in CI. In a follow up we can fix the issues reported by the other checks one at a time, and enable them.	2020-06-01 10:40:04 -05:00
Daniel Nephin	1aeede5eb7	config: use the new HookTranslateKeys instead of lib.TranslateKeys With the exception of CA provider config, which will be migrated at some later time.	2020-06-01 10:39:58 -05:00
Daniel Nephin	b11a615f0c	Add alias struct tags for new decode hook	2020-06-01 10:39:51 -05:00
Raphaël Rondeau	b29c954480	connect: fix endpoints clusterName when using cluster escape hatch (#7319 ) ```changelog * fix(connect): fix endpoints clusterName when using cluster escape hatch ```	2020-06-01 10:35:31 -05:00
Pierre Souchay	0d86e802be	Stop all watches before shuting down anything dring shutdown. (#7526 ) This will prevent watches from being triggered. ```changelog * fix(agent): stop all watches before shuting down ```	2020-06-01 10:35:14 -05:00
Pierre Souchay	66612e5dc6	tests: added unit test to ensure watches are not re-triggered on consul reload (#7449 ) This ensures no regression about https://github.com/hashicorp/consul/issues/7318 And ensure that https://github.com/hashicorp/consul/issues/7446 cannot happen anymore	2020-06-01 10:33:31 -05:00
Pierre Souchay	876ee89d4a	Allow to restrict servers that can join a given Serf Consul cluster. (#7628 ) Based on work done in https://github.com/hashicorp/memberlist/pull/196 this allows to restrict the IP ranges that can join a given Serf cluster and be a member of the cluster. Restrictions on IPs can be done separatly using 2 new differents flags and config options to restrict IPs for LAN and WAN Serf.	2020-06-01 10:31:32 -05:00
R.B. Boyer	c2b903b597	create lib/stringslice package (#7934 )	2020-05-27 16:48:01 +00:00
R.B. Boyer	b527e77850	agent: handle re-bootstrapping in a secondary datacenter when WAN federation via mesh gateways is configured (#7931 ) The main fix here is to always union the `primary-gateways` list with the list of mesh gateways in the primary returned from the replicated federation states list. This will allow any replicated (incorrect) state to be supplemented with user-configured (correct) state in the config file. Eventually the game of random selection whack-a-mole will pick a winning entry and re-replicate the latest federation states from the primary. If the user-configured state is actually the incorrect one, then the same eventual correct selection process will work in that case, too. The secondary fix is actually to finish making wanfed-via-mgws actually work as originally designed. Once a secondary datacenter has replicated federation states for the primary AND managed to stand up its own local mesh gateways then all of the RPCs from a secondary to the primary SHOULD go through two sets of mesh gateways to arrive in the consul servers in the primary (one hop for the secondary datacenter's mesh gateway, and one hop through the primary datacenter's mesh gateway). This was neglected in the initial implementation. While everything works, ideally we should treat communications that go around the mesh gateways as just provided for bootstrapping purposes. Now we heuristically use the success/failure history of the federation state replicator goroutine loop to determine if our current mesh gateway route is working as intended. If it is, we try using the local gateways, and if those don't work we fall back on trying the primary via the union of the replicated state and the go-discover configuration flags. This can be improved slightly in the future by possibly initializing the gateway choice to local on startup if we already have replicated state. This PR does not address that improvement. Fixes #7339	2020-05-27 16:32:22 +00:00
R.B. Boyer	1765fa854e	connect: ensure proxy-defaults protocol is used for upstreams (#7938 )	2020-05-21 21:09:51 +00:00
hashicorp-ci	7dd0a87286	update bindata_assetfs.go	2020-05-21 19:33:58 +00:00
Daniel Nephin	7925a0074c	Merge pull request #7933 from hashicorp/dnephin/state-txn-missing-errors state: fix unhandled error	2020-05-21 17:03:33 +00:00
Aleksandr Zagaevskiy	6aecf89418	Preserve ModifyIndex for unchanged entry in KVS TXN (#7832 )	2020-05-21 17:03:16 +00:00
Seth Hoenig	352ed2c13b	grpc: use default resolver scheme for grpc dialing (#7617 ) Currently checks of type gRPC will emit log messages such as, 2020/02/12 13:48:22 [INFO] parsed scheme: "" 2020/02/12 13:48:22 [INFO] scheme "" not registered, fallback to default scheme Without adding full support for using custom gRPC schemes (maybe that's right long-term path) we can just supply the default scheme as provided by the grpc library. Fixes https://github.com/hashicorp/consul/issues/7274 and https://github.com/hashicorp/nomad/issues/7415	2020-05-21 17:01:47 +00:00
Daniel Nephin	c02d4e1390	Merge pull request #7894 from hashicorp/dnephin/add-linter-staticcheck-1 Fix some bugs/issues found by staticcheck	2020-05-21 17:01:15 +00:00
Kyle Havlovitz	0bcbed16ca	Standardize support for Tagged and BindAddresses in Ingress Gateways (#7924 ) * Standardize support for Tagged and BindAddresses in Ingress Gateways This updates the TaggedAddresses and BindAddresses behavior for Ingress to match Mesh/Terminating gateways. The `consul connect envoy` command now also allows passing an address without a port for tagged/bind addresses. * Update command/connect/envoy/envoy.go Co-authored-by: Freddy <freddygv@users.noreply.github.com> * PR comments * Check to see if address is an actual IP address * Update agent/xds/listeners.go Co-authored-by: Freddy <freddygv@users.noreply.github.com> * fix whitespace Co-authored-by: Chris Piraino <cpiraino@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2020-05-21 14:08:43 +00:00
Chris Piraino	6969d08361	Merge pull request #7898 from hashicorp/bug/update-gateways-on-config-entry-delete Remove error from GatewayServices RPC when a service is not a gateway	2020-05-18 18:03:35 +00:00
hashicorp-ci	b1c9c5c571	update bindata_assetfs.go	2020-05-14 14:33:09 +00:00
Pierre Souchay	cf55e81c06	tests: fix unstable test `TestAgentAntiEntropy_Checks`. (#7594 ) Example of failure: https://circleci.com/gh/hashicorp/consul/153932#tests/containers/2	2020-05-14 09:54:49 +02:00
Kit Patella	ad1d4d4d07	http: migrate from instrumentation in s.wrap() to an s.enterpriseHandler()	2020-05-13 15:47:05 -07:00
Matt Keeler	acccdbe45c	Fix identity resolution on clients and in secondary dcs (#7862 ) Previously this happened to be using the method on the Server/Client that was meant to allow the ACLResolver to locally resolve tokens. On Servers that had tokens (primary or secondary dc + token replication) this function would lookup the token from raft and return the ACLIdentity. On clients this was always a noop. We inadvertently used this function instead of creating a new one when we added logging accessor ids for permission denied RPC requests. With this commit, a new method is used for resolving the identity properly via the ACLResolver which may still resolve locally in the case of being on a server with tokens but also supports remote token resolution.	2020-05-13 13:00:08 -04:00

1 2 3 4 5 ...

2054 Commits