consul

Commit Graph

Author	SHA1	Message	Date
Mike Morris	65be58703c	connect: remove managed proxies (#6220 ) * connect: remove managed proxies implementation and all supporting config options and structs * connect: remove deprecated ProxyDestination * command: remove CONNECT_PROXY_TOKEN env var * agent: remove entire proxyprocess proxy manager * test: remove all managed proxy tests * test: remove irrelevant managed proxy note from TestService_ServerTLSConfig * test: update ContentHash to reflect managed proxy removal * test: remove deprecated ProxyDestination test * telemetry: remove managed proxy note * http: remove /v1/agent/connect/proxy endpoint * ci: remove deprecated test exclusion * website: update managed proxies deprecation page to note removal * website: remove managed proxy configuration API docs * website: remove managed proxy note from built-in proxy config * website: add note on removing proxy subdirectory of data_dir	2019-08-09 15:19:30 -04:00
Aestek	24a0f2bba2	ae: use stale requests when performing full sync (#5873 ) Read requests performed during anti antropy full sync currently target the leader only. This generates a non-negligible load on the leader when the DC is large enough and can be offloaded to the followers following the "eventually consistent" policy for the agent state. We switch the AE read calls to use stale requests with a small (2s) MaxStaleDuration value and make sure we do not read too fast after a write.	2019-06-17 18:05:47 +02:00
R.B. Boyer	40336fd353	agent: fix several data races and bugs related to node-local alias checks (#5876 ) The observed bug was that a full restart of a consul datacenter (servers and clients) in conjunction with a restart of a connect-flavored application with bring-your-own-service-registration logic would very frequently cause the envoy sidecar service check to never reflect the aliased service. Over the course of investigation several bugs and unfortunate interactions were corrected: (1) local.CheckState objects were only shallow copied, but the key piece of data that gets read and updated is one of the things not copied (the underlying Check with a Status field). When the stock code was run with the race detector enabled this highly-relevant-to-the-test-scenario field was found to be racy. Changes: a) update the existing Clone method to include the Check field b) copy-on-write when those fields need to change rather than incrementally updating them in place. This made the observed behavior occur slightly less often. (2) If anything about how the runLocal method for node-local alias check logic was ever flawed, there was no fallback option. Those checks are purely edge-triggered and failure to properly notice a single edge transition would leave the alias check incorrect until the next flap of the aliased check. The change was to introduce a fallback timer to act as a control loop to double check the alias check matches the aliased check every minute (borrowing the duration from the non-local alias check logic body). This made the observed behavior eventually go away when it did occur. (3) Originally I thought there were two main actions involved in the data race: A. The act of adding the original check (from disk recovery) and its first health evaluation. B. The act of the HTTP API requests coming in and resetting the local state when re-registering the same services and checks. It took awhile for me to realize that there's a third action at work: C. The goroutines associated with the original check and the later checks. The actual sequence of actions that was causing the bad behavior was that the API actions result in the original check to be removed and re-added _without waiting for the original goroutine to terminate_. This means for brief windows of time during check definition edits there are two goroutines that can be sending updates for the alias check status. In extremely unlikely scenarios the original goroutine sees the aliased check start up in `critical` before being removed but does not get the notification about the nearly immediate update of that check to `passing`. This is interlaced wit the new goroutine coming up, initializing its base case to `passing` from the current state and then listening for new notifications of edge triggers. If the original goroutine "finishes" its update, it then commits one more write into the local state of `critical` and exits leaving the alias check no longer reflecting the underlying check. The correction here is to enforce that the old goroutines must terminate before spawning the new one for alias checks.	2019-05-24 13:36:56 -05:00
Freddy	f2213f60e0	Update alias checks on local add and remove	2019-04-24 12:17:06 -06:00
R.B. Boyer	f4a3b9d518	fix typos reported by golangci-lint:misspell (#5434 )	2019-03-06 11:13:28 -06:00
Aestek	2aac4d5168	Register and deregisters services and their checks atomically in the local state (#5012 ) Prevent race between register and deregister requests by saving them together in the local state on registration. Also adds more cleaning in case of failure when registering services / checks.	2019-03-04 09:34:05 -05:00
Aestek	5960974db1	[Fix] Services sometimes not being synced with acl_enforce_version_8 = false (#4771 ) Fixes: https://github.com/hashicorp/consul/issues/3676 This fixes a bug were registering an agent with a non-existent ACL token can prevent other services registered with a good token from being synced to the server when using `acl_enforce_version_8 = false`. ## Background When `acl_enforce_version_8` is off the agent does not check the ACL token validity before storing the service in its state. When syncing a service registered with a missing ACL token we fall into the default error handling case (https://github.com/hashicorp/consul/blob/master/agent/local/state.go#L1255) and stop the sync (https://github.com/hashicorp/consul/blob/master/agent/local/state.go#L1082) without setting its Synced property to true like in the permission denied case. This means that the sync will always stop at the faulty service(s). The order in which the services are synced is random since we iterate on a map. So eventually all services with good ACL tokens will be synced, this can however take some time and is influenced by the cluster size, the bigger the slower because retries are less frequent. Having a service in this state also prevent all further sync of checks as they are done after the services. ## Changes This change modify the sync process to continue even if there is an error. This fixes the issue described above as well as making the sync more error tolerant: if the server repeatedly refuses a service (the ACL token could have been deleted by the time the service is synced, the servers were upgraded to a newer version that has more strict checks on the service definition...). Then all services and check that can be synced will, and those that don't will be marked as errors in the logs instead of blocking the whole process.	2019-01-04 10:01:50 -05:00
Paul Banks	0f27ffd163	Proxy Config Manager (#4729 ) * Proxy Config Manager This component watches for local state changes on the agent and ensures that each service registered locally with Kind == connect-proxy has it's state being actively populated in the cache. This serves two purposes: 1. For the built-in proxy, it ensures that the state needed to accept connections is available in RAM shortly after registration and likely before the proxy actually starts accepting traffic. 2. For (future - next PR) xDS server and other possible future proxies that require _push_ based config discovery, this provides a mechanism to subscribe and be notified about updates to a proxy instance's config including upstream service discovery results. * Address review comments * Better comments; Better delivery of latest snapshot for slow watchers; Embed Config * Comment typos * Add upstream Stringer for funsies	2018-10-10 16:55:34 +01:00
Paul Banks	e812f5516a	Add -sidecar-for and new /agent/service/:service_id endpoint (#4691 ) - A new endpoint `/v1/agent/service/:service_id` which is a generic way to look up the service for a single instance. The primary value here is that it: - supports hash-based blocking and so; - replaces `/agent/connect/proxy/:proxy_id` as the mechanism the built-in proxy uses to read its config. - It's not proxy specific and so works for any service. - It has a temporary shim to call through to the existing endpoint to preserve current managed proxy config defaulting behaviour until that is removed entirely (tested). - The built-in proxy now uses the new endpoint exclusively for it's config - The built-in proxy now has a `-sidecar-for` flag that allows the service ID of the _target_ service to be specified, on the condition that there is exactly one "sidecar" proxy (that is one that has `Proxy.DestinationServiceID` set) for the service registered. - Several fixes for edge cases for SidecarService - A fix for `Alias` checks - when running locally they didn't update their state until some external thing updated the target. If the target service has no checks registered as below, then the alias never made it past critical.	2018-10-10 16:55:34 +01:00
Paul Banks	b83bbf248c	Add Proxy Upstreams to Service Definition (#4639 ) * Refactor Service Definition ProxyDestination. This includes: - Refactoring all internal structs used - Updated tests for both deprecated and new input for: - Agent Services endpoint response - Agent Service endpoint response - Agent Register endpoint - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Register - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Services endpoint response - Catalog Node endpoint response - Catalog Service endpoint response - Updated API tests for all of the above too (both deprecated and new forms of register) TODO: - config package changes for on-disk service definitions - proxy config endpoint - built-in proxy support for new fields * Agent proxy config endpoint updated with upstreams * Config file changes for upstreams. * Add upstream opaque config and update all tests to ensure it works everywhere. * Built in proxy working with new Upstreams config * Command fixes and deprecations * Fix key translation, upstream type defaults and a spate of other subtele bugs found with ned to end test scripts... TODO: tests still failing on one case that needs a fix. I think it's key translation for upstreams nested in Managed proxy struct. * Fix translated keys in API registration. ≈ * Fixes from docs - omit some empty undocumented fields in API - Bring back ServiceProxyDestination in Catalog responses to not break backwards compat - this was removed assuming it was only used internally. * Documentation updates for Upstreams in service definition * Fixes for tests broken by many refactors. * Enable travis on f-connect branch in this branch too. * Add consistent Deprecation comments to ProxyDestination uses * Update version number on deprecation notices, and correct upstream datacenter field with explanation in docs	2018-10-10 16:55:34 +01:00
Martin	feb3ce4ee0	Use target service name instead of ID as connect proxy service name (#4620 )	2018-09-05 20:33:17 +01:00
Mitchell Hashimoto	f97bfd5be8	agent: address some basic feedback	2018-07-12 09:36:11 -07:00
Mitchell Hashimoto	7543d270e2	agent/local: support local alias checks	2018-07-12 09:36:10 -07:00
Pierre Souchay	ff53648df2	Merge remote-tracking branch 'origin/master' into ACL_additional_info	2018-07-07 14:09:18 +02:00
Paul Banks	e21723a891	Persist proxy state through agent restart	2018-06-25 12:24:08 -07:00
Mitchell Hashimoto	d9bd4ffebd	agent/local: clarify the non-risk of a full buffer	2018-06-14 09:42:10 -07:00
Mitchell Hashimoto	437689e83c	agent/local: remove outdated comment	2018-06-14 09:42:10 -07:00
Mitchell Hashimoto	2bd39a84a6	agent/local: add Notify mechanism for proxy changes	2018-06-14 09:42:08 -07:00
Mitchell Hashimoto	476ea7b04a	agent: start/stop proxies	2018-06-14 09:42:08 -07:00
Mitchell Hashimoto	7355a614fe	agent/local: store proxy on local state, wip, not working yet	2018-06-14 09:42:08 -07:00
Paul Banks	e0e12e165b	TLS watching integrated into Service with some basic tests. There are also a lot of small bug fixes found when testing lots of things end-to-end for the first time and some cleanup now it's integrated with real CA code.	2018-06-14 09:42:07 -07:00
Paul Banks	2a69663448	Agent Connect Proxy config endpoint with hash-based blocking	2018-06-14 09:41:57 -07:00
Paul Banks	e6071051cf	Added connect proxy config and local agent state setup on boot.	2018-06-14 09:41:57 -07:00
Pierre Souchay	c83124a94c	Removed labels from new ACL denied metrics	2018-06-08 11:56:46 +02:00
Pierre Souchay	064f8ad170	Removed consul prefix from metrics as requested by @kyhavlov	2018-06-08 11:51:50 +02:00
Pierre Souchay	65d3a2b26e	Fixed import	2018-04-18 17:09:25 +02:00
Pierre Souchay	f13aa5ba9b	Added labels to improve new metric	2018-04-18 16:51:22 +02:00
Pierre Souchay	d9a23bb2fa	Track calls blocked by ACLs using metrics	2018-04-17 10:17:16 +02:00
Guido Iaquinti	8cd11d5888	Add package name to log output	2018-03-21 15:56:14 +00:00
Josh Soref	94835a2715	Spelling (#3958 ) * spelling: another * spelling: autopilot * spelling: beginning * spelling: circonus * spelling: default * spelling: definition * spelling: distance * spelling: encountered * spelling: enterprise * spelling: expands * spelling: exits * spelling: formatting * spelling: health * spelling: hierarchy * spelling: imposed * spelling: independence * spelling: inspect * spelling: last * spelling: latest * spelling: client * spelling: message * spelling: minimum * spelling: notify * spelling: nonexistent * spelling: operator * spelling: payload * spelling: preceded * spelling: prepared * spelling: programmatically * spelling: required * spelling: reconcile * spelling: responses * spelling: request * spelling: response * spelling: results * spelling: retrieve * spelling: service * spelling: significantly * spelling: specifies * spelling: supported * spelling: synchronization * spelling: synchronous * spelling: themselves * spelling: unexpected * spelling: validations * spelling: value	2018-03-19 16:56:00 +00:00
Frank Schroeder	52e73301f6	local state: clone check to avoid side effect	2017-10-23 10:56:05 +02:00
Frank Schroeder	6bc9d66192	local state: use synchronized access to internal maps	2017-10-23 10:56:05 +02:00
Frank Schroeder	58d52ac580	local state: rename Add{Check,Service}State to Set{Check,Service}State	2017-10-23 10:56:04 +02:00
Frank Schroeder	e144f51b29	local state: move Metadata methods together	2017-10-23 10:56:04 +02:00
Frank Schroeder	4f9e05f634	local state: update documentation of updateSyncState	2017-10-23 10:56:04 +02:00
Frank Schroeder	41c7b0927e	local state: update comments	2017-10-23 10:56:04 +02:00
Frank Schroeder	de57b16d99	local state: address review comments * move non-blocking notification mechanism into ae.Trigger * move Pause/Resume into separate type	2017-10-23 10:56:04 +02:00
Frank Schroeder	71c74e62c7	local state: fix anti-entropy state tests The anti-entropy tests relied on the side-effect of the StartSync() method to perform a full sync instead of a partial sync. This lead to multiple anti-entropy go routines being started unnecessary retry loops. This change changes the behavior to perform synchronous full syncs when necessary removing the need for all of the time.Sleep and most of the retry loops.	2017-10-23 10:56:04 +02:00
Frank Schroeder	37b95ef98e	local state: fix test with updated error message	2017-10-23 10:56:04 +02:00
Frank Schroeder	b803bf3091	local state: tests compile	2017-10-23 10:56:03 +02:00
Frank Schroeder	0a9ac9749e	local state: replace multi-map state with structs The state of the service and health check records was spread out over multiple maps guarded by a single lock. Access to the maps has to happen in a coordinated effort and the tests often violated this which made them brittle and racy. This patch replaces the multiple maps with a single one for both checks and services to make the code less fragile. This is also necessary since moving the local state into its own package creates circular dependencies for the tests. To avoid this the tests can no longer access internal data structures which they should not be doing in the first place. The tests still don't compile but this is a ncessary step in that direction.	2017-10-23 10:56:03 +02:00
Frank Schroeder	6027a9e2a5	local state: move to separate package This patch moves the local state to a separate package to further decouple it from the agent code. The code compiles but the tests do not yet.	2017-10-23 10:56:03 +02:00
Frank Schroeder	67a0689f71	Revert "local state: move to separate package" This reverts commit `d447e823c6`.	2017-10-23 10:08:34 +02:00
Frank Schroeder	623e07760a	Revert "local state: replace multi-map state with structs" This reverts commit `ccbae7da5b`.	2017-10-23 10:08:34 +02:00
Frank Schroeder	9ed4b2d631	Revert "local state: tests compile" This reverts commit `1af52bf7be`.	2017-10-23 10:08:34 +02:00
Frank Schroeder	5ae4c52ca6	Revert "local state: fix test with updated error message" This reverts commit `e9149f64d9`.	2017-10-23 10:08:34 +02:00
Frank Schroeder	67cdfc038e	Revert "local state: fix anti-entropy state tests" This reverts commit `f8e20cd996`.	2017-10-23 10:08:34 +02:00
Frank Schroeder	46641e44d9	Revert "local state: address review comments" This reverts commit `1d315075b1`.	2017-10-23 10:08:33 +02:00
Frank Schroeder	abd83f2d28	Revert "local state: update comments" This reverts commit `42188164f8`.	2017-10-23 10:08:33 +02:00
Frank Schroeder	c6e441dd60	Revert "local state: update documentation of updateSyncState" This reverts commit `e86521e637`.	2017-10-23 10:08:33 +02:00
Frank Schroeder	648b4da4f8	Revert "local state: move Metadata methods together" This reverts commit `9bc8127728`.	2017-10-23 10:08:33 +02:00
Frank Schroeder	fa45a6a547	Revert "local state: rename Add{Check,Service}State to Set{Check,Service}State" This reverts commit `9280841a80`.	2017-10-23 10:08:33 +02:00
Frank Schroeder	d6f52d9ed1	Revert "local state: use synchronized access to internal maps" This reverts commit `39a2d8d25e`.	2017-10-23 10:08:32 +02:00
Frank Schroeder	4e862d126d	Revert "local state: clone check to avoid side effect" This reverts commit `af1243c725`.	2017-10-23 10:08:32 +02:00
Frank Schroeder	af1243c725	local state: clone check to avoid side effect	2017-10-23 08:03:18 +02:00
Frank Schroeder	39a2d8d25e	local state: use synchronized access to internal maps	2017-10-23 08:03:18 +02:00
Frank Schroeder	9280841a80	local state: rename Add{Check,Service}State to Set{Check,Service}State	2017-10-23 08:03:18 +02:00
Frank Schroeder	9bc8127728	local state: move Metadata methods together	2017-10-23 08:03:18 +02:00
Frank Schroeder	e86521e637	local state: update documentation of updateSyncState	2017-10-23 08:03:18 +02:00
Frank Schroeder	42188164f8	local state: update comments	2017-10-23 08:03:18 +02:00
Frank Schroeder	1d315075b1	local state: address review comments * move non-blocking notification mechanism into ae.Trigger * move Pause/Resume into separate type	2017-10-23 08:03:18 +02:00
Frank Schroeder	f8e20cd996	local state: fix anti-entropy state tests The anti-entropy tests relied on the side-effect of the StartSync() method to perform a full sync instead of a partial sync. This lead to multiple anti-entropy go routines being started unnecessary retry loops. This change changes the behavior to perform synchronous full syncs when necessary removing the need for all of the time.Sleep and most of the retry loops.	2017-10-23 08:03:18 +02:00
Frank Schroeder	e9149f64d9	local state: fix test with updated error message	2017-10-23 08:03:18 +02:00
Frank Schroeder	1af52bf7be	local state: tests compile	2017-10-23 08:03:18 +02:00
Frank Schroeder	ccbae7da5b	local state: replace multi-map state with structs The state of the service and health check records was spread out over multiple maps guarded by a single lock. Access to the maps has to happen in a coordinated effort and the tests often violated this which made them brittle and racy. This patch replaces the multiple maps with a single one for both checks and services to make the code less fragile. This is also necessary since moving the local state into its own package creates circular dependencies for the tests. To avoid this the tests can no longer access internal data structures which they should not be doing in the first place. The tests still don't compile but this is a ncessary step in that direction.	2017-10-23 08:03:18 +02:00
Frank Schroeder	d447e823c6	local state: move to separate package This patch moves the local state to a separate package to further decouple it from the agent code. The code compiles but the tests do not yet.	2017-10-23 08:03:18 +02:00

1 2 3

116 Commits