consul

Commit Graph

Author	SHA1	Message	Date
freddygv	3492f9e0d6	Finish cleanup from ServiceConfigRequest changes	2021-03-15 16:38:01 -06:00
Daniel Nephin	4d456922a9	state: use runCase pattern for large test The TestServiceHealthEventsFromChanges function was over 1400 lines. Attempting to debug test failures in test functions this large is difficult. It requires scrolling to the line which defines the testcase because the failure message only includes the line number of the assertion, not the line number of the test case. This is an excellent example of where test tables stop working well, and start being a problem. To mitigate this problem, the runCase pattern can be used. When one of these tests fails, a failure message will print the line number of both the test case and the assertion. This allows a developer to quickly jump to both of the relevant lines, signficanting reducing the time it takes to debug test failures. For example, one such failure could look like this: catalog_events_test.go:1610: case: service reg, new node catalog_events_test.go:1605: assertion failed: values are not equal	2021-03-15 17:53:16 -04:00
freddygv	7df846aa24	Pass MeshGateway config in service config request ResolveServiceConfig is called by service manager before the proxy registration is in the catalog. Therefore we should pass proxy registration flags in the request rather than trying to fetch them from the state store (where they may not exist yet).	2021-03-15 14:32:13 -06:00
freddygv	8b46d8dcbb	Restore old Envoy prefix on escape hatches This is done because after removing ID and NodeName from ServiceConfigRequest we will no longer know whether a request coming in is for a Consul client earlier than v1.10.	2021-03-15 14:12:57 -06:00
freddygv	08759e46ed	Add RPC endpoint for intention upstreams	2021-03-15 08:50:35 -06:00
freddygv	08737fa606	Add state store function for intention upstreams	2021-03-15 08:50:35 -06:00
freddygv	3722ce2fff	Refactor IntentionDecision This enables it to be called for many upstreams or downstreams of a service while only querying intentions once. Additionally, decisions are now optionally denied due to L7 permissions being present. This enables the function to be used to filter for potential upstreams/downstreams of a service.	2021-03-15 08:50:35 -06:00
Daniel Nephin	f40b76af2d	proxycfg: use rpcclient/health.Client instead of passing around cache name This should allow us to swap out the implementation with something other than `agent/cache` without making further code changes.	2021-03-12 11:46:04 -05:00
Daniel Nephin	566741a143	catalog_events: set the right key for connect snapshots	2021-03-12 11:35:43 -05:00
Daniel Nephin	1a764553c0	rpcclient: use streaming for connect health	2021-03-12 11:35:42 -05:00
Kyle Havlovitz	1e87c7183a	Merge pull request #9672 from hashicorp/ca-force-skip-xc connect/ca: Allow ForceWithoutCrossSigning for all providers	2021-03-11 11:49:15 -08:00
freddygv	6fd30d0384	Add TransparentProxy opt to proxy definition	2021-03-11 11:37:21 -07:00
freddygv	e3dc2a49df	Turn Limits and PassiveHealthChecks into pointers	2021-03-11 11:04:40 -07:00
freddygv	acec711a6a	Update server-side config resolution and client-side merging	2021-03-10 21:05:11 -07:00
Daniel Nephin	9d924a81a9	Merge pull request #9797 from hashicorp/dnephin/state-index-node-id state: convert nodes.ID to the new pattern of functional indexers	2021-03-10 17:34:23 -05:00
Daniel Nephin	b06b3dd8f8	state: move ConfigEntryKindName Previously this type was defined in structs, but unlike the other types in structs this type is not used by RPC requests. By moving it to state we can better indicate that this is not an API type, but part of the state implementation.	2021-03-10 12:27:22 -05:00
Daniel Nephin	948d1a317d	Merge pull request #9796 from hashicorp/dnephin/state-cleanup-catalog-index-oss state: remove duplicate tableCheck indexes	2021-03-10 12:20:09 -05:00
Daniel Nephin	71b0f0a7a6	structs: remove EnterpriseMeta.GetNamespace I added this recently without realizing that the method already existed and was named NamespaceOrEmpty. Replace all calls to GetNamespace with NamespaceOrEmpty or NamespaceOrDefault as appropriate.	2021-03-09 15:17:26 -05:00
Daniel Nephin	23421e190c	state: adjust compare for catalog events Document that this comparison should roughly match MatchesKey Only sort by overrideKey or service name, but not both Add namespace to the sort. The client side also builds a map of these based on the namespace/node/service key, so the only order that really matters is the ordering of register/dereigster events.	2021-03-09 14:00:36 -05:00
Daniel Nephin	68ec20f66a	state: handle terminating gateway events properly in snapshot Refactored out a function that can be used for both the snapshot and stream of events to translate an event into an appropriate connect event. Previously terminating gateway events would have used the wrong key in the snapshot, which would have caused them to be filtered out later on. Also removed an unused function, and some commented out code.	2021-03-09 14:00:35 -05:00
Kyle Havlovitz	db572aca59	Add remaining terminating gateway tests for namespaces Co-Authored-By: Daniel Nephin <dnephin@hashicorp.com>	2021-03-09 14:00:35 -05:00
Daniel Nephin	701285e470	Start to setup enterprise tests for terminating gateway streaming events. Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:35 -05:00
Daniel Nephin	ae368768e5	state: Add support for override of namespace in MatchesKey also tests for MatchesKey Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:35 -05:00
Daniel Nephin	4756ff059d	state: update calls to ensureConfigEntryTxn The EnterpriseMeta paramter was removed after this code was written, but before it merged. Also the table name constant has changed.	2021-03-09 14:00:35 -05:00
Daniel Nephin	30a575dd33	state: add 2 more test cases for terminate gateway streaming events Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:34 -05:00
Kyle Havlovitz	a21be5efa8	Added 6 new test cases for terminating gateway events Co-Authored-By: Daniel Nephin <dnephin@hashicorp.com>	2021-03-09 14:00:34 -05:00
Daniel Nephin	06b1c32e25	state: Add two more tests for connect events with terminating gateways And expand one test case to cover more. Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:34 -05:00
Daniel Nephin	eb58a39738	state: Include the override key in the sorting of events Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:34 -05:00
Kyle Havlovitz	c2481ca10f	state: Add terminating gateway events on updating a config entry Co-Authored-By: Daniel Nephin <dnephin@hashicorp.com>	2021-03-09 14:00:34 -05:00
Daniel Nephin	28de159c14	state: add first terminating catalog catalog event Health of a terminating gateway instance changes - Generate an event for creating/destroying this instance of the terminating gateway, duplicate it for each affected service Co-Authored-By: Kyle Havlovitz <kylehav@gmail.com>	2021-03-09 14:00:33 -05:00
Daniel Nephin	a4e68e32d6	state: convert nodes.ID to new functional pattern In preparation for adding other identifiers to the index.	2021-03-05 12:30:40 -05:00
Daniel Nephin	6b95e8dfe2	Merge pull request #9188 from hashicorp/dnephin/more-streaming-tests Add more streaming tests	2021-02-26 12:36:55 -05:00
Daniel Nephin	5c8a6311b6	Merge pull request #9703 from pierresouchay/streaming_tags_and_case_insensitive Streaming filter tags + case insensitive lookups for Service Names	2021-02-26 12:06:26 -05:00
Daniel Nephin	55add28725	catalog_events: set the right key for connect snapshots Add a test for catalog_event snapshot on connect topic	2021-02-25 14:30:39 -05:00
Daniel Nephin	432dd2d204	consul: Add integration tests of streaming. Restored from streaming-rpc-final branch. Co-authored-by: Paul Banks <banks@banksco.de>	2021-02-25 14:30:39 -05:00
Daniel Nephin	b7f8e3bad2	state: Add a test for ServiceHealthSnapshot	2021-02-25 14:08:10 -05:00
Daniel Nephin	340d714fd6	state: add a test case for memdb indexers	2021-02-19 17:14:46 -05:00
Daniel Nephin	cc5c88213a	state: support for functional indexers These new functional indexers provide a few advantages: 1. enterprise differences can be isolated to a single function (the indexer function), making code easier to change 2. as a consequence of (1) we no longer need to wrap all the calls to Txn operations, making code easier to read. 3. by removing reflection we should increase the performance of all operations. One important change is in making all the function signatures the same. https://blog.golang.org/errors-are-values An extra boolean return value for SingleIndexer.FromObject is superfluous. The error value can indicate when the index value could not be created. By removing this extra return value we can use the same signature for both indexer functions. This has the nice properly of a function being usable for both indexing operations.	2021-02-19 17:14:46 -05:00
Daniel Nephin	b861642910	state: remove duplicate index on the checks table By using a new pattern for more specific indexes. This allows us to use the same index for both service checks and node checks. It removes the abstraction around memdb.Txn operations, and isolates all of the enterprise differences in a single place (the indexer).	2021-02-19 17:14:46 -05:00
Daniel Nephin	519bb82a00	state: remove duplicate function catalogChecksForNodeService was a duplicate of catalogListServiceChecks	2021-02-19 17:14:46 -05:00
Daniel Nephin	d18e00194a	Merge pull request #9720 from hashicorp/dnephin/ent-meta-ergo-1 structs: rename EnterpriseMeta constructor	2021-02-16 15:31:58 -05:00
Daniel Nephin	363d738fd2	Merge pull request #9772 from hashicorp/streamin-fix-bad-cached-snapshot streaming: fix snapshot cache bug	2021-02-16 15:28:00 -05:00
Daniel Nephin	89383e2d98	Merge pull request #9728 from hashicorp/dnephin/state-index-table state: document how index table is used	2021-02-16 15:27:27 -05:00
Daniel Nephin	d1772ae305	structs: rename EnterpriseMeta constructor To match the Go convention.	2021-02-16 14:45:43 -05:00
Daniel Nephin	ba3a1b95e1	stream: fix a snapshot cache bug Previously a snapshot created as part of a resumse-stream request could have incorrectly cached the newSnapshotToFollow event. This would cause clients to error because they received an unexpected framing event.	2021-02-16 12:52:23 -05:00
Daniel Nephin	9b3c6da9df	stream: test the snapshot cache is saved correctly when the cache entry is created from resuming a stream.	2021-02-16 12:08:43 -05:00
R.B. Boyer	39e4ae25ac	connect: connect CA Roots in the primary datacenter should use a SigningKeyID derived from their local intermediate (#9428 ) This fixes an issue where leaf certificates issued in primary datacenters using Vault as a Connect CA would be reissued very frequently (every ~20 seconds) because the logic meant to detect root rotation was errantly triggering. The hash of the rootCA was being compared against a hash of the intermediateCA and always failing. This doesn't apply to the Consul built-in CA provider because there is no intermediate in use in the primary DC. This is reminiscent of #6513	2021-02-08 13:18:51 -06:00
Daniel Nephin	30332ffb43	state: Use the tableIndex constant	2021-02-05 18:37:45 -05:00
Daniel Nephin	3ecbeda234	state: Document index table And move the IndexEntry (which is stored in the table) next to the table schema definition.	2021-02-05 18:37:45 -05:00
Daniel Nephin	a4690ac7d9	Merge pull request #9719 from hashicorp/oss/state-store-4 state: remove registerSchema	2021-02-05 14:02:38 -05:00
Daniel Nephin	1c4e0cfa2a	Merge pull request #9718 from hashicorp/oss/dnephin/ent-meta-in-state-store-3 state: convert all table name constants to the new prefix pattern	2021-02-05 14:02:07 -05:00
Daniel Nephin	0814f22715	Merge pull request #9665 from hashicorp/dnephin/state-store-indexes-2 state: move config-entries table definition to config_entries_schema.go	2021-02-05 14:01:08 -05:00
Daniel Nephin	912dbb4cb4	Merge pull request #9664 from hashicorp/dnephin/state-store-indexes state: move ACL schema and index definitions to acl_schema.go	2021-02-05 13:38:31 -05:00
Daniel Nephin	05d5ec4804	state: remove the need for registerSchema registerSchema creates some indirection which is not necessary in this case. newDBSchema can call each of the tables. Enterprise tables can be added from the existing withEnterpriseSchema shim.	2021-02-05 12:19:56 -05:00
Daniel Nephin	2cbf8b5fd0	state: rename table name constants to use pattern the 'table' prefix is shorter, and also reads better in queries.	2021-02-05 12:12:19 -05:00
Daniel Nephin	8ac9d54ccc	state: rename connect constants	2021-02-05 12:12:19 -05:00
Daniel Nephin	0c34e474c5	state: rename table name constants to new pattern Using Apps Hungarian Notation for these constants makes the memdb queries more readable.	2021-02-05 12:12:18 -05:00
Pierre Souchay	7a024ed074	Streaming filter tags + case insensitive lookups for Service Names Will fix: * https://github.com/hashicorp/consul/issues/9695 * https://github.com/hashicorp/consul/issues/9702	2021-02-04 11:00:51 +01:00
Daniel Nephin	2d5b5afec1	state: Remove unnecessary entMeta arg to EnsureConfigEntry	2021-02-03 18:10:38 -05:00
Kyle Havlovitz	7dac583863	connect/ca: Allow ForceWithoutCrossSigning for all providers This allows setting ForceWithoutCrossSigning when reconfiguring the CA for any provider, in order to forcibly move to a new root in cases where the old provider isn't reachable or able to cross-sign for whatever reason.	2021-01-29 13:38:11 -08:00
Daniel Nephin	5b4703f0e4	state: rename config-entries table const to match new pattern	2021-01-28 20:34:34 -05:00
Daniel Nephin	cd06b5728c	state: move config-entries table to new pattern	2021-01-28 20:34:15 -05:00
Daniel Nephin	e8931b868c	state: use indexID this change was already made to enterprise, so backporting it.	2021-01-28 20:30:08 -05:00
Daniel Nephin	1cccdc45c2	state: Move ACL schema indexes to match Ent and use constants for table and index names.	2021-01-28 20:05:09 -05:00
Matt Keeler	f561462064	Upgrade raft-autopilot and wait for autopilot it to stop when revoking leadership (#9644 ) Fixes: 9626	2021-01-27 11:14:52 -05:00
Hans Hasselberg	444cdeb8fb	Add flags to support CA generation for Connect (#9585 )	2021-01-27 08:52:15 +01:00
R.B. Boyer	c608dc0d60	server: initialize mgw-wanfed to use local gateways more on startup (#9528 ) Fixes #9342	2021-01-25 17:30:38 -06:00
Daniel Nephin	48112e6298	Merge pull request #9420 from hashicorp/dnephin/reduce-duplicate-in-catalog-schema state: reduce interface for Enterprise schema	2021-01-25 17:04:25 -05:00
R.B. Boyer	51e3ca6cbb	server: use the presense of stored federation state data as a sign that we already activated the federation state feature flag (#9519 ) This way we only have to wait for the serf barrier to pass once before we can make use of federation state APIs Without this patch every restart needs to re-compute the change.	2021-01-25 13:24:32 -06:00
R.B. Boyer	9ef3f20127	server: when wan federating via mesh gateways only do heuristic primary DC bypass on the leader (#9366 ) Fixes #9341	2021-01-22 10:03:24 -06:00
Freddy	e50019b092	Update topology mapping Refs on all proxy instance deletions (#9589 ) * Insert new upstream/downstream mapping to persist new Refs * Avoid upserting mapping copy if it's a no-op * Add test with panic repro * Avoid deleting up/downstreams from inside memdb iterator * Avoid deleting gateway mappings from inside memdb iterator * Add CHANGELOG entry * Tweak changelog entry Co-authored-by: Paul Banks <banks@banksco.de>	2021-01-20 15:17:26 +00:00
Daniel Nephin	810424a61e	state: do not delete from inside an iteration Deleting from memdb inside an interation can cause a panic from Iterator.Next. This case is technically safe (for now) because the iterator is using the root radix tree not a modified one. However this could break at any time if someone adds an insert or delete to the coordinates table before this place in the function. It also sets a bad example, because generally deletes in an interator are not safe. So this commit uses the pattern we have in other places to move the deletes out of the iteration.	2021-01-19 17:00:07 -05:00
Matt Keeler	d9d4c492ab	Ensure that CA initialization does not block leader election. After fixing that bug I uncovered a couple more: Fix an issue where we might try to cross sign a cert when we never had a valid root. Fix a potential issue where reconfiguring the CA could cause either the Vault or AWS PCA CA providers to delete resources that are still required by the new incarnation of the CA.	2021-01-19 15:27:48 -05:00
Daniel Nephin	a6000e6ad8	state: add a regression test for state store schema To allow the index to be refactored without accidental changes. To update the expected value run: 'go test ./agent/consul/state -update'	2021-01-15 18:49:55 -05:00
Daniel Nephin	24312f8c96	state: reduce interface for Enterprise schema Using withEnterpriseSchema() we can apply any enterprise schema changes with a single shim, removing the need to duplicate all of the table definitions. Also move all the catalog schemas to a new file to shrink catalog.go a bit.	2021-01-15 18:49:55 -05:00
Daniel Nephin	e66af1a559	agent/consuk: Rename RPCRate -> RPCRateLimit so that the field name is consistent across config structs.	2021-01-14 17:26:00 -05:00
Daniel Nephin	5684223e36	agent/consul: make Client/Server config reloading more obvious I believe this commit also fixes a bug. Previously RPCMaxConnsPerClient was not being re-read from the RuntimeConfig, so passing it to Server.ReloadConfig was never changing the value. Also improve the test runtime by not doing a lot of unnecessary work.	2021-01-14 17:21:10 -05:00
Daniel Nephin	8fdc789ded	Merge pull request #9460 from hashicorp/dnephin/fix-data-races Fix a couple data races in tests	2021-01-14 17:07:01 -05:00
Chris Piraino	0712e03f33	Fix bug in usage metrics when multiple service instances are changed in a single transaction (#9440 ) * Fix bug in usage metrics that caused a negative count to occur There were a couple of instances were usage metrics would do the wrong thing and result in incorrect counts, causing the count to attempt to decrement below zero and return an error. The usage metrics did not account for various places where a single transaction could delete/update/add multiple service instances at once. We also remove the error when attempting to decrement below zero, and instead just make sure we do not accidentally underflow the unsigned integer. This is a more graceful failure than returning an error and not allowing a transaction to commit. * Add changelog	2021-01-12 15:31:47 -06:00
Chris Piraino	aabdccdfa0	Log replication warnings when no error suppression is defined (#9320 ) * Log replication warnings when no error suppression is defined * Add changelog file	2021-01-08 14:03:06 -06:00
Daniel Nephin	d113f0e690	structs: Fix printing of IDs These types are used as values (not pointers) in other structs. Using a pointer receiver causes problems when the value is printed. fmt will not call the String method if it is passed a value and the String method has a pointer receiver. By using a value receiver the correct string is printed. Also remove some unused methods.	2021-01-07 18:47:38 -05:00
Daniel Nephin	d64425d2e4	Merge pull request #9213 from hashicorp/dnephin/resolve-tokens-take-2 acl: Remove some unused things and document delegate method	2021-01-06 18:51:51 -05:00
R.B. Boyer	19baf4bc25	acl: use the presence of a management policy in the state store as a sign that we already migrated to v2 acls (#9505 ) This way we only have to wait for the serf barrier to pass once before we can upgrade to v2 acls. Without this patch every restart needs to re-compute the change, and potentially if a stray older node joins after a migration it might regress back to v1 mode which would be problematic.	2021-01-05 17:04:27 -06:00
Matt Keeler	85e5da53d5	Special case the error returned when we have a Raft leader but are not tracking it in the ServerLookup (#9487 ) This can happen when one other node in the cluster such as a client is unable to communicate with the leader server and sees it as failed. When that happens its failing status eventually gets propagated to the other servers in the cluster and eventually this can result in RPCs returning “No cluster leader” error. That error is misleading and unhelpful for determing the root cause of the issue as its not raft stability but rather and client -> server networking issue. Therefore this commit will add a new error that will be returned in that case to differentiate between the two cases.	2021-01-04 14:05:23 -05:00
R.B. Boyer	d5d62d9e08	server: deletions of intentions by name using the intention API is now idempotent (#9278 ) Restoring a behavior inadvertently changed while fixing #9254	2021-01-04 11:27:00 -06:00
Daniel Nephin	71b82a7e5b	Maybe fix another data race in a test	2020-12-22 18:53:54 -05:00
Daniel Nephin	bae9125fc1	Fix one race caused by t.Parallel	2020-12-22 18:27:18 -05:00
Daniel Nephin	cb3dbc92f9	Merge pull request #9340 from hashicorp/dnephin/skip-slow-tests-with-short testing: skip slow tests with -short	2020-12-11 13:33:44 -05:00
R.B. Boyer	d921690bfe	acl: global tokens created by auth methods now correctly replicate to secondary datacenters (#9351 ) Previously the tokens would fail to insert into the secondary's state store because the AuthMethod field of the ACLToken did not point to a known auth method from the primary.	2020-12-09 15:22:29 -06:00
Daniel Nephin	b9e60c0775	testing: skip slow tests with -short Add a skip condition to all tests slower than 100ms. This change was made using `gotestsum tool slowest` with data from the last 3 CI runs of master. See https://github.com/gotestyourself/gotestsum#finding-and-skipping-slow-tests With this change: ``` $ time go test -count=1 -short ./agent ok github.com/hashicorp/consul/agent 0.743s real 0m4.791s $ time go test -count=1 -short ./agent/consul ok github.com/hashicorp/consul/agent/consul 4.229s real 0m8.769s ```	2020-12-07 13:42:55 -05:00
Kyle Havlovitz	88d669c0e0	connect: Fix a case where the active root would get unset even when there wasn't a new one	2020-12-02 11:42:23 -08:00
Kyle Havlovitz	c4eff420be	Merge pull request #9009 from hashicorp/update-secondary-ca connect: Fix an issue with updating CA config in a secondary datacenter	2020-11-30 14:49:28 -08:00
Kyle Havlovitz	781cae5809	Use a buffered channel for CA intermediate renew func	2020-11-30 14:37:24 -08:00
R.B. Boyer	d2d1b05a4e	server: fix panic when deleting a non existent intention (#9254 ) * server: fix panic when deleting a non existent intention * add changelog * Always return an error when deleting non-existent ixn Co-authored-by: freddygv <gh@freddygv.xyz>	2020-11-24 13:44:20 -05:00
Hans Hasselberg	57701695c3	add missing descriptions for metrics	2020-11-23 22:06:30 +01:00
Kit Patella	fcec25de40	add entries for missing fsm operations and mark duplicated metrics prefixes as deprecated	2020-11-23 12:42:51 -08:00
Kyle Havlovitz	13c31ccfce	Clean up the logic in persistNewRootAndConfig	2020-11-20 15:54:44 -08:00
Kyle Havlovitz	0bfda4481f	Add CA server delegate interface for testing	2020-11-19 20:08:06 -08:00
Kit Patella	5c09dc322e	add telemetry and definition help entries for missing catalog and acl metrics	2020-11-19 13:29:44 -08:00
Kit Patella	9e54e897d7	remove stale entries and rename/define acl.resolveToken	2020-11-19 13:06:28 -08:00
Freddy	fd5928fa4e	Require operator:write to get Connect CA config (#9240 ) A vulnerability was identified in Consul and Consul Enterprise (“Consul”) such that operators with `operator:read` ACL permissions are able to read the Consul Connect CA configuration when explicitly configured with the `/v1/connect/ca/configuration` endpoint, including the private key. This allows the user to effectively privilege escalate by enabling the ability to mint certificates for any Consul Connect services. This would potentially allow them to masquerade (receive/send traffic) as any service in the mesh. -- This PR increases the permissions required to read the Connect CA's private key when it was configured via the `/connect/ca/configuration` endpoint. They are now `operator:write`.	2020-11-19 10:14:48 -07:00
Kyle Havlovitz	9be7c6401c	connect: update some function comments in CA manager	2020-11-17 16:00:19 -08:00
Daniel Nephin	3885835e8c	acl: remove a test-only method	2020-11-17 18:16:34 -05:00
Daniel Nephin	0ee86935f0	Remove two unused delegate methods	2020-11-17 18:16:26 -05:00
Matt Keeler	66fd23d67f	Refactor to call non-voting servers read replicas (#9191 ) Co-authored-by: Kit Patella <kit@jepsen.io>	2020-11-17 10:53:57 -05:00
Kit Patella	d15b6fddd3	Merge pull request #9198 from hashicorp/mkcp/telemetry/add-all-metric-definitions Add metric definitions for all metrics known at Consul start	2020-11-16 15:54:50 -08:00
Matt Keeler	748d56b8ab	Prevent panic if autopilot health is requested prior to leader establishment finishing. (#9204 )	2020-11-16 17:08:17 -05:00
Daniel Nephin	b7367467f6	Merge pull request #9114 from hashicorp/dnephin/filtering-in-stream stream: improve naming of Payload methods	2020-11-16 14:20:07 -05:00
Kit Patella	15af5ead0b	trim help strings to save a few bytes	2020-11-16 11:02:11 -08:00
Kit Patella	3966ecb02f	merge master	2020-11-16 10:46:53 -08:00
Kit Patella	5da2f1efa8	finish adding static server metrics	2020-11-13 16:26:08 -08:00
Kyle Havlovitz	16e95f1d7b	Reorganize some CA manager code for correctness/readability	2020-11-13 14:46:01 -08:00
Kyle Havlovitz	6fba82a4fa	connect: Add CAManager for synchronizing CA operations	2020-11-13 14:33:44 -08:00
Kyle Havlovitz	af34b26221	connect: Add logic for updating secondary DC intermediate on config set	2020-11-13 14:33:44 -08:00
R.B. Boyer	9eb262252a	server: intentions CRUD requires connect to be enabled (#9194 ) Fixes #9123	2020-11-13 16:19:12 -06:00
Kit Patella	06d59c03b9	add the service name in the agent rather than in the definitions themselves	2020-11-13 13:18:04 -08:00
R.B. Boyer	c7233ba871	server: remove config entry CAS in legacy intention API bridge code (#9151 ) Change so line-item intention edits via the API are handled via the state store instead of via CAS operations. Fixes #9143	2020-11-13 14:42:21 -06:00
R.B. Boyer	c52bc632df	server: skip deleted and deleting namespaces when migrating intentions to config entries (#9186 )	2020-11-13 13:56:41 -06:00
Mike Morris	7af643ac37	ci: update to Go 1.15.4 and alpine:3.12 (#9036 ) * ci: stop building darwin/386 binaries Go 1.15 drops support for 32-bit binaries on Darwin https://golang.org/doc/go1.15#darwin * tls: ConnectionState::NegotiatedProtocolIsMutual is deprecated in Go 1.15, this value is always true * correct error messages that changed slightly * Completely regenerate some TLS test data Co-authored-by: R.B. Boyer <rb@hashicorp.com>	2020-11-13 13:02:59 -05:00
R.B. Boyer	c003871c54	server: break up Intention.Apply monolithic method (#9007 ) The Intention.Apply RPC is quite large, so this PR attempts to break it down into smaller functions and dissolves the pre-config-entry approach to the breakdown as it only confused things.	2020-11-13 09:15:39 -06:00
Kit Patella	24a2471029	first pass on agent-configured prometheusDefs and adding defs for every consul metric	2020-11-12 18:12:12 -08:00
R.B. Boyer	61eac21f1a	agent: return the default ACL policy to callers as a header (#9101 ) Header is: X-Consul-Default-ACL-Policy=<allow\|deny> This is of particular utility when fetching matching intentions, as the fallthrough for a request that doesn't match any intentions is to enforce using the default acl policy.	2020-11-12 10:38:32 -06:00
Matt Keeler	71da0209bf	Add a paramter in state store methods to indicate whether a resource insertion is from a snapshot restoration (#9156 ) The Catalog, Config Entry, KV and Session resources potentially re-validate the input as its coming in. We need to prevent snapshot restoration failures due to missing namespaces or namespaces that are being deleted in enterprise.	2020-11-11 11:21:42 -05:00
Matt Keeler	a3a653342b	Fix a bunch of linter warnings	2020-11-09 09:22:12 -05:00
Matt Keeler	c048e86bb2	Switch to using the external autopilot module	2020-11-09 09:22:11 -05:00
Daniel Nephin	fb70c8bac2	stream: document that Payload must be immutable If they are sent to EventPublisher.Publish. Also document that PayloadEvents is expected to come from a subscription and that it is not immutable.	2020-11-06 13:00:33 -05:00
Daniel Nephin	43af0ba7a3	stream: rename FilterByKey	2020-11-05 19:21:16 -05:00
Daniel Nephin	868cfe1eac	stream: Add HasReadPermission to Payload Required now that filter is a method on PayloadEvents instead of Event	2020-11-05 19:17:18 -05:00
Daniel Nephin	36202f7938	stream: move event filtering to PayloadEvents Removes the weirdness around PayloadEvents.FilterByKey	2020-11-05 17:50:17 -05:00
Daniel Nephin	79b5ca1ce6	stream: Remove unused method	2020-11-05 16:49:59 -05:00
Daniel Nephin	a33c50ef0d	Merge pull request #9073 from hashicorp/dnephin/backport-streaming-namespaces streaming: backport namespace changes	2020-11-05 14:19:10 -05:00
Daniel Nephin	c82f6ef2d8	Merge pull request #9061 from hashicorp/dnephin/event-fields stream: support filtering by namespace	2020-11-05 14:18:35 -05:00
Daniel Nephin	b95b14e168	state: test EventPayloadCheckServiceNode.FilterByKey Also fix a bug in that function when only one of key or namespace were the empty string.	2020-10-30 14:35:57 -04:00
Daniel Nephin	56d6079da3	stream: Add tests for filterByKey with namespace And fix a bug where a request with a Namespace but no Key would not be properly filtered	2020-10-30 14:35:42 -04:00
Daniel Nephin	2c00045161	stream: Move FilterByKey events to a table In preparation for adding new tests.	2020-10-30 14:35:28 -04:00
Daniel Nephin	43c5803a25	state: use enterprise meta for creating events	2020-10-30 14:34:04 -04:00
Daniel Nephin	0ad2406d7c	stream: include the namespace in the snap cache key Otherwise the wrong snapshot could be returned when the same key is used in different namespaces	2020-10-30 14:34:04 -04:00
Daniel Nephin	c42fe5ae43	subscribe: set the request namespace	2020-10-30 14:34:04 -04:00
R.B. Boyer	fa4b0854fb	state: ensure we unblock intentions queries upon the upgrade to config entries (#9062 ) 1. do a state store query to list intentions as the agent would do over in `agent/proxycfg` backing `agent/xds` 2. upgrade the database and do a fresh `service-intentions` config entry write 3. the blocking query inside of the agent cache in (1) doesn't notice (2)	2020-10-29 15:28:31 -05:00
R.B. Boyer	b24b4169e1	restore prior signature of test helper so enterprise compiles	2020-10-29 13:52:15 -05:00
Daniel Nephin	a5dd2001cf	stream: remove Event.Key Makes Payload a type with FilterByKey so that Payloads can implement filtering by key. With this approach we don't need to expose a Namespace field on Event, and we don't need to invest micro formats or require a bunch of code to be aware of exactly how the key field is encoded.	2020-10-28 16:48:04 -04:00
Daniel Nephin	1c094da40d	state: use go-cmp for comparison The output of the previous assertions made it impossible to debug the tests without code changes. With go-cmp comparing the entire slice we can see the full diffs making it easier to debug failures.	2020-10-28 16:33:00 -04:00
Daniel Nephin	3dfb7c224b	stream: Use a no-op event publisher if streaming is disabled	2020-10-28 13:54:19 -04:00
Daniel Nephin	23eee604c9	store: use a ReadDB for snapshots to remove the cyclic dependency between the snapshot handlers and the state.Store	2020-10-28 13:07:42 -04:00
Daniel Nephin	7b9ee25956	Merge pull request #9026 from hashicorp/dnephin/streaming-without-cache-query-param streaming: rename config and remove requirement for cache=1	2020-10-28 12:33:25 -04:00
Daniel Nephin	477d665309	Merge pull request #8618 from hashicorp/dnephin/remove-txn-readtxn state: Use ReadTxn everywhere	2020-10-28 12:32:47 -04:00
Daniel Nephin	c398a6b272	state: disable streaming connect topic	2020-10-26 11:49:47 -04:00
R.B. Boyer	58387fef0a	server: config entry replication now correctly uses namespaces in comparisons (#9024 ) Previously config entries sharing a kind & name but in different namespaces could occasionally cause "stuck states" in replication because the namespace fields were ignored during the differential comparison phase. Example: Two config entries written to the primary: kind=A,name=web,namespace=bar kind=A,name=web,namespace=foo Under the covers these both get saved to memdb, so they are sorted by all 3 components (kind,name,namespace) during natural iteration. This means that before the replication code does it's own incomplete sort, the underlying data IS sorted by namespace ascending (bar comes before foo). After one pass of replication the primary and secondary datacenters have the same set of config entries present. If "kind=A,name=web,namespace=bar" were to be deleted, then things get weird. Before replication the two sides look like: primary: [ kind=A,name=web,namespace=foo ] secondary: [ kind=A,name=web,namespace=bar kind=A,name=web,namespace=foo ] The differential comparison phase walks these two lists in sorted order and first compares "kind=A,name=web,namespace=foo" vs "kind=A,name=web,namespace=bar" and falsely determines they are the SAME and are thus cause an update of "kind=A,name=web,namespace=foo". Then it compares "<nothing>" with "kind=A,name=web,namespace=foo" and falsely determines that the latter should be DELETED. During reconciliation the deletes are processed before updates, and so for a brief moment in the secondary "kind=A,name=web,namespace=foo" is erroneously deleted and then immediately restored. Unfortunately after this replication phase the final state is identical to the initial state, so when it loops around again (rate limited) it repeats the same set of operations indefinitely.	2020-10-23 13:41:54 -05:00
Daniel Nephin	0f1fb24d19	state: convert the remaining functions to ReadTxn Required also converting some of the transaction functions to WriteTxn because TxnRO() called the same helper as TxnRW. This change allows us to return a memdb.Txn for read-only txn instead of wrapping them with state.txn.	2020-10-23 14:29:22 -04:00
Daniel Nephin	8bd1a2cd16	Merge pull request #8975 from hashicorp/dnephin/stream-close-on-unsub stream: close the subscription on Unsubscribe	2020-10-23 12:58:12 -04:00

1 2 3 4 5 ...

1279 Commits