consul

mirror of https://github.com/status-im/consul.git synced 2025-02-20 09:28:34 +00:00

Author	SHA1	Message	Date
Dan Stough	14efb28086	fix(v2dns): add node ttl to workloads, comment cleanup, and changelog (#20643 ) * fix(v2dns): add node ttl to workloads, plus comment cleanup * docs(v2dns): changelog	2024-02-14 17:38:11 -05:00
Derek Menteer	9f7626d501	Ensure all topics are refreshed on FSM restore and add supervisor loop to v1 controller subscriptions (#20642 ) Ensure all topics are refreshed on FSM restore and add supervisor loop to v1 controller subscriptions This PR fixes two issues: 1. Not all streams were force closed whenever a snapshot restore happened. This means that anything consuming data from the stream (controllers, queries, etc) were unaware that the data they have is potentially stale / invalid. This first part ensures that all topics are purged. 2. The v1 controllers did not properly handle stream errors (which are likely to appear much more often due to 1 above) and so it introduces a supervisor thread to restart the watches when these errors occur.	2024-02-14 14:17:55 -06:00
Dan Stough	137c9c0973	[CE] Misc cleanup for V2 DNS (#20640 ) * chore: gitignore zed editor * chore(v2dns): remove ent/ce split from router * fix(v2dns): v2 workloads now have tenancy in output * feat(v2dns): support 'cluster' label * chore(v2dns): less chatty debug logs	2024-02-14 12:40:38 -05:00
Melissa Kam	64cd172f30	[CC-7411] Fix environment variable precedence when linking to HCP (#20527 ) Fix so that link API values are used over env vars When a link is created via the API, those values should take precedence over the values set by environment variables. This change loads all the env vars initially as part of the config builder rather than on demand.	2024-02-13 14:06:18 -06:00
Michael Zalimeni	2c1addfd64	[NET-7015] DNS v2 + Catalog v2 int test (#20607 ) test(v2dns): Add Catalog v2 integration test Add a basic integration test covering major functionality tested against Catalog v2 resources. This complements existing tests that ensure compatibility between v1 and v2 DNS when testing against Catalog v1 resources.	2024-02-13 17:40:08 +00:00
Dan Stough	0f0b080514	[CE] feat(v2dns): add v2 style query metrics (#20608 ) feat(v2dns): add v2 style query metrics	2024-02-13 12:08:01 -05:00
Semir Patel	b716a9ef6b	resource: reconcile managed types every ~8hrs (#20606 )	2024-02-13 10:51:54 -06:00
John Murret	7e8f2e5f08	NET-7644/NET-7634 - Implement query lookup for tagged addresses on nodes and services including WAN translation. (#20583 ) NET-7644 - Implement tagged addresses and wan translation	2024-02-12 14:27:25 -05:00
Dan Stough	5802080db1	feat(v2dns): enable peering queries (#20581 )	2024-02-12 14:25:45 -05:00
Nick Cellino	5fb6ab6a3a	Move HCP Manager lifecycle management out of Link controller (#20401 ) * Add function to get update channel for watching HCP Link * Add MonitorHCPLink function This function can be called in a goroutine to manage the lifecycle of the HCP manager. * Update HCP Manager config in link monitor before starting This updates HCPMonitorLink so it updates the HCP manager with an HCP client and management token when a Link is upserted. * Let MonitorHCPManager handle lifecycle instead of link controller * Remove cleanup from Link controller and move it to MonitorHCPLink Previously, the Link Controller was responsible for cleaning up the HCP-related files on the file system. This change makes it so MonitorHCPLink handles this cleanup. As a result, we are able to remove the PlacementEachServer placement strategy for the Link controller because it no longer needs to do this per-node cleanup. * Remove HCP Manager dependency from Link Controller The Link controller does not need to have HCP Manager as a dependency anymore, so this removes that dependency in order to simplify the design. * Add Linked prefix to Linked status variables This is in preparation for adding a new status type to the Link resource. * Add new "validated" status type to link resource The link resource controller will now set a "validated" status in addition to the "linked" status. This is needed so that other components (eg the HCP manager) know when the Link is ready to link with HCP. * Fix tests * Handle new 'EndOfSnapshot' WatchList event * Fix watch test * Remove unnecessary config from TestAgent_scadaProvider Since the Scada provider is now started on agent startup regardless of whether a cloud config is provided, this removes the cloud config override from the relevant test. This change is not exactly related to the changes from this PR, but rather is something small and sort of related that was noticed while working on this PR. * Simplify link watch test and remove sleep from link watch This updates the link watch test so that it uses more mocks and does not require setting up the infrastructure for the HCP Link controller. This also removes the time.Sleep delay in the link watcher loop in favor of an error counter. When we receive 10 consecutive errors, we shut down the link watcher loop. * Add better logging for link validation. Remove EndOfSnapshot test. * Refactor link monitor test into a table test * Add some clarifying comments to link monitor * Simplify link watch test * Test a bunch more errors cases in link monitor test * Use exponential backoff instead of errorCounter in LinkWatch * Move link watch and link monitor into a single goroutine called from server.go * Refactor HCP link watcher to use single go-routine. Previously, if the WatchClient errored, we would've never recovered because we never retry to create the stream. With this change, we have a single goroutine that runs for the life of the server agent and if the WatchClient stream ever errors, we retry the creation of the stream with an exponential backoff.	2024-02-12 10:48:23 -05:00
John Murret	c8e4cea69c	set up ent and CE specific DNS tests to be able to run v1 and v2 (#20571 )	2024-02-09 15:53:56 -07:00
Dan Stough	01001f630e	feat(v2dns): catalog v2 service query support (#20564 )	2024-02-09 17:41:40 -05:00
Dan Stough	24e15cc24e	feat(v2dns): prepared query ttls (#20563 )	2024-02-09 11:26:02 -05:00
John Murret	7cac918811	NET-7637 / NET-7659/NET-7636/NET-7647/NET-7648/NET-7646/NET-7649/NET-7645 - Multiple DNS v2 fixes (#20556 )	2024-02-08 19:56:04 -07:00
Derek Menteer	a1c8d4dd19	Decouple xds capacity controller and raft-autopilot (#20511 ) Decouple xds capacity controller and autopilot This prevents a potential bug where autopilot deadlocks while attempting to execute `AutopilotDelegate.NotifyState()` on an xdscapacity controller that stopped consuming messages.	2024-02-08 15:31:44 -06:00
Chris S. Kim	26661a1c3b	Add default intention policy (#20544 )	2024-02-08 20:25:42 +00:00
Joshua Timmons	242b777547	Fix logging when we fail to export metrics to hcp (#20514 )	2024-02-08 11:00:47 -05:00
Joshua Timmons	c790740cc6	Fix: avoid redundant logs on failures to export metrics (#20519 )	2024-02-08 11:00:20 -05:00
John Murret	8ac54707d6	DNS v2 Multiple fixes. (#20525 ) * DNS v2 Multiple fixes. * add license header * get rid of DefaultIntentionPolicy change that was not supposed to be there.	2024-02-07 21:24:00 -07:00
Nathan Coleman	45d645471b	[NET-7414] Reconcile PST for mesh gateway workloads on change to ComputedExportedServices (#20271 ) * Reconcile ProxyStateTemplate on change to ComputedExportedServices * gofmt changeset --------- Co-authored-by: NiniOak <anita.akaeze@hashicorp.com>	2024-02-07 21:27:13 +00:00
skpratt	57bad0df85	add traffic permissions excludes and tests (#20453 ) * add traffic permissions tests * review fixes * Update internal/mesh/internal/controllers/sidecarproxy/builder/local_app.go Co-authored-by: John Landa <jonathanlanda@gmail.com> --------- Co-authored-by: John Landa <jonathanlanda@gmail.com>	2024-02-07 20:21:44 +00:00
Eric Haberkorn	1bd253021b	V1 Compat Exported Services Controller Optimizations (#20517 ) V1 compat exported services controller optimizations * Don't start the v2 exported services controller in v1 mode. * Use the controller cache.	2024-02-07 14:05:42 -05:00
Matt Keeler	49e6c0232d	Panic for unregistered types (#20476 ) * Panic when controllers attempt to make invalid requests to the resource service This will help to catch bugs in tests that could cause infinite errors to be emitted. * Disable the API GW v2 controller With the previous commit, this would cause a server to panic due to watching a type which has not yet been created/registered. * Ensure that a test server gets the full type registry instead of constructing its own * Skip TestServer_ControllerDependencies * Fix peering tests so that they use the full resource registry.	2024-02-06 11:23:06 -05:00
Dan Stough	fcc43a9a36	feat(v2dns): catalog v2 SOA and NS support (#20480 )	2024-02-06 11:12:04 -05:00
John Murret	3bf999e46b	NET-7631 - Fix Node records that point to external/ non-IP addresses (#20491 ) * NET-7630 - Fix TXT record creation on node queries * NET-7631 - Fix Node records that point to external/ non-IP addresses * NET-7630 - Fix TXT record creation on node queries	2024-02-06 15:16:02 +00:00
John Murret	7d4deda640	NET-7630 - Fix TXT record creation on node queries (#20483 )	2024-02-06 09:53:39 -05:00
Ashesh Vidyut	cffb5d7c6e	Fix audit-log encoding issue (CC-7337) (#20345 ) * add changes * added changelog * change update * CE chnages * Removed gzip size fix * fix changelog * Update .changelog/20345.txt Co-authored-by: Hans Hasselberg <hans@hashicorp.com> * Adding comments --------- Co-authored-by: Abhishek Sahu <abhishek.sahu@hashicorp.com> Co-authored-by: Hans Hasselberg <hans@hashicorp.com> Co-authored-by: srahul3 <rahulsharma@hashicorp.com>	2024-02-06 16:40:07 +05:30
Tauhid Anjum	88b8a1cc36	NET-6776 - Update Routes controller to use ComputedFailoverPolicy CE (#20496 ) Update Routes controller to use ComputedFailoverPolicy	2024-02-06 13:28:18 +05:30
Derek Menteer	922844b8e0	Fix issue with persisting proxy-defaults (#20481 ) Fix issue with persisting proxy-defaults This resolves an issue introduced in hashicorp/consul#19829 where the proxy-defaults configuration entry with an HTTP protocol cannot be updated after it has been persisted once and a router exists. This occurs because the protocol field is not properly pre-computed before being passed into validation functions.	2024-02-05 16:00:19 -06:00
John Murret	0d434dafac	Do not parallelize DNS tests because they consume too many ports (#20482 )	2024-02-05 14:54:05 -07:00
John Murret	602e3c4fd5	DNS V2 - Revise discovery result to have service and node name and address fields. (#20468 ) * DNS V2 - Revise discovery result to have service and node name and address fields. * NET-7488 - dns v2 add support for prepared queries in catalog v1 data model (#20470) NET-7488 - dns v2 add support for prepared queries in catalog v1 data model.	2024-02-03 03:23:52 +00:00
Dan Stough	9602b43183	feat(v2dns): catalog v2 workload query support (#20466 )	2024-02-02 18:29:38 -05:00
R.B. Boyer	c029b20615	v2: ensure the controller caches are fully populated before first use (#20421 ) The new controller caches are initialized before the DependencyMappers or the Reconciler run, but importantly they are not populated. The expectation is that when the WatchList call is made to the resource service it will send an initial snapshot of all resources matching a single type, and then perpetually send UPSERT/DELETE events afterward. This initial snapshot will cycle through the caching layer and will catch it up to reflect the stored data. Critically the dependency mappers and reconcilers will race against the restoration of the caches on server startup or leader election. During this time it is possible a mapper or reconciler will use the cache to lookup a specific relationship and not find it. That very same reconciler may choose to then recompute some persisted resource and in effect rewind it to a prior computed state. Change - Since we are updating the behavior of the WatchList RPC, it was aligned to match that of pbsubscribe and pbpeerstream using a protobuf oneof instead of the enum+fields option. - The WatchList rpc now has 3 alternating response events: Upsert, Delete, EndOfSnapshot. When set the initial batch of "snapshot" Upserts sent on a new watch, those operations will be followed by an EndOfSnapshot event before beginning the never-ending sequence of Upsert/Delete events. - Within the Controller startup code we will launch N+1 goroutines to execute WatchList queries for the watched types. The UPSERTs will be applied to the nascent cache only (no mappers will execute). - Upon witnessing the END operation, those goroutines will terminate. - When all cache priming routines complete, then the normal set of N+1 long lived watch routines will launch to officially witness all events in the system using the primed cached.	2024-02-02 15:11:05 -06:00
wangxinyi7	fb2b696c0e	missing prefix / (#20447 ) * missing prefix / and fix typos	2024-02-02 12:48:45 -08:00
Eric Haberkorn	543c6a30af	Trigger the V1 Compat exported-services Controller when V1 Config Entries are Updated (#20456 ) * Trigger the v1 compat exported-services controller when the v1 config entry is modified. * Hook up exported-services config entries to the event publisher. * Add tests to the v2 exported services shim. * Use the local materializer trigger updates on the v1 compat exported services controller when exported-services config entries are modified. * stop sleeping when context is cancelled	2024-02-02 15:30:04 -05:00
Eric Haberkorn	d0243b618d	Change the multicluster group to v2 (#20430 )	2024-02-01 12:08:26 -05:00
Chris S. Kim	b6f10bc58f	Skip filter chain created by permissive mtls (#20406 )	2024-01-31 16:39:12 -05:00
wangxinyi7	3b44be530d	only forwarding the resource service traffic in client agent to server agent (#20347 ) * only forwarding the resource service traffic in client agent to server agent	2024-01-31 12:05:47 -08:00
Nick Ethier	383d92e9ab	hcp.v2.TelemetryState resource and controller implementation (#20257 ) * pbhcp: add TelemetryState resource * agent/hcp: add GetObservabilitySecrets to client * internal/hcp: add TelemetryState controller logic * hcp/telemetry-state: added config options for hcp sdk and debug key to skip deletion during reconcile * pbhcp: update proto documentation * hcp: address PR feedback, additional validations and code cleanup * internal/hcp: fix type sig change in test * update testdata/v2-resource-dependencies	2024-01-31 14:47:05 -05:00
Derek Menteer	3e8ec8d18e	Fix SAN matching on terminating gateways (#20417 ) Fixes issue: hashicorp/consul#20360 A regression was introduced in hashicorp/consul#19954 where the SAN validation matching was reduced from 4 potential types down to just the URI. Terminating gateways will need to match on many fields depending on user configuration, since they make egress calls outside of the cluster. Having more than one matcher behaves like an OR operation, where any match is sufficient to pass the certificate validation. To maintain backwards compatibility with the old untyped `match_subject_alt_names` Envoy behavior, we should match on all 4 enum types. https://www.envoyproxy.io/docs/envoy/latest/api-v3/extensions/transport_sockets/tls/v3/common.proto#enum-extensions-transport-sockets-tls-v3-subjectaltnamematcher-santype	2024-01-31 12:17:45 -06:00
John Murret	c82b78b088	NET-7165 - fix address and target setting (#20403 )	2024-01-30 15:34:35 -07:00
Ronald	8799c36410	[NET-6231] Handle Partition traffic permissions when reconciling traffic permissions (#20408 ) [NET-6231] Partition traffic permissions Co-authored-by: Chris S. Kim <ckim@hashicorp.com>	2024-01-30 22:14:32 +00:00
Chris S. Kim	7cc88a1577	Handle NamespaceTrafficPermissions when reconciling TrafficPermissions (#20407 )	2024-01-30 21:31:25 +00:00
Melissa Kam	b0e87dbe13	[CC-7049] Stop the HCP manager when link is deleted (#20351 ) * Add Stop method to telemetry provider Stop the main loop of the provider and set the config to disabled. * Add interface for telemetry provider Added for easier testing. Also renamed Run to Start, which better fits with Stop. * Add Stop method to HCP manager * Add manager interface, rename implementation Add interface for easier testing, rename existing Manager to HCPManager. * Stop HCP manager in link Finalizer * Attempt to cleanup if resource has been deleted The link should be cleaned up by the finalizer, but there's an edge case in a multi-server setup where the link is fully deleted on one server before the other server reconciles. This will cover the case where the reconcile happens after the resource is deleted. * Add a delete mananagement token function Passes a function to the HCP manager that deletes the management token that was initially created by the manager. * Delete token as part of stopping the manager * Lock around disabling config, remove descriptions	2024-01-30 09:40:36 -06:00
John Murret	7c6a3c83f2	NET-7165 - v2 - add service questions (#20390 ) * NET-7165 - v2 - add service questions * removing extraneous copied over code from autogen PR script. * fixing license checking	2024-01-29 22:33:45 +00:00
Melissa Kam	3b9bb8d6f9	[CC-7044] Start HCP manager as part of link creation (#20312 ) * Check for ACL write permissions on write Link eventually will be creating a token, so require acl:write. * Convert Run to Start, only allow to start once * Always initialize HCP components at startup * Support for updating config and client * Pass HCP manager to controller * Start HCP manager in link resource Start as part of link creation rather than always starting. Update the HCP manager with values from the link before starting as well. * Fix metrics sink leaked goroutine * Remove the hardcoded disabled hostname prefix The HCP metrics sink will always be enabled, so the length of sinks will always be greater than zero. This also means that we will also always default to prefixing metrics with the hostname, which is what our documentation states is the expected behavior anyway. * Add changelog * Check and set running status in one method * Check for primary datacenter, add back test * Clarify merge reasoning, fix timing issue in test * Add comment about controller placement * Expand on breaking change, fix typo in changelog	2024-01-29 16:31:44 -06:00
Matt Keeler	34a32d4ce5	Remove V2 PeerName field from pbresource.Tenancy (#19865 ) The peer name will eventually show up elsewhere in the resource. For now though this rips it out of where we don’t want it to be.	2024-01-29 15:08:31 -05:00
Dan Stough	0ca7313b07	feat(v2dns): add PTR query support (#20362 )	2024-01-29 11:40:10 -05:00
Tyler Wendlandt	7e08d8988c	NET-5398: Update UI server to include if v2 is enabled (#20353 ) * Update ui server to include V2 Catalog flag * Fix typo	2024-01-26 14:38:51 -07:00
Nitya Dhanushkodi	0ec7bddb9a	[Net-5594][Net-7466] v2: Only route to endpoints that implement the port being routed to, and make xdscontroller and xdsv2 golden tests use tenancy (#20356 ) * If a workload does not implement a port, it should not be included in the list of endpoints for the Envoy cluster for that port. * Adds tenancy tests for xds controller and xdsv2 resource generation, and adds all those files. * The original change in this PR was for filtering the list of endpoints by the port being routed to (bullet 1). Since I made changes to sidecarproxycontroller golden files, I realized some of the golden files were unused because of the tenancy changes, so when I deleted those, that broke xds controller tests which weren't correctly using tenancy. So when I fixed that, then the xdsv2 tests broke, so I added tenancy support there too. So now, from sidecarproxy controller -> xds controller -> xdsv2 we now have tenancy support and all the golden files are lined up.	2024-01-26 10:07:21 -08:00

1 2 3 4 5 ...

5467 Commits