consul

Commit Graph

Author	SHA1	Message	Date
R.B. Boyer	c029b20615	v2: ensure the controller caches are fully populated before first use (#20421 ) The new controller caches are initialized before the DependencyMappers or the Reconciler run, but importantly they are not populated. The expectation is that when the WatchList call is made to the resource service it will send an initial snapshot of all resources matching a single type, and then perpetually send UPSERT/DELETE events afterward. This initial snapshot will cycle through the caching layer and will catch it up to reflect the stored data. Critically the dependency mappers and reconcilers will race against the restoration of the caches on server startup or leader election. During this time it is possible a mapper or reconciler will use the cache to lookup a specific relationship and not find it. That very same reconciler may choose to then recompute some persisted resource and in effect rewind it to a prior computed state. Change - Since we are updating the behavior of the WatchList RPC, it was aligned to match that of pbsubscribe and pbpeerstream using a protobuf oneof instead of the enum+fields option. - The WatchList rpc now has 3 alternating response events: Upsert, Delete, EndOfSnapshot. When set the initial batch of "snapshot" Upserts sent on a new watch, those operations will be followed by an EndOfSnapshot event before beginning the never-ending sequence of Upsert/Delete events. - Within the Controller startup code we will launch N+1 goroutines to execute WatchList queries for the watched types. The UPSERTs will be applied to the nascent cache only (no mappers will execute). - Upon witnessing the END operation, those goroutines will terminate. - When all cache priming routines complete, then the normal set of N+1 long lived watch routines will launch to officially witness all events in the system using the primed cached.	2024-02-02 15:11:05 -06:00
wangxinyi7	fb2b696c0e	missing prefix / (#20447 ) * missing prefix / and fix typos	2024-02-02 12:48:45 -08:00
Eric Haberkorn	543c6a30af	Trigger the V1 Compat exported-services Controller when V1 Config Entries are Updated (#20456 ) * Trigger the v1 compat exported-services controller when the v1 config entry is modified. * Hook up exported-services config entries to the event publisher. * Add tests to the v2 exported services shim. * Use the local materializer trigger updates on the v1 compat exported services controller when exported-services config entries are modified. * stop sleeping when context is cancelled	2024-02-02 15:30:04 -05:00
Eric Haberkorn	d0243b618d	Change the multicluster group to v2 (#20430 )	2024-02-01 12:08:26 -05:00
Chris S. Kim	b6f10bc58f	Skip filter chain created by permissive mtls (#20406 )	2024-01-31 16:39:12 -05:00
wangxinyi7	3b44be530d	only forwarding the resource service traffic in client agent to server agent (#20347 ) * only forwarding the resource service traffic in client agent to server agent	2024-01-31 12:05:47 -08:00
Nick Ethier	383d92e9ab	hcp.v2.TelemetryState resource and controller implementation (#20257 ) * pbhcp: add TelemetryState resource * agent/hcp: add GetObservabilitySecrets to client * internal/hcp: add TelemetryState controller logic * hcp/telemetry-state: added config options for hcp sdk and debug key to skip deletion during reconcile * pbhcp: update proto documentation * hcp: address PR feedback, additional validations and code cleanup * internal/hcp: fix type sig change in test * update testdata/v2-resource-dependencies	2024-01-31 14:47:05 -05:00
Derek Menteer	3e8ec8d18e	Fix SAN matching on terminating gateways (#20417 ) Fixes issue: hashicorp/consul#20360 A regression was introduced in hashicorp/consul#19954 where the SAN validation matching was reduced from 4 potential types down to just the URI. Terminating gateways will need to match on many fields depending on user configuration, since they make egress calls outside of the cluster. Having more than one matcher behaves like an OR operation, where any match is sufficient to pass the certificate validation. To maintain backwards compatibility with the old untyped `match_subject_alt_names` Envoy behavior, we should match on all 4 enum types. https://www.envoyproxy.io/docs/envoy/latest/api-v3/extensions/transport_sockets/tls/v3/common.proto#enum-extensions-transport-sockets-tls-v3-subjectaltnamematcher-santype	2024-01-31 12:17:45 -06:00
John Murret	c82b78b088	NET-7165 - fix address and target setting (#20403 )	2024-01-30 15:34:35 -07:00
Ronald	8799c36410	[NET-6231] Handle Partition traffic permissions when reconciling traffic permissions (#20408 ) [NET-6231] Partition traffic permissions Co-authored-by: Chris S. Kim <ckim@hashicorp.com>	2024-01-30 22:14:32 +00:00
Chris S. Kim	7cc88a1577	Handle NamespaceTrafficPermissions when reconciling TrafficPermissions (#20407 )	2024-01-30 21:31:25 +00:00
Melissa Kam	b0e87dbe13	[CC-7049] Stop the HCP manager when link is deleted (#20351 ) * Add Stop method to telemetry provider Stop the main loop of the provider and set the config to disabled. * Add interface for telemetry provider Added for easier testing. Also renamed Run to Start, which better fits with Stop. * Add Stop method to HCP manager * Add manager interface, rename implementation Add interface for easier testing, rename existing Manager to HCPManager. * Stop HCP manager in link Finalizer * Attempt to cleanup if resource has been deleted The link should be cleaned up by the finalizer, but there's an edge case in a multi-server setup where the link is fully deleted on one server before the other server reconciles. This will cover the case where the reconcile happens after the resource is deleted. * Add a delete mananagement token function Passes a function to the HCP manager that deletes the management token that was initially created by the manager. * Delete token as part of stopping the manager * Lock around disabling config, remove descriptions	2024-01-30 09:40:36 -06:00
John Murret	7c6a3c83f2	NET-7165 - v2 - add service questions (#20390 ) * NET-7165 - v2 - add service questions * removing extraneous copied over code from autogen PR script. * fixing license checking	2024-01-29 22:33:45 +00:00
Melissa Kam	3b9bb8d6f9	[CC-7044] Start HCP manager as part of link creation (#20312 ) * Check for ACL write permissions on write Link eventually will be creating a token, so require acl:write. * Convert Run to Start, only allow to start once * Always initialize HCP components at startup * Support for updating config and client * Pass HCP manager to controller * Start HCP manager in link resource Start as part of link creation rather than always starting. Update the HCP manager with values from the link before starting as well. * Fix metrics sink leaked goroutine * Remove the hardcoded disabled hostname prefix The HCP metrics sink will always be enabled, so the length of sinks will always be greater than zero. This also means that we will also always default to prefixing metrics with the hostname, which is what our documentation states is the expected behavior anyway. * Add changelog * Check and set running status in one method * Check for primary datacenter, add back test * Clarify merge reasoning, fix timing issue in test * Add comment about controller placement * Expand on breaking change, fix typo in changelog	2024-01-29 16:31:44 -06:00
Matt Keeler	34a32d4ce5	Remove V2 PeerName field from pbresource.Tenancy (#19865 ) The peer name will eventually show up elsewhere in the resource. For now though this rips it out of where we don’t want it to be.	2024-01-29 15:08:31 -05:00
Dan Stough	0ca7313b07	feat(v2dns): add PTR query support (#20362 )	2024-01-29 11:40:10 -05:00
Tyler Wendlandt	7e08d8988c	NET-5398: Update UI server to include if v2 is enabled (#20353 ) * Update ui server to include V2 Catalog flag * Fix typo	2024-01-26 14:38:51 -07:00
Nitya Dhanushkodi	0ec7bddb9a	[Net-5594][Net-7466] v2: Only route to endpoints that implement the port being routed to, and make xdscontroller and xdsv2 golden tests use tenancy (#20356 ) * If a workload does not implement a port, it should not be included in the list of endpoints for the Envoy cluster for that port. * Adds tenancy tests for xds controller and xdsv2 resource generation, and adds all those files. * The original change in this PR was for filtering the list of endpoints by the port being routed to (bullet 1). Since I made changes to sidecarproxycontroller golden files, I realized some of the golden files were unused because of the tenancy changes, so when I deleted those, that broke xds controller tests which weren't correctly using tenancy. So when I fixed that, then the xdsv2 tests broke, so I added tenancy support there too. So now, from sidecarproxy controller -> xds controller -> xdsv2 we now have tenancy support and all the golden files are lined up.	2024-01-26 10:07:21 -08:00
sarahalsmiller	37ebaa6920	Net 7155- Consul API Gateway Controller Stub Work (#20324 ) * API Gateway proto * fix lint issue * new line * run make proto format * checkpoint * stub * Update internal/mesh/internal/controllers/apigateways/controller.go	2024-01-25 23:16:20 +00:00
Luke Kysow	840f11a0c5	Change logging of registered v2 resource endpoints to add /api prefix (#20352 ) * Change logging of registered v2 resource endpoints to add /api prefix Previous: agent.http: Registered resource endpoint: endpoint=/demo/v1/executive New: agent.http: Registered resource endpoint: endpoint=/api/demo/v1/executive This reduces confusion when attempting to call the APIs after looking at the logs.	2024-01-25 14:18:54 -08:00
Semir Patel	efdf80413c	resource: add MutateAndValidate endpoint (#20311 )	2024-01-25 13:12:30 -06:00
Dan Stough	6828780131	feat(v2dns): add partial support for SOA records (#20320 )	2024-01-24 15:32:42 -05:00
Melissa Kam	7900544249	[CC-7063] Fetch HCP agent bootstrap config in Link reconciler (#20306 ) * Move config-dependent methods to separate package In order to reuse the fetching and file creation part of the bootstrap package, move the code that would cause cyclical dependencies to a different package. * Export needed bootstrap methods and variables Also add back validating persisted config and update tests. * Add support to check for just management token Add a new method that fetches the bootstrap configuration only if there isn't a valid management token file instead of checking for all the hcp-config files. * Pass data dir as a dependency to link controller The link controller needs to check the data directory for the hcp-config files. * Fetch bootstrap config for token in controller Load the management token when reconciling a link resource, which will fetch the agent boostrap configuration if the token is not already persisted locally. Skip this step if the cluster is in read-only mode. * Validate resource ID format in link creation * Handle unauthorized and forbidden errors Check for 401 and 403s when making GNM requests, exit bootstrap fetch loop and return specific failure statuses for link. * Move test function to a testing file * Log load and status write errors	2024-01-24 09:51:43 -06:00
aahel	3446eb3b1b	added computed failover controller (#20329 ) * added computed failover controller * removed some uncessary changes * removed uncessary changes * minor refactor * minor refactor fmt * added copyright	2024-01-24 11:50:27 +05:30
skpratt	0abf8f8426	Net 5092/internal l7 traffic permissions (#20276 ) * wire up L7 Traffic Permissions * testing * update comment	2024-01-23 20:07:58 -06:00
skpratt	44bcda8523	Net 7074/decentralized exported services management (#20318 ) * Add decentralized management of V1 exported-services config entries using V2 multicluster resources. * cleanup --------- Co-authored-by: Matt Keeler <mjkeeler7@gmail.com>	2024-01-23 19:44:10 -06:00
Tauhid Anjum	5d294b26d3	NET-5824 Exported services api (#20015 ) * Exported services api implemented * Tests added, refactored code * Adding server tests * changelog added * Proto gen added * Adding codegen changes * changing url, response object * Fixing lint error by having namespace and partition directly * Tests changes * refactoring tests * Simplified uniqueness logic for exported services, sorted the response in order of service name * Fix lint errors, refactored code	2024-01-23 10:06:59 +05:30
Lord-Y	758ddf84e9	Case sensitive route match (#19647 ) Add case insensitive param on service route match This commit adds in a new feature that allows service routers to specify that paths and path prefixes should ignore upper / lower casing when matching URLs. Co-authored-by: Derek Menteer <105233703+hashi-derek@users.noreply.github.com>	2024-01-22 09:23:24 -06:00
Nick Cellino	34b343a980	Unconditionally add Access-Control-Expose-Headers HTTP header (#20220 ) * Unconditionally add Access-Control-Expose-Headers HTTP header * Return nil instead of err	2024-01-22 10:18:35 -05:00
Dan Stough	97ae244d8a	feat(v2dns): add grpc DNS support (#20296 )	2024-01-22 10:10:03 -05:00
Semir Patel	6d9e8fdd05	resource: retry non-CAS deletes automatically (#20292 )	2024-01-22 06:45:01 -08:00
R.B. Boyer	2e08a7e1c7	v2: prevent use of the v2 experiments in secondary datacenters for now (#20299 ) Ultimately we will have to rectify wan federation with v2 catalog adjacent experiments, but for now blanket prevent usage of the resource-apis, v2dns, and v2tenancy experiments in secondary datacenters.	2024-01-19 16:31:49 -06:00
Nick Cellino	37a5fddffa	Create HCP management token in HCP manager (#19830 ) * Create HCP management token in HCP manager * Change InitializeManagementToken to ManagementTokenUpserter * Implement and use management token upsert function * Fix race condition in test * Add idea for improvement as comment * Return early in upsertManagementToken if token exists	2024-01-19 13:58:49 -05:00
Melissa Kam	98c9702ba3	[CC-7031] Add initialization support to resource controllers (#20138 ) * Add Initializer to the controller The Initializer adds support for running any required initialization steps when the controller is first started. * Implement HCP Link initializer The link initializer will create a Link resource if the cloud configuration has been set. * Simplify retry logic and testing * Remove internal retry, replace with logging logic	2024-01-19 11:47:48 -06:00
Nick Cellino	fe678e9da1	Sync cluster attributes from GNM to Link resource (#20158 ) * Add 'GetCluster' function to HCP client * Sync cluster data inside Link controller * Add access mode to HCP Link * Sync AccessLevel property * Fix imports and remove outdated comments * Switch accessMode to access level * Add comment around HCPClientFn * Fix spacing in link.proto * Add helper for writing status. Fix reconciliation loop	2024-01-19 10:02:55 -05:00
Matt Keeler	f9c04881f9	Failover policy cache (#20244 ) * Migrate the Failover controller to use the controller cache * Remove the Catalog FailoverMapper and its usage in the mesh routes controller.	2024-01-19 09:35:34 -05:00
Dan Stough	0edfa74d15	feat(v2dns): recursor support (#20249 ) * feat(v2dns): recursor support * test: fix leaking test agent in dns svc test	2024-01-18 18:30:04 -05:00
Dhia Ayachi	d641998641	Fix to not create a watch to `Internal.ServiceDump` when mesh gateway is not used (#20168 ) This add a fix to properly verify the gateway mode before creating a watch specific to mesh gateways. This watch have a high performance cost and when mesh gateways are not used is not used. This also adds an optimization to only return the nodes when watching the Internal.ServiceDump RPC to avoid unnecessary disco chain compilation. As watches in proxy config only need the nodes.	2024-01-18 16:44:53 -06:00
John Murret	938d2315e0	DNS v2 - add virtual ip questions (#20245 )	2024-01-17 23:46:18 +00:00
John Murret	bc4da5f5d6	check error in TestDNSCycleRecursorCheckAllFail before asserting response to stop panic in CI. (#20231 )	2024-01-17 07:25:35 -07:00
Dan Stough	cb384ac068	feat(v2dns): addr. query support (#20224 )	2024-01-16 22:36:02 -05:00
Melissa Kam	c112a6632d	[CC-7042] Update and enable the HCP metrics sink in the HCP manager (#20072 ) * Option to set HCP client at runtime Allows us to initially set a nil HCP client for the telemetry provider and update it later. * Set telemetry provider HCP client in HCP manager Set the telemetry provider as a dependency and pass it to the manager. Update the telemetry provider's HCP client when the HCP manager starts. * Add a provider interface for the metrics client This provider will allow us to configure and reconfigure the retryable HTTP client and the headers for the metrics client. * Move HTTP retryable client to separate file Copied directly from the metrics client. * Abstract HCP specific values in HTTP client Remove HCP specific references and instead initiate with a generic TLS configuration and authentication source. * Set up HTTP client and headers in the provider Move setup from the metrics client to the HCP telemetry provider. * Update the telemetry provider in the HCP manager Initialize the provider without the HCP configs and then update it in the HCP manager to enable it. * Improve test assertion, fix method comment * Move client provider to metrics client * Stop the manager on setup error * Add separate lock for http configuration * Start telemetry provider in HCP manager * Update HCP client and config as part of Run * Remove option to set config at initialization * Simplify and clean up setting HCP configs * Add test for telemetry provider Run method * Fix race condition * Use clone of HTTP headers * Only allow initial update and run once	2024-01-16 10:46:12 -06:00
Derek Menteer	b8b8ad46fc	Various race condition and test fixes. (#20212 ) * Increase timeouts for flakey peering test. * Various test fixes. * Fix race condition in reconcilePeering. This resolves an issue where a peering object in the state store was incorrectly mutated by a function, resulting in the test being flagged as failing when the -race flag was used.	2024-01-16 08:57:43 -06:00
John Murret	93e06b799e	v1 dns - add doc strings for functions and update function names to be consistent and more descriptive. (#20194 ) v1 dns - add doc strings for functions and update function names to be consistent and mre descriptive.	2024-01-12 22:07:42 +00:00
R.B. Boyer	7f9ed032fd	agent: remove data race in agent config (#20200 ) To fix an issue displaying the current reloaded config in the v1/agent/self endpoint #18681 caused the agent's internal config struct member to be deepcopied and replaced on reload. This is not safe because the field is not protected by a lock, nor should it be due to how it is accessed by the rest of the system. This PR does the same deepcopy, but into a new field solely for the point of capturing the current reloaded values for display purposes. If there has been no reload then the original config is used.	2024-01-12 15:11:21 -06:00
Matt Keeler	326c0ecfbe	In-Memory gRPC (#19942 ) * Implement In-Process gRPC for use by controller caching/indexing This replaces the pipe base listener implementation we were previously using. The new style CAN avoid cloning resources which our controller caching/indexing is taking advantage of to not duplicate resource objects in memory. To maintain safety for controllers and for them to be able to modify data they get back from the cache and the resource service, the client they are presented in their runtime will be wrapped with an autogenerated client which clones request and response messages as they pass through the client. Another sizable change in this PR is to consolidate how server specific gRPC services get registered and managed. Before this was in a bunch of different methods and it was difficult to track down how gRPC services were registered. Now its all in one place. * Fix race in tests * Ensure the resource service is registered to the multiplexed handler for forwarding from client agents * Expose peer streaming on the internal handler	2024-01-12 11:54:07 -05:00
John Murret	3fa4a21edd	remove the skipping of slow tests in go-tests-ce and go-test-enterprise (#20139 ) * remove the skipping of slow tests in go-tests-ce and go-test-enterprise * add license header	2024-01-10 20:39:34 -07:00
Dan Stough	d52e80b619	[OSS] feat: add experiments flag for v2 dns and skeleton interfaces (#20115 ) feat: add experiments flag for v2 dns and skeleton interfaces	2024-01-10 11:19:20 -05:00
loshz	7724bb88d5	[NET-6593] agent: check for minimum RSA key size (#20112 ) * agent: check for minimum RSA key size * add changelog * agent: add test for RSA generated key sizes * use constants in generating priv key func * update key size error message	2024-01-10 12:15:36 +00:00
Derek Menteer	131ef2a133	Fix broken tests. (#20134 )	2024-01-09 14:57:27 -06:00
Derek Menteer	6854e1e90d	Fix broken tests. (#20130 ) This fixes some tests that were broken, but not caught, due to the CICD pipeline only running a subset of the overall tests on PRs.	2024-01-09 13:45:29 -06:00
Nick Cellino	0deebaf637	Add Link resource type and controller skeleton (#19788 ) * Add HCCLink resource type * Register HCCLink resource type with basic validation * Add validation for required fields * Add test for default ACLs * Add no-op controller for HCCLink * Add resource-apis semantic validation check in hcclink controller * Add copyright headers * Rename HCCLink to Link * Add hcp_cluster_url to link proto * Update 'disabled' reason with more detail * Update link status name to consul.io/hcp/link * Change link version from v1 to v2 * Use feature flag/experiment to enable v2 resources with HCP	2024-01-09 13:57:59 -05:00
John Murret	21e2bb2a67	Make DNS test run across a matrix of dns and catalog versions. (#20114 ) * Make DNS test run across a matrix of dns and catalog versions. * node tests * add version hcl config to service lookup tests	2024-01-08 13:14:26 -07:00
Melissa Kam	5dc8eabcce	[CC-7041] Update and start the SCADA provider in HCP manager (#19976 ) * Update SCADA provider version Also update mocks for SCADA provider. * Create SCADA provider w/o HCP config, then update Adds a placeholder config option to allow us to initialize a SCADA provider without the HCP configuration. Also adds an update method to then add the HCP configuration. We need this to be able to eventually always register a SCADA listener at startup before the HCP config values are known. * Pass cloud configuration to HCP manager Save the entire cloud configuration and pass it to the HCP manager. * Update and start SCADA provider in HCP manager Move config updating and starting to the HCP manager. The HCP manager will eventually be responsible for all processes that contribute to linking to HCP.	2024-01-08 09:49:29 -06:00
Ganesh S	0d57acc549	Add sameness group references in exported services controller (#20100 )	2024-01-08 11:55:52 +05:30
John Murret	c12245be3c	Break up DNS tests into 3 files to help with GH UI and IDE issues. (#20103 )	2024-01-05 13:37:27 -07:00
cskh	15b40f36f3	Use safeio to write server metadata file (#20101 ) * Use safeio to write server metadata file * guard the conversion	2024-01-05 14:46:19 -05:00
John Murret	7a410d7c5b	NET-6945 - Replace usage of deprecated Envoy field envoy.config.core.v3.HeaderValueOption.append (#20078 ) * NET-6945 - Replace usage of deprecated Envoy field envoy.config.core.v3.HeaderValueOption.append * update proto for v2 and then update xds v2 logic * add changelog * Update 20078.txt to be consistent with existing changelog entries * swap enum values tomatch envoy.	2024-01-04 00:36:25 +00:00
Dan Stough	073959866d	feat(v2): add consul service and workloads to catalog (#20077 )	2024-01-03 15:14:42 -05:00
John Murret	d925e4b812	NET-6946 / NET-6941 - Replace usage of deprecated Envoy fields envoy.config.route.v3.HeaderMatcher.safe_regex_match and envoy.type.matcher.v3.RegexMatcher.google_re2 (#20013 ) * NET-6946 - Replace usage of deprecated Envoy field envoy.config.route.v3.HeaderMatcher.safe_regex_match * removing unrelated changes * update golden files * do not set engine type	2024-01-03 09:53:39 -07:00
John Murret	2f335113f8	NET-6943 - Replace usage of deprecated Envoy field envoy.config.router.v3.WeightedCluster.total_weight. (#20011 )	2023-12-22 19:49:44 +00:00
John Murret	90cd56c5c3	NET-4774 - replace usage of deprecated Envoy field match_subject_alt_names (#19954 )	2023-12-22 18:34:44 +00:00
John Murret	21ea5c92fd	NET-6944 - Replace usage of deprecated Envoy field envoy.extensions.filters.http.lua.v3.Lua.inline_code (#20012 )	2023-12-22 17:20:41 +00:00
Nathan Coleman	ab60fec15a	[NET-6426] Add gateway proxy controller that generates empty proxy state template (#19901 ) * NET-6426 Create ProxyStateTemplate when reconciling MeshGateway resource * Add TODO for switching fetch method based on gateway type * Use gateway-kind in workload metadata instead of owner reference * Create ProxyStateTemplate builder for gatewayproxy controller * Update to use new controller interface * Add copyright headers * Set correct name for ProxyStateTemplate identity reference * Generate empty ProxyStateTemplate by fetching MeshGateway This cheats and looks up the MeshGateway directly. In the future, we will need a Workload => xGateway mapper * Specify owner reference when writing ProxyStateTemplate * Update dependency mapper to account for multiple controllers per resource type * Regenerate v2 resource dependencies map * Add helpful trace logs, tag TODOs with ticket identifiers	2023-12-21 16:37:47 -05:00
Nitya Dhanushkodi	9975b8bd73	[NET-5455] Allow disabling request and idle timeouts with negative values in service router and service resolver (#19992 ) * add coverage for testing these timeouts	2023-12-19 15:36:07 -08:00
cskh	cff872749d	agent: prevent empty server_metadata.json (#19935 )	2023-12-19 10:01:56 -05:00
aahel	ae998a698a	added computed failover policy resource (#19975 )	2023-12-18 05:52:24 +00:00
Derek Menteer	bbdbf3e4f8	Fix bug with prepared queries using sameness-groups. (#19970 ) This commit fixes an issue where the partition was not properly set on the peering query failover target created from sameness-groups. Before this change, it was always empty, meaning that the data would be queried with respect to the default partition always. This resulted in a situation where a PQ that was attempting to use a sameness-group for failover would select peers from the default partition, rather than the partition of the sameness-group itself.	2023-12-15 11:42:13 -06:00
aahel	a6496898de	added tenancy to TestBuildL4TrafficPermissions (#19932 )	2023-12-14 10:41:24 +05:30
Matt Keeler	123bc95e1a	Add Common Controller Caching Infrastructure (#19767 ) * Add Common Controller Caching Infrastructure	2023-12-13 10:06:39 -05:00
Dhia Ayachi	f2b26ac194	Hash based config entry replication (#19795 ) * add a hash to config entries when normalizing * add GetHash and implement comparing hashes * only update if the Hash is different * only update if the Hash is different and not 0 * fix proto to include the Hash * fix proto gen * buf format * add SetHash and fix tests * fix config load tests * fix state test and config test * recalculate hash when restoring config entries * fix snapshot restore test * add changelog * fix missing normalize, fix proto indexes and add normalize test	2023-12-12 08:29:13 -05:00
Ronald	e13fbc743e	Remove warning for consul 1.17 deprecation (#19897 )	2023-12-11 23:28:04 +00:00
Derek Menteer	dfab5ade50	Fix ClusterLoadAssignment timeouts dropping endpoints. (#19871 ) When a large number of upstreams are configured on a single envoy proxy, there was a chance that it would timeout when waiting for ClusterLoadAssignments. While this doesn't always immediately cause issues, consul-dataplane instances appear to consistently drop endpoints from their configurations after an xDS connection is re-established (the server dies, random disconnect, etc). This commit adds an `xds_fetch_timeout_ms` config to service registrations so that users can set the value higher for large instances that have many upstreams. The timeout can be disabled by setting a value of `0`. This configuration was introduced to reduce the risk of causing a breaking change for users if there is ever a scenario where endpoints would never be received. Rather than just always blocking indefinitely or for a significantly longer period of time, this config will affect only the service instance associated with it.	2023-12-11 09:25:11 -06:00
Derek Menteer	0ac958f27b	Fix xDS missing endpoint race condition. (#19866 ) This fixes the following race condition: - Send update endpoints - Send update cluster - Recv ACK endpoints - Recv ACK cluster Prior to this fix, it would have resulted in the endpoints NOT existing in Envoy. This occurred because the cluster update implicitly clears the endpoints in Envoy, but we would never re-send the endpoint data to compensate for the loss, because we would incorrectly ACK the invalid old endpoint hash. Since the endpoint's hash did not actually change, they would not be resent. The fix for this is to effectively clear out the invalid pending ACKs for child resources whenever the parent changes. This ensures that we do not store the child's hash as accepted when the race occurs. An escape-hatch environment variable `XDS_PROTOCOL_LEGACY_CHILD_RESEND` was added so that users can revert back to the old legacy behavior in the event that this produces unknown side-effects. Visit the following thread for some extra context on why certainty around these race conditions is difficult: https://github.com/envoyproxy/envoy/issues/13009 This bug report and fix was mostly implemented by @ksmiley with some minor tweaks. Co-authored-by: Keith Smiley <ksmiley@salesforce.com>	2023-12-08 11:37:12 -06:00
Thomas Eckert	8125a32a4e	Add CE version of Gateway Upstream Disambiguation (#19860 ) * Add CE version of gateway-upstream-disambiguation * Use NamespaceOrDefault and PartitionOrDefault * Add Changelog entry * Remove the unneeded reassignment * Use c.ID()	2023-12-07 17:56:14 -05:00
Dhia Ayachi	d93f7f730d	parse config protocol on write to optimize disco-chain compilation (#19829 ) * parse config protocol on write to optimize disco-chain compilation * add changelog	2023-12-07 13:46:46 -05:00
Jared Kirschner	d3e658b0e7	improve client RPC metrics consistency (#19721 ) The client.rpc metric now excludes internal retries for consistency with client.rpc.exceeded and client.rpc.failed. All of these metrics now increment at most once per RPC method call, allowing for accurate calculation of failure / rate limit application occurrence. Additionally, if an RPC fails because no servers are present, client.rpc.failed is now incremented.	2023-12-06 13:21:08 -05:00
Matt Keeler	efe279f802	Retry lint fixes (#19151 ) * Add a make target to run lint-consul-retry on all the modules * Cleanup sdk/testutil/retry * Fix a bunch of retry.Run* usage to not use the outer testing.T * Fix some more recent retry lint issues and pin to v1.4.0 of lint-consul-retry * Fix codegen copywrite lint issues * Don’t perform cleanup after each retry attempt by default. * Use the common testutil.TestingTB interface in test-integ/tenancy * Fix retry tests * Update otel access logging extension test to perform requests within the retry block	2023-12-06 12:11:32 -05:00
Ronald	dc02fa695f	[NET-6251] Nomad client templated policy (#19827 )	2023-12-06 10:32:12 -05:00
Semir Patel	c1bbda8128	resource: block default namespace deletion + test refactorings (#19822 )	2023-12-05 14:00:06 -05:00
lornasong	edf4610ed9	[Cloud][CC-6925] Updates to pushing server state (#19682 ) * Upgrade hcp-sdk-go to latest version v0.73 Changes: - go get github.com/hashicorp/hcp-sdk-go - go mod tidy * From upgrade: regenerate protobufs for upgrade from 1.30 to 1.31 Ran: `make proto` Slack: https://hashicorp.slack.com/archives/C0253EQ5B40/p1701105418579429 * From upgrade: fix mock interface implementation After upgrading, there is the following compile error: cannot use &mockHCPCfg{} (value of type mockHCPCfg) as "github.com/hashicorp/hcp-sdk-go/config".HCPConfig value in return statement: mockHCPCfg does not implement "github.com/hashicorp/hcp-sdk-go/config".HCPConfig (missing method Logout) Solution: update the mock to have the missing Logout method * From upgrade: Lint: remove usage of deprecated req.ServerState.TLS Due to upgrade, linting is erroring due to usage of a newly deprecated field 22:47:56 [consul]: make lint --> Running golangci-lint (.) agent/hcp/testing.go:157:24: SA1019: req.ServerState.TLS is deprecated: use server_tls.internal_rpc instead. (staticcheck) time.Until(time.Time(req.ServerState.TLS.CertExpiry)).Hours()/24, ^ * From upgrade: adjust oidc error message From the upgrade, this test started failing: === FAIL: internal/go-sso/oidcauth TestOIDC_ClaimsFromAuthCode/failed_code_exchange (re-run 2) (0.01s) oidc_test.go:393: unexpected error: Provider login failed: Error exchanging oidc code: oauth2: "invalid_grant" "unexpected auth code" Prior to the upgrade, the error returned was: ``` Provider login failed: Error exchanging oidc code: oauth2: cannot fetch token: 401 Unauthorized\nResponse: {\"error\":\"invalid_grant\",\"error_description\":\"unexpected auth code\"}\n ``` Now the error returned is as below and does not contain "cannot fetch token" ``` Provider login failed: Error exchanging oidc code: oauth2: "invalid_grant" "unexpected auth code" ``` * Update AgentPushServerState structs with new fields HCP-side changes for the new fields are in: https://github.com/hashicorp/cloud-global-network-manager-service/pull/1195/files * Minor refactor for hcpServerStatus to abstract tlsInfo into struct This will make it easier to set the same tls-info information to both - status.TLS (deprecated field) - status.ServerTLSMetadata (new field to use instead) * Update hcpServerStatus to parse out information for new fields Changes: - Improve error message and handling (encountered some issues and was confused) - Set new field TLSInfo.CertIssuer - Collect certificate authority metadata and set on TLSInfo.CertificateAuthorities - Set TLSInfo on both server.TLS and server.ServerTLSMetadata.InternalRPC * Update serverStatusToHCP to convert new fields to GNM rpc * Add changelog * Feedback: connect.ParseCert, caCerts * Feedback: refactor and unit test server status * Feedback: test to use expected struct * Feedback: certificate with intermediate * Feedback: catch no leaf, remove expectedErr * Feedback: update todos with jira ticket * Feedback: mock tlsConfigurator	2023-12-04 10:25:18 -05:00
aahel	7936e55807	added node health resource (#19803 )	2023-12-02 11:14:03 +05:30
John Maguire	a0240e3794	[NET-5688] APIGateway UI Topology Fixes (#19657 ) * Update catalog and ui endpoints to show APIGateway in gateway service topology view * Added initial implementation for service view * updated ui * Fix topology view for gateways * Adding tests for gw controller * remove unused args * Undo formatting changes * Fix call sites for upstream/downstream gw changes * Add config entry tests * Fix function calls again * Move from ServiceKey to ServiceName, cleanup from PR review * Add additional check for length of services in bound apigateway for IsSame comparison * fix formatting for proto * gofmt * Add DeepCopy for retrieved BoundAPIGateway * gofmt * gofmt * Rename function to be more consistent	2023-11-28 21:27:14 +00:00
Thomas Eckert	419677cc9e	[NET-6420] Add MeshConfiguration Controller stub (#19745 ) * Add meshconfiguration/controller * Add MeshConfiguration Registration function * Fix the TODOs on the RegisterMeshGateway function * Call RegisterMeshConfiguration * Add comment to MeshConfigurationRegistration * Add a test for Reconcile and some comments	2023-11-28 18:56:07 +00:00
Semir Patel	5930748cb0	resource: ListByOwner returns empty list on non-existent tenancy (#19742 )	2023-11-27 14:56:08 -06:00
Ronald	eded2ff347	[NET-6249] Add templated policies description (#19735 )	2023-11-27 10:34:22 -05:00
Ronald	c1dbf00a85	NET-6251 API gateway templated policy (#19728 )	2023-11-24 17:55:05 +00:00
Poonam Jadhav	78f918a103	feat: create a default namespace (#19681 ) * feat: create a default namespace on leader * refactor: add comment and move inittenancy to leader file * refactor: rephrase comment	2023-11-22 14:32:57 -05:00
Mike Nomitch	302f994410	[NET-6640] Adds "Policy" BindType to BindingRule (#19499 ) feat: add bind type of policy Co-authored-by: Ronald Ekambi <ronekambi@gmail.com>	2023-11-20 13:11:08 +00:00
Semir Patel	75c2def1ca	resource: preserve deferred deletion metadata on non-CAS writes (#19674 )	2023-11-17 10:51:25 -06:00
Ronald	ea0caa3e0f	[NET-6103] Enable query tokens by service name using templated policy (#19666 )	2023-11-16 14:32:06 -05:00
John Murret	2591318c82	Skip tests with p95 greater than 30 seconds outside of main and release branches. (#19628 ) Skip tests with p95 greater than 30 seconds.	2023-11-15 13:43:33 -07:00
Semir Patel	1eed205286	resource: freeze resources after marked for deletion (4 of 5) (#19603 )	2023-11-15 10:58:27 -06:00
Kumar Kavish	68e7f27fd2	[NET-6438] Add tenancy to xDS Tests (#19551 ) * [NET-6438] Add tenancy to xDS Tests * [NET-6438] Add tenancy to xDS Tests - Fixing imports * [NET-6438] Add tenancy to xDS Tests - Added cleanup post test run * [NET-6356] Add tenancy to xDS Tests - Added cleanup post test run * [NET-6438] Add tenancy to xDS Tests - using t.Cleanup instead of defer delete * [NET-6438] Add tenancy to xDS Tests - rebased * [NET-6438] Add tenancy to xDS Tests - rebased	2023-11-10 15:32:36 +05:30
aahel	005e1b9926	added exported svc controller (#19589 ) * added exported svc controller * added license headers	2023-11-10 07:27:53 +05:30
Nathan Coleman	40c57f10a0	NET-6391 Initialize controller for MeshGateway resource (#19552 ) * Generate resource_types for MeshGateway by specifying spec option * Register MeshGateway type w/ TODOs for hooks * Initialize controller for MeshGateway resources * Add meshgateway to list of v2 resource dependencies for golden test * Scope MeshGateway resource to partition	2023-11-09 16:33:14 -05:00
John Murret	780e91688d	Migrate remaining individual resource tests for service mesh to TestAllResourcesFromSnapshot (#19583 ) * migrate expose checks and paths tests to resources_test.go * fix failing expose paths tests * fix the way endpoint resources get created to make expose tests pass. * remove endpoint resources that are already inlined on local_app clusters * renaiming and comments * migrate remaining service mesh tests to resources_test.go * cleanup * update proxystateconverter to skip ading alpn to clusters and listener filterto match v1 behavior	2023-11-09 20:08:37 +00:00
Kumar Kavish	f09dbb99e9	[NET-6356] Add tenancy to Failover Tests (#19547 ) * [NET-6356] Add tenancy to Failover Tests * [NET-6438] Add tenancy to xDS Tests - Added cleanup post test run * [NET-6356] Add tenancy to failover Tests - using t.Cleanup instead of defer delete	2023-11-10 01:14:09 +05:30
John Murret	f5bf256425	Migrate individual resource tests for API Gateway to TestAllResourcesFromSnapshot (#19584 ) migrate individual api gateway tests to resources_test.go	2023-11-09 17:01:54 +00:00
John Murret	a94fa4c3ed	Migrate individual resource tests for Mesh Gateway to TestAllResourcesFromSnapshot (#19502 ) migrate mesh-gateway tests to resources_test.go	2023-11-09 16:39:16 +00:00

1 2 3 4 5 ...

5485 Commits