consul

Commit Graph

Author	SHA1	Message	Date
Hans Hasselberg	b6097a99b8	Add new fields to HCP bootstrap config request and push state request To support linking cluster, HCP needs to know the datacenter and if ACLs are enabled. Otherwise hosted Consul Core UI won't work properly.	2023-05-12 21:01:56 -06:00
Eric Haberkorn	8bb16567cd	sidecar-proxy refactor (#17328 )	2023-05-12 16:49:42 -04:00
Chris Thain	b9102c295d	Add Network Filter Support for Envoy Extensions (#17325 )	2023-05-12 09:52:50 -07:00
Kyle Havlovitz	81d8332524	Attach service virtual IP info to compiled discovery chain (#17295 ) * Add v1/internal/service-virtual-ip for manually setting service VIPs * Attach service virtual IP info to compiled discovery chain * Separate auto-assigned and manual VIPs in response	2023-05-12 02:28:16 +00:00
Kyle Havlovitz	bd0eb07ed3	Add /v1/internal/service-virtual-ip for manually setting service VIPs (#17294 )	2023-05-12 00:38:52 +00:00
R.B. Boyer	cd80ea18ff	grpc: ensure grpc resolver correctly uses lan/wan addresses on servers (#17270 ) The grpc resolver implementation is fed from changes to the router.Router. Within the router there is a map of various areas storing the addressing information for servers in those areas. All map entries are of the WAN variety except a single special entry for the LAN. Addressing information in the LAN "area" are local addresses intended for use when making a client-to-server or server-to-server request. The client agent correctly updates this LAN area when receiving lan serf events, so by extension the grpc resolver works fine in that scenario. The server agent only initially populates a single entry in the LAN area (for itself) on startup, and then never mutates that area map again. For normal RPCs a different structure is used for LAN routing. Additionally when selecting a server to contact in the local datacenter it will randomly select addresses from either the LAN or WAN addressed entries in the map. Unfortunately this means that the grpc resolver stack as it exists on server agents is either broken or only accidentally functions by having servers dial each other over the WAN-accessible address. If the operator disables the serf wan port completely likely this incidental functioning would break. This PR enforces that local requests for servers (both for stale reads or leader forwarded requests) exclusively use the LAN "area" information and also fixes it so that servers keep that area up to date in the router. A test for the grpc resolver logic was added, as well as a higher level full-stack test to ensure the externally perceived bug does not return.	2023-05-11 11:08:57 -05:00
Dan Upton	5030101cdb	resource: add missing validation to the `List` and `WatchList` endpoints (#17213 )	2023-05-10 10:38:48 +01:00
Derek Menteer	5ecab506a6	Fix ent bug caused by #17241 . (#17278 ) Fix ent bug caused by #17241 All tests passed in OSS, but not ENT. This is a patch to resolve the problem for both.	2023-05-09 16:36:29 -05:00
cskh	48f7d99305	snapshot: some improvments to the snapshot process (#17236 ) * snapshot: some improvments to the snapshot process Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com>	2023-05-09 15:28:52 -04:00
Semir Patel	40eefaba18	Reaper controller for cascading deletes of owner resources (#17256 )	2023-05-09 13:57:40 -05:00
Freddy	7c3e9cd862	Hash namespace+proxy ID when creating socket path (#17204 ) UNIX domain socket paths are limited to 104-108 characters, depending on the OS. This limit was quite easy to exceed when testing the feature on Kubernetes, due to how proxy IDs encode the Pod ID eg: metrics-collector-59467bcb9b-fkkzl-hcp-metrics-collector-sidecar-proxy To ensure we stay under that character limit this commit makes a couple changes: - Use a b64 encoded SHA1 hash of the namespace + proxy ID to create a short and deterministic socket file name. - Add validation to proxy registrations and proxy-defaults to enforce a limit on the socket directory length.	2023-05-09 12:20:26 -06:00
Dan Upton	d53a1d4a27	resource: add helpers for more efficiently comparing IDs etc (#17224 )	2023-05-09 19:02:24 +01:00
Derek Menteer	4f6da20fe5	Fix multiple issues related to proxycfg health queries. (#17241 ) Fix multiple issues related to proxycfg health queries. 1. The datacenter was not being provided to a proxycfg query, which resulted in bypassing agentless query optimizations and using the normal API instead. 2. The health rpc endpoint would return a zero index when insufficient ACLs were detected. This would result in the agent cache performing an infinite loop of queries in rapid succession without backoff.	2023-05-09 12:37:58 -05:00
Dan Upton	972998203e	controller: deduplicate items in queue (#17168 )	2023-05-09 18:14:20 +01:00
Dan Upton	6e1bc57469	Controller Runtime	2023-05-09 15:25:55 +01:00
Matt Keeler	34915670f2	Register new catalog & mesh protobuf types with the resource registry (#17225 )	2023-05-08 15:36:35 -04:00
Derek Menteer	50ef6a697e	Fix issue with peer stream node cleanup. (#17235 ) Fix issue with peer stream node cleanup. This commit encompasses a few problems that are closely related due to their proximity in the code. 1. The peerstream utilizes node IDs in several locations to determine which nodes / services / checks should be cleaned up or created. While VM deployments with agents will likely always have a node ID, agentless uses synthetic nodes and does not populate the field. This means that for consul-k8s deployments, all services were likely bundled together into the same synthetic node in some code paths (but not all), resulting in strange behavior. The Node.Node field should be used instead as a unique identifier, as it should always be populated. 2. The peerstream cleanup process for unused nodes uses an incorrect query for node deregistration. This query is NOT namespace aware and results in the node (and corresponding services) being deregistered prematurely whenever it has zero default-namespace services and 1+ non-default-namespace services registered on it. This issue is tricky to find due to the incorrect logic mentioned in #1, combined with the fact that the affected services must be co-located on the same node as the currently deregistering service for this to be encountered. 3. The stream tracker did not understand differences between services in different namespaces and could therefore report incorrect numbers. It was updated to utilize the full service name to avoid conflicts and return proper results.	2023-05-08 13:13:25 -05:00
Semir Patel	991a002fcc	resource: List resources by owner (#17190 )	2023-05-08 12:26:19 -05:00
Dan Upton	917afcf3c6	controller: make the `WorkQueue` generic (#16982 )	2023-05-05 15:38:22 +01:00
John Eikenberry	bd76fdeaeb	enable auto-tidy expired issuers in vault (as CA) When using vault as a CA and generating the local signing cert, try to enable the PKI endpoint's auto-tidy feature with it set to tidy expired issuers.	2023-05-03 20:30:37 +00:00
Nathan Coleman	bdef22354b	Use auth context when evaluating service read permissions (#17207 ) Co-authored-by: Blake Covarrubias <1812+blake@users.noreply.github.com>	2023-05-02 16:23:42 -04:00
Poonam Jadhav	ef5d54fd4c	feat: add no-op reporting background routine (#17178 )	2023-04-28 20:07:03 -04:00
Eric Haberkorn	2c0da88ce7	fix panic in `injectSANMatcher` when `tlsContext` is `nil` (#17185 )	2023-04-28 16:27:57 -04:00
Paul Glass	e4a341c88a	Permissive mTLS: Config entry filtering and CLI warnings (#17183 ) This adds filtering for service-defaults: consul config list -filter 'MutualTLSMode == "permissive"'. It adds CLI warnings when the CLI writes a config entry and sees that either service-defaults or proxy-defaults contains MutualTLSMode=permissive, or sees that the mesh config entry contains AllowEnablingPermissiveMutualTLSMode=true.	2023-04-28 12:51:36 -05:00
R.B. Boyer	6b4986907d	peering: ensure that merged central configs of peered upstreams for partitioned downstreams work (#17179 ) Partitioned downstreams with peered upstreams could not properly merge central config info (i.e. proxy-defaults and service-defaults things like mesh gateway modes) if the upstream had an empty DestinationPartition field in Enterprise. Due to data flow, if this setup is done using Consul client agents the field is never empty and thus does not experience the bug. When a service is registered directly to the catalog as is the case for consul-dataplane use this field may be empty and and the internal machinery of the merging function doesn't handle this well. This PR ensures the internal machinery of that function is referentially self-consistent.	2023-04-28 12:36:08 -05:00
Semir Patel	1037bf7f69	Sync .golangci.yml from ENT (#17180 )	2023-04-28 17:14:37 +00:00
John Landa	eded58b62a	Remove artificial ACLTokenMaxTTL limit for configuring acl token expiry (#17066 ) * Remove artificial ACLTokenMaxTTL limit for configuring acl token expiry * Add changelog * Remove test on default MaxTokenTTL * Change to imperitive tense for changelog entry	2023-04-28 10:57:30 -05:00
Semir Patel	9fef1c7f17	Create tombstone on resource `Delete` (#17108 )	2023-04-28 10:49:08 -05:00
Dan Upton	eff5dd1812	resource: owner references must include a uid (#17169 )	2023-04-28 11:22:42 +01:00
Freddy	e02ef16f02	Update HCP bootstrapping to support existing clusters (#16916 ) * Persist HCP management token from server config We want to move away from injecting an initial management token into Consul clusters linked to HCP. The reasoning is that by using a separate class of token we can have more flexibility in terms of allowing HCP's token to co-exist with the user's management token. Down the line we can also more easily adjust the permissions attached to HCP's token to limit it's scope. With these changes, the cloud management token is like the initial management token in that iit has the same global management policy and if it is created it effectively bootstraps the ACL system. * Update SDK and mock HCP server The HCP management token will now be sent in a special field rather than as Consul's "initial management" token configuration. This commit also updates the mock HCP server to more accurately reflect the behavior of the CCM backend. * Refactor HCP bootstrapping logic and add tests We want to allow users to link Consul clusters that already exist to HCP. Existing clusters need care when bootstrapped by HCP, since we do not want to do things like change ACL/TLS settings for a running cluster. Additional changes: * Deconstruct MaybeBootstrap so that it can be tested. The HCP Go SDK requires HTTPS to fetch a token from the Auth URL, even if the backend server is mocked. By pulling the hcp.Client creation out we can modify its TLS configuration in tests while keeping the secure behavior in production code. * Add light validation for data received/loaded. * Sanitize initial_management token from received config, since HCP will only ever use the CloudConfig.MangementToken. * Add changelog entry	2023-04-27 22:27:39 +02:00
John Maguire	391ed069c4	APIGW: Update how status conditions for certificates are handled (#17115 ) * Move status condition for invalid certifcate to reference the listener that is using the certificate * Fix where we set the condition status for listeners and certificate refs, added tests * Add changelog	2023-04-27 15:54:44 +00:00
Semir Patel	5eaeb7b8e5	Support Envoy's MaxEjectionPercent and BaseEjectionTime config entries for passive health checks (#15979 ) * Add MaxEjectionPercent to config entry * Add BaseEjectionTime to config entry * Add MaxEjectionPercent and BaseEjectionTime to protobufs * Add MaxEjectionPercent and BaseEjectionTime to api * Fix integration test breakage * Verify MaxEjectionPercent and BaseEjectionTime in integration test upstream confings * Website docs for MaxEjectionPercent and BaseEjection time * Add `make docs` to browse docs at http://localhost:3000 * Changelog entry * so that is the difference between consul-docker and dev-docker * blah * update proto funcs * update proto --------- Co-authored-by: Maliz <maliheh.monshizadeh@hashicorp.com>	2023-04-26 15:59:48 -07:00
Michael Wilkerson	80b1dbcc7d	fixed aliases for sameness group (sameness_group) (#17161 )	2023-04-26 14:53:23 -07:00
Eric Haberkorn	a87115c598	add acl filter logs (#17143 )	2023-04-26 10:57:35 -04:00
Dan Upton	faae7bb5f2	testing: `RunResourceService` helper (#17068 )	2023-04-26 11:57:10 +01:00
Semir Patel	e7bb8fdf15	Fix or disable pipeline breaking changes that made it into main in last day or so (#17130 ) * Fix straggler from renaming Register->RegisterTypes * somehow a lint failure got through previously * Fix lint-consul-retry errors * adding in fix for success jobs getting skipped. (#17132) * Temporarily disable inmem backend conformance test to get green pipeline * Another test needs disabling --------- Co-authored-by: John Murret <john.murret@hashicorp.com>	2023-04-25 15:17:48 -05:00
Dan Upton	b9c485dcb8	Controller Supervision (#17016 )	2023-04-25 12:52:35 +01:00
John Maguire	e47f3216e5	APIGW Normalize Status Conditions (#16994 ) * normalize status conditions for gateways and routes * Added tests for checking condition status and panic conditions for validating combinations, added dummy code for fsm store * get rid of unneeded gateway condition generator struct * Remove unused file * run go mod tidy * Update tests, add conflicted gateway status * put back removed status for test * Fix linting violation, remove custom conflicted status * Update fsm commands oss * Fix incorrect combination of type/condition/status * cleaning up from PR review * Change "invalidCertificate" to be of accepted status * Move status condition enums into api package * Update gateways controller and generated code * Update conditions in fsm oss tests * run go mod tidy on consul-container module to fix linting * Fix type for gateway endpoint test * go mod tidy from changes to api * go mod tidy on troubleshoot * Fix route conflicted reason * fix route conflict reason rename * Fix text for gateway conflicted status * Add valid certificate ref condition setting * Revert change to resolved refs to be handled in future PR	2023-04-24 16:22:55 -04:00
Michael Wilkerson	001d540afc	Add sameness group field to prepared queries (#17089 ) * added method for converting SamenessGroupConfigEntry - added new method `ToQueryFailoverTargets` for converting a SamenessGroupConfigEntry's members to a list of QueryFailoverTargets - renamed `ToFailoverTargets` ToServiceResolverFailoverTargets to distinguish it from `ToQueryFailoverTargets` * Added SamenessGroup to PreparedQuery - exposed Service.Partition to API when defining a prepared query - added a method for determining if a QueryFailoverOptions is empty - This will be useful for validation - added unit tests * added method for retrieving a SamenessGroup to state store * added logic for using PQ with SamenessGroup - added branching path for SamenessGroup handling in execute. It will be handled separate from the normal PQ case - added a new interface so that the `GetSamenessGroupFailoverTargets` can be properly tested - separated the execute logic into a `targetSelector` function so that it can be used for both failover and sameness group PQs - split OSS only methods into new PQ OSS files - added validation that `samenessGroup` is an enterprise only feature * added documentation for PQ SamenessGroup	2023-04-24 13:21:28 -07:00
Derek Menteer	a33b224a55	Fix virtual services being included in intention topology as downstreams. (#17099 )	2023-04-24 12:03:26 -05:00
Semir Patel	46816071df	De-scope tenenacy requirements to OSS only for now. (#17087 ) Partition and namespace must be "default" Peername must be "local"	2023-04-24 08:14:51 -05:00
Kyle Havlovitz	6d01d07cf8	Include virtual services from discovery chain in intention topology (#16862 )	2023-04-21 16:58:13 +00:00
Kyle Havlovitz	d5277af70d	Add manual virtual IP support to state store (#16815 )	2023-04-21 09:19:02 -07:00
Eric Haberkorn	53cdda8d17	Fix a bug with disco chain config entry fetching (#17078 ) Before this change, we were not fetching service resolvers (and therefore service defaults) configuration entries for services on members of sameness groups.	2023-04-21 09:18:32 -04:00
Semir Patel	53f49b2fa1	Enforce operator:write acl on `WriteStatus` endpoint (#17019 )	2023-04-20 16:25:33 +00:00
Eric Haberkorn	b1fae05983	Add sameness groups to service intentions. (#17064 )	2023-04-20 12:16:04 -04:00
hashicorp-copywrite[bot]	9f81fc01e9	[COMPLIANCE] Add Copyright and License Headers (#16854 ) Co-authored-by: hashicorp-copywrite[bot] <110428419+hashicorp-copywrite[bot]@users.noreply.github.com> Co-authored-by: Ronald <roncodingenthusiast@users.noreply.github.com>	2023-04-20 12:40:22 +00:00
Paul Glass	f4406e69b9	[NET-3091] Update service intentions to support jwt provider references (#17037 ) * [NET-3090] Add new JWT provider config entry * Add initial test cases * update validations for jwt-provider config entry fields * more validation * start improving tests * more tests * Normalize * Improve tests and move validate fns * usage test update * Add split between ent and oss for partitions * fix lint issues * Added retry backoff, fixed tests, removed unused defaults * take into account default partitions * use countTrue and add aliases * omit audiences if empty * fix failing tests * add omit-entry * Add JWT intentions * generate proto * fix deep copy issues * remove extra field * added some tests * more tests * add validation for creating existing jwt * fix nil issue * More tests, fix conflicts and improve memdb call * fix namespace * add aliases * consolidate errors, skip duplicate memdb calls * reworked iteration over config entries * logic improvements from review --------- Co-authored-by: Ronald Ekambi <ronekambi@gmail.com>	2023-04-19 18:16:39 -04:00
Paul Glass	ac200cfec8	[NET-3090] Add new JWT provider config entry (#17036 ) * [NET-3090] Add new JWT provider config entry * Add initial test cases * update validations for jwt-provider config entry fields * more validation * start improving tests * more tests * Normalize * Improve tests and move validate fns * usage test update * Add split between ent and oss for partitions * fix lint issues * Added retry backoff, fixed tests, removed unused defaults * take into account default partitions * use countTrue and add aliases * omit audiences if empty * fix failing tests * add omit-entry * update copyright headers ids --------- Co-authored-by: Ronald Ekambi <ronekambi@gmail.com> Co-authored-by: Ronald <roncodingenthusiast@users.noreply.github.com>	2023-04-19 17:54:14 -04:00
Paul Glass	77ecff3209	Permissive mTLS (#17035 ) This implements permissive mTLS , which allows toggling services into "permissive" mTLS mode. Permissive mTLS mode allows incoming "non Consul-mTLS" traffic to be forward unmodified to the application. * Update service-defaults and proxy-defaults config entries with a MutualTLSMode field * Update the mesh config entry with an AllowEnablingPermissiveMutualTLS field and implement the necessary validation. AllowEnablingPermissiveMutualTLS must be true to allow changing to MutualTLSMode=permissive, but this does not require that all proxy-defaults and service-defaults are currently in strict mode. * Update xDS listener config to add a "permissive filter chain" when MutualTLSMode=permissive for a particular service. The permissive filter chain matches incoming traffic by the destination port. If the destination port matches the service port from the catalog, then no mTLS is required and the traffic sent is forwarded unmodified to the application.	2023-04-19 14:45:00 -05:00
R.B. Boyer	d07aac8d7e	Revert "cache: refactor agent cache fetching to prevent unnecessary f… (#16818 ) (#17046 ) Revert "cache: refactor agent cache fetching to prevent unnecessary fetches on error (#14956)" Co-authored-by: Derek Menteer <105233703+hashi-derek@users.noreply.github.com>	2023-04-19 13:17:21 -05:00
John Murret	2cefa8d9bd	ci: remove test-integrations CircleCI workflow (#16928 ) * remove all CircleCI files * remove references to CircleCI * remove more references to CircleCI * pin golangci-lint to v1.51.1 instead of v1.51	2023-04-19 16:19:29 +00:00
Luke Kysow	46212cc570	Don't send updates twice (#16999 )	2023-04-18 10:41:58 -07:00
Poonam Jadhav	5d7a7ff041	feat: set up reporting agent (#16991 )	2023-04-18 11:03:05 -04:00
Dan Upton	a37a441991	server: wire up in-process Resource Service (#16978 )	2023-04-18 10:03:23 +01:00
Semir Patel	2f7d591702	Tenancy wildcard validaton for `Write`, `Read`, and `Delete` endpoints (#17004 )	2023-04-17 16:33:20 -05:00
Derek Menteer	87324c9ec8	Add PrioritizeByLocality to config entries. (#17007 ) This commit adds the PrioritizeByLocality field to both proxy-config and service-resolver config entries for locality-aware routing. The field is currently intended for enterprise only, and will be used to enable prioritization of service-mesh connections to services based on geographical region / zone.	2023-04-14 15:42:54 -05:00
Michael Wilkerson	0dd4ea2033	* added Sameness Group to proto files (#16998 ) - added Sameness Group to config entries - added Sameness Group to subscriptions * generated proto files * added Sameness Group events to the state store - added test cases * Refactored health RPC Client - moved code that is common to rpcclient under rpcclient common.go. This will help set us up to support future RPC clients * Refactored proxycfg glue views - Moved views to rpcclient config entry. This will allow us to reuse this code for a config entry client * added config entry RPC Client - Copied most of the testing code from rpcclient/health * hooked up new rpcclient in agent * fixed documentation and comments for clarity	2023-04-14 09:24:46 -07:00
Dhia Ayachi	79d4040b6c	add IP rate limiting config update (#16997 ) * add IP rate limiting config update * fix review comments	2023-04-14 09:26:38 -04:00
Semir Patel	79b30476e0	Enforce Owner rules in `Write` endpoint (#16983 )	2023-04-14 08:19:46 -05:00
Semir Patel	8611ec56f3	Fix delete when uid not provided (#16996 )	2023-04-14 08:18:24 -05:00
Eric Haberkorn	44b39240a8	move enterprise test cases out of open source (#16985 )	2023-04-13 09:07:06 -04:00
Semir Patel	b8c9e133be	Add mutate hook to `Write` endpoint (#16958 )	2023-04-12 16:50:07 -05:00
Semir Patel	3b83c7ee9a	Enforce ACLs on resource `Write` and `Delete` endpoints (#16956 )	2023-04-12 16:22:44 -05:00
Dhia Ayachi	b85a149eaf	Memdb Txn Commit race condition fix (#16871 ) * Add a test to reproduce the race condition * Fix race condition by publishing the event after the commit and adding a lock to prevent out of order events. * split publish to generate the list of events before committing the transaction. * add changelog * remove extra func * Apply suggestions from code review Co-authored-by: Dan Upton <daniel@floppy.co> * add comment to explain test --------- Co-authored-by: Dan Upton <daniel@floppy.co>	2023-04-12 13:18:01 -04:00
Poonam Jadhav	8255cc97f5	feat: add reporting config with reload (#16890 )	2023-04-11 15:04:02 -04:00
Dan Upton	d595e6ade9	resource: `WriteStatus` endpoint (#16886 )	2023-04-11 19:23:14 +01:00
Derek Menteer	1bcaeabfc3	Remove deprecated service-defaults upstream behavior. (#16957 ) Prior to this change, peer services would be targeted by service-default overrides as long as the new `peer` field was not found in the config entry. This commit removes that deprecated backwards-compatibility behavior. Now it is necessary to specify the `peer` field in order for upstream overrides to apply to a peer upstream.	2023-04-11 10:20:33 -05:00
Semir Patel	317240fca7	Resource validation hook for `Write` endpoint (#16950 )	2023-04-11 06:55:32 -05:00
Semir Patel	686f49346c	Check acls on resource `Read`, `List`, and `WatchList` (#16842 )	2023-04-11 06:10:14 -05:00
John Maguire	92be8bd762	APIGW: Routes with duplicate parents should be invalid (#16926 ) * ensure route parents are unique when creating an http route * Ensure tcp route parents are unique * Added unit tests	2023-04-10 13:20:32 -04:00
John Eikenberry	97173725b7	log warning about certificate expiring sooner and with more details The old setting of 24 hours was not enough time to deal with an expiring certificates. This change ups it to 28 days OR 40% of the full cert duration, whichever is shorter. It also adds details to the log message to indicate which certificate it is logging about and a suggested action.	2023-04-07 20:38:07 +00:00
Chris Thain	175bb1a303	Wasm Envoy HTTP extension (#16877 )	2023-04-06 14:12:07 -07:00
Semir Patel	1794484298	Resource `Delete` endpoint (#16756 )	2023-04-06 08:58:54 -05:00
Dan Upton	4fa2537b3b	Resource `Write` endpoint (#16786 )	2023-04-06 10:40:04 +01:00
Dan Upton	671d5825ca	Raft storage backend (#16619 )	2023-04-04 17:30:06 +01:00
cskh	a319953576	docs: add envoy to the proxycfg diagram (#16834 ) * docs: add envoy to the proxycfg diagram	2023-04-04 09:42:42 -04:00
Freddy	f6de5ff635	Allow dialer to re-establish terminated peering (#16776 ) Currently, if an acceptor peer deletes a peering the dialer's peering will eventually get to a "terminated" state. If the two clusters need to be re-peered the acceptor will re-generate the token but the dialer will encounter this error on the call to establish: "failed to get addresses to dial peer: failed to refresh peer server addresses, will continue to use initial addresses: there is no active peering for "<<<ID>>>"" This is because in `exchangeSecret().GetDialAddresses()` we will get an error if fetching addresses for an inactive peering. The peering shows up as inactive at this point because of the existing terminated state. Rather than checking whether a peering is active we can instead check whether it was deleted. This way users do not need to delete terminated peerings in the dialing cluster before re-establishing them.	2023-04-03 12:07:45 -06:00
Chris S. Kim	a5397b1f23	Connect CA Primary Provider refactor (#16749 ) * Rename Intermediate cert references to LeafSigningCert Within the Consul CA subsystem, the term "Intermediate" is confusing because the meaning changes depending on provider and datacenter (primary vs secondary). For example, when using the Consul CA the "ActiveIntermediate" may return the root certificate in a primary datacenter. At a high level, we are interested in knowing which CA is responsible for signing leaf certs, regardless of its position in a certificate chain. This rename makes the intent clearer. * Move provider state check earlier * Remove calls to GenerateLeafSigningCert GenerateLeafSigningCert (formerly known as GenerateIntermediate) is vestigial in non-Vault providers, as it simply returns the root certificate in primary datacenters. By folding Vault's intermediate cert logic into `GenerateRoot` we can encapsulate the intermediate cert handling within `newCARoot`. * Move GenerateLeafSigningCert out of PrimaryProvidder Now that the Vault Provider calls GenerateLeafSigningCert within GenerateRoot, we can remove the method from all other providers that never used it in a meaningful way. * Add test for IntermediatePEM * Rename GenerateRoot to GenerateCAChain "Root" was being overloaded in the Consul CA context, as different providers and configs resulted in a single root certificate or a chain originating from an external trusted CA. Since the Vault provider also generates intermediates, it seems more accurate to call this a CAChain.	2023-04-03 11:40:33 -04:00
Eric Haberkorn	a6d69adcf5	Add default resolvers to disco chains based on the default sameness group (#16837 )	2023-03-31 14:35:56 -04:00
Derek Menteer	8d40cf9858	Add sameness-group to exported-services config entries (#16836 ) This PR adds the sameness-group field to exported-service config entries, which allows for services to be exported to multiple destination partitions / peers easily.	2023-03-31 12:36:44 -05:00
Dan Upton	651549c97d	storage: fix resource leak in Watch (#16817 )	2023-03-31 13:24:19 +01:00
Eric Haberkorn	0d1d2fc4c9	add order by locality failover to Consul enterprise (#16791 )	2023-03-30 10:08:38 -04:00
Ronald	b64674623e	Copyright headers for missing files/folders (#16708 ) * copyright headers for agent folder	2023-03-28 18:48:58 -04:00
Ronald	94ec4eb2f4	copyright headers for agent folder (#16704 ) * copyright headers for agent folder * Ignore test data files * fix proto files and remove headers in agent/uiserver folder * ignore deep-copy files	2023-03-28 14:39:22 -04:00
John Maguire	c833464daf	Update normalization of route refs (#16789 ) * Use merge of enterprise meta's rather than new custom method * Add merge logic for tcp routes * Add changelog * Normalize certificate refs on gateways * Fix infinite call loop * Explicitly call enterprise meta	2023-03-28 11:23:49 -04:00
Michael Wilkerson	e5d58c59c9	changes to support new PQ enterprise fields (#16793 )	2023-03-27 15:40:49 -07:00
Semir Patel	440f11203f	Resource service List(..) endpoint (#16753 )	2023-03-27 16:25:27 -05:00
Dhia Ayachi	10df4d83aa	add ip rate limiter controller OSS parts (#16790 )	2023-03-27 17:00:25 -04:00
Kyle Havlovitz	42c5b29713	Allocate virtual ip for resolver/router/splitter config entries (#16760 )	2023-03-27 13:04:24 -07:00
Semir Patel	032aba3175	WatchList(..) endpoint for the resource service (#16726 )	2023-03-27 14:37:54 -05:00
John Maguire	351bdc3c0d	Fix struct tags for TCPService enterprise meta (#16781 ) * Fix struct tags for TCPService enterprise meta * Add changelog	2023-03-27 16:17:04 +00:00
Semir Patel	3415689eb6	Read(...) endpoint for the resource service (#16655 )	2023-03-27 10:35:39 -05:00
Derek Menteer	2236975011	Change partition for peers in discovery chain targets (#16769 ) This commit swaps the partition field to the local partition for discovery chains targeting peers. Prior to this change, peer upstreams would always use a value of default regardless of which partition they exist in. This caused several issues in xds / proxycfg because of id mismatches. Some prior fixes were made to deal with one-off id mismatches that this PR also cleans up, since they are no longer needed.	2023-03-24 15:40:19 -05:00
John Eikenberry	0b1dc4ec36	tests instantiating clients w/o shutting down (#16755 ) noticed via their port still in use messages.	2023-03-24 16:54:11 +00:00
Poonam Jadhav	3df271959c	fix: remove unused tenancy category from rate limit spec (#16740 )	2023-03-23 12:14:59 -04:00
Dhia Ayachi	3ba0eb5074	delete config when nil (#16690 ) * delete config when nil * fix mock interface implementation * fix handler test to use the right assertion * extract DeleteConfig as a separate API. * fix mock limiter implementation to satisfy the new interface * fix failing tests * add test comments	2023-03-22 15:19:54 -04:00
Eric Haberkorn	495ad4c7ef	add enterprise xds tests (#16738 )	2023-03-22 14:56:18 -04:00
Eric Haberkorn	3c5c53aa80	fix bug where pqs that failover to a cluster peer dont un-fail over (#16729 )	2023-03-22 09:24:13 -04:00
cskh	7f6f6891f7	fix: gracefully fail on invalid port number (#16721 )	2023-03-21 22:29:21 -04:00
John Maguire	8dd1d73874	Remove unused are hosts set check (#16691 ) * Remove unused are hosts set check * Remove all traces of unused 'AreHostsSet' parameter * Remove unused Hosts attribute * Remove commented out use of snap.APIGateway.Hosts	2023-03-21 16:23:23 +00:00
Nitya Dhanushkodi	b9bd2c3780	peering: peering partition failover fixes (#16673 ) add local source partition for peered upstreams	2023-03-20 10:00:29 -07:00
John Maguire	1ef9f4dade	Fix route subscription when using namespaces (#16677 ) * Fix route subscription when using namespaces * Update changelog * Fix changelog entry to reference that the bug was enterprise only	2023-03-20 12:42:30 -04:00
Melisa Griffin	606f8fbbab	Adds check to verify that the API Gateway is being created with at least one listener	2023-03-20 12:37:30 -04:00
Poonam Jadhav	9c64731a56	feat: add category annotation to RPC and gRPC methods (#16646 )	2023-03-20 11:24:29 -04:00
Eric Haberkorn	7477f52a16	add sameness groups to discovery chains (#16671 )	2023-03-20 09:12:37 -04:00
Andrew Stucki	501b87fd31	[API Gateway] Fix invalid cluster causing gateway programming delay (#16661 ) * Add test for http routes * Add fix * Fix tests * Add changelog entry * Refactor and fix flaky tests	2023-03-17 13:31:04 -04:00
Eric Haberkorn	eaa39f4ef5	add sameness group support to service resolver failover and redirects (#16664 )	2023-03-17 10:48:06 -04:00
Eric Haberkorn	57e034b746	fix confusing spiffe ids in golden tests (#16643 )	2023-03-15 14:30:36 -04:00
wangxinyi7	152c75349e	net 2731 ip config entry OSS version (#16642 ) * ip config entry * name changing * move to ent * ent version * renaming * change format * renaming * refactor * add default values	2023-03-15 11:21:24 -07:00
John Maguire	ff5887a99e	Update e2e tests for namespaces (#16627 ) * Refactored "NewGatewayService" to handle namespaces, fixed TestHTTPRouteFlattening test * Fixed existing http_route tests for namespacing * Squash aclEnterpriseMeta for ResourceRefs and HTTPServices, accept namespace for creating connect services and regular services * Use require instead of assert after creating namespaces in http_route_tests * Refactor NewConnectService and NewGatewayService functions to use cfg objects to reduce number of method args * Rename field on SidecarConfig in tests from `SidecarServiceName` to `Name` to avoid stutter	2023-03-15 17:51:36 +00:00
Freddy	724b752ca7	Backport ENT-4704 (#16612 )	2023-03-14 14:55:11 -06:00
Derek Menteer	8f75d99299	Fix issue with trust bundle read ACL check. (#16630 ) This commit fixes an issue where trust bundles could not be read by services in a non-default namespace, unless they had excessive ACL permissions given to them. Prior to this change, `service:write` was required in the default namespace in order to read the trust bundle. Now, `service:write` to a service in any namespace is sufficient.	2023-03-14 12:24:33 -05:00
Chris S. Kim	d5677e5680	Preserve CARoots when updating Vault CA configuration (#16592 ) If a CA config update did not cause a root change, the codepath would return early and skip some steps which preserve its intermediate certificates and signing key ID. This commit re-orders some code and prevents updates from generating new intermediate certificates.	2023-03-13 17:32:59 -04:00
Derek Menteer	f2902e6608	Add sameness-group configuration entry. (#16608 ) This commit adds a sameness-group config entry to the API and structs packages. It includes some validation logic and a new memdb index that tracks the default sameness-group for each partition. Sameness groups will simplify the effort of managing failovers / intentions / exports for peers and partitions. Note that this change purely to introduce the configuration entry and does not include the full functionality of sameness-groups.	2023-03-13 16:19:11 -05:00
Ashvitha	f95ffe0355	Allow HCP metrics collection for Envoy proxies Co-authored-by: Ashvitha Sridharan <ashvitha.sridharan@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com> Add a new envoy flag: "envoy_hcp_metrics_bind_socket_dir", a directory where a unix socket will be created with the name `<namespace>_<proxy_id>.sock` to forward Envoy metrics. If set, this will configure: - In bootstrap configuration a local stats_sink and static cluster. These will forward metrics to a loopback listener sent over xDS. - A dynamic listener listening at the socket path that the previously defined static cluster is sending metrics to. - A dynamic cluster that will forward traffic received at this listener to the hcp-metrics-collector service. Reasons for having a static cluster pointing at a dynamic listener: - We want to secure the metrics stream using TLS, but the stats sink can only be defined in bootstrap config. With dynamic listeners/clusters we can use the proxy's leaf certificate issued by the Connect CA, which isn't available at bootstrap time. - We want to intelligently route to the HCP collector. Configuring its addreess at bootstrap time limits our flexibility routing-wise. More on this below. Reasons for defining the collector as an upstream in `proxycfg`: - The HCP collector will be deployed as a mesh service. - Certificate management is taken care of, as mentioned above. - Service discovery and routing logic is automatically taken care of, meaning that no code changes are required in the xds package. - Custom routing rules can be added for the collector using discovery chain config entries. Initially the collector is expected to be deployed to each admin partition, but in the future could be deployed centrally in the default partition. These config entries could even be managed by HCP itself.	2023-03-10 13:52:54 -07:00
Eric Haberkorn	e298f506a5	Add Peer Locality to Discovery Chains (#16588 ) Add peer locality to discovery chains	2023-03-10 12:59:47 -05:00
Eric Haberkorn	57e2493415	allow setting locality on services and nodes (#16581 )	2023-03-10 09:36:15 -05:00
Semir Patel	176945aa86	GRPC stub for the ResourceService (#16528 )	2023-03-09 13:40:23 -06:00
Andrew Stucki	040647e0ba	auto-updated agent/uiserver/dist/ from commit `63204b518` (#16587 ) Co-authored-by: hc-github-team-consul-core <github-team-consul-core@hashicorp.com>	2023-03-09 13:56:53 -05:00
Eric Haberkorn	89de91b263	fix bug that can lead to peering service deletes impacting the state of local services (#16570 )	2023-03-08 11:24:03 -05:00
Eric Haberkorn	dbaf8bf49c	add agent locality and replicate it across peer streams (#16522 )	2023-03-07 14:05:23 -05:00
John Eikenberry	f5641ffccc	support vault auth config for alicloud ca provider Add support for using existing vault auto-auth configurations as the provider configuration when using Vault's CA provider with AliCloud. AliCloud requires 2 extra fields to enable it to use STS (it's preferred auth setup). Our vault-plugin-auth-alicloud package contained a method to help generate them as they require you to make an http call to a faked endpoint proxy to get them (url and headers base64 encoded).	2023-03-07 03:02:05 +00:00
Melisa Griffin	fc232326a0	NET-2904 Fixes API Gateway Route Service Weight Division Error	2023-03-06 08:41:57 -05:00
Melisa Griffin	129eca8fdb	NET-2903 Normalize weight for http routes (#16512 ) * NET-2903 Normalize weight for http routes * Update website/content/docs/connect/gateways/api-gateway/configuration/http-route.mdx Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>	2023-03-03 16:39:59 -05:00
R.B. Boyer	9a485cdb49	proxycfg: ensure that an irrecoverable error in proxycfg closes the xds session and triggers a replacement proxycfg watcher (#16497 ) Receiving an "acl not found" error from an RPC in the agent cache and the streaming/event components will cause any request loops to cease under the assumption that they will never work again if the token was destroyed. This prevents log spam (#14144, #9738). Unfortunately due to things like: - authz requests going to stale servers that may not have witnessed the token creation yet - authz requests in a secondary datacenter happening before the tokens get replicated to that datacenter - authz requests from a primary TO a secondary datacenter happening before the tokens get replicated to that datacenter The caller will get an "acl not found" before the token exists, rather than just after. The machinery added above in the linked PRs will kick in and prevent the request loop from looping around again once the tokens actually exist. For `consul-dataplane` usages, where xDS is served by the Consul servers rather than the clients ultimately this is not a problem because in that scenario the `agent/proxycfg` machinery is on-demand and launched by a new xDS stream needing data for a specific service in the catalog. If the watching goroutines are terminated it ripples down and terminates the xDS stream, which CDP will eventually re-establish and restart everything. For Consul client usages, the `agent/proxycfg` machinery is ahead-of-time launched at service registration time (called "local" in some of the proxycfg machinery) so when the xDS stream comes in the data is already ready to go. If the watching goroutines terminate it should terminate the xDS stream, but there's no mechanism to re-spawn the watching goroutines. If the xDS stream reconnects it will see no `ConfigSnapshot` and will not get one again until the client agent is restarted, or the service is re-registered with something changed in it. This PR fixes a few things in the machinery: - there was an inadvertent deadlock in fetching snapshot from the proxycfg machinery by xDS, such that when the watching goroutine terminated the snapshots would never be fetched. This caused some of the xDS machinery to get indefinitely paused and not finish the teardown properly. - Every 30s we now attempt to re-insert all locally registered services into the proxycfg machinery. - When services are re-inserted into the proxycfg machinery we special case "dead" ones such that we unilaterally replace them rather that doing that conditionally.	2023-03-03 14:27:53 -06:00
John Eikenberry	56ffee6d42	add provider ca support for approle auth-method Adds support for the approle auth-method. Only handles using the approle role/secret to auth and it doesn't support the agent's extra management configuration options (wrap and delete after read) as they are not required as part of the auth (ie. they are vault agent things).	2023-03-03 19:29:53 +00:00
Andrew Stucki	cc0765b87d	Fix resolution of service resolvers with subsets for external upstreams (#16499 ) * Fix resolution of service resolvers with subsets for external upstreams * Add tests * Add changelog entry * Update view filter logic	2023-03-03 14:17:11 -05:00
Eric Haberkorn	5f81662066	Add support for failover policies (#16505 )	2023-03-03 11:12:38 -05:00
Andrew Stucki	5deffbd95b	Fix issue where terminating gateway service resolvers weren't properly cleaned up (#16498 ) * Fix issue where terminating gateway service resolvers weren't properly cleaned up * Add integration test for cleaning up resolvers * Add changelog entry * Use state test and drop integration test	2023-03-03 09:56:57 -05:00
Andrew Stucki	4b661d1e0c	Add ServiceResolver RequestTimeout for route timeouts to make TerminatingGateway upstream timeouts configurable (#16495 ) * Leverage ServiceResolver ConnectTimeout for route timeouts to make TerminatingGateway upstream timeouts configurable * Regenerate golden files * Add RequestTimeout field * Add changelog entry	2023-03-03 09:37:12 -05:00
John Eikenberry	e8eec1fa80	add provider ca auth support for kubernetes Adds support for Kubernetes jwt/token file based auth. Only needs to read the file and save the contents as the jwt/token.	2023-03-02 22:05:40 +00:00
John Eikenberry	4211069080	add provider ca support for jwt file base auth Adds support for a jwt token in a file. Simply reads the file and sends the read in jwt along to the vault login. It also supports a legacy mode with the jwt string being passed directly. In which case the path is made optional.	2023-03-02 20:33:06 +00:00
Chris S. Kim	321439f5a7	Speed up test by registering services concurrently (#16509 )	2023-03-02 14:36:44 -05:00
John Eikenberry	4f2d9a91e5	add provider ca auth-method support for azure Does the required dance with the local HTTP endpoint to get the required data for the jwt based auth setup in Azure. Keeps support for 'legacy' mode where all login data is passed on via the auth methods parameters. Refactored check for hardcoded /login fields.	2023-03-01 00:07:33 +00:00
Dan Upton	73b9b407ba	grpc: fix data race in balancer registration (#16229 ) Registering gRPC balancers is thread-unsafe because they are stored in a global map variable that is accessed without holding a lock. Therefore, it's expected that balancers are registered _once_ at the beginning of your program (e.g. in a package `init` function) and certainly not after you've started dialing connections, etc. > NOTE: this function must only be called during initialization time > (i.e. in an init() function), and is not thread-safe. While this is fine for us in production, it's challenging for tests that spin up multiple agents in-memory. We currently register a balancer per- agent which holds agent-specific state that cannot safely be shared. This commit introduces our own registry that _is_ thread-safe, and implements the Builder interface such that we can call gRPC's `Register` method once, on start-up. It uses the same pattern as our resolver registry where we use the dial target's host (aka "authority"), which is unique per-agent, to determine which builder to use.	2023-02-28 10:18:38 +00:00
Andrew Stucki	801a17329e	Fix attempt for test fail panics in xDS (#16319 ) * Fix attempt for test fail panics in xDS * switch to a mutex pointer	2023-02-24 17:00:31 -05:00
Chris S. Kim	a518893685	Fix various flaky tests (#16396 )	2023-02-23 14:52:18 -05:00
Eric Haberkorn	595131fca9	Refactor the disco chain -> xds logic (#16392 )	2023-02-23 11:32:32 -05:00
Paul Banks	8ac211b427	Correct WAL metrics registrations (#16388 )	2023-02-23 14:07:17 +00:00
Dhia Ayachi	ae9c228967	Rate limiter/add ip prefix (#16342 ) * add support for prefixes in the config tree * fix to use default config when the prefix have no config	2023-02-22 15:15:51 -05:00
Andrew Stucki	641737f32b	[API Gateway] Fix infinite loop in controller and binding non-accepted routes and gateways (#16377 )	2023-02-22 14:55:40 -05:00
Andrew Stucki	0972697661	[API Gateway] Various fixes for Config Entry fields (#16347 ) * [API Gateway] Various fixes for Config Entry fields * simplify logic per PR review	2023-02-22 04:02:04 +00:00
Andrew Stucki	18e2ee77ca	[API Gateway] Fix targeting service splitters in HTTPRoutes (#16350 ) * [API Gateway] Fix targeting service splitters in HTTPRoutes * Fix test description	2023-02-22 03:48:26 +00:00
Andrew Stucki	823fc821fa	[API Gateway] Turn down controller log levels (#16348 )	2023-02-21 20:42:01 -06:00
Derek Menteer	ad865f549b	Fix issue with peer services incorrectly appearing as connect-enabled. (#16339 ) Prior to this commit, all peer services were transmitted as connect-enabled as long as a one or more mesh-gateways were healthy. With this change, there is now a difference between typical services and connect services transmitted via peering. A service will be reported as "connect-enabled" as long as any of these conditions are met: 1. a connect-proxy sidecar is registered for the service name. 2. a connect-native instance of the service is registered. 3. a service resolver / splitter / router is registered for the service name. 4. a terminating gateway has registered the service.	2023-02-21 13:59:36 -06:00
Andrew Stucki	7f9ec78932	[API Gateway] Validate listener name is not empty (#16340 ) * [API Gateway] Validate listener name is not empty * Update docstrings and test	2023-02-21 14:12:19 -05:00
cskh	8e5942f5ca	fix: add tls config to unix socket when https is used (#16301 ) * fix: add tls config to unix socket when https is used * unit test and changelog	2023-02-21 08:28:13 -05:00
Andrew Stucki	4607b535be	Fix HTTPRoute and TCPRoute expectation for enterprise metadata (#16322 )	2023-02-17 17:28:49 -05:00
Andrew Stucki	15d2684ecc	Normalize all API Gateway references (#16316 )	2023-02-17 21:37:34 +00:00

1 2 3 4 5 ...

5145 Commits