consul

Commit Graph

Author	SHA1	Message	Date
Daniel Nephin	9f27d61bee	Remove unused var The usage was removed in `8e22d80e35`, however it seems there may be a bug here because the cluster name is not updated when the target changes.	2020-05-19 16:50:14 -04:00
Daniel Nephin	4f738b462b	Handle error from template.Execute Refactored the function to make the problem more obvious, by using a guard.	2020-05-19 16:50:14 -04:00
Daniel Nephin	c662f0f0de	Fix a number of problems found by staticcheck Some of these problems are minor (unused vars), but others are real bugs (ignored errors). Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com>	2020-05-19 16:50:14 -04:00
Daniel Nephin	5c99109dd9	Remove unused var The usage of this var was removed in `b92f895c23`. Found by using staticcheck	2020-05-19 16:50:14 -04:00
Chris Piraino	9d9e23cc44	Add service id context to the proxycfg logger This is especially useful when multiple proxies are all querying the same Consul agent.	2020-05-18 09:08:05 -05:00
Chris Piraino	79468793f6	Do not return an error if requested service is not a gateway This commit converts the previous error into just a Warn-level log message. By returning an error when the requested service was not a gateway, we did not appropriately update envoy because the cache Fetch returned an error and thus did not propagate the update through proxycfg and xds packages.	2020-05-18 09:08:04 -05:00
Aleksandr Zagaevskiy	a75e3d9051	Preserve ModifyIndex for unchanged entry in KVS TXN (#7832 )	2020-05-14 13:25:04 -06:00
Pierre Souchay	cf55e81c06	tests: fix unstable test `TestAgentAntiEntropy_Checks`. (#7594 ) Example of failure: https://circleci.com/gh/hashicorp/consul/153932#tests/containers/2	2020-05-14 09:54:49 +02:00
Kit Patella	ad1d4d4d07	http: migrate from instrumentation in s.wrap() to an s.enterpriseHandler()	2020-05-13 15:47:05 -07:00
Matt Keeler	acccdbe45c	Fix identity resolution on clients and in secondary dcs (#7862 ) Previously this happened to be using the method on the Server/Client that was meant to allow the ACLResolver to locally resolve tokens. On Servers that had tokens (primary or secondary dc + token replication) this function would lookup the token from raft and return the ACLIdentity. On clients this was always a noop. We inadvertently used this function instead of creating a new one when we added logging accessor ids for permission denied RPC requests. With this commit, a new method is used for resolving the identity properly via the ACLResolver which may still resolve locally in the case of being on a server with tokens but also supports remote token resolution.	2020-05-13 13:00:08 -04:00
Chris Piraino	7a7760bfd5	Make new gateway tests compatible with enterprise (#7856 )	2020-05-12 13:48:20 -05:00
Daniel Nephin	600645b5f9	Add unconvert linter To find unnecessary type convertions	2020-05-12 13:47:25 -04:00
Drew Bailey	c9d0b83277	Value is already an int, remove type cast	2020-05-12 13:13:09 -04:00
Daniel Nephin	9d5ab443a7	Merge pull request #7689 from hashicorp/dnephin/remove-deadcode-1 Remove some dead code	2020-05-12 12:33:59 -04:00
Daniel Nephin	47238a693d	Merge pull request #7819 from hashicorp/dnephin/remove-t.Parallel-1 test: Remove t.Parallel() from agent/structs tests	2020-05-12 12:11:57 -04:00
R.B. Boyer	1efafd7523	acl: add auth method for JWTs (#7846 )	2020-05-11 20:59:29 -05:00
Kit Patella	58ee349a83	Merge pull request #7843 from hashicorp/oss-sync/auditing-config agent/config: Fix tests & include Audit struct as a pointer on Config	2020-05-11 14:23:44 -07:00
Kit Patella	10b3478a4d	agent/config: include Audit struct as a pointer on Config, fix tests	2020-05-11 14:13:05 -07:00
Kit Patella	b5564751bf	Merge pull request #7841 from hashicorp/oss-sync/auditing-config OSS sync - Auditing config	2020-05-11 13:44:38 -07:00
Kit Patella	f5030957d0	agent/config: add auditing config to OSS and add to enterpriseConfigMap exclusions	2020-05-11 13:27:35 -07:00
Chris Piraino	c21052457b	Return early from updateGatewayServices if nothing to update (#7838 ) * Return early from updateGatewayServices if nothing to update Previously, we returned an empty slice of gatewayServices, which caused us to accidentally delete everything in the memdb table * PR comment and better formatting	2020-05-11 14:46:48 -05:00
Chris Piraino	4d6751bf16	Fix TestInternal_GatewayServiceDump_Ingress (#7840 ) Protocol was added as a field on GatewayServices after GatewayServiceDump PR branch was created.	2020-05-11 14:46:31 -05:00
R.B. Boyer	7414a3fa53	cli: ensure 'acl auth-method update' doesn't deep merge the Config field (#7839 )	2020-05-11 14:21:17 -05:00
Chris Piraino	74c0543ef2	PR comment and better formatting	2020-05-11 14:04:59 -05:00
Chris Piraino	fb9ee9d892	Return early from updateGatewayServices if nothing to update Previously, we returned an empty slice of gatewayServices, which caused us to accidentally delete everything in the memdb table	2020-05-11 12:38:04 -05:00
Freddy	b3ec383d04	Gateway Services Nodes UI Endpoint (#7685 ) The endpoint supports queries for both Ingress Gateways and Terminating Gateways. Used to display a gateway's linked services in the UI.	2020-05-11 11:35:17 -06:00
Kyle Havlovitz	136549205c	Merge pull request #7759 from hashicorp/ingress/tls-hosts Add TLS option for Ingress Gateway listeners	2020-05-11 09:18:43 -07:00
Kyle Havlovitz	8d140ce9af	Disallow the blanket wildcard prefix from being used as custom host	2020-05-08 20:24:18 -07:00
Chris Piraino	a0e1f57ac2	Remove development log line	2020-05-08 20:24:18 -07:00
Chris Piraino	26f92e74f6	Compute all valid DNSSANs for ingress gateways For DNSSANs we take into account the following and compute the appropriate wildcard values: - source datacenter - namespaces - alt domains	2020-05-08 20:23:17 -07:00
Daniel Nephin	5655d7f34e	Add outlier_detection check to integration test Fix decoding of time.Duration types.	2020-05-08 14:56:57 -04:00
Daniel Nephin	eaa05d623a	xds: Add passive health check config for upstreams	2020-05-08 14:56:57 -04:00
Chris Piraino	429d0cedd2	Restoring config entries updates the gateway-services table (#7811 ) - Adds a new validateConfigEntryEnterprise function - Also fixes some state store tests that were failing in enterprise	2020-05-08 13:24:33 -05:00
Daniel Nephin	e60bb9f102	test: Remove t.Parallel() from agent/structs tests go test will only run tests in parallel within a single package. In this case the package test run time is exactly the same with or without t.Parallel() (~0.7s). In generally we should avoid t.Parallel() as it causes a number of problems with `go test` not reporting failure messages correctly. I encountered one of these problems, which is what prompted this change. Since `t.Parallel` is not providing any benefit in this package, this commit removes it. The change was automated with: git grep -l 't.Parallel' \| xargs sed -i -e '/t.Parallel/d'	2020-05-08 14:06:10 -04:00
Freddy	c32a4f1ece	Fix up enterprise compatibility for gateways (#7813 )	2020-05-08 09:44:34 -06:00
Jono Sosulska	9b363e9f23	Fix spelling of deregister (#7804 )	2020-05-08 10:03:45 -04:00
Chris Piraino	f55e20a2f7	Allow ingress gateways to send empty clusters, routes, and listeners (#7795 ) This is useful when updating an config entry with no services, and the expected behavior is that envoy closes all listeners and clusters. We also allow empty routes because ingress gateways name route configurations based on the port of the listener, so it is important we remove any stale routes. Then, if a new listener with an old port is added, we will not have to deal with stale routes hanging around routing to the wrong place. Endpoints are associated with clusters, and thus by deleting the clusters we don't have to care about sending empty endpoint responses.	2020-05-07 16:19:25 -05:00
Chris Piraino	0bd5618cb2	Cleanup proxycfg for TLS - Use correct enterprise metadata for finding config entry - nil out cancel functions on config snapshot copy - Look at HostsSet when checking validity	2020-05-07 10:22:57 -05:00
Chris Piraino	5105bf3d67	Require individual services in ingress entry to match protocols (#7774 ) We require any non-wildcard services to match the protocol defined in the listener on write, so that we can maintain a consistent experience through ingress gateways. This also helps guard against accidental misconfiguration by a user. - Update tests that require an updated protocol for ingress gateways	2020-05-06 16:09:24 -05:00
Freddy	b069887b2a	Remove timeout and call to Fatal from goroutine (#7797 )	2020-05-06 14:33:17 -06:00
Chris Piraino	0c22eacca8	Add TLS field to ingress API structs - Adds test in api and command/config/write packages	2020-05-06 15:12:02 -05:00
Chris Piraino	30792e933b	Add test for adding DNSSAN for ConnectCALeaf cache type	2020-05-06 15:12:02 -05:00
Chris Piraino	0b9ba9660d	Validate hosts input in ingress gateway config entry We can only allow host names that are valid domain names because we put these hosts into a DNSSAN. In addition, we validate that the wildcard specifier '*' is only present as the leftmost label to allow for a wildcard DNSSAN and associated wildcard Host routing in the ingress gateway proxy.	2020-05-06 15:12:02 -05:00
Kyle Havlovitz	f14c54e25e	Add TLS option and DNS SAN support to ingress config xds: Only set TLS context for ingress listener when requested	2020-05-06 15:12:02 -05:00
Chris Piraino	905279f5d1	A proxy-default config entry only exists in the default namespace	2020-05-06 15:06:14 -05:00
Chris Piraino	d498a0afc9	Correctly set a namespace label in the required domain for xds routes If an upstream is not in the default namespace, we expect DNS requests to be served over "<service-name>.ingress.<namespace>.*"	2020-05-06 15:06:14 -05:00
Chris Piraino	114a18e890	Remove outdated comment	2020-05-06 15:06:14 -05:00
Chris Piraino	d8517bd6fd	Better document wildcard specifier interactions	2020-05-06 15:06:14 -05:00
Chris Piraino	45e635286a	Re-add comment on connect-proxy virtual hosts	2020-05-06 15:06:14 -05:00
Kyle Havlovitz	f9672f9bf1	Make sure IngressHosts isn't parsed during JSON decode	2020-05-06 15:06:14 -05:00
Chris Piraino	c44f877758	Comment why it is ok to expect upstreams slice to not be empty	2020-05-06 15:06:13 -05:00
Chris Piraino	881760f701	xds: Use only the port number as the configured route name This removes duplication of protocol from the stats_prefix	2020-05-06 15:06:13 -05:00
Kyle Havlovitz	89e6b16815	Filter wildcard gateway services to match listener protocol This now requires some type of protocol setting in ingress gateway tests to ensure the services are not filtered out. - small refactor to add a max(x, y) function - Use internal configEntryTxn function and add MaxUint64 to lib	2020-05-06 15:06:13 -05:00
Chris Piraino	f40833d094	Allow Hosts field to be set on an ingress config entry - Validate that this cannot be set on a 'tcp' listener nor on a wildcard service. - Add Hosts field to api and test in consul config write CLI - xds: Configure envoy with user-provided hosts from ingress gateways	2020-05-06 15:06:13 -05:00
Chris Piraino	b73a13fc9e	Remove service_subset field from ingress config entry We decided that this was not a useful MVP feature, and just added unnecessary complexity	2020-05-06 15:06:13 -05:00
Kyle Havlovitz	711d1389aa	Support multiple listeners referencing the same service in gateway definitions	2020-05-06 15:06:13 -05:00
Kyle Havlovitz	247f9eaf13	Allow ingress gateways to route traffic based on Host header This commit adds the necessary changes to allow an ingress gateway to route traffic from a single defined port to multiple different upstream services in the Consul mesh. To do this, we now require all HTTP requests coming into the ingress gateway to specify a Host header that matches "<service-name>.*" in order to correctly route traffic to the correct service. - Differentiate multiple listener's route names by port - Adds a case in xds for allowing default discovery chains to create a route configuration when on an ingress gateway. This allows default services to easily use host header routing - ingress-gateways have a single route config for each listener that utilizes domain matching to route to different services.	2020-05-06 15:06:13 -05:00
R.B. Boyer	a854e4d9c5	acl: oss plumbing to support auth method namespace rules in enterprise (#7794 ) This includes website docs updates.	2020-05-06 13:48:04 -05:00
R.B. Boyer	3242d0816d	test: make the kube auth method test helper use freeport (#7788 )	2020-05-05 16:55:21 -05:00
Hans Hasselberg	096a2f2f02	network_segments: stop advertising segment tags	2020-05-05 21:32:05 +02:00
Hans Hasselberg	995a24b8e4	agent: refactor to use a single addrFn	2020-05-05 21:08:10 +02:00
Hans Hasselberg	6994c0d47f	agent: rename local/global to src/dst	2020-05-05 21:07:34 +02:00
Chris Piraino	69b44fb942	Construct a default destination if one does not exist for service-router (#7783 )	2020-05-05 10:49:50 -05:00
R.B. Boyer	22eb016153	acl: add MaxTokenTTL field to auth methods (#7779 ) When set to a non zero value it will limit the ExpirationTime of all tokens created via the auth method.	2020-05-04 17:02:57 -05:00
R.B. Boyer	ca52ba7068	acl: add DisplayName field to auth methods (#7769 ) Also add a few missing acl fields in the api.	2020-05-04 15:18:25 -05:00
Hans Hasselberg	c4093c87cc	agent: don't let left nodes hold onto their node-id (#7747 )	2020-05-04 18:39:08 +02:00
Matt Keeler	daec810e34	Merge pull request #7714 from hashicorp/oss-sync/msp-agent-token	2020-05-04 11:33:50 -04:00
Matt Keeler	cbe3a70f56	Update enterprise configurations to be in OSS This will emit warnings about the configs not doing anything but still allow them to be parsed. This also added the warnings for enterprise fields that we already had in OSS but didn’t change their enforcement behavior. For example, attempting to use a network segment will cause a hard error in OSS.	2020-05-04 10:21:05 -04:00
R.B. Boyer	9533451a63	acl: refactor the authmethod.Validator interface (#7760 ) This is a collection of refactors that make upcoming PRs easier to digest. The main change is the introduction of the authmethod.Identity struct. In the one and only current auth method (type=kubernetes) all of the trusted identity attributes are both selectable and projectable, so they were just passed around as a map[string]string. When namespaces were added, this was slightly changed so that the enterprise metadata can also come back from the login operation, so login now returned two fields. Now with some upcoming auth methods it won't be true that all identity attributes will be both selectable and projectable, so rather than update the login function to return 3 pieces of data it seemed worth it to wrap those fields up and give them a proper name.	2020-05-01 17:35:28 -05:00
R.B. Boyer	54ba8e3868	acl: change authmethod.Validator to take a logger (#7758 )	2020-05-01 15:55:26 -05:00
R.B. Boyer	8927b54121	test: move some test helpers over from enterprise (#7754 )	2020-05-01 14:52:15 -05:00
R.B. Boyer	b282268408	sdk: extracting testutil.RequireErrorContains from various places it was duplicated (#7753 )	2020-05-01 11:56:34 -05:00
Hans Hasselberg	51549bd232	rpc: oss changes for network area connection pooling (#7735 )	2020-04-30 22:12:17 +02:00
Freddy	021f0ee36e	Watch fallback channel for gateways that do not exist (#7715 ) Also ensure that WatchSets in tests are reset between calls to watchFired. Any time a watch fires, subsequent calls to watchFired on the same WatchSet will also return true even if there were no changes.	2020-04-29 16:52:27 -06:00
Matt Keeler	7a4c73acaf	Updates to allow for using an enterprise specific token as the agents token This is needed to allow for managed Consul instances to register themselves in the catalog with one of the managed service provider tokens.	2020-04-28 09:44:26 -04:00
Matt Keeler	bec3fb7c18	Some boilerplate to allow for ACL Bootstrap disabling configurability	2020-04-28 09:42:46 -04:00
Freddy	137a2c32c6	TLS Origination for Terminating Gateways (#7671 )	2020-04-27 16:25:37 -06:00
freddygv	4710410cb5	Remove fallthrough	2020-04-27 12:00:14 -06:00
freddygv	d1e6d668c2	Add authz filter when creating filterchain	2020-04-27 11:08:41 -06:00
freddygv	034d7d83d4	Fix snapshot IsEmpty	2020-04-27 11:08:41 -06:00
freddygv	3afe816a94	Clean up dead code, issue addressed by passing ws to serviceGatewayNodes	2020-04-27 11:08:41 -06:00
Freddy	3b1b24c2ce	Update agent/proxycfg/state_test.go	2020-04-27 11:08:41 -06:00
freddygv	eddd5bd73b	PR comments	2020-04-27 11:08:41 -06:00
freddygv	77bb2f1002	Fix internal endpoint test	2020-04-27 11:08:41 -06:00
freddygv	d82e7e8c2a	Fix listener error handling	2020-04-27 11:08:41 -06:00
freddygv	6abc71f915	Skip filter chain creation if no client cert	2020-04-27 11:08:41 -06:00
freddygv	915db10903	Avoid deleting mappings for services linked to other gateways on dereg	2020-04-27 11:08:41 -06:00
freddygv	cd28d4125d	Re-fix bug in CheckConnectServiceNodes	2020-04-27 11:08:41 -06:00
freddygv	09a8e5f36d	Use golden files for gateway certs and fix listener test flakiness	2020-04-27 11:08:41 -06:00
freddygv	840d27a9d5	Un-nest switch in gateway update handler	2020-04-27 11:08:40 -06:00
freddygv	c0e1751878	Allow terminating-gateway to setup listener before servicegroups are known	2020-04-27 11:08:40 -06:00
freddygv	913b13f31f	Add subset support	2020-04-27 11:08:40 -06:00
freddygv	9f233dece2	Fix ConnectQueryBlocking test	2020-04-27 11:08:40 -06:00
freddygv	86342e4bca	Fix bug in CheckConnectServiceNodes Previously, if a blocking query called CheckConnectServiceNodes before the gateway-services memdb table had any entries, a nil watchCh would be returned when calling serviceTerminatingGatewayNodes. This means that the blocking query would not fire if a gateway config entry was added after the watch started. In cases where the blocking query started on proxy registration, the proxy could potentially never become aware of an upstream endpoint if that upstream was going to be represented by a gateway.	2020-04-27 11:08:40 -06:00
freddygv	219c78e586	Add xds cluster/listener/endpoint management	2020-04-27 11:08:40 -06:00
freddygv	24207226ca	Add proxycfg state management for terminating-gateways	2020-04-27 11:07:06 -06:00
freddygv	c9385129ae	Require service:read to read terminating-gateway config	2020-04-27 11:07:06 -06:00
Matt Keeler	a1648c61ae	A couple testing helper updates (#7694 )	2020-04-27 12:17:38 -04:00
Kit Patella	df14a7c694	Merge pull request #7699 from pierresouchay/fix_comment_misplaced Fixed comment on wrong line	2020-04-24 10:09:58 -07:00
Chris Piraino	ecc8a2d6f7	Allow ingress gateways to route through mesh gateways - Adds integration test for mesh gateways local + remote modes with ingress - ingress golden files updated for mesh gateway endpoints	2020-04-24 09:31:32 -05:00
Chris Piraino	cb9df538d5	Add all the xds ingress tests This commit copies many of the connect-proxy xds testcases and reuses for ingress gateways. This allows us to more easily see changes to the envoy configuration when make updates to ingress gateways.	2020-04-24 09:31:32 -05:00
Chris Piraino	0ca9b606e8	Pull out setupTestVariationConfigEntriesAndSnapshot in proxycfg This allows us to reuse the same variations for ingress gateway testing	2020-04-24 09:31:32 -05:00
Kyle Havlovitz	e7b1ee55de	Add http routing support and integration test to ingress gateways	2020-04-24 09:31:32 -05:00
Hans Hasselberg	1194fe441f	auto_encrypt: add validations for auto_encrypt.{tls,allow_tls} (#7704 ) Fixes https://github.com/hashicorp/consul/issues/7407.	2020-04-24 15:51:38 +02:00
Pierre Souchay	5e79efc80f	Fixed comment on wrong line. While investigating and fixing an issue on our 1.5.1 branch, I saw you also/already fixed the bug I found (tags not updated for existing servers), but comment is misplaced.	2020-04-24 01:15:15 +02:00
Freddy	3956cff60f	Fix check deletion in anti-entropy sync (#7690 ) * Incorporate entMeta into service equality check	2020-04-23 10:16:50 -06:00
Daniel Nephin	d6e22a77e3	Remove deadcode This UnmarshalJSON was never called. The decode function is passed a map[string]interface so it has no way of knowing that this function exists. Tested by adding a panic to this function and watching the tests pass. I attempted to use this Unmarshal function by passing in the type, however the tests showed that it does not work. The test was failing to parse the request. If the performance of this endpoint is indeed critical we can solve the problem by adding all the fields to the request struct and handling the normalziation without a custom Unmarshal.	2020-04-22 16:48:28 -04:00
Daniel Nephin	ff0d894101	agent: remove deadcode that called lib.TranslateKeys Move the last remaining function from agent/config.go to the one place it was called.	2020-04-22 13:41:43 -04:00
Chris Piraino	115d2d5db5	Expect default enterprise metadata in gateway tests (#7664 ) This makes it so that both OSS and enterprise tests pass correctly In the api tests, explicitly set namespace to empty string so that tests can be shared.	2020-04-20 09:02:35 -05:00
Kit Patella	ccece5cd21	http: rename paresTokenResolveProxy to parseTokenWithDefault	2020-04-17 13:35:24 -07:00
Kit Patella	e2467f4b2c	Merge pull request #7656 from hashicorp/feature/audit/oss-merge agent: stub out auditing functionality in OSS	2020-04-17 13:33:06 -07:00
Kit Patella	3b105435b8	agent,config: port enterprise only fields to embedded enterprise structs	2020-04-17 13:27:39 -07:00
Daniel Nephin	67d14d8349	Merge pull request #7641 from hashicorp/dnephin/agent-cache-request-info agent/cache: reduce function arguments by removing duplicates	2020-04-17 14:10:49 -04:00
Chris Piraino	6ef8ae9965	Fix bug where non-typical services are associated with gateways (#7662 ) On every service registration, we check to see if a service should be assassociated to a wildcard gateway-service. This fixes an issue where we did not correctly check to see if the service being registered was a "typical" service or not.	2020-04-17 11:24:34 -05:00
Daniel Nephin	81755c860a	agent/cache: remove error return from fetch A previous change removed the only error, so the return value can be removed now.	2020-04-17 11:55:01 -04:00
Daniel Nephin	4ef9fc9f27	agent/cache: reduce function arguments by removing duplicates A few of the unexported functions in agent/cache took a large number of arguments. These arguments were effectively overrides for values that were provided in RequestInfo. By using a struct we can not only reduce the number of arguments, but also simplify the logic by removing the need for overrides.	2020-04-17 11:35:07 -04:00
Kit Patella	4a86cb12c1	config/runtime: fix an extra field in config sanitize	2020-04-16 16:37:25 -07:00
Daniel Nephin	5fe7043439	agent/cache: Make all cache options RegisterOptions Previously the SupportsBlocking option was specified by a method on the type, and all the other options were specified from RegisterOptions. This change moves RegisterOptions to a method on the type, and moves SupportsBlocking into the options struct. Currently there are only 2 cache-types. So all cache-types can implement this method by embedding a struct with those predefined values. In the future if a cache type needs to be registered more than once with different options it can remove the embedded type and implement the method in a way that allows for paramaterization.	2020-04-16 18:56:34 -04:00
Kit Patella	927f584761	agent: stub out auditing functionality in OSS	2020-04-16 15:07:52 -07:00
Kyle Havlovitz	e9e8c0e730	Ingress Gateways for TCP services (#7509 ) * Implements a simple, tcp ingress gateway workflow This adds a new type of gateway for allowing Ingress traffic into Connect from external services. Co-authored-by: Chris Piraino <cpiraino@hashicorp.com>	2020-04-16 14:00:48 -07:00
Daniel Nephin	f46d1b5c94	agent/structs: Remove ServiceID.Init and CheckID.Init The Init method provided the same functionality as the New constructor. The constructor is both more widely used, and more idiomatic, so remove the Init method. This change is in preparation for fixing printing of these IDs.	2020-04-15 12:09:56 -04:00
sasha	ac9b330f6b	add DNSSAN and IPSAN to cache key (#7597 )	2020-04-15 10:11:11 -05:00
Matt Keeler	6a78c24d67	Update the Client code to use the common version checking infra… (#7558 ) Also reduce the log level of some version checking messages on the server as they can be pretty noisy during upgrades and really are more for debugging purposes.	2020-04-14 11:54:27 -04:00
Matt Keeler	da893c36a1	Allow the bootstrap endpoint to be disabled in enterprise. (#7614 )	2020-04-14 11:45:39 -04:00
Daniel Nephin	89f41bddfe	Remove TTL from cacheEntryExpiry This should very slightly reduce the amount of memory required to store each item in the cache. It will also enable setting different TTLs based on the type of result. For example we may want to use a shorter TTL when the result indicates the resource does not exist, as storing these types of records could easily lead to a DOS caused by OOM.	2020-04-13 13:10:38 -04:00
Daniel Nephin	7246d8b6cb	agent/cache: Reduce differences between notify implementations These two notify functions are very similar. There appear to be just enough differences that trying to parameterize the differences may not improve things. For now, reduce some of the cosmetic differences so that the material differences are more obvious.	2020-04-13 13:10:38 -04:00
Daniel Nephin	66fbb13976	agent/cache: Inline the refresh function to make recursion more obvious fetch is already an exceptionally long function, but hiding the recrusion in a function call likely does not help.	2020-04-13 13:10:38 -04:00
Daniel Nephin	faeaed5d0c	agent/cache: Make the return values of getEntryLocked more obvious Use named returned so that the caller has a better idea of what these bools mean. Return early to reduce the scope, and make it more obvious what values are returned in which cases. Also reduces the number of conditional expressions in each case.	2020-04-13 13:10:38 -04:00
Daniel Nephin	e9e45545dd	agent/cache: Small formatting improvements to improve readability Remove Cache.entryKey which called a single function. Format multiline struct creation one field per line.	2020-04-13 12:34:11 -04:00
Daniel Nephin	329d76fd0e	Remove SnapshotRPC passthrough The caller has access to the delegate, so we do not gain anything by wrapping the call in Agent.	2020-04-13 12:32:57 -04:00
Daniel Nephin	1f25bf88b8	Merge pull request #7596 from hashicorp/dnephin/agent-cache-type-entry agent/cache: move typeEntry lookup to the edge	2020-04-13 12:24:07 -04:00
Pierre Souchay	1b4218a068	fix flaky TestReplication_FederationStates test due to race conditions (#7612 ) The test had two racy bugs related to memdb references. The first was when we initially populated data and retained the FederationState objects in a slice. Due to how the `inmemCodec` works these were actually the identical objects passed into memdb. The second was that the `checkSame` assertion function was reading from memdb and setting the RaftIndexes to zeros to aid in equality checks. This was mutating the contents of memdb which is a no-no. With this fix, the command: ``` i=0; while /usr/local/bin/go test -count=1 -timeout 30s github.com/hashicorp/consul/agent/consul -run '^(TestReplication_FederationStates)$'; do i=$((i + 1)); printf "$i "; done ``` That used to break on my machine in less than 20 runs is now running 150+ times without any issue. Might also fix #7575	2020-04-09 15:42:41 -05:00
Pierre Souchay	4a6569a4e3	tests: change default http_max_conns_per_client to 250 to ease tests (#7625 ) On recent Mac OS versions, the ulimit defaults to 256 by default, but many systems (eg: some Linux distributions) often limit this value to 1024. On validation of configuration, Consul now validates that the number of allowed files descriptors is bigger than http_max_conns_per_client. This make some unit tests failing on Mac OS. Use a less important value in unit test, so tests runs well by default on Mac OS without need for tuning the OS.	2020-04-09 11:11:42 +02:00
Freddy	9eb1867fbb	Terminating gateway discovery (#7571 ) * Enable discovering terminating gateways * Add TerminatingGatewayServices to state store * Use GatewayServices RPC endpoint for ingress/terminating	2020-04-08 12:37:24 -06:00
Freddy	aae14b3951	Add decode rules for Expose cfg in service-defaults (#7611 )	2020-04-07 19:37:47 -06:00
Matt Keeler	0e7d3d93b3	Enable filtering language support for the v1/connect/intentions… (#7593 ) * Enable filtering language support for the v1/connect/intentions listing API * Update website for filtering of Intentions * Update website/source/api/connect/intentions.html.md	2020-04-07 11:48:44 -04:00
Daniel Nephin	8549cc2d99	Merge pull request #7598 from pierresouchay/preallocation_of_dns_meta Pre-allocations of DNS meta to avoid several allocations	2020-04-06 14:00:32 -04:00
Pierre Souchay	d1d016d61d	[LINT] Close resp.Body to avoid linter complaining (#7600 )	2020-04-06 09:11:04 -04:00
Pierre Souchay	c9e01ed0a3	Pre-allocations of DNS meta to avoid several allocations	2020-04-05 11:12:41 +02:00
Daniel Nephin	c9a87be6ee	agent/cache: move typeEntry lookup to the edge This change moves all the typeEntry lookups to the first step in the exported methods, and makes unexporter internals accept the typeEntry struct. This change is primarily intended to make it easier to extract the container of caches from the Cache type. It may incidentally reduce locking in fetch, but that was not a goal.	2020-04-03 16:01:56 -04:00
Pierre Souchay	73056fecf8	Fixed unstable test TestForwardSignals() Sometimes, in the CI, it could receive a SIGURG, producing this line: FAIL: TestForwardSignals/signal-interrupt (0.06s) util_test.go:286: expected to read line "signal: interrupt" but got "signal: urgent I/O condition" Only forward the signals we test to avoid this kind of false positive Example of such unstable errors in CI: https://circleci.com/gh/hashicorp/consul/153571	2020-04-03 14:23:03 +02:00
Pierre Souchay	09e638a9c6	tests: more tolerance to latency for unstable test `TestCacheNotifyPolling()`. (#7574 )	2020-04-03 10:29:38 +02:00
Matt Keeler	8aec09aa8f	Ensure that token clone copies the roles (#7577 )	2020-04-02 12:09:35 -04:00
Chris Piraino	584f90bbeb	Fix flapping of mesh gateway connect-service watches (#7575 )	2020-04-02 10:12:13 -05:00
Pierre Souchay	2a8bf45e38	agent: show warning when enable_script_checks is enabled without safty net (#7437 ) In order to enforce a bit security on Consul agents, add a new method in agent to highlight possible security issues. This does not return an error for now, but might in the future. For now, it detects issues such as: https://www.hashicorp.com/blog/protecting-consul-from-rce-risk-in-specific-configurations/ This would display this kind of messages: ``` 2020-03-11T18:27:49.873+0100 [ERROR] agent: [SECURITY] issue: error="using enable-script-checks without ACLs and without allow_write_http_from is DANGEROUS, use enable-local-script-checks instead see https://www.hashicorp.com/blog/protecting-consul-from-rce-risk-in-specific-configurations/" ```	2020-04-02 09:59:23 +02:00
Andy Lindeman	fb0a990e4d	agent: rewrite checks with proxy address, not local service address (#7518 ) Exposing checks is supposed to allow a Consul agent bound to a different IP address (e.g., in a different Kubernetes pod) to access healthchecks through the proxy while the underlying service binds to localhost. This is an important security feature that makes sure no external traffic reaches the service except through the proxy. However, as far as I can tell, this is subtly broken in the case where the Consul agent cannot reach the proxy over localhost. If a proxy is configured with: `{ LocalServiceAddress: "127.0.0.1", Checks: true }`, as is typical with a sidecar proxy, the Consul checks are currently rewritten to `127.0.0.1:<random port>`. A Consul agent that does not share the loopback address cannot reach this address. Just to make sure I was not misunderstanding, I tried configuring the proxy with `{ LocalServiceAddress: "<pod ip>", Checks: true }`. In this case, while the checks are rewritten as expected and the agent can reach the dynamic port, the proxy can no longer reach its backend because the traffic is no longer on the loopback interface. I think rewriting the checks to use `proxy.Address`, the proxy's own address, is more correct in this case. That is the IP where the proxy can be reached, both by other proxies and by a Consul agent running on a different IP. The local service address should continue to use `127.0.0.1` in most cases.	2020-04-02 09:35:43 +02:00
Andy Lindeman	c1cb18c648	proxycfg: support path exposed with non-HTTP2 protocol (#7510 ) If a proxied service is a gRPC or HTTP2 service, but a path is exposed using the HTTP1 or TCP protocol, Envoy should not be configured with `http2ProtocolOptions` for the cluster backing the path. A situation where this comes up is a gRPC service whose healthcheck or metrics route (e.g. for Prometheus) is an HTTP1 service running on a different port. Previously, if these were exposed either using `Expose: { Checks: true }` or `Expose: { Paths: ... }`, Envoy would still be configured to communicate with the path over HTTP2, which would not work properly.	2020-04-02 09:35:04 +02:00
Pierre Souchay	be1c5c4b48	config: validate system limits against limits.http_max_conns_per_client (#7434 ) I spent some time today on my local Mac to figure out why Consul 1.6.3+ was not accepting limits.http_max_conns_per_client. This adds an explicit check on number of file descriptors to be sure it might work (this is no guarantee as if many clients are reaching the agent, it might consume even more file descriptors) Anyway, many users are fighting with RLIMIT_NOFILE, having a clear message would allow them to figure out what to fix. Example of message (reload or start): ``` 2020-03-11T16:38:37.062+0100 [ERROR] agent: Error starting agent: error="system allows a max of 512 file descriptors, but limits.http_max_conns_per_client: 8192 needs at least 8212" ```	2020-04-02 09:22:17 +02:00
Shaker Islam	ac309d55f4	docs: document exported functions in agent.go (closes #7101 ) (#7366 ) and fix one linter error	2020-04-01 22:52:23 +02:00
Pierre Souchay	08acd6a03e	[FIX BUILD] fix build due to merge of #7562 Due to merge #7562, upstream does not compile anymore. Error is: ERRO Running error: gofmt: analysis skipped: errors in package: [/Users/p.souchay/go/src/github.com/hashicorp/consul/agent/config_endpoint_test.go:188:33: too many arguments]	2020-04-01 18:29:45 +02:00
Daniel Nephin	0d8edc3e27	Merge pull request #7562 from hashicorp/dnephin/remove-tname-from-name testing: Remove old default value from NewTestAgent() calls	2020-04-01 11:48:45 -04:00
Daniel Nephin	190fc3c732	Merge pull request #7533 from hashicorp/dnephin/xds-server-1 agent/xds: small cleanup	2020-04-01 11:24:50 -04:00
Emre Savcı	2083b7b04d	agent: add len, cap while initializing arrays	2020-04-01 10:54:51 +02:00
Daniel Nephin	e759daafdd	Rename NewTestAgentWithFields to StartTestAgent This function now only starts the agent. Using: git grep -l 'StartTestAgent(t, true,' \| \ xargs sed -i -e 's/StartTestAgent(t, true,/StartTestAgent(t,/g'	2020-03-31 17:14:55 -04:00
Daniel Nephin	f9f6b14533	Convert the remaining calls to NewTestAgentWithFields After removing the t.Name() parameter with sed, convert the last few tests which use a custom name to call NewTestAgentWithFields instead.	2020-03-31 17:14:55 -04:00
Daniel Nephin	0c409d460f	Merge pull request #7470 from hashicorp/dnephin/dns-unused-params dns: Remove a few unused function parameters	2020-03-31 16:56:19 -04:00
Pierre Souchay	54b22c638d	config: allow running `consul agent -dev -ui-dir=some_path` (#7525 ) When run in with `-dev` in DevMode, it is not possible to replace the embeded UI with another one because `-dev` implies `-ui`. This commit allows this an slightly change the error message about Consul 0.7.0 which is very old and does not apply to current version anyway.	2020-03-31 22:36:20 +02:00
Daniel Nephin	475659a132	Remove name from NewTestAgent Using: git grep -l 'NewTestAgent(t, t.Name(),' \| \ xargs sed -i -e 's/NewTestAgent(t, t.Name(),/NewTestAgent(t,/g'	2020-03-31 16:13:44 -04:00
Freddy	90576060bc	Add config entry for terminating gateways (#7545 ) This config entry will be used to configure terminating gateways. It accepts the name of the gateway and a list of services the gateway will represent. For each service users will be able to specify: its name, namespace, and additional options for TLS origination. Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Chris Piraino <cpiraino@hashicorp.com>	2020-03-31 13:27:32 -06:00
Kyle Havlovitz	c911174327	Add config entry/state for Ingress Gateways (#7483 ) * Add Ingress gateway config entry and other relevant structs * Add api package tests for ingress gateways * Embed EnterpriseMeta into ingress service struct * Add namespace fields to api module and test consul config write decoding * Don't require a port for ingress gateways * Add snakeJSON and camelJSON cases in command test * Run Normalize on service's ent metadata Sadly cannot think of a way to test this in OSS. * Every protocol requires at least 1 service * Validate ingress protocols * Update agent/structs/config_entry_gateways.go Co-authored-by: Chris Piraino <cpiraino@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2020-03-31 11:59:10 -05:00
Daniel Nephin	9d959907a4	Merge pull request #7485 from hashicorp/dnephin/do-not-skip-tests-on-ci ci: Make it harder to accidentally skip tests on CI, and doc why some are skipped	2020-03-31 11:15:44 -04:00
Daniel Nephin	ad7c78f134	Remove t.Name() from TestAgent.Name And re-add the name to the logger so that log messages from different agents in a single can be identified.	2020-03-30 16:47:24 -04:00
Daniel Nephin	231c99f7b4	Document Agent.LogOutput	2020-03-30 14:32:13 -04:00
Daniel Nephin	dd40a1535e	testing: reduce verbosity of output log Previously the log output included the test name twice and a long date format. The test output is already grouped by test, so adding the test name did not add any new information. The date and time are only useful to understand elapsed time, so using a short format should provide succident detail. Also fixed a bug in NewTestAgentWithFields where nil was returned instead of the test agent.	2020-03-30 13:23:13 -04:00
Daniel Nephin	1d90ecc31d	Remove unused token parameter	2020-03-27 17:57:16 -04:00
Daniel Nephin	ef68e404a5	A little less 'just'	2020-03-27 16:08:25 -04:00
Daniel Nephin	002fc85ef2	Remove unused customEDSClusterJSON	2020-03-27 15:38:16 -04:00
Matt Keeler	028654410c	Ensure server requirements checks are done against ALL known se… (#7491 ) Co-authored-by: Paul Banks <banks@banksco.de>	2020-03-27 12:31:43 -04:00
Matt Keeler	74a665afc3	Add information about which services are proxied to ui services… (#7417 )	2020-03-27 10:57:46 -04:00
Daniel Nephin	b5c7d292e4	Merge pull request #7516 from hashicorp/dnephin/remove-unused-method agent: Remove unused method Encrypted from delegate interface	2020-03-26 14:17:58 -04:00
Daniel Nephin	bb8833a2d5	agent: Remove unused Encrypted from interface It appears to be unused. It looks like it has been around a while, I geuss at some point we stopped using this method.	2020-03-26 12:34:31 -04:00
Freddy	18d356899c	Enable CLI to register terminating gateways (#7500 ) * Enable CLI to register terminating gateways * Centralize gateway proxy configuration	2020-03-26 10:20:56 -06:00
Daniel Nephin	33c7894123	Merge pull request #7498 from hashicorp/dnephin/small-cleanup envoy: small cleanup in cmd and server	2020-03-25 13:24:44 -04:00
Alejandro Baez	bafa69bb69	Add PolicyReadByName for API (#6615 )	2020-03-25 10:34:24 -04:00
Chris Piraino	136099d834	Fix flakey health check reload test (#7490 ) This test would occasionally fail because we checked for a status of "critical" initially. This races with the actual healthcheck being run and declared passing. We instead use a ttl health check so that we don't rely on timing at all.	2020-03-25 09:09:13 -05:00
Daniel Nephin	266bdf7465	agent: Remove xdsServer field The field is only referenced from a single method, it can be a local var	2020-03-24 18:05:14 -04:00
Daniel Nephin	326453eaa1	dns: Remove a few unused params	2020-03-24 15:56:41 -04:00
Daniel Nephin	61ec7aa5c9	ci: Run all connect/ca tests from the integration suite To reduce the chance of some tests not being run because it does not match the regex passed to '-run'. Also document why some tests are allowed to be skipped on CI.	2020-03-24 15:22:01 -04:00
Daniel Nephin	f4a35dfd84	ci: Do not skip tests because of missing binaries on CI If the CI environment is not correct for running tests the tests should fail, so that we don't accidentally stop running some tests because of a change to our CI environment. Also removed a duplicate delcaration from init. I believe one was overriding the other as they are both in the same package.	2020-03-24 14:34:13 -04:00
Kim Ngo	bef693df9c	agent/xds: Update mesh gateway to use service router timeout (#7444 ) * website/connect/proxy/envoy: specify timeout precedence for services behind mesh gateway	2020-03-17 14:50:14 -05:00
Matt Keeler	80db61193c	Fix ACL mode advertisement and detection (#7451 ) These changes are necessary to ensure advertisement happens correctly even when datacenters are connected via network areas in Consul enterprise. This also changes how we check if ACLs can be upgraded within the local datacenter. Previously we would iterate through all LAN members. Now we just use the ServerLookup type to iterate through all known servers in the DC.	2020-03-16 12:54:45 -04:00
Freddy	709932f088	Update MSP token and filtering (#7431 )	2020-03-11 12:08:49 -06:00
Hans Hasselberg	7777891aa6	tls: remove old ciphers (#7282 ) Following advice from: https://github.com/ssllabs/research/wiki/SSL-and-TLS-Deployment-Best-Practices, this PR removes old ciphers.	2020-03-10 21:44:26 +01:00
R.B. Boyer	85a08bf8ed	server: strip local ACL tokens from RPCs during forwarding if crossing datacenters (#7419 ) Fixes #7414	2020-03-10 11:15:22 -05:00
Kyle Havlovitz	955ee64b95	Merge pull request #7373 from hashicorp/acl-segments-fix Add stub methods for ACL/segment bug fix from enterprise	2020-03-09 14:25:49 -07:00
R.B. Boyer	6adad71125	wan federation via mesh gateways (#6884 ) This is like a Möbius strip of code due to the fact that low-level components (serf/memberlist) are connected to high-level components (the catalog and mesh-gateways) in a twisty maze of references which make it hard to dive into. With that in mind here's a high level summary of what you'll find in the patch: There are several distinct chunks of code that are affected: * new flags and config options for the server * retry join WAN is slightly different * retry join code is shared to discover primary mesh gateways from secondary datacenters * because retry join logic runs in the agent and the results of that operation for primary mesh gateways are needed in the server there are some methods like `RefreshPrimaryGatewayFallbackAddresses` that must occur at multiple layers of abstraction just to pass the data down to the right layer. * new cache type `FederationStateListMeshGatewaysName` for use in `proxycfg/xds` layers * the function signature for RPC dialing picked up a new required field (the node name of the destination) * several new RPCs for manipulating a FederationState object: `FederationState:{Apply,Get,List,ListMeshGateways}` * 3 read-only internal APIs for debugging use to invoke those RPCs from curl * raft and fsm changes to persist these FederationStates * replication for FederationStates as they are canonically stored in the Primary and replicated to the Secondaries. * a special derivative of anti-entropy that runs in secondaries to snapshot their local mesh gateway `CheckServiceNodes` and sync them into their upstream FederationState in the primary (this works in conjunction with the replication to distribute addresses for all mesh gateways in all DCs to all other DCs) * a "gateway locator" convenience object to make use of this data to choose the addresses of gateways to use for any given RPC or gossip operation to a remote DC. This gets data from the "retry join" logic in the agent and also directly calls into the FSM. * RPC (`:8300`) on the server sniffs the first byte of a new connection to determine if it's actually doing native TLS. If so it checks the ALPN header for protocol determination (just like how the existing system uses the type-byte marker). * 2 new kinds of protocols are exclusively decoded via this native TLS mechanism: one for ferrying "packet" operations (udp-like) from the gossip layer and one for "stream" operations (tcp-like). The packet operations re-use sockets (using length-prefixing) to cut down on TLS re-negotiation overhead. * the server instances specially wrap the `memberlist.NetTransport` when running with gateway federation enabled (in a `wanfed.Transport`). The general gist is that if it tries to dial a node in the SAME datacenter (deduced by looking at the suffix of the node name) there is no change. If dialing a DIFFERENT datacenter it is wrapped up in a TLS+ALPN blob and sent through some mesh gateways to eventually end up in a server's :8300 port. * a new flag when launching a mesh gateway via `consul connect envoy` to indicate that the servers are to be exposed. This sets a special service meta when registering the gateway into the catalog. * `proxycfg/xds` notice this metadata blob to activate additional watches for the FederationState objects as well as the location of all of the consul servers in that datacenter. * `xds:` if the extra metadata is in place additional clusters are defined in a DC to bulk sink all traffic to another DC's gateways. For the current datacenter we listen on a wildcard name (`server.<dc>.consul`) that load balances all servers as well as one mini-cluster per node (`<node>.server.<dc>.consul`) * the `consul tls cert create` command got a new flag (`-node`) to help create an additional SAN in certs that can be used with this flavor of federation.	2020-03-09 15:59:02 -05:00
Matt Keeler	e3891db55b	Gather instance counts of aggregated services (#7415 )	2020-03-09 11:56:19 -04:00
Pierre Souchay	864f7efffa	agent: configuration reload preserves check's statuses for services (#7345 ) This fixes issue #7318 Between versions 1.5.2 and 1.5.3, a regression has been introduced regarding health of services. A patch #6144 had been issued for HealthChecks of nodes, but not for healthchecks of services. What happened when a reload was: 1. save all healthcheck statuses 2. cleanup everything 3. add new services with healthchecks In step 3, the state of healthchecks was taken into account locally, so at step 3, but since we cleaned up at step 2, state was lost. This PR introduces the snap parameter, so step 3 can use information from step 1	2020-03-09 12:59:41 +01:00
Hans Hasselberg	c46e2ae59b	docs: add docs for kv_max_value_size (#7405 ) Apart from the added docs, the error messages are similar now and are pointing to the corresponding options. Fixes #6708.	2020-03-09 11:13:40 +01:00
Kim Ngo	a8f4123d37	agent/txn_endpoint: configure max txn request length (#7388 ) configure max transaction size separately from kv limit	2020-03-05 15:42:37 -06:00
Matt Keeler	7584dfe8c8	Fix session backwards incompatibility with 1.6.x and earlier.	2020-03-05 15:34:55 -05:00
John Cowen	e83fb1882c	Adds http_config.response_headers to the UI headers plus tests (#7369 )	2020-03-03 13:18:35 +00:00
Pierre Souchay	2300e2d4ba	agent: take Prometheus MIME-type header into account (#7371 ) This will avoid adding format=prometheus in request and to parse more easily metrics using Prometheus. This commit follows https://github.com/hashicorp/consul/pull/6514 as the PR has been closed and extends it by accepting old Prometheus mime-type.	2020-03-03 14:18:19 +01:00
Kyle Havlovitz	7c57837908	Add stub methods for ACL/segment bug fix from enterprise	2020-03-02 10:30:23 -08:00
Hans Hasselberg	e05ac57e8f	tls: support tls 1.3 (#7325 )	2020-02-19 23:22:31 +01:00
Matt Keeler	861f754dad	Properly detect no alt domain set (#7323 )	2020-02-19 14:41:43 -05:00
Matt Keeler	4c9577678e	xDS Mesh Gateway Resolver Subset Fixes (#7294 ) * xDS Mesh Gateway Resolver Subset Fixes The first fix was that clusters were being generated for every service resolver subset regardless of there being any service instances of the associated service in that dc. The previous logic didn’t care at all but now it will omit generating those clusters unless we also have service instances that should be proxied. The second fix was to respect the DefaultSubset of a service resolver so that mesh-gateways would configure the endpoints of the unnamed subset cluster to only those endpoints matched by the default subsets filters. * Refactor the gateway endpoint generation to be a little easier to read	2020-02-19 11:57:55 -05:00
rerorero	2630a949f7	fix: Destroying a session that doesn't exist returns status cod… (#6905 ) fix #6840	2020-02-18 11:13:15 -05:00
Wim	3a2c865ff6	Fix high cpu usage with IPv6 recursor address. Closes #6120 (#6128 )	2020-02-18 11:09:11 -05:00
Chris Piraino	47ff532735	Fixes envoy config when both RetryOn* values are set (#7280 )	2020-02-18 09:25:47 -06:00

... 2 3 4 5 6 ...

2164 Commits