consul

Commit Graph

Author	SHA1	Message	Date
Aestek	4afbe792df	Improve blocking queries on services that do not exist (#4810 ) ## Background When making a blocking query on a missing service (was never registered, or is not registered anymore) the query returns as soon as any service is updated. On clusters with frequent updates (5~10 updates/s in our DCs) these queries virtually do not block, and clients with no protections againt this waste ressources on the agent and server side. Clients that do protect against this get updates later than they should because of the backoff time they implement between requests. ## Implementation While reducing the number of unnecessary updates we still want : * Clients to be notified as soon as when the last instance of a service disapears. * Clients to be notified whenever there's there is an update for the service. * Clients to be notified as soon as the first instance of the requested service is added. To reduce the number of unnecessary updates we need to block when a request to a missing service is made. However in the following case : 1. Client `client1` makes a query for service `foo`, gets back a node and X-Consul-Index 42 2. `foo` is unregistered 3. `client1` makes a query for `foo` with `index=42` -> `foo` does not exist, the query blocks and `client1` is not notified of the change on `foo` We could store the last raft index when each service was last alive to know wether we should block on the incoming query or not, but that list could grow indefinetly. We instead store the last raft index when a service was unregistered and use it when a query targets a service that does not exist. When a service `srv` is unregistered this "missing service index" is always greater than any X-Consul-Index held by the clients while `srv` was up, allowing us to immediatly notify them. 1. Client `client1` makes a query for service `foo`, gets back a node and `X-Consul-Index: 42` 2. `foo` is unregistered, we set the "missing service index" to 43 3. `client1` makes a blocking query for `foo` with `index=42` -> `foo` does not exist, we check against the "missing service index" and return immediatly with `X-Consul-Index: 43` 4. `client1` makes a blocking query for `foo` with `index=43` -> we block 5. Other changes happen in the cluster, but foo still doesn't exist and "missing service index" hasn't changed, the query is still blocked 6. `foo` is registered again on index 62 -> `foo` exists and its index is greater than 43, we unblock the query	2019-01-11 09:26:14 -05:00
Matt Keeler	1048f3d5e7	acl: Prevent tokens from deleting themselves (#5210 ) Fixes #4897 Also apparently token deletion could segfault in secondary DCs when attempting to delete non-existant tokens. For that reason both checks are wrapped within the non-nil check.	2019-01-10 09:22:51 -05:00
Kyle Havlovitz	c07c5446a8	txn: clean up some state store/acl code	2019-01-09 11:59:23 -08:00
Pierre Souchay	ae7f88f995	Avoid to have infinite recursion in DNS lookups when resolving CNAMEs (#4918 ) * Avoid to have infinite recursion in DNS lookups when resolving CNAMEs This will avoid killing Consul when a Service.Address is using CNAME to a Consul CNAME that creates an infinite recursion. This will fix https://github.com/hashicorp/consul/issues/4907 * Use maxRecursionLevel = 3 to allow several recursions	2019-01-07 16:53:54 -05:00
Paul Banks	b29bc906ee	bugfix: use ServiceTags to generate cache key hash (#4987 ) * bugfix: use ServiceTags to generate cahce key hash * update unit test * update * remote print log * Update .gitignore * Completely deprecate ServiceTag field internally for clarity * Add explicit test for CacheInfo cases	2019-01-07 21:30:47 +00:00
Kyle Havlovitz	995e728ea0	txn: fix an issue with querying nodes by name instead of ID	2018-12-12 12:46:33 -08:00
Pierre Souchay	f4dc8b42e0	[Travis][UnstableTests] Fixed unstable tests in travis (#5013 ) * [Travis][UnstableTests] Fixed unstable tests in travis as seen in https://travis-ci.org/hashicorp/consul/jobs/460824602 * Fixed unstable tests in https://travis-ci.org/hashicorp/consul/jobs/460857687	2018-12-12 12:09:42 -08:00
Kyle Havlovitz	67bac7a815	api: add support for new txn operations	2018-12-12 10:54:09 -08:00
Kyle Havlovitz	de4dbf583e	txn: add tests for RPC endpoint	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	6a512e5c0f	txn: add ACL enforcement/validation to new txn ops	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	9467067432	state: add tests for new txn ops	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	7759e9ea8b	txn: add service operations	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	ab58986ac3	txn: add node operations	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	01e1b5b1df	txn: add pre-check operations to txn endpoint	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	b371ea8783	Add check operations to transaction api	2018-12-12 10:04:10 -08:00
Kyle Havlovitz	4f2715d4e2	connect/ca: prevent blank CA config in snapshot This PR both prevents a blank CA config from being written out to a snapshot and allows Consul to gracefully recover from a snapshot with an invalid CA config. Fixes #4954.	2018-12-06 17:40:53 -08:00
R.B. Boyer	c1eccfd1db	agent: remove some stray fmt.Print* calls (#5015 )	2018-11-29 09:45:51 -06:00
Pierre Souchay	c5ae9caa28	Fixed another list of unstable unit tests in travis (#4915 ) * Fixed another list of unstable unit tests in travis Fixed failing tests in https://travis-ci.org/hashicorp/consul/jobs/451357061 * Fixed another list of unstable unit tests in travis. Fixed failing tests in https://travis-ci.org/hashicorp/consul/jobs/451357061	2018-11-20 11:27:26 +00:00
Kyle Havlovitz	76f102a1e0	Merge pull request #4952 from hashicorp/test-version tests: Bump test server version to 1.4.0	2018-11-13 13:37:10 -08:00
R.B. Boyer	934fae659f	acl: add stub hooks to support some plumbing in enterprise (#4951 )	2018-11-13 15:35:54 -06:00
Kyle Havlovitz	269354c61d	oss: bump test server version to 1.4.0	2018-11-13 13:13:26 -08:00
Aestek	4942e66440	Fix catalog tag filter backward compat (#4944 ) Fix catalog service node filtering (ex /v1/catalog/service/srv?tag=tag1) between agent version <=v1.2.3 and server >=v1.3.0. New server version did not account for the old field when filtering hence request made from old agent were not tag-filtered.	2018-11-13 14:44:36 +00:00
Kyle Havlovitz	4a73a59d70	Merge pull request #4917 from hashicorp/replication-token-cleanup Use acl replication_token for connect	2018-11-12 09:12:54 -08:00
Kyle Havlovitz	972177071d	update non-voting server test to fix enterprise diff	2018-11-09 12:50:24 -08:00
Kyle Havlovitz	643bd13aed	oss: do a proper check-and-set on the CA roots/config fsm operation	2018-11-09 12:36:23 -08:00
R.B. Boyer	2afc2a3c3b	acl: fixes ACL replication for legacy tokens without AccessorIDs (#4885 )	2018-11-07 07:59:44 -08:00
Kyle Havlovitz	e8dd89359a	agent: fix formatting	2018-11-07 02:16:03 -08:00
R.B. Boyer	9211d2701d	fix comment typos (#4890 )	2018-11-02 12:00:39 -05:00
Kyle Havlovitz	8337e3d8c0	Merge pull request #4872 from hashicorp/node-snapshot-fix Node ID/datacenter snapshot fix	2018-10-31 15:51:07 -07:00
Matt Keeler	db2cf01406	Adds documentation for the new ACL APIs (#4851 ) * Update the ACL API docs * Add a CreateTime to the anon token Also require acl:read permissions at least to perform rule translation. Don’t want someone DoSing the system with an open endpoint that actually does a bit of work. * Fix one place where I was referring to id instead of AccessorID * Add godocs for the API package additions. * Minor updates: removed some extra commas and updated the acl intro paragraph * minor tweaks * Updated the language to be clearer * Updated the language to be clearer for policy page * I was also confused by that! Your updates are much clearer. Co-Authored-By: kaitlincarter-hc <43049322+kaitlincarter-hc@users.noreply.github.com> * Sounds much better. Co-Authored-By: kaitlincarter-hc <43049322+kaitlincarter-hc@users.noreply.github.com> * Updated sidebar layout and deprecated warning	2018-10-31 15:11:51 -07:00
Matt Keeler	f9cf0eb36e	Remaining ACL Unit Tests (#4852 ) * Add leader token upgrade test and fix various ACL enablement bugs * Update the leader ACL initialization tests. * Add a StateStore ACL tests for ACLTokenSet and ACLTokenGetBy* functions * Advertise the agents acl support status with the agent/self endpoint. * Make batch token upsert CAS’able to prevent consistency issues with token auto-upgrade * Finish up the ACL state store token tests * Finish the ACL state store unit tests Also rename some things to make them more consistent. * Do as much ACL replication testing as I can.	2018-10-31 13:00:46 -07:00
Kyle Havlovitz	bd6d0e598f	fsm: update snapshot/restore test to include ID and datacenter	2018-10-30 15:53:14 -07:00
Kyle Havlovitz	6483356329	fsm: add missing ID/datacenter to persistNodes	2018-10-30 15:52:54 -07:00
Matt Keeler	790cf90ee5	Fix the NonVoter Bootstrap test (#4786 )	2018-10-24 10:23:50 -04:00
Kyle Havlovitz	819566f6b7	fsm: add Intention operations to transactions for internal use	2018-10-19 10:02:28 -07:00
Matt Keeler	34b53e7099	A few misc fixes found by go vet	2018-10-19 12:28:36 -04:00
Matt Keeler	18b29c45c4	New ACLs (#4791 ) This PR is almost a complete rewrite of the ACL system within Consul. It brings the features more in line with other HashiCorp products. Obviously there is quite a bit left to do here but most of it is related docs, testing and finishing the last few commands in the CLI. I will update the PR description and check off the todos as I finish them over the next few days/week. Description At a high level this PR is mainly to split ACL tokens from Policies and to split the concepts of Authorization from Identities. A lot of this PR is mostly just to support CRUD operations on ACLTokens and ACLPolicies. These in and of themselves are not particularly interesting. The bigger conceptual changes are in how tokens get resolved, how backwards compatibility is handled and the separation of policy from identity which could lead the way to allowing for alternative identity providers. On the surface and with a new cluster the ACL system will look very similar to that of Nomads. Both have tokens and policies. Both have local tokens. The ACL management APIs for both are very similar. I even ripped off Nomad's ACL bootstrap resetting procedure. There are a few key differences though. Nomad requires token and policy replication where Consul only requires policy replication with token replication being opt-in. In Consul local tokens only work with token replication being enabled though. All policies in Nomad are globally applicable. In Consul all policies are stored and replicated globally but can be scoped to a subset of the datacenters. This allows for more granular access management. Unlike Nomad, Consul has legacy baggage in the form of the original ACL system. The ramifications of this are: A server running the new system must still support other clients using the legacy system. A client running the new system must be able to use the legacy RPCs when the servers in its datacenter are running the legacy system. The primary ACL DC's servers running in legacy mode needs to be a gate that keeps everything else in the entire multi-DC cluster running in legacy mode. So not only does this PR implement the new ACL system but has a legacy mode built in for when the cluster isn't ready for new ACLs. Also detecting that new ACLs can be used is automatic and requires no configuration on the part of administrators. This process is detailed more in the "Transitioning from Legacy to New ACL Mode" section below.	2018-10-19 12:04:07 -04:00
Pierre Souchay	fab55bee2b	dns: implements prefix lookups for DNS TTL (#4605 ) This will fix https://github.com/hashicorp/consul/issues/4509 and allow forinstance lb-* to match services lb-001 or lb-service-007.	2018-10-19 08:41:04 -07:00
Kyle Havlovitz	c617326470	re-add Connect multi-dc config changes This reverts commit `8bcfbaffb6`.	2018-10-19 08:41:03 -07:00
Jack Pearkes	8bcfbaffb6	Revert "Connect multi-dc config" (#4784 )	2018-10-11 17:32:45 +01:00
Aestek	25f04fbd21	[Security] Add finer control over script checks (#4715 ) * Add -enable-local-script-checks options These options allow for a finer control over when script checks are enabled by giving the option to only allow them when they are declared from the local file system. * Add documentation for the new option * Nitpick doc wording	2018-10-11 13:22:11 +01:00
Rebecca Zanzig	34e5516834	Support multiple tags for health and catalog http api endpoints (#4717 ) * Support multiple tags for health and catalog api endpoints Fixes #1781. Adds a `ServiceTags` field to the ServiceSpecificRequest to support multiple tags, updates the filter logic in the catalog store, and propagates these change through to the health and catalog endpoints. Note: Leaves `ServiceTag` in the struct, since it is being used as part of the DNS lookup, which in turn uses the health check. * Update the api package to support multiple tags Includes additional tests. * Update new tests to use the `require` library * Update HealthConnect check after a bad merge	2018-10-11 12:50:05 +01:00
Pierre Souchay	51b33ef015	[Performance On Large clusters] Reduce updates on large services (#4720 ) * [Performance On Large clusters] Checks do update services/nodes only when really modified to avoid too many updates on very large clusters In a large cluster, when having a few thousands of nodes, the anti-entropy mechanism performs lots of changes (several per seconds) while there is no real change. This patch wants to improve this in order to increase Consul scalability when using many blocking requests on health for instance. * [Performance for large clusters] Only updates index of service if service is really modified * [Performance for large clusters] Only updates index of nodes if node is really modified * Added comments / ensure IsSame() has clear semantics * Avoid having modified boolean, return nil directly if stutures are Same * Fixed unstable unit tests TestLeader_ChangeServerID * Rewrite TestNode_IsSame() for better readability as suggested by @banks * Rename ServiceNode.IsSame() into IsSameService() + added unit tests * Do not duplicate TestStructs_ServiceNode_Conversions() and increase test coverage of IsSameService * Clearer documentation in IsSameService * Take into account ServiceProxy into ServiceNode.IsSameService() * Fixed IsSameService() with all new structures	2018-10-11 12:42:39 +01:00
Pierre Souchay	251156eb68	Added SOA configuration for DNS settings. (#4714 ) This will allow to fine TUNE SOA settings sent by Consul in DNS responses, for instance to be able to control negative ttl. Will fix: https://github.com/hashicorp/consul/issues/4713 # Example Override all settings: * min_ttl: 0 => 60s * retry: 600 (10m) => 300s (5 minutes), * expire: 86400 (24h) => 43200 (12h) * refresh: 3600 (1h) => 1800 (30 minutes) ``` consul agent -dev -hcl 'dns_config={soa={min_ttl=60,retry=300,expire=43200,refresh=1800}}' ``` Result: ``` dig +multiline @localhost -p 8600 service.consul ; <<>> DiG 9.12.1 <<>> +multiline @localhost -p 8600 service.consul ; (2 servers found) ;; global options: +cmd ;; Got answer: ;; ->>HEADER<<- opcode: QUERY, status: NXDOMAIN, id: 36557 ;; flags: qr aa rd; QUERY: 1, ANSWER: 0, AUTHORITY: 1, ADDITIONAL: 1 ;; WARNING: recursion requested but not available ;; OPT PSEUDOSECTION: ; EDNS: version: 0, flags:; udp: 4096 ;; QUESTION SECTION: ;service.consul. IN A ;; AUTHORITY SECTION: consul. 0 IN SOA ns.consul. hostmaster.consul. ( 1537959133 ; serial 1800 ; refresh (30 minutes) 300 ; retry (5 minutes) 43200 ; expire (12 hours) 60 ; minimum (1 minute) ) ;; Query time: 4 msec ;; SERVER: 127.0.0.1#8600(127.0.0.1) ;; WHEN: Wed Sep 26 12:52:13 CEST 2018 ;; MSG SIZE rcvd: 93 ```	2018-10-10 15:50:56 -04:00
Kyle Havlovitz	e4349c5710	connect/ca: more OSS split for multi-dc	2018-10-10 12:17:59 -07:00
Kyle Havlovitz	0da4f2b2e8	connect/ca: split CA initialization logic between oss/enterprise	2018-10-10 12:17:59 -07:00
Kyle Havlovitz	56dc426227	agent: add primary_datacenter and connect replication config options	2018-10-10 12:17:59 -07:00
Kyle Havlovitz	98d95cfa80	connect: add ExternalTrustDomain to CARoot fields	2018-10-10 12:16:47 -07:00
Kyle Havlovitz	46c829b879	docs: deprecate acl_datacenter and replace it with primary_datacenter	2018-10-10 12:16:47 -07:00
Paul Banks	b83bbf248c	Add Proxy Upstreams to Service Definition (#4639 ) * Refactor Service Definition ProxyDestination. This includes: - Refactoring all internal structs used - Updated tests for both deprecated and new input for: - Agent Services endpoint response - Agent Service endpoint response - Agent Register endpoint - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Register - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Services endpoint response - Catalog Node endpoint response - Catalog Service endpoint response - Updated API tests for all of the above too (both deprecated and new forms of register) TODO: - config package changes for on-disk service definitions - proxy config endpoint - built-in proxy support for new fields * Agent proxy config endpoint updated with upstreams * Config file changes for upstreams. * Add upstream opaque config and update all tests to ensure it works everywhere. * Built in proxy working with new Upstreams config * Command fixes and deprecations * Fix key translation, upstream type defaults and a spate of other subtele bugs found with ned to end test scripts... TODO: tests still failing on one case that needs a fix. I think it's key translation for upstreams nested in Managed proxy struct. * Fix translated keys in API registration. ≈ * Fixes from docs - omit some empty undocumented fields in API - Bring back ServiceProxyDestination in Catalog responses to not break backwards compat - this was removed assuming it was only used internally. * Documentation updates for Upstreams in service definition * Fixes for tests broken by many refactors. * Enable travis on f-connect branch in this branch too. * Add consistent Deprecation comments to ProxyDestination uses * Update version number on deprecation notices, and correct upstream datacenter field with explanation in docs	2018-10-10 16:55:34 +01:00
Alex Dadgar	43d0f96c42	do not bootstrap with non voters	2018-09-19 17:41:36 -07:00
Kyle Havlovitz	d515d25856	Merge pull request #4644 from hashicorp/ca-refactor connect/ca: rework initialization/root generation in providers	2018-09-13 13:08:34 -07:00
Paul Banks	74f2a80a42	Fix CA pruning when CA config uses string durations. (#4669 ) * Fix CA pruning when CA config uses string durations. The tl;dr here is: - Configuring LeafCertTTL with a string like "72h" is how we do it by default and should be supported - Most of our tests managed to escape this by defining them as time.Duration directly - Out actual default value is a string - Since this is stored in a map[string]interface{} config, when it is written to Raft it goes through a msgpack encode/decode cycle (even though it's written from server not over RPC). - msgpack decode leaves the string as a `[]uint8` - Some of our parsers required string and failed - So after 1 hour, a default configured server would throw an error about pruning old CAs - If a new CA was configured that set LeafCertTTL as a time.Duration, things might be OK after that, but if a new CA was just configured from config file, intialization would cause same issue but always fail still so would never prune the old CA. - Mostly this is just a janky error that got passed tests due to many levels of complicated encoding/decoding. tl;dr of the tl;dr: Yay for type safety. Map[string]interface{} combined with msgpack always goes wrong but we somehow get bitten every time in a new way :D We already fixed this once! The main CA config had the same problem so @kyhavlov already wrote the mapstructure DecodeHook that fixes it. It wasn't used in several places it needed to be and one of those is notw in `structs` which caused a dependency cycle so I've moved them. This adds a whole new test thta explicitly tests the case that broke here. It also adds tests that would have failed in other places before (Consul and Vaul provider parsing functions). I'm not sure if they would ever be affected as it is now as we've not seen things broken with them but it seems better to explicitly test that and support it to not be bitten a third time! * Typo fix * Fix bad Uint8 usage	2018-09-13 15:43:00 +01:00
Pierre Souchay	1a906ef34e	Fix more unstable tests in agent and command	2018-09-12 14:49:27 +01:00
Kyle Havlovitz	c112a72880	connect/ca: some cleanup and reorganizing of the new methods	2018-09-11 16:43:04 -07:00
Pierre Souchay	22500f242e	Fix unstable tests in agent, api, and command/watch	2018-09-10 16:58:53 +01:00
Pierre Souchay	eddcf228ea	Implementation of Weights Data structures (#4468 ) * Implementation of Weights Data structures Adding this datastructure will allow us to resolve the issues #1088 and #4198 This new structure defaults to values: ``` { Passing: 1, Warning: 0 } ``` Which means, use weight of 0 for a Service in Warning State while use Weight 1 for a Healthy Service. Thus it remains compatible with previous Consul versions. * Implemented weights for DNS SRV Records * DNS properly support agents with weight support while server does not (backwards compatibility) * Use Warning value of Weights of 1 by default When using DNS interface with only_passing = false, all nodes with non-Critical healthcheck used to have a weight value of 1. While having weight.Warning = 0 as default value, this is probably a bad idea as it breaks ascending compatibility. Thus, we put a default value of 1 to be consistent with existing behaviour. * Added documentation for new weight field in service description * Better documentation about weights as suggested by @banks * Return weight = 1 for unknown Check states as suggested by @banks * Fixed typo (of -> or) in error message as requested by @mkeeler * Fixed unstable unit test TestRetryJoin * Fixed unstable tests * Fixed wrong Fatalf format in `testrpc/wait.go` * Added notes regarding DNS SRV lookup limitations regarding number of instances * Documentation fixes and clarification regarding SRV records with weights as requested by @banks * Rephrase docs	2018-09-07 15:30:47 +01:00
Kyle Havlovitz	546bdf8663	connect/ca: add Configure/GenerateRoot to provider interface	2018-09-06 19:18:59 -07:00
Pierre Souchay	9a2ae6e8eb	Fixed more flaky tests in ./agent/consul (#4617 )	2018-09-04 14:02:47 +01:00
Freddy	d7a404f2ee	Bugfix: Use "%#v" when formatting structs (#4600 )	2018-08-28 12:37:34 -04:00
Pierre Souchay	b898131723	[BUGFIX] Avoid returning empty data on startup of a non-leader server (#4554 ) Ensure that DB is properly initialized when performing stale queries Addresses: - https://github.com/hashicorp/consul-replicate/issues/82 - https://github.com/hashicorp/consul/issues/3975 - https://github.com/hashicorp/consul-template/issues/1131	2018-08-23 12:06:39 -04:00
Kyle Havlovitz	e5e1f867e5	Merge branch 'master' into ca-snapshot-fix	2018-08-16 13:00:54 -07:00
Kyle Havlovitz	f186edc42c	fsm: add connect service config to snapshot/restore test	2018-08-16 12:58:54 -07:00
nickmy9729	beddf03b26	Added code to allow snapshot inclusion of NodeMeta (#4527 )	2018-08-16 15:33:35 -04:00
Kyle Havlovitz	b51d76f469	fsm: add missing CA config to snapshot/restore logic	2018-08-16 11:58:50 -07:00
Kyle Havlovitz	4b35d877ca	autopilot: don't follow the normal server removal rules for nonvoters	2018-08-14 14:24:51 -07:00
Kyle Havlovitz	ea14482376	Fix stats fetcher healthcheck RPCs not being independent	2018-08-14 14:23:52 -07:00
Pierre Souchay	0d6de257a2	Display more information about check being not properly added when it fails (#4405 ) * Display more information about check being not properly added when it fails It follows an incident where we add lots of error messages: [WARN] consul.fsm: EnsureRegistration failed: failed inserting check: Missing service registration That seems related to Consul failing to restart on respective agents. Having Node information as well as service information would help diagnose the issue. * Renamed ensureCheckIfNodeMatches() as requested by @banks	2018-08-14 17:45:33 +01:00
Pierre Souchay	ef3b81ab13	Allow to rename nodes with IDs, will fix #3974 and #4413 (#4415 ) * Allow to rename nodes with IDs, will fix #3974 and #4413 This change allow to rename any well behaving recent agent with an ID to be renamed safely, ie: without taking the name of another one with case insensitive comparison. Deprecated behaviour warning ---------------------------- Due to asceding compatibility, it is still possible however to "take" the name of another name by not providing any ID. Note that when not providing any ID, it is possible to have 2 nodes having similar names with case differences, ie: myNode and mynode which might lead to DB corruption on Consul server side and lead to server not properly restarting. See #3983 and #4399 for Context about this change. Disabling registration of nodes without IDs as specified in #4414 should probably be the way to go eventually. * Removed the case-insensitive search when adding a node within the else block since it breaks the test TestAgentAntiEntropy_Services While the else case is probably legit, it will be fixed with #4414 in a later release. * Added again the test in the else to avoid duplicated names, but enforce this test only for nodes having IDs. Thus most tests without any ID will work, and allows us fixing * Added more tests regarding request with/without IDs. `TestStateStore_EnsureNode` now test registration and renaming with IDs `TestStateStore_EnsureNodeDeprecated` tests registration without IDs and tests removing an ID from a node as well as updated a node without its ID (deprecated behaviour kept for backwards compatibility) * Do not allow renaming in case of conflict, including when other node has no ID * Fixed function GetNodeID that was not working due to wrong type when searching node from its ID Thus, all tests about renaming were not working properly. Added the full test cas that allowed me to detect it. * Better error messages, more tests when nodeID is not a valid UUID in GetNodeID() * Added separate TestStateStore_GetNodeID to test GetNodeID. More complete test coverage for GetNodeID * Added new unit test `TestStateStore_ensureNoNodeWithSimilarNameTxn` Also fixed comments to be clearer after remarks from @banks * Fixed error message in unit test to match test case * Use uuid.ParseUUID to parse Node.ID as requested by @mkeeler	2018-08-10 11:30:45 -04:00
Siva Prasad	c88900aaa9	PR to fix TestAgent_IndexChurn and TestPreparedQuery_Wrapper. (#4512 ) * Fixes TestAgent_IndexChurn * Fixes TestPreparedQuery_Wrapper * Increased sleep in agent_test for IndexChurn to 500ms * Made the comment about joinWAN operation much less of a cliffhanger	2018-08-09 12:40:07 -04:00
Armon Dadgar	4f1fd34e9e	consul: Update buffer sizes	2018-08-08 10:26:58 -07:00
Siva Prasad	288d350a73	Revert "CA initialization while boostrapping and TestLeader_ChangeServerID fix." (#4497 ) * Revert "BUGFIX: Unit test relying on WaitForLeader() did not work due to wrong test (#4472)" This reverts commit `cec5d72396`. * Revert "CA initialization while boostrapping and TestLeader_ChangeServerID fix. (#4493)" This reverts commit `589b589b53`.	2018-08-07 08:29:48 -04:00
Pierre Souchay	cec5d72396	BUGFIX: Unit test relying on WaitForLeader() did not work due to wrong test (#4472 ) - Improve resilience of testrpc.WaitForLeader() - Add additionall retry to CI - Increase "go test" timeout to 8m - Add wait for cluster leader to several tests in the agent package - Add retry to some tests in the api and command packages	2018-08-06 19:46:09 -04:00
Siva Prasad	589b589b53	CA initialization while boostrapping and TestLeader_ChangeServerID fix. (#4493 ) * connect: fix an issue with Consul CA bootstrapping being interrupted * streamline change server id test	2018-08-06 16:15:24 -04:00
Kyle Havlovitz	fa0d8aff33	fix inconsistency in TestConnectCAConfig_GetSet	2018-07-26 07:46:47 -07:00
Kyle Havlovitz	ed87949385	Merge pull request #4400 from hashicorp/leaf-cert-ttl Add configurable leaf cert TTL to Connect CA	2018-07-25 17:53:25 -07:00
Paul Banks	8cbeb29e73	Fixes #4421 : General solution to stop blocking queries with index 0 (#4437 ) * Fix theoretical cache collision bug if/when we use more cache types with same result type * Generalized fix for blocking query handling when state store methods return zero index * Refactor test retry to only affect CI * Undo make file merge * Add hint to error message returned to end-user requests if Connect is not enabled when they try to request cert * Explicit error for Roots endpoint if connect is disabled * Fix tests that were asserting old behaviour	2018-07-25 20:26:27 +01:00
Kyle Havlovitz	ce10de036e	connect/ca: check LeafCertTTL when rotating expired roots	2018-07-20 16:04:04 -07:00
Kyle Havlovitz	d6ca015a42	connect/ca: add configurable leaf cert TTL	2018-07-16 13:33:37 -07:00
Matt Keeler	63d5c069fc	Merge pull request #4379 from hashicorp/persist-intermediates connect: persist intermediate CAs on leader change	2018-07-12 12:09:13 -04:00
Matt Keeler	0e83059d1f	Revert "Allow changing Node names since Node now have IDs"	2018-07-12 11:19:21 -04:00
Matt Keeler	91150cca59	Fixup formatting	2018-07-12 10:14:26 -04:00
Matt Keeler	3807e04de9	Revert PR 4294 - Catalog Register: Generate UUID for services registered without one UUID auto-generation here causes trouble in a few cases. The biggest being older nodes reregistering will fail when the UUIDs are different and the names match This reverts commit `0f70034082`. This reverts commit `d1a8f9cb3f`. This reverts commit `cf69ec42a4`.	2018-07-12 10:06:50 -04:00
Kyle Havlovitz	f95c6807e7	connect: use reflect.DeepEqual instead for test	2018-07-11 13:10:58 -07:00
Matt Keeler	98ead2a8f8	Merge pull request #3983 from pierresouchay/node_renaming Allow changing Node names since Node now have IDs	2018-07-11 16:03:02 -04:00
Kyle Havlovitz	4e5fb6bc19	connect: add provider state to snapshots	2018-07-11 11:34:49 -07:00
Kyle Havlovitz	462ace4867	connect: update leader initializeCA comment	2018-07-11 10:00:42 -07:00
Kyle Havlovitz	1d3f4b5099	connect: persist intermediate CAs on leader change	2018-07-11 09:44:30 -07:00
Pierre Souchay	fecae3de21	When renaming a node, ensure the name is not taken by another node. Since DNS is case insensitive and DB as issues when similar names with different cases are added, check for unicity based on case insensitivity. Following another big incident we had in our cluster, we also validate that adding/renaming a not does not conflicts with case insensitive matches. We had the following error once: - one node called: mymachine.MYDC.mydomain was shut off - another node (different ID) was added with name: mymachine.mydc.mydomain before 72 hours When restarting the consul server of domain, the consul server restarted failed to start since it detected an issue in RAFT database because mymachine.MYDC.mydomain and mymachine.mydc.mydomain had the same names. Checking at registration time with case insensitivity should definitly fix those issues and avoid Consul DB corruption.	2018-07-11 14:42:54 +02:00
Matt Keeler	d19c7d8882	Merge pull request #4303 from pierresouchay/non_blocking_acl Only send one single ACL cache refresh across network when TTL is over	2018-07-10 08:57:33 -04:00
MagnumOpus21	300330e24b	Agent/Proxy: Formatting and test cases fix	2018-07-09 12:46:10 -04:00
Kyle Havlovitz	401b206a2e	Store the time CARoot is rotated out instead of when to prune	2018-07-06 16:05:25 -07:00
Kyle Havlovitz	1492243e0a	connect/ca: add logic for pruning old stale RootCA entries	2018-07-02 10:35:05 -07:00
Pierre Souchay	bd023f352e	Updated swith case to use same branch for async-cache and extend-cache	2018-07-02 17:39:34 +02:00
Pierre Souchay	1e7665c0d5	Updated documentation and adding more test case for async-cache	2018-07-01 23:50:30 +02:00
Pierre Souchay	abde81a3e7	Added async-cache with similar behaviour as extend-cache but asynchronously	2018-07-01 23:50:30 +02:00
Pierre Souchay	9406ca1c95	Only send one single ACL cache refresh across network when TTL is over It will allow the following: * when connectivity is limited (saturated linnks between DCs), only one single request to refresh an ACL will be sent to ACL master DC instead of statcking ACL refresh queries * when extend-cache is used for ACL, do not wait for result, but refresh the ACL asynchronously, so no delay is not impacting slave DC * When extend-cache is not used, keep the existing blocking mechanism, but only send a single refresh request. This will fix https://github.com/hashicorp/consul/issues/3524	2018-07-01 23:50:30 +02:00
Matt Keeler	22b7b688a3	Move starting enterprise functionality	2018-06-29 17:38:29 -04:00
Matt Keeler	0f70034082	Move default uuid test into the consul package	2018-06-27 09:21:58 -04:00
Matt Keeler	d1a8f9cb3f	go fmt changes	2018-06-27 09:07:22 -04:00
Matt Keeler	cf69ec42a4	Make sure to generate UUIDs when services are registered without one This makes the behavior line up with the docs and expected behavior	2018-06-26 17:04:08 -04:00
mkeeler	6813a99081	Merge remote-tracking branch 'connect/f-connect'	2018-06-25 19:42:51 +00:00
Kyle Havlovitz	3baa67cdef	connect/ca: pull the cluster ID from config during a rotation	2018-06-25 12:25:42 -07:00
Kyle Havlovitz	b4ef7bb64d	connect/ca: leave blank root key/cert out of the default config (unnecessary)	2018-06-25 12:25:42 -07:00
Kyle Havlovitz	050da22473	connect/ca: undo the interface changes and use sign-self-issued in Vault	2018-06-25 12:25:42 -07:00
Kyle Havlovitz	bc997688e3	connect/ca: update Consul provider to use new cross-sign CSR method	2018-06-25 12:25:41 -07:00
Kyle Havlovitz	226a59215d	connect/ca: fix vault provider URI SANs and test	2018-06-25 12:25:41 -07:00
Kyle Havlovitz	1a8ac686b2	connect/ca: add the Vault CA provider	2018-06-25 12:25:41 -07:00
Paul Banks	e33bfe249e	Note leadership issues in comments	2018-06-25 12:25:41 -07:00
Paul Banks	e514570dfa	Actually return Intermediate certificates bundled with a leaf!	2018-06-25 12:25:40 -07:00
Paul Banks	2e223ea2b7	Fix hot loop in cache for RPC returning zero index.	2018-06-25 12:25:37 -07:00
Paul Banks	05a8097c5d	Fix misc test failures (some from other PRs)	2018-06-25 12:25:13 -07:00
Paul Banks	382ce8f98a	Only set precedence on write path	2018-06-25 12:25:13 -07:00
Paul Banks	4a54f8f7e3	Fix some tests failures caused by the sorting change and some cuased by previous UpdatePrecedence() change	2018-06-25 12:25:13 -07:00
Paul Banks	bf7a62e0e0	Sort intention list by precedence	2018-06-25 12:25:13 -07:00
Kyle Havlovitz	edbeeeb23c	agent: update accepted CA config fields and defaults	2018-06-25 12:25:09 -07:00
Mitchell Hashimoto	028aa78e83	agent/consul: set precedence value on struct itself	2018-06-25 12:24:16 -07:00
Mitchell Hashimoto	daf46c9cfa	agent/consul: support a Connect option on prepared query request	2018-06-25 12:24:12 -07:00
Mitchell Hashimoto	440b1b2d97	agent/consul: prepared query supports "Connect" field	2018-06-25 12:24:11 -07:00
Mitchell Hashimoto	1830c6b308	agent: switch ConnectNative to an embedded struct	2018-06-25 12:24:10 -07:00
Mitchell Hashimoto	eb3fcb39b3	agent/consul/state: support querying by Connect native	2018-06-25 12:24:08 -07:00
Mitchell Hashimoto	d6a823ad0d	agent/consul: support catalog registration with Connect native	2018-06-25 12:24:07 -07:00
Matt Keeler	af910bda39	Merge pull request #4216 from hashicorp/rpc-limiting Make RPC limits reloadable	2018-06-20 09:05:28 -04:00
Mitchell Hashimoto	1906fe1c0d	agent: address feedback	2018-06-14 09:42:20 -07:00
Mitchell Hashimoto	0accfc1628	agent: rename test to check	2018-06-14 09:42:18 -07:00
Mitchell Hashimoto	2a29679e9d	agent/consul: forward request if necessary	2018-06-14 09:42:17 -07:00
Mitchell Hashimoto	54ac5adb08	agent: comments to point to differing logic	2018-06-14 09:42:17 -07:00
Mitchell Hashimoto	d68462fca6	agent/consul: implement Intention.Test endpoint	2018-06-14 09:42:17 -07:00
Paul Banks	f4b8e8c96d	Add default CA config back - I didn't add it and causes nil panics	2018-06-14 09:42:17 -07:00
Paul Banks	1228a5839a	Ooops remove the CA stuff from actual server defaults and make it test server only	2018-06-14 09:42:16 -07:00
Paul Banks	4aeab3897c	Fixed many tests after rebase. Some still failing and seem unrelated to any connect changes.	2018-06-14 09:42:16 -07:00
Paul Banks	b4803eca59	Generate CSR using real trust-domain	2018-06-14 09:42:16 -07:00
Paul Banks	622a475eb1	Add CSR signing verification of service ACL, trust domain and datacenter.	2018-06-14 09:42:16 -07:00
Paul Banks	c1f2025d96	Return TrustDomain from CARoots RPC	2018-06-14 09:42:15 -07:00
Kyle Havlovitz	e00088e8ee	Rename some of the CA structs/files	2018-06-14 09:42:15 -07:00
Kyle Havlovitz	6e9f1f8acb	Add more metadata to structs.CARoot	2018-06-14 09:42:15 -07:00
Kyle Havlovitz	627aa80d5a	Use provider state table for a global serial index	2018-06-14 09:42:15 -07:00
Kyle Havlovitz	de72834b8c	Move connect CA provider to separate package	2018-06-14 09:42:15 -07:00
Mitchell Hashimoto	bc605a1576	agent/consul: change provider wait from goto to a loop	2018-06-14 09:42:14 -07:00
Mitchell Hashimoto	c8b65217c3	agent/consul: check nil on getCAProvider result	2018-06-14 09:42:14 -07:00
Mitchell Hashimoto	9b3495dddb	agent/consul: retry reading provider a few times	2018-06-14 09:42:14 -07:00
Paul Banks	90c574ebaa	Wire up agent leaf endpoint to cache framework to support blocking.	2018-06-14 09:42:07 -07:00
Kyle Havlovitz	a4d18f0eaa	Fill out connect CA rpc endpoint tests	2018-06-14 09:42:06 -07:00
Kyle Havlovitz	cce7f1cca1	Add tests for the built in CA's state store table	2018-06-14 09:42:06 -07:00
Kyle Havlovitz	15fbc2fd97	Add more tests for built-in provider	2018-06-14 09:42:06 -07:00
Kyle Havlovitz	edcfdb37af	Fix some inconsistencies around the CA provider code	2018-06-14 09:42:06 -07:00
Kyle Havlovitz	daa8dd1779	Add CA config to connect section of agent config	2018-06-14 09:42:05 -07:00
Kyle Havlovitz	32d1eae28b	Move ConsulCAProviderConfig into structs package	2018-06-14 09:42:04 -07:00
Kyle Havlovitz	315b8bf594	Simplify the CAProvider.Sign method	2018-06-14 09:42:04 -07:00
Kyle Havlovitz	c6e1b72ccb	Simplify the CA provider interface by moving some logic out	2018-06-14 09:42:04 -07:00
Kyle Havlovitz	a325388939	Clarify some comments and names around CA bootstrapping	2018-06-14 09:42:04 -07:00
Kyle Havlovitz	33418afd3c	Add cross-signing mechanism to root rotation	2018-06-14 09:42:00 -07:00
Kyle Havlovitz	d83fbfc766	Add the root rotation mechanism to the CA config endpoint	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	f9d92d795e	Have the built in CA store its state in raft	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	30c1973e8b	Fix the testing endpoint's root set op	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	ab737ef0f8	Hook the CA RPC endpoint into the provider interface	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	1f6501895f	Add CA bootstrapping on establishing leadership	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	682f105c7c	Add the bootstrap config for the CA	2018-06-14 09:41:59 -07:00
Kyle Havlovitz	1787f88618	Add CA config set to fsm operations	2018-06-14 09:41:58 -07:00
Kyle Havlovitz	6b3416e480	Add the Connect CA config to the state store	2018-06-14 09:41:58 -07:00
Paul Banks	730da74369	Fix various test failures and vet warnings. Intention de-duplication in previously merged PR actualy failed some tests that were not caught be me or CI. I ran the test files for state changes but they happened not to trigger this case so I made sure they did first and then fixed. That fixed some upstream intention endpoint tests that I'd not run as part of testing the previous fix.	2018-06-14 09:41:58 -07:00
Paul Banks	88541bba17	Add tests all the way up through the endpoints to ensure duplicate src/destination is supported and so ultimately deny/allow nesting works. Also adds a sanity check test for `api.Agent().ConnectAuthorize()` and a fix for a trivial bug in it.	2018-06-14 09:41:57 -07:00
Paul Banks	ed9f07c361	Allow duplicate source or destination, but enforce uniqueness across all four.	2018-06-14 09:41:57 -07:00
Mitchell Hashimoto	845f7cd8ad	agent/consul/state: ensure exactly one active CA exists when setting	2018-06-14 09:41:54 -07:00
Mitchell Hashimoto	17ca8ad083	agent/connect: rename SpiffeID to CertURI	2018-06-14 09:41:53 -07:00
Mitchell Hashimoto	0cbcb07d61	agent/connect: use proper keyusage fields for CA and leaf	2018-06-14 09:41:53 -07:00
Mitchell Hashimoto	a54d1af421	agent/consul: encode issued cert serial number as hex encoded	2018-06-14 09:41:53 -07:00
Mitchell Hashimoto	63d674d07d	agent: /v1/connect/ca/configuration PUT for setting configuration	2018-06-14 09:41:52 -07:00
Mitchell Hashimoto	1c3dbc83ff	agent/consul/fsm,state: snapshot/restore for CA roots	2018-06-14 09:41:52 -07:00
Mitchell Hashimoto	90f423fd02	agent/consul/fsm,state: tests for CA root related changes	2018-06-14 09:41:52 -07:00
Mitchell Hashimoto	1c72639d60	agent/consul: set more fields on the issued cert	2018-06-14 09:41:52 -07:00
Mitchell Hashimoto	c2588262b7	agent: /v1/connect/ca/leaf/:service_id	2018-06-14 09:41:52 -07:00
Mitchell Hashimoto	e40afd6a73	agent/consul: CAS operations for setting the CA root	2018-06-14 09:41:51 -07:00
Mitchell Hashimoto	578db06600	agent/consul: tests for CA endpoints	2018-06-14 09:41:51 -07:00
Mitchell Hashimoto	891cd22ad9	agent/consul: key the public key of the CSR, verify in test	2018-06-14 09:41:51 -07:00
Mitchell Hashimoto	d768d5e9a7	agent/consul: test for ConnectCA.Sign	2018-06-14 09:41:51 -07:00
Mitchell Hashimoto	f4ec28bfe3	agent/consul: basic sign endpoint not tested yet	2018-06-14 09:41:51 -07:00
Mitchell Hashimoto	5a950190f3	agent/consul: RPC endpoints to list roots	2018-06-14 09:41:50 -07:00
Mitchell Hashimoto	130098b7b5	agent/consul/state: CARoot structs and initial state store	2018-06-14 09:41:49 -07:00
Mitchell Hashimoto	4d852e62a3	agent: address PR feedback	2018-06-14 09:41:49 -07:00
Mitchell Hashimoto	6313bc5615	agent: clarified a number of comments per PR feedback	2018-06-14 09:41:49 -07:00
Mitchell Hashimoto	353953fcd2	agent/consul: Health.ServiceNodes ACL check for Connect	2018-06-14 09:41:49 -07:00
Mitchell Hashimoto	b6c0cb7115	agent/consul: Catalog endpoint ACL requirements for Connect proxies	2018-06-14 09:41:49 -07:00
Mitchell Hashimoto	2feef5f7a3	agent/consul: require name for proxies	2018-06-14 09:41:48 -07:00
Mitchell Hashimoto	44ec8d94d2	agent: clean up connect/non-connect duplication by using shared methods	2018-06-14 09:41:48 -07:00
Mitchell Hashimoto	7d79f9c46f	agent/consul: implement Health.ServiceNodes for Connect, DNS works	2018-06-14 09:41:47 -07:00
Mitchell Hashimoto	e01914a025	agent/consul: Catalog.ServiceNodes supports Connect filtering	2018-06-14 09:41:47 -07:00
Mitchell Hashimoto	2062e37270	agent/consul/state: ConnectServiceNodes	2018-06-14 09:41:47 -07:00
Mitchell Hashimoto	7ed26e2c64	agent/consul: enforce ACL on ProxyDestination	2018-06-14 09:41:47 -07:00
Mitchell Hashimoto	0c0c0a58e7	agent/consul: proxy registration and tests	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	4d4a8443e8	agent: test /v1/catalog/node/:node to list connect proxies	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	6e257ea51c	agent: /v1/catalog/service/:service works with proxies	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	63e4a35827	agent/consul/state: convert proxy test to testify/assert	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	21c6fc623a	agent/consul/state: service registration with proxy works	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	a621afe72c	agent/consul: convert intention ACLs to testify/assert	2018-06-14 09:41:46 -07:00
Mitchell Hashimoto	9dc8aa0fb3	agent/consul,structs: add tests for ACL filter and prefix for intentions	2018-06-14 09:41:45 -07:00
Mitchell Hashimoto	5ac649af7f	agent/consul: Intention.Match ACLs	2018-06-14 09:41:45 -07:00
Mitchell Hashimoto	4d87601bf4	agent/consul: Intention.Get ACLs	2018-06-14 09:41:45 -07:00
Mitchell Hashimoto	9bbbb73734	agent/consul: Intention.Apply ACL on rename	2018-06-14 09:41:45 -07:00
Mitchell Hashimoto	01b644e213	agent/consul: tests for ACLs on Intention.Apply update/delete	2018-06-14 09:41:45 -07:00

... 2 3 4 5 6 ...

601 Commits