consul

Commit Graph

Author	SHA1	Message	Date
Frank Schröder	94f58199b1	agent: add option to discard health output (#3562 ) * agent: add option to discard health output In high volatile environments consul will have checks with "noisy" output which changes every time even though the status does not change. Since the output is stored in the raft log every health check update unblocks a blocking call on health checks since the raft index has changed even though the status of the health checks may not have changed at all. By discarding the output of the health checks the users can choose a different tradeoff. Less visibility on why a check failed in exchange for a reduced change rate on the raft log. * agent: discard output also when adding a check * agent: add test for discard check output * agent: update docs * go vet * Adds discard_check_output to reloadable config table. * Updates the change log.	2017-10-10 17:04:52 -07:00
preetapan	77c972f594	Fixes agent error handling when check definition is invalid. Distingu… (#3560 ) * Fixes agent error handling when check definition is invalid. Distinguishes between empty checks vs invalid checks * Made CheckTypes return Checks from service definition struct rather than a new copy, and other changes from code review. This also errors when json payload contains empty structs * Simplify and improve validate method, and make sure that CheckTypes always returns a new copy of validated check definitions * Tweaks some small style things and error messages. * Updates the change log.	2017-10-10 16:54:06 -07:00
Frank Schröder	759ef8a1d4	config: add generic method to translate between CamelCase and snake_case (#3557 ) * doc: document discrepancy between id and CheckID * doc: document enable_tag_override change * config: add TranslateKeys helper TranslateKeys makes it easier to map between different representations of internal structures. It allows to recursively map alias keys to canonical keys in structured maps. * config: use TranslateKeys for config file This also adds support for 'enabletagoverride' and removes the need for a separate CheckID alias field. * config: remove dead code * agent: use TranslateKeys for FixupCheckType * agent: translate enable_tag_override during service registration * doc: add '.hcl' as valid extension * config: map ScriptArgs to args * config: add comment for TranslateKeys	2017-10-10 16:40:59 -07:00
James Phillips	bb12368eac	Makes RPC handling more robust when rolling servers. (#3561 ) * Adds client-side retry for no leader errors. This paves over the case where the client was connected to the leader when it loses leadership. * Adds a configurable server RPC drain time and a fail-fast path for RPCs. When a server leaves it gets removed from the Raft configuration, so it will never know who the new leader server ends up being. Without this we'd be doomed to wait out the RPC hold timeout and then fail. This makes things fail a little quicker while a sever is draining, and since we added a client retry AND since the server doing this has already shut down and left the Serf LAN, clients should retry against some other server. * Makes the RPC hold timeout configurable. * Reorders struct members. * Sets the RPC hold timeout default for test servers. * Bumps the leave drain time up to 5 seconds. * Robustifies retries with a simpler client-side RPC hold. * Reverts untended delete.	2017-10-10 15:19:50 -07:00
Preetha Appan	e7dc345cfa	Fix unit test after dns library upgrade to account for correct data length	2017-10-06 17:40:17 -05:00
James Phillips	4dab70cb93	Fixes handling of stop channel and failed barrier attempts. (#3546 ) * Fixes handling of stop channel and failed barrier attempts. There were two issues here. First, we needed to not exit when there was a timeout trying to write the barrier, because Raft might not step down, so we'd be left as the leader but having run all the step down actions. Second, we didn't close over the stopCh correctly, so it was possible to nil that out and have the leaderLoop never exit. We close over it properly AND sequence the nil-ing of it AFTER the leaderLoop exits for good measure, so the code is more robust. Fixes #3545 * Cleans up based on code review feedback. * Tweaks comments. * Renames variables and removes comments.	2017-10-06 07:54:49 -07:00
Victor Boivie	8e361beb7a	Minor typo (boostrap)	2017-10-05 16:28:48 +02:00
James Phillips	3bc6df5f0e	Adds script warning and fixes Docker args recognition.	2017-10-04 21:41:27 -07:00
Kyle Havlovitz	adf29675f3	Merge pull request #3535 from hashicorp/metric-docs Update metric names and add a legacy config flag	2017-10-04 17:39:16 -07:00
Kyle Havlovitz	a3e9ac5840	Add a test for legacy metrics with a whitelist filter	2017-10-04 17:27:57 -07:00
Kyle Havlovitz	198ed6076d	Clean up subprocess handling and make shell use optional (#3509 ) * Clean up handling of subprocesses and make using a shell optional * Update docs for subprocess changes * Fix tests for new subprocess behavior * More cleanup of subprocesses * Minor adjustments and cleanup for subprocess logic * Makes the watch handler reload test use the new path. * Adds check tests for new args path, and updates existing tests to use new path. * Adds support for script args in Docker checks. * Fixes the sanitize unit test. * Adds panic for unknown watch type, and reverts back to Run(). * Adds shell option back to consul lock command. * Adds shell option back to consul exec command. * Adds shell back into consul watch command. * Refactors signal forwarding and makes Windows-friendly. * Adds a clarifying comment. * Changes error wording to a warning. * Scopes signals to interrupt and kill. This avoids us trying to send SIGCHILD to the dead process. * Adds an error for shell=false for consul exec. * Adds notes about the deprecated script and handler fields. * De-nests an if statement.	2017-10-04 16:48:00 -07:00
Kyle Havlovitz	c728564994	Update metric names and add a legacy config flag	2017-10-04 16:43:27 -07:00
Frank Schröder	ce887a0c45	Provide stable config for agent/self (#3532 ) * config: provide stable config for /v1/agent/self (#3530) This patch adds a stable subset of the previous Config struct to the agent/self response. The actual runtime configuration is moved into DebugConfig and will be documented to change. Fixes #3530 * config: fix tests * doc: update api documentation for /v1/agent/self	2017-10-04 10:43:17 -07:00
James Phillips	4f2dccc2a9	Merge pull request #3531 from hashicorp/pr-3521-slackpad ui: Use monospace font for textarea controls.	2017-10-04 09:53:41 -07:00
James Phillips	b34d576193	Updates checked in web assets to pick up CSS change. Closes #3521	2017-10-04 09:52:15 -07:00
Preetha Appan	8dcd7e700c	Remove extra newline	2017-10-03 15:19:31 -05:00
Preetha Appan	26accb3b8a	Only allow 'list' policies within 'key' policy definitions. Consolidated two similar tests into one and fixed alignment.	2017-10-03 15:15:56 -05:00
Preetha Appan	51a04ec87d	Introduces new 'list' permission that applies to KV store recursive reads, and enforced only when opted in.	2017-10-02 17:10:21 -05:00
Frank Schroeder	1944218492	use ports from derived addresses	2017-09-29 20:26:43 +02:00
Frank Schroeder	42f8ff7b3c	config: drop advertise_addrs Fixes #3516	2017-09-29 20:26:43 +02:00
Frank Schroeder	abe41d231c	Fix tests after config refactor	2017-09-28 12:32:46 +02:00
Patrick Sodré	7501331d13	Implement encodeKVasRFC1464 function	2017-09-28 12:32:46 +02:00
Patrick Sodré	2cc6ac542c	Add RFC1464 tests	2017-09-28 12:32:45 +02:00
Patrick Sodré	865f087ec9	Turn encodeKVasRFC1464 into a plain function	2017-09-28 12:32:45 +02:00
Patrick Sodré	12fb0bfd5b	Use verify for NodeLookup CNAME, and TXT tests	2017-09-28 12:32:45 +02:00
Patrick Sodré	d5e3b9d843	Refactor formatTxtRecords as encodeKVasRFC1464 - Move the logic of rfc1035 out of the encoding function - Left basic version of encodingKV as 'k=v'	2017-09-28 12:32:45 +02:00
Patrick Sodré	655c89ee10	Fix editorial suggestions	2017-09-28 12:32:45 +02:00
Patrick Sodré	afb0c92334	Remove redundant check of Node.Meta size	2017-09-28 12:32:45 +02:00
Patrick Sodré	53e812e759	Return Node.Meta info using the DNS interface	2017-09-28 12:32:45 +02:00
Patrick Sodré	ab90865865	Add test for NoteLookup ANY request	2017-09-28 12:32:45 +02:00
Patrick Sodré	4c6b8022c2	Add test for querying Node.Meta with DNS TXT - Lookup TXT records using recursive lookups - Expect TXT record equal to value if key starts with rfc1035- - Expect TXT record in rfc1464 otherwise, i.e. (k=v) ref #2709	2017-09-28 12:32:45 +02:00
Frank Schröder	07dea89f31	fail early when advertise addr is set to ANY (#3507 )	2017-09-27 13:57:55 -07:00
Frank Schröder	9a67556bb5	only detect advertise address if derived value is any (#3506 ) * only detect advertise address if derived value is any * determine detect function only when advertise addr is any	2017-09-27 12:59:47 -07:00
James Phillips	98850322c0	Adds a comment about Datacenter and NodeName being stable interfaces in the runtime config strucutre.	2017-09-27 11:59:22 -07:00
Frank Schröder	21118cafeb	Recursive sanitize (#3505 ) * vendor: add github.com/sergi/go-diff/diffmatchpatch for diff'ing test output * config: refactor Sanitize to recursively clean runtime config and format complex fields * Removes an extra int cast. * Adds a top-level check test case for sanitization.	2017-09-27 11:47:40 -07:00
James Phillips	0190c4a081	Gets rid of flaky clause in stats fetcher unit test. Given how the rutine is coded we can still get data so this wasn't a reliable thing to check.	2017-09-26 20:53:06 -07:00
preetapan	4d9fc638b4	Issue 3452 (#3500 ) * Make sure that id and address are set in member created during reaping of catalog nodes that have been removed from serf * Get address from node table in the state store rather than from service address * Fix incorrect lookup by checkname instead of node name * Make sure that serverlookup is called with the right address format, added unit test. * Address code review comments * Tweaks style stuff.	2017-09-26 20:49:41 -07:00
Frank Schröder	e84c2b2edd	Metrics service prefix (#3498 ) * metrics: replace statsite_prefix with service_prefix The metrics prefix isn't statsite specific and is in fact used for all metrics providers. Since we are deprecating fields anyway we should fix this one as well. Fixes #3293 * Updates docs and sorts telemetry section. * Renames to "metrics_prefix" to disambiguate with Consul services. * Updates the change log.	2017-09-26 17:49:55 -07:00
James Phillips	49058fee11	Merge pull request #3501 from hashicorp/snapshot-test-hang Cleans up some edge cases in TestSnapshot_Forward_Leader.	2017-09-26 14:08:33 -07:00
James Phillips	5fa2322e0b	Cleans up some edge cases in TestSnapshot_Forward_Leader. These could cause the tests to hang.	2017-09-26 14:07:28 -07:00
Kyle Havlovitz	bfa70a10ca	Fix watch error when http & https are disabled (#3493 ) Remove an error in watch reloading that happens when http and https are both disabled, and use an https address for running watches if no http addresses are present. Fixes #3425.	2017-09-26 13:47:27 -07:00
Preetha Appan	3c4a108769	Move Raft protocol version for list peers end point to server side, fix unit tests. This fixes #3449	2017-09-26 09:35:39 -05:00
Frank Schroeder	56e6439be9	fix data race Since state.Checks() returns a shallow copy its elements must not be modified. Copying the elements in the handler does not guarantee consistency since that list is guarded by a different lock. Therefore, the only solution is to have state.Checks() return a deep copy.	2017-09-26 13:42:10 +02:00
Frank Schroeder	7bd85792b2	config: do not clobber multiple check and service definitions This patch ensures that multiple files with single 'check' or 'service' definitions result in the combination of them.	2017-09-26 10:24:18 +02:00
James Phillips	a75a779469	Renames `enable_ui` to `ui` to keep compatibility with existing configs.	2017-09-26 00:05:55 -07:00
Frank Schröder	1e461110e6	agent: consolidate handling of 405 Method Not Allowed (#3405 ) * agent: consolidate http method not allowed checks This patch uses the error handling of the http handlers to handle HTTP method not allowed errors across all available endpoints. It also adds a test for testing whether the endpoints respond with the correct status code. * agent: do not panic on metrics tests * agent: drop other tests for MethodNotAllowed * agent: align /agent/join with reality /agent/join uses PUT instead of GET as documented. * agent: align /agent/check/{fail,warn,pass} with reality /agent/check/{fail,warn,pass} uses PUT instead of GET as documented. * fix some tests * Drop more tests for method not allowed * Align TestAgent_RegisterService_InvalidAddress with reality * Changes API client join to use PUT instead of GET. * Fixes agent endpoint verbs and removes obsolete tests. * Updates the change log.	2017-09-25 23:11:19 -07:00
preetapan	73951d8319	Merge pull request #3494 from hashicorp/enforce_json_extension Enforce json or hcl extension to Consul config files, updated unit tests	2017-09-25 17:30:33 -05:00
James Phillips	45646ac3f4	Bumps default Raft protocol to version 3. (#3477 ) * Changes default Raft protocol to 3. * Changes numPeers() to report only voters. This should have been there before, but it's more obvious that this is incorrect now that we default the Raft protocol to 3, which puts new servers in a read-only state while Autopilot waits for them to become healthy. * Fixes TestLeader_RollRaftServer. * Fixes TestOperator_RaftRemovePeerByAddress. * Fixes TestServer_. Relaxed the check for a given number of voter peers and instead do a thorough check that all servers see each other in their Raft configurations. Fixes TestACL_. These now just check for Raft replication to be set up, and don't care about the number of voter peers. Fixes TestOperator_Raft_ListPeers. * Fixes TestAutopilot_CleanupDeadServerPeriodic. * Fixes TestCatalog_ListNodes_ConsistentRead_Fail. * Fixes TestLeader_ChangeServerID and adjusts the conn pool to throw away sockets when it sees io.EOF. * Changes version to 1.0.0 in the options doc. * Makes metrics test more deterministic with autopilot metrics possible.	2017-09-25 15:27:04 -07:00
Preetha Appan	a286ad7533	Enforce json or hcl extension to Consul config files, updated unit tests	2017-09-25 17:17:12 -05:00
James Phillips	f42e85ce22	Removes unused imports in agent_test.go.	2017-09-25 13:42:15 -07:00
Preetha Appan	d7e27e67c1	Introduce Code Policy validation via sentinel, with a noop implementation	2017-09-25 13:44:55 -05:00
Frank Schröder	12216583a1	New config parser, HCL support, multiple bind addrs (#3480 ) * new config parser for agent This patch implements a new config parser for the consul agent which makes the following changes to the previous implementation: * add HCL support * all configuration fragments in tests and for default config are expressed as HCL fragments * HCL fragments can be provided on the command line so that they can eventually replace the command line flags. * HCL/JSON fragments are parsed into a temporary Config structure which can be merged using reflection (all values are pointers). The existing merge logic of overwrite for values and append for slices has been preserved. * A single builder process generates a typed runtime configuration for the agent. The new implementation is more strict and fails in the builder process if no valid runtime configuration can be generated. Therefore, additional validations in other parts of the code should be removed. The builder also pre-computes all required network addresses so that no address/port magic should be required where the configuration is used and should therefore be removed. * Upgrade github.com/hashicorp/hcl to support int64 * improve error messages * fix directory permission test * Fix rtt test * Fix ForceLeave test * Skip performance test for now until we know what to do * Update github.com/hashicorp/memberlist to update log prefix * Make memberlist use the default logger * improve config error handling * do not fail on non-existing data-dir * experiment with non-uniform timeouts to get a handle on stalled leader elections * Run tests for packages separately to eliminate the spurious port conflicts * refactor private address detection and unify approach for ipv4 and ipv6. Fixes #2825 * do not allow unix sockets for DNS * improve bind and advertise addr error handling * go through builder using test coverage * minimal update to the docs * more coverage tests fixed * more tests * fix makefile * cleanup * fix port conflicts with external port server 'porter' * stop test server on error * do not run api test that change global ENV concurrently with the other tests * Run remaining api tests concurrently * no need for retry with the port number service * monkey patch race condition in go-sockaddr until we understand why that fails * monkey patch hcl decoder race condidtion until we understand why that fails * monkey patch spurious errors in strings.EqualFold from here * add test for hcl decoder race condition. Run with go test -parallel 128 * Increase timeout again * cleanup * don't log port allocations by default * use base command arg parsing to format help output properly * handle -dc deprecation case in Build * switch autopilot.max_trailing_logs to int * remove duplicate test case * remove unused methods * remove comments about flag/config value inconsistencies * switch got and want around since the error message was misleading. * Removes a stray debug log. * Removes a stray newline in imports. * Fixes TestACL_Version8. * Runs go fmt. * Adds a default case for unknown address types. * Reoders and reformats some imports. * Adds some comments and fixes typos. * Reorders imports. * add unix socket support for dns later * drop all deprecated flags and arguments * fix wrong field name * remove stray node-id file * drop unnecessary patch section in test * drop duplicate test * add test for LeaveOnTerm and SkipLeaveOnInt in client mode * drop "bla" and add clarifying comment for the test * split up tests to support enterprise/non-enterprise tests * drop raft multiplier and derive values during build phase * sanitize runtime config reflectively and add test * detect invalid config fields * fix tests with invalid config fields * use different values for wan sanitiziation test * drop recursor in favor of recursors * allow dns_config.udp_answer_limit to be zero * make sure tests run on machines with multiple ips * Fix failing tests in a few more places by providing a bind address in the test * Gets rid of skipped TestAgent_CheckPerformanceSettings and adds case for builder. * Add porter to server_test.go to make tests there less flaky * go fmt	2017-09-25 11:40:42 -07:00
James Phillips	d84c0b1a01	Robustifies check in TestCatalog_ListNodes_ConsistentRead_Fail test. Fixes #3469	2017-09-13 21:22:53 -07:00
James Phillips	828be5771a	Revert "Manages segments list via a pointer." This reverts commit `c277a42504`.	2017-09-07 16:37:11 -07:00
James Phillips	c277a42504	Manages segments list via a pointer.	2017-09-07 16:21:07 -07:00
James Phillips	96a89a3381	Cleans up formatting.	2017-09-07 12:26:58 -07:00
James Phillips	00605c0214	Shows the segment name in the keyring API and command output.	2017-09-07 12:17:39 -07:00
James Phillips	aa5ef4a098	Populates the segment keyrings based on the LAN keyring.	2017-09-07 12:17:20 -07:00
James Phillips	88a150cee1	Moves reconcile loop into segment stub.	2017-09-06 18:01:53 -07:00
James Phillips	5c03cb571d	Takes the skip out of the client check. Without this the merge delegate won't check the segment for non-servers a little below here.	2017-09-06 17:05:40 -07:00
James Phillips	3418c7ff93	Merge pull request #3447 from hashicorp/issue-3070 Skips unique node ID check for old versions of Consul.	2017-09-06 13:24:15 -07:00
James Phillips	520060e138	Fixes incorrect comment.	2017-09-06 13:23:19 -07:00
James Phillips	084679ab65	Pulls down some code for the check loop.	2017-09-06 13:07:42 -07:00
James Phillips	3535652595	Uses the Raft configuration for the self-add skip check.	2017-09-06 13:05:51 -07:00
Preetha Appan	5f2e1c9b07	Change member join reconcile step to process joining itself, to handle node IP address changes correctly when number of servers < 3	2017-09-06 13:53:01 -05:00
James Phillips	1333fa57a1	Skips unique node ID check for old versions of Consul. Fixes #3070.	2017-09-05 22:57:29 -07:00
James Phillips	67b19ac065	Allow _all for WAN as a no-op.	2017-09-05 13:40:19 -07:00
James Phillips	1a117ba0a8	Makes the all segments query explict, and the default for `consul members`.	2017-09-05 12:22:20 -07:00
James Phillips	9258506dab	Adds simple rate limiting for client agent RPC calls to Consul servers. (#3440 ) * Added rate limiting for agent RPC calls. * Initializes the rate limiter based on the config. * Adds the rate limiter into the snapshot RPC path. * Adds unit tests for the RPC rate limiter. * Groups the RPC limit parameters under "limits" in the config. * Adds some documentation about the RPC limiter. * Sends a 429 response when the rate limiter kicks in. * Adds docs for new telemetry. * Makes snapshot telemetry look like RPC telemetry and cleans up comments.	2017-09-01 15:02:50 -07:00
Kyle Havlovitz	220db48aa7	Merge pull request #3431 from hashicorp/network-segments-oss	2017-09-01 10:24:58 -07:00
Kyle Havlovitz	0e33e2ecab	Pass listeners into setupSegments	2017-08-31 17:56:43 -07:00
Kyle Havlovitz	62102a537e	Organize segments for a cleaner split between enterprise and OSS	2017-08-31 17:39:46 -07:00
Kyle Havlovitz	baa501e0c5	Fill in the segment in the QuerySource for prepared query lookups	2017-08-31 03:35:59 -07:00
Kyle Havlovitz	7e565d7338	Fix some inconsistencies with segment logic and comments	2017-08-30 17:43:46 -07:00
Kyle Havlovitz	16aaf27208	Default bind/advertise for segments to BindAddr/AdvertiseAddr	2017-08-30 12:51:10 -07:00
Preetha Appan	2386214655	Wire server provider for raft layer only on protocol version 3 and above, and update changelog	2017-08-30 14:36:47 -05:00
Kyle Havlovitz	21513b0393	Update coord display in ui to account for segments	2017-08-30 11:58:29 -07:00
Kyle Havlovitz	14b027a3c2	Add segment addr field to tags for LAN flood joiner	2017-08-30 11:58:29 -07:00
Kyle Havlovitz	d129767657	Add agent.segment interpolation to prepared queries	2017-08-30 11:58:29 -07:00
Kyle Havlovitz	2ada0439d4	Add rpc_listener option to segment config	2017-08-30 11:58:29 -07:00
Kyle Havlovitz	a30e7657af	Add segment config validation	2017-08-30 11:58:29 -07:00
James Phillips	b1a15e0c3d	Adds open source side of network segments (feature is Enterprise-only).	2017-08-30 11:58:29 -07:00
Preetha Appan	a231eea0e7	More cleanup from code review	2017-08-30 12:31:36 -05:00
Preetha Appan	c6ee9bfa69	Remove copy pasted duplicate line, update documentation.	2017-08-30 10:02:10 -05:00
Preetha Appan	0f4e24f72c	Consolidate server lookup into one place and replace usages of localConsuls.	2017-08-30 09:30:33 -05:00
Preetha Appan	0f418a1bcf	Remove unused function	2017-08-30 09:30:33 -05:00
Preetha Appan	e639154abd	Remove stray commented line	2017-08-30 09:30:33 -05:00
Preetha Appan	00836a6aab	Remove server address tracking logic from manager/router and maintain it as part of lan event listener instead. Used sync.Map to track this, and added unit tests	2017-08-30 09:30:33 -05:00
Preetha Appan	830aca958a	ServerAddressProvider interface also returns an error now	2017-08-30 09:30:33 -05:00
Preetha Appan	c68fce89b5	Use config struct to create NetworkTransport layer when setting up raft	2017-08-30 09:30:33 -05:00
Preetha Appan	393ce1581b	Implement AddressProvider and wire that up to raft transport layer to support server nodes changing their IP addresses in containerized environments	2017-08-30 09:30:33 -05:00
Frank Schroeder	831d84c940	build: make tests independent of build tags When the metadata server is scanning the agents for potential servers it is parsing the version number which the agent provided when it joined. This version number has to conform to a certain format, i.e. 'n.n.n'. Without this version number properly set some tests fail with error messages that disguise the root cause. The default version number is currently set to 'unknown' in version/version.go which does not parse and triggers the tests to fail. The work around is to use a build tag 'consul' which will use the version number set in version_base.go instead which has the correct format and is set to the current release version. In addition, some parts of the code also require the version number to be of a certain value. Setting it to '0.0.0' for example makes some tests pass and others fail since they don't pass the semantic check. When using go build/install/test one has to remember to use '-tags consul' or tests will fail with non-obvious error messages. Using build tags makes the build process more complex and error prone since it prevents the use of the plain go toolchain and - at least in its current form - introduces subtle build and test issues. We should try to eliminate build tags for anything else but platform specific code. This patch removes all references to specific version numbers in the code and tests and sets the default version to '9.9.9' which is syntactically correct and passes the semantic check. This solves the issue of running go build/install/test without tags for the OSS build.	2017-08-30 13:40:18 +02:00
Frank Schroeder	d8195b3a4d	agent: drop status code comments	2017-08-23 22:36:23 +02:00
Frank Schroeder	f09a8bb1b6	agent: use http.StatusRequestEntityTooLarge instead of 413	2017-08-23 22:36:23 +02:00
Frank Schroeder	bc5dc32c1d	agent: use http.StatusInternalServerError instead of 500	2017-08-23 22:36:23 +02:00
Frank Schroeder	fa121be33f	agent: use http.StatusMethodNotAllowed instead of 405	2017-08-23 22:36:23 +02:00
Frank Schroeder	ad5c1d9e72	agent: use http.StatusNotFound instead of 404	2017-08-23 22:36:23 +02:00
Frank Schroeder	1a557ee9e9	agent: use http.StatusForbidden instead of 403	2017-08-23 22:36:23 +02:00
Frank Schroeder	7e2bc1b411	agent: use http.StatusUnauthorized instead of 401	2017-08-23 22:36:23 +02:00
Frank Schroeder	5d1546b052	agent: use http.StatusBadRequest instead of 400	2017-08-23 22:36:23 +02:00
Frank Schroeder	14ab5c7641	agent: support go-discover retry-join for wan	2017-08-23 21:23:34 +02:00
Frank Schröder	a3934c263c	acl: consolidate error handling (#3401 ) The error handling of the ACL code relies on the presence of certain magic error messages. Since the error values are sent via RPC between older and newer consul agents we cannot just replace the magic values with typed errors and switch to type checks since this would break compatibility with older clients. Therefore, this patch moves all magic ACL error messages into the acl package and provides default error values and helper functions which determine the type of error.	2017-08-23 16:52:48 +02:00
Frank Schroeder	16c58da27d	agent: drop unused code This code from http://github.com/hashicorp/consul/pull/3353 is no longer required.	2017-08-22 00:02:46 +02:00
Frank Schroeder	bf96857b17	dns: replace nameserver lookup with consistent rpc call This patch replaces the code which determines the list of servers in the current cluster with an RPC call to get the list of active consul service instances which only run on servers. This replaces the previous implementation which was more complex and relied on serf messages which can provide a different view than the consistent response from the raft log. As a side effect it makes the implementation independent of the server and the agent which means it works consistently across both. Different behavior for server and agent was the root cause for the bug in http://github.com/hashicorp/consul/issue/3047. Fixes #3407	2017-08-22 00:02:46 +02:00
Frank Schroeder	4052c6d2d2	dns: split node lookup from request handling	2017-08-22 00:02:46 +02:00
Frank Schroeder	d4e3d4344a	dns: refactor label by unrolling loop	2017-08-22 00:02:46 +02:00
Frank Schroeder	70be1ab635	dns: move ttl closer to usage	2017-08-22 00:02:46 +02:00
James Phillips	f51d56c80c	Switches to using a read lock for the agent's RPC dispatcher. This prevents RPC calls from getting serialized in this spot. Fixes #3376	2017-08-09 18:51:55 -07:00
Frank Schröder	4b642fed2f	agent: honor deprecated flags for retry-join-{ec2,azure,gce} (#3384 )	2017-08-09 16:18:30 -07:00
James Phillips	e8a83bb463	Revert "Return 403 rather than a 404 when acls cause all results to be filter…"	2017-08-09 15:06:57 -07:00
James Phillips	02a87df044	Revert "Ensure that we return a permission denied only if the list of keys/en…"	2017-08-09 15:06:20 -07:00
Preetha Appan	42fb49c00b	Added unit test case to kvs_endpointtest	2017-08-09 15:50:22 -05:00
Preetha Appan	3276891142	Ensure that we return a permission denied only if the list of keys/entries prior to filtering by ACL is non empty	2017-08-09 15:32:18 -05:00
Frank Schroeder	7cff50a4df	agent: move agent/consul/agent to agent/metadata	2017-08-09 14:36:52 +02:00
Frank Schroeder	c395599cea	agent: move agent/consul/servers to agent/router	2017-08-09 14:36:37 +02:00
Frank Schroeder	1acff3533e	agent: move agent/consul/structs to agent/structs	2017-08-09 14:32:12 +02:00
James Phillips	cb618918b3	Cleans up some go fmt issues.	2017-08-08 21:52:50 -07:00
James Phillips	7442039c2d	Fixes a vet error.	2017-08-08 16:00:18 -07:00
Kyle Havlovitz	cf02e3bc22	Merge pull request #3369 from hashicorp/metrics-enhancements Add support for labels/filters from go-metrics	2017-08-08 13:55:30 -07:00
Kyle Havlovitz	c1c883f441	Add doc links for metrics endpoint	2017-08-08 13:05:38 -07:00
Kyle Havlovitz	0428e9fe9e	Update docs for metrics endpoint	2017-08-08 12:33:30 -07:00
Frank Schroeder	9fa237ddb6	dns: minor cleanups	2017-08-08 13:55:58 +02:00
Kyle Havlovitz	d5634fe2a8	Add support for labels/filters from go-metrics	2017-08-08 01:45:10 -07:00
Preetha Appan	72ae8c8f33	Go back to using <nodename>.node.dc.consul as the name of the ns record being returned.	2017-08-07 16:02:33 -05:00
Frank Schroeder	8a9653bdf8	dns: keep NS names in consul domain	2017-08-07 11:11:55 +02:00
Frank Schroeder	f17bf78bb1	dns: postmaster -> hostmaster	2017-08-07 11:11:55 +02:00
Frank Schroeder	60608b455d	dns: we do not support zone transfers	2017-08-07 11:11:55 +02:00
Frank Schroeder	76b2538915	dns: drop CNAME for primary name server	2017-08-07 11:11:55 +02:00
Preetha Appan	7f34dc08a5	Added test case with IPV6 bind address for NS records, rewrote tests to use verify library and other code review feedback	2017-08-07 11:11:55 +02:00
Preetha Appan	76319f751d	Added back glue records in NS response, expanded unit test. Also reused same function used in node lookup for adding A/AAAA records in the extra section of the NS response	2017-08-07 11:11:55 +02:00
Preetha Appan	f01f17bda3	Don't add A records for NS requests, because the record being returned already resolves correctly. Also fixed all the unit tests, and ignored hostnames that don't meet valid dns hostname criteria	2017-08-07 11:11:55 +02:00
Frank Schroeder	7ea11c2f45	dns: provide correct SOA and NS responses This patch changes the behavior of the DNS server as follows: * The SOA response contains the SOA record in the Answer section instead of the Authority section. It also contains NS records in the Authority and the corresponding A glue records in the Extra section. In addition, CNAMEs are added to the Extra section to make the MNAME of the SOA record resolvable. AAAA glue records are not yet supported. * The NS response returns up to three random servers from the consul cluster in the Answer section and the glue A records in the Extra section. AAAA glue records are not yet supported.	2017-08-07 11:11:55 +02:00
Preetha Appan	824fc4ee20	Unify regex used to identify invalid dns characters	2017-08-07 11:11:55 +02:00
Preetha Appan	37f75a393e	Use sanitized version of node name of server in NS record, and start with "server" rather than "ns"	2017-08-07 11:11:55 +02:00
Preetha Appan	794d1afe44	Removed a copy pasted irrelevant comment, and other code review feedback	2017-08-07 11:11:54 +02:00
Preetha Appan	f9db387097	Add NS records and A records for each server. Constructs ns host names using the advertise address of the server.	2017-08-07 11:11:54 +02:00
James Phillips	4bee2e49f5	Adds secure introduction for the ACL replication token. (#3357 ) Adds secure introduction for the ACL replication token, as well as a separate enable config for ACL replication.	2017-08-03 15:39:31 -07:00
Frank Schroeder	9ffeba18ee	agent: fix code for updated go-discover signature Closes #3351	2017-08-03 21:32:11 +02:00
James Phillips	c0a5ad7903	Adds a new /v1/acl/bootstrap API (#3349 )	2017-08-02 17:05:18 -07:00
Miguel Prokop	6852dec3f2	agent: Fix script quoting on windows (#1875 ) This patch fixes the quoting for executing scripts on windows and splits the platform dependent code. Fixes #1875	2017-08-02 17:01:21 +02:00
Frank Schroeder	2fac427cd4	agent: use github.com/hashicorp/go-discover Replace the provider specific node discovery code with go-discover to support AWS, Azure and GCE. Fixes #3282	2017-08-01 11:41:43 +02:00
Preetha Appan	4076c0d741	Return nil instead of empty list when returning a PermissionDenied error, updated unit test	2017-07-31 17:23:20 -05:00
Preetha Appan	6336014a86	Return 403 rather than a 404 when acls cause all results to be filtered out. This fixes #2637	2017-07-31 13:50:29 -05:00
preetapan	0f494d8b86	Merge pull request #3332 from hashicorp/issue_3322 This fixes #3322	2017-07-28 17:54:30 -05:00
Preetha Appan	2d84cd2330	Tweaked parsing error message to quote properly	2017-07-28 17:52:35 -05:00
James Phillips	10b660d77a	Adds missing autopilot snapshot test and avoids snapshotting nil. (#3333 )	2017-07-28 15:48:42 -07:00
Preetha Appan	5aeab1463b	Validate unix sockets and ip addresses as needed, more test cases	2017-07-28 17:18:10 -05:00
Preetha Appan	4cec55e8db	Modify ResolveTmplAddrs to parse advertise IPs, added test cases that fail to parse correctly	2017-07-28 15:01:32 -05:00
Preetha Appan	13c118ea51	Removed extra newlines	2017-07-28 10:51:11 -05:00
Preetha Appan	840749db7e	Fix comments, and remove redundant TestConfig init from a couple of unit tests	2017-07-28 10:40:43 -05:00
Frank Schroeder	b19b062194	add tests for go-sockaddr template parsing	2017-07-28 15:40:22 +02:00
Frank Schroeder	ac9602e798	agent: unix sockets are not ip addrs	2017-07-28 14:53:21 +02:00
Frank Schroeder	2fcdb35cbb	config: refactor tmpl resolution fn	2017-07-28 12:20:49 +02:00
Preetha Appan	aa98aeb4b1	Moved handling advertise address to readConfig and out of the agent's constructor, plus unit test fixes	2017-07-27 22:06:31 -05:00
Preetha Appan	25acd1534a	Move go-socketaddr template parsing into config package to make it happen before creating a new agent. Also removed redundant parsetemplate calls from agent.go.	2017-07-27 16:17:35 -05:00
James Phillips	6250cd70f5	Adds option to prepared queries to remove empty tags. (#3330 )	2017-07-26 22:46:43 -07:00
James Phillips	496b0bcf07	Adds support for agent-side ACL token management via API instead of config files. (#3324 ) * Adds token store and removes all runtime use of config for ACL tokens. * Adds a new API for changing agent tokens on the fly.	2017-07-26 11:03:43 -07:00
Preetha Appan	b94617b281	Add extra test case for deleting entire tree with empty prefix	2017-07-26 09:42:07 -05:00
Preetha Appan	4498814843	Don't insert tombstone for empty prefix delete. Other minor unit test fixes	2017-07-25 21:54:11 -05:00
Preetha Appan	fee418d378	Removed redundant comments and unit test	2017-07-25 20:39:33 -05:00
Preetha Appan	b772c477c2	Removed redundant call to reap tombstone from unit test	2017-07-25 19:39:05 -05:00
Preetha Appan	ae443e21d6	Improved unit test per code review	2017-07-25 19:17:40 -05:00
Preetha Appan	36acf8d6a4	Use new DeletePrefixMethod for implementing KVSDeleteTree operation. This makes deletes on sub trees larger than one million nodes about 100 times faster. Added unit tests.	2017-07-25 17:21:18 -05:00
James Phillips	c413a9161e	Removes an unnecessary close.	2017-07-24 21:41:18 -07:00
Preetha Appan	f8b633c69e	Removed redundant logging	2017-07-24 21:07:48 -05:00
Preetha Appan	c26fd66edd	Clean up temporary files on write errors, and ignore any temporary service files on load with a warning. This fixes #3207	2017-07-24 12:42:51 -05:00
James Phillips	1774fdc237	Tweaks the error when scripts are disabled. This will hopefully help people self-serve if they upgrade without accounting for this.	2017-07-19 22:15:04 -07:00
Kyle Havlovitz	d74390ef86	Fix UpgradeVersionTag field not being passed correctly (#3304 )	2017-07-19 17:39:48 -07:00
Preetha Appan	1f35aa6ff2	Made unit test for AddCheck error check the actual error string	2017-07-19 11:00:56 -05:00
Preetha Appan	c32e4ebe26	Unit test for failure case of AddCheck	2017-07-19 10:28:52 -05:00
Frank Schroeder	0047b7d3f0	fix spelling in filenames Fixes #3301	2017-07-19 13:16:38 +02:00
Frank Schroeder	83577e0daa	agent: make docker client work on windows	2017-07-19 12:03:59 +02:00
Frank Schroeder	b97ab92d87	build: add missing build tags	2017-07-19 05:17:01 +02:00
preetapan	fb43953894	Merge pull request #3296 from hashicorp/ensure_registration_race Fix race condition between removing a service and adding a check for …	2017-07-18 18:36:47 -05:00
Preetha Appan	e50f0e6722	Clean up any watch monitors associated with a failed AddCheck	2017-07-18 16:54:20 -05:00
Preetha Appan	6a257f242e	Removed unit test, added clarifying comment and returned a friendlier error message similar to the one in agent's AddService method Fixes #3297	2017-07-18 16:15:47 -05:00
Preetha Appan	9f048afe29	Fix race condition between removing a service and adding a check for the same service, which was causing orphaned checks	2017-07-18 16:15:47 -05:00
Kyle Havlovitz	19eae3d14b	Add UpgradeVersionTag to autopilot config	2017-07-18 13:35:41 -07:00
Frank Schroeder	0d9b53730f	agent: stop docker checks on shutdown	2017-07-18 20:59:24 +02:00
Frank Schroeder	60540c2417	agent: stop and remove docker checks Note that there is no test since the correct way to solve (and test) this is to replace the different maps with a single one or to hide that functionality behind a separate data structure. This will be addressed in #3294. Fixes #3265	2017-07-18 20:59:24 +02:00
Frank Schroeder	2123700056	agent: replace docker check This patch replaces the Docker client which is used for health checks with a simplified version tailored for that purpose. See #3254 See #3257 Fixes #3270	2017-07-18 20:24:38 +02:00
James Phillips	fff0f9698f	Prevents disabling gossip keyring file from disabling gossip encryption. (#3278 )	2017-07-17 12:48:45 -07:00
James Phillips	1791d99a10	Adds new config to make script checks opt-in, updates documentation. (#3284 )	2017-07-17 11:20:35 -07:00
James Phillips	780e68a753	Changes remote exec KV read to call GetTokenForAgent(). (#3283 ) * Changes remote exec KV read to call GetTokenForAgent(), which can use the acl_agent_token instead of the acl_token. Fixes #3160. * Fixes remote exec unit test with ACLs. * Adds unhappy ACL path to unit tests for remote exec.	2017-07-16 21:12:16 -07:00
James Phillips	1004d0ec0e	Adds node read privileges to the acl_agent_master_token. (#3277 ) Fixes #3113.	2017-07-16 20:08:26 -07:00
Frank Schröder	c001722848	azure: tag map can return nil (#3280 ) Fixes #3193	2017-07-16 14:29:43 -07:00
James Phillips	218ac4cb1e	Obfuscates ACL tokens appearing in /v1/acl/<verb>/<token> APIs. (#3276 ) * Obfuscates ACL tokens appearing in /v1/acl APIs. * Makes test positively identify the desired strings. * Adds an example and explanation of the regular expression.	2017-07-15 00:07:08 -07:00
James Phillips	872cf9ff95	Changes ACL clone response to 403 if not authorized, or if token doesn't exist. (#3275 ) Fixes #1113	2017-07-14 20:43:30 -07:00
Kyle Havlovitz	78c3a86405	Add TLS setting to router areas	2017-07-14 17:38:08 -07:00
James Phillips	0881e46111	Cleans up version 8 ACLs in the agent and the docs. (#3248 ) * Moves magic check and service constants into shared structs package. * Removes the "consul" service from local state. Since this service is added by the leader, it doesn't really make sense to also keep it in local state (which requires special ACLs to configure), and requires a bunch of special cases in the local state logic. This requires fewer special cases and makes ACL bootstrapping cleaner. * Makes coordinate update ACL log message a warning, similar to other AE warnings. * Adds much more detailed examples for bootstrapping ACLs. This can hopefully replace https://gist.github.com/slackpad/d89ce0e1cc0802c3c4f2d84932fa3234.	2017-07-13 22:33:47 -07:00
Frank Schroeder	764dabfcf7	agent: fix go vet issue	2017-07-11 07:13:46 -07:00
James Phillips	66edec5dfd	Adds the ability to blacklist specific HTTP endpoints. (#3252 )	2017-07-10 13:51:25 -07:00
James Phillips	7200b8cda8	UI cleanup follow up from #3245 . (#3251 ) * Removes unnecessary set for model component which will be null. * Returns a 404 for a missing node, not a 200 with an empty response. * Updates built-in web assets.	2017-07-10 09:40:00 -07:00
James Phillips	aa11956d63	Changes the default ACL token type to "client" in web UI. (#3246 ) * Changes the default ACL token type to "client". * Updates built-in web assets.	2017-07-08 17:28:04 -07:00
James Phillips	86b1e64a33	Cleans up web UI and fixes ACL token "stuckness" issue. (#3245 ) * Removes GitHub reference. * Doesn't display ACL token on the unauthorized page. * Removes useless fetch for nodes and cleans up comments. * Provides a path to reset the ACL token when it's invalid. This included making the settings page global so it's reachable, and adding some more information about an error on the error page. * Updates built-in web assets.	2017-07-08 17:16:05 -07:00
Frank Schroeder	1781fd311f	address review comments	2017-07-07 09:22:34 +02:00
Frank Schroeder	e4b40acc7e	agent: remove unused code	2017-07-07 09:22:34 +02:00
Frank Schroeder	8c792ad57d	agent: make TestClient_RPC_ConsulServerPing more robust	2017-07-07 09:22:34 +02:00
Frank Schroeder	4a4b91a2db	agent: fix data races with registerEndpoint Only register a different endpoint after it has been fully created.	2017-07-07 09:22:34 +02:00
Frank Schroeder	19b937ba80	agent: make Reap test timing less aggressive	2017-07-07 09:22:34 +02:00

... 2 3 4 5 6 ...

423 Commits