consul

mirror of https://github.com/status-im/consul.git synced 2025-02-10 04:45:05 +00:00

Author	SHA1	Message	Date
R.B. Boyer	58c0c101af	update changelog	2019-05-24 13:38:00 -05:00
R.B. Boyer	40336fd353	agent: fix several data races and bugs related to node-local alias checks (#5876 ) The observed bug was that a full restart of a consul datacenter (servers and clients) in conjunction with a restart of a connect-flavored application with bring-your-own-service-registration logic would very frequently cause the envoy sidecar service check to never reflect the aliased service. Over the course of investigation several bugs and unfortunate interactions were corrected: (1) local.CheckState objects were only shallow copied, but the key piece of data that gets read and updated is one of the things not copied (the underlying Check with a Status field). When the stock code was run with the race detector enabled this highly-relevant-to-the-test-scenario field was found to be racy. Changes: a) update the existing Clone method to include the Check field b) copy-on-write when those fields need to change rather than incrementally updating them in place. This made the observed behavior occur slightly less often. (2) If anything about how the runLocal method for node-local alias check logic was ever flawed, there was no fallback option. Those checks are purely edge-triggered and failure to properly notice a single edge transition would leave the alias check incorrect until the next flap of the aliased check. The change was to introduce a fallback timer to act as a control loop to double check the alias check matches the aliased check every minute (borrowing the duration from the non-local alias check logic body). This made the observed behavior eventually go away when it did occur. (3) Originally I thought there were two main actions involved in the data race: A. The act of adding the original check (from disk recovery) and its first health evaluation. B. The act of the HTTP API requests coming in and resetting the local state when re-registering the same services and checks. It took awhile for me to realize that there's a third action at work: C. The goroutines associated with the original check and the later checks. The actual sequence of actions that was causing the bad behavior was that the API actions result in the original check to be removed and re-added _without waiting for the original goroutine to terminate_. This means for brief windows of time during check definition edits there are two goroutines that can be sending updates for the alias check status. In extremely unlikely scenarios the original goroutine sees the aliased check start up in `critical` before being removed but does not get the notification about the nearly immediate update of that check to `passing`. This is interlaced wit the new goroutine coming up, initializing its base case to `passing` from the current state and then listening for new notifications of edge triggers. If the original goroutine "finishes" its update, it then commits one more write into the local state of `critical` and exits leaving the alias check no longer reflecting the underlying check. The correction here is to enforce that the old goroutines must terminate before spawning the new one for alias checks.	2019-05-24 13:36:56 -05:00
Freddy	6b31482333	Increase reliability of TestResetSessionTimerLocked_Renew	2019-05-24 13:54:51 -04:00
Hans Hasselberg	27f05b16a0	Update CHANGELOG.md	2019-05-24 16:51:44 +02:00
Pierre Souchay	e892981418	agent: Improve startup message to avoid confusing users when no error occurs (#5896 ) * Improve startup message to avoid confusing users when no error occurs Several times, some users not very familiar with Consul get confused by error message at startup: `[INFO] agent: (LAN) joined: 1 Err: <nil>` Having `Err: <nil>` seems weird to many users, I propose to have the following instead: * Success: `[INFO] agent: (LAN) joined: 1` * Error: `[WARN] agent: (LAN) couldn't join: %d Err: ERROR`	2019-05-24 16:50:18 +02:00
Freddy	17e74985b0	Run TestServer_Expect on its own (#5890 )	2019-05-23 19:52:33 -04:00
Freddy	6c19cacd42	Flaky test: ACLReplication_Tokens (#5891 ) * Exclude non-go workflows while testing * Wait for s2 global-management policy * Revert "Exclude non-go workflows while testing" This reverts commit 47a83cbe9f19d0e1e475eabaa223d61fb4c56019.	2019-05-23 19:52:02 -04:00
Freddy	d4ea163b0b	Add retries to StatsFetcherTest (#5892 )	2019-05-23 19:51:31 -04:00
Jack Pearkes	7e3cd36709	Putting source back into Dev Mode	2019-05-23 12:03:07 -07:00
Jack Pearkes	40cec98468	Release v1.5.1 v1.5.1	2019-05-22 20:19:12 +00:00
Jack Pearkes	ff899d689d	Update CHANGELOG.md	2019-05-22 20:16:54 +00:00
Jack Pearkes	5a637318d8	Update CHANGELOG to include 1.4.5 release	2019-05-22 20:15:32 +00:00
Jack Pearkes	36ebca1fd0	Fix to prevent allowing recursive KV deletions when we shouldn’t	2019-05-22 20:13:30 +00:00
Kyle Havlovitz	5457bca10c	Update CHANGELOG.md	2019-05-22 09:54:10 -07:00
freddygv	5d7c257698	Revert "Exclude non-go workflows while testing" This reverts commit 47a83cbe9f19d0e1e475eabaa223d61fb4c56019.	2019-05-21 19:17:39 -06:00
freddygv	40b809bce3	Wait for s2 global-management policy	2019-05-21 17:58:37 -06:00
freddygv	47a83cbe9f	Exclude non-go workflows while testing	2019-05-21 17:13:13 -06:00
Freddy	e9259ca97a	Change log line used for verification	2019-05-21 17:07:06 -06:00
Freddy	d1c315fad9	Stop running TestLeader_ChangeServerID in parallel	2019-05-21 15:28:08 -06:00
Alvin Huang	bb602bf719	exclude agent/proxyprocess tests since it will be removed in 1.6 (#5878 )	2019-05-21 17:19:37 -04:00
Sarah Christoff	d60f3c15f8	Merge pull request #5867 from hashicorp/bug/test-flaky-TestCoordinate_Node Update type assertion logic	2019-05-21 14:56:05 -05:00
Sarah Christoff	32b5992d0f	Add retries around `obj`	2019-05-21 13:36:52 -05:00
Sarah Christoff	73d73e0e20	Add retries to all `obj`	2019-05-21 13:31:37 -05:00
Paul Banks	2d47b28722	Envoy integration test improvements (#5797 ) * Grab consul logs on integration test failures too and don't remove .gitignore * Don't wipe logs so we have some artifacts to upload at the end	2019-05-21 14:17:41 +01:00
Sarah Christoff	2a018e5e0a	Update agent/coordinate_endpoint_test.go Co-Authored-By: Freddy <freddygv@users.noreply.github.com>	2019-05-17 14:32:50 -05:00
Sarah Christoff	b96d9b01bd	Update type assertion logic Logic updated to evaluate with a boolean after the type assertion. This allows us to check if the type assertion succeeded and be more clear with errors.	2019-05-17 13:32:36 -05:00
Alvin Huang	f452de1c50	bump middleman-hashicorp to 0.3.40 and exclude guide rendering (#5847 ) * bump middleman-hashicorp to 0.3.40 and exclude guide rendering * add notes to Makefile for volume mounts hack PR#5847 * make note of the PR number in the comment	2019-05-17 14:07:25 -04:00
Alvin Huang	c8dc8049c4	enable go tests (#5863 )	2019-05-17 11:42:56 -04:00
John Cowen	ee393e34fb	ui: Adds tick whilst editing the link template in the Settings area (#5820 ) 1. Amends our `base` animation placeholder to always reset transition-duration. This has no effect on other components that are already using this animation. 2. Adds a confirming class whenever a key is pressed, to show the green tick. Uses CSS via `transition-delay` for debouncing.	2019-05-17 12:33:12 +01:00
Justin Weissig	8473fc1476	docs: Fixed typo: authorative/authoritative.	2019-05-16 15:52:32 -06:00
Jack Pearkes	45cdc80767	website: clarify removal for managed proxies (#5849 ) We didn't end up removing these completely in 1.5.0. See also #5848.	2019-05-16 12:48:32 +01:00
Kyle Havlovitz	b15cb60851	Merge pull request #5485 from hashicorp/change-node-id Allow nodes to change IDs when replacing a dead node	2019-05-15 12:18:13 -07:00
Kyle Havlovitz	31bb9d67df	Set the dead node reclaim timer at 30s	2019-05-15 11:59:33 -07:00
Kyle Havlovitz	1d9b8e1860	vendor: update memberlist	2019-05-15 11:10:40 -07:00
Kyle Havlovitz	29eb83c9c2	Merge branch 'master' into change-node-id	2019-05-15 10:51:04 -07:00
Justin Weissig	9e33bb626d	docs: fixed typo (#5844 ) Fixed typo: reqired/required.	2019-05-15 10:55:29 -05:00
kaitlincarter-hc	c2c22a35fb	[docs] Updating links to guides (#5795 ) * fixing links in the docs post guide migartion. * fixed one more * Update website/source/docs/acl/acl-legacy.html.md Co-Authored-By: kaitlincarter-hc <43049322+kaitlincarter-hc@users.noreply.github.com> * Update website/source/docs/enterprise/connect-multi-datacenter/index.html.md * Updating based on comments and fixing word wrap * Update website/source/api/acl-legacy.html.md * Update website/source/api/acl/acl.html.md * Update website/source/docs/agent/options.html.md * Update website/source/docs/faq.html.md * Update website/source/docs/internals/architecture.html.md * Update website/source/docs/agent/encryption.html.md	2019-05-15 10:49:41 -05:00
Justin Weissig	ae5900e652	Fixed typo: registratered/registered.	2019-05-15 09:01:19 -06:00
Rebecca Zanzig	dc341df27e	Merge pull request #5843 from hashicorp/docs/helmtypos [docs] Update to correct parameter name in docs	2019-05-14 15:56:57 -07:00
Rebecca Zanzig	803a84bf83	Update to correct parameter name in docs	2019-05-14 15:30:59 -07:00
Rebecca Zanzig	8895d9c2d5	Merge pull request #5825 from hashicorp/docs/helming Add docs for Helm chart features introduced in v0.8.0	2019-05-14 12:13:59 -07:00
Rebecca Zanzig	ca01f2a93a	Add docs for Helm chart features introduced in v0.8.0 Additionally defines a new annotation that is used with these new features.	2019-05-14 11:28:08 -07:00
Justin Weissig	460aa38c15	docs: fixed typos (#5839 ) Fixed spelling: defintions/definitions, Speficied/Specifies.	2019-05-14 10:54:51 -05:00
Justin Weissig	f5ec74f0e1	docs: fixed spelling (#5840 ) Spelling: namepace/namespace.	2019-05-14 10:54:22 -05:00
Judith Malnick	effb069645	[docs] Add K8s L7 Observability Guide (#5826 ) * add l7 observability guide * fix urls	2019-05-13 13:32:39 -07:00
Justin Weissig	02be3d9c6b	docs: spelling and wording (#5830 ) Fixed spelling and wording: avaliable/available, upto/up to, an other/another, recieved/received, transfered/transferred (x2)	2019-05-13 16:39:25 +01:00
Justin Weissig	8d62353d00	docs: fixed typos (#5832 ) Fixed typos: persmissions/permissions + regardles/regardless	2019-05-13 10:22:17 -05:00
Christoph Puhl	87bf4c026b	Update index.html.md (#5813 ) Missing dot in Consul Kubernetes landing page.	2019-05-13 10:06:05 -05:00
Brian Flad	1fbe44d2c5	docs/agent/options: Add go-sockaddr template examples for -bind (#5701 ) Reference: https://github.com/hashicorp/consul/issues/4090 Examples covering a variety of potential use cases. Verified via `sockaddr eval` and `console agent -bind` on a test machine: ```console # Baseline $ sockaddr eval 'GetAllInterfaces' [127.0.0.1/8 {1 65536 lo up\|loopback} ::1 {1 65536 lo up\|loopback} 10.0.0.10/8 {2 1500 eth0 b8:27:eb:7b:36:95 up\|broadcast\|multicast} fe80::12dc:5e4d:8ff8:2d96/64 {2 1500 eth0 b8:27:eb:7b:36:95 up\|broadcast\|multicast} 192.168.1.10/24 {3 1500 wlan0 b8:27:eb:2e:63:c0 up\|broadcast\|multicast} fe80::b6dc:5758:c306:b15b/64 {3 1500 wlan0 b8:27:eb:2e:63:c0 up\|broadcast\|multicast}] # Using address within a specific CIDR $ sockaddr eval 'GetPrivateInterfaces \| include "network" "10.0.0.0/8" \| attr "address"' 10.0.0.10 # Using a static network interface name $ sockaddr eval 'GetInterfaceIP "eth0"' 10.0.0.10 # Using regular expression matching for network interface name that is forwardable and up $ sockaddr eval 'GetAllInterfaces \| include "name" "^eth" \| include "flags" "forwardable\|up" \| attr "address"' 10.0.0.10 ```	2019-05-13 14:08:05 +01:00
Nathan Johnson	521fd89497	Updating documentation of example systemd service to be of type notify. (#5689 ) https://github.com/hashicorp/consul/issues/2121 https://www.freedesktop.org/software/systemd/man/systemd.service.html When set to notify, systemd will not attempt to start any dependent services until after consul sends the notify signal. This is useful in cases where there services that rely on the local consul agent to be up and functional before they can start. The default is simple, which will immediately mark the service as up and functioning even if consul has not yet joined the cluster and has started listening for connnections.	2019-05-13 14:04:45 +01:00

1 2 3 4 5 ...

9947 Commits