Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change l4 healthcheck timeout #1828

Merged
merged 1 commit into from
Oct 6, 2022

Conversation

code-elinka
Copy link
Contributor

@code-elinka code-elinka commented Sep 26, 2022

According to l4 healthcheck constants the default set up has timeout = 8 seconds and threshold = 3 times. As result it can lead to 24 (8*3) seconds of lost requests. This change is supposed to decrease this time to only 6 seconds for sharedHc=false.

@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Sep 26, 2022
@k8s-ci-robot
Copy link
Contributor

Hi @code-elinka. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot k8s-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Sep 26, 2022
@panslava
Copy link
Contributor

yeh, trash is hard to hold for 3 times consecutive

@panslava
Copy link
Contributor

We want to introduce this change only to services with external traffic policy local, so this needs a bit more changes

@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Sep 27, 2022
@k8s-ci-robot k8s-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Sep 27, 2022
@code-elinka code-elinka marked this pull request as draft September 27, 2022 09:15
@k8s-ci-robot k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 27, 2022
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from fc000d3 to 235301b Compare October 3, 2022 20:17
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Oct 3, 2022
@code-elinka
Copy link
Contributor Author

code-elinka commented Oct 3, 2022

Wrong change. ExternalTrafficPolicy is not the same with XLB 🤡

@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from 235301b to 0367e93 Compare October 3, 2022 21:47
@code-elinka code-elinka marked this pull request as ready for review October 3, 2022 21:48
@k8s-ci-robot k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 3, 2022
@code-elinka
Copy link
Contributor Author

/unassign @bowei

@code-elinka
Copy link
Contributor Author

/assign @cezarygerard

@code-elinka
Copy link
Contributor Author

/uncc @bowei

@k8s-ci-robot k8s-ci-robot removed the request for review from bowei October 3, 2022 21:53
@code-elinka
Copy link
Contributor Author

code-elinka commented Oct 3, 2022

/cc @cezarygerard
/unassign @cezarygerard
/assign @code-elinka

@cezarygerard Wow, that's really nice that I did not push the change earlier and did not ask you to review. I made a stupid mistake, read Slavik comment in a wrong way and added new consts for XLB, not ExternalTrafficPolicies=Local 😀

@cezarygerard
Copy link
Contributor

/ok-to-test

@k8s-ci-robot k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Oct 5, 2022
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from 0367e93 to 168fc9c Compare October 6, 2022 09:47
@k8s-ci-robot k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 6, 2022
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from 168fc9c to 8b7aba1 Compare October 6, 2022 09:47
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 6, 2022
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from 8b7aba1 to aa0136f Compare October 6, 2022 09:50
@code-elinka code-elinka requested review from cezarygerard and removed request for panslava and swetharepakula October 6, 2022 09:52
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from aa0136f to ae443dc Compare October 6, 2022 15:33
According to l4 healthcheck constants
the default set up has timeout = 8 seconds
and threshold = 3 times. As result it can
lead to 24 (8*3) seconds of lost requests.
This change is supposed to decrease this
time to only 6 seconds for sharedHc=false.

Signed-off-by: Elina Akhmanova <elinka@google.com>
@code-elinka code-elinka force-pushed the change-l4-healthcheck branch from ae443dc to 22247d2 Compare October 6, 2022 15:37
@code-elinka
Copy link
Contributor Author

/retest

@cezarygerard
Copy link
Contributor

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 6, 2022
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cezarygerard, code-elinka

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 6, 2022
@k8s-ci-robot k8s-ci-robot merged commit 6b01d42 into kubernetes:master Oct 6, 2022
@code-elinka code-elinka deleted the change-l4-healthcheck branch October 6, 2022 19:37
wantUnhealthyThreshold: gceLocalHcUnhealthyThreshold + 1,
},
}
for _, tc := range testCases {
t.Run(tc.desc, func(t *testing.T) {
// healthcheck intervals and thresholds are common for Global and Regional healthchecks. Hence testing only Global case.
wantHC := newL4HealthCheck("hc", types.NamespacedName{Name: "svc", Namespace: "default"}, false, "/", 12345, utils.ILB, meta.Global, "")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this doesn't exercise shared/local that is the thing we just changed,no?

@@ -27,7 +27,7 @@ import (

func TestMergeHealthChecks(t *testing.T) {
t.Parallel()
for _, tc := range []struct {
testCases := []struct {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if we add a shared bool var here we could this new dimension

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let me do that, seems a low hanging fruit I can use to participate here

code-elinka added a commit to akhmanova/ingress-gce that referenced this pull request Oct 13, 2022
This change adds changed l4 healthcheck timeouts
to the v1.14 release.
According to l4 healthcheck constants
the default set up has timeout = 8 seconds
and threshold = 3 times. As result it can
lead to 24 (8*3) seconds of lost requests.
This change is supposed to decrease this
time to only 6 seconds for sharedHc=false.

Signed-off-by: Elina Akhmanova <elinka@google.com>
k8s-ci-robot added a commit that referenced this pull request Oct 13, 2022
[Cherrypick #1828] l4 healthcheck timeouts change
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. ok-to-test Indicates a non-member PR verified by an org member that is safe to test. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants