Communication with services sometimes failed #127

vitaliy-leschenko · 2020-11-24T08:24:42Z

Describe the bug
Windows pods can't communicate with services if target pod stay on different Windows node.
Maybe this is duplicate of another issues but symptoms are a little bit difference.

Windows pods can communicate with services from pods that run on Linux without any problem.
Linux pods can communicate with services from pods that run on Windows without any problem.

To Reproduce
Steps to reproduce the behavior:

Setup new cluster with 2 linux and 2 windows nodes using PrepareNode.ps1
Apply yaml
It creates 2 daemonsets iis (on windows) and nginx (on linux) and 2 services for each daemonset.
Try to connect to one of iis pods and curl:
curl -Ik nginx.issues.svc.cluster.local <-- will be success in 100% cases
curl -Ik iis.isses.svc.cluster.local <-- will be success only in cases when it tries to curl from pod that runs on the same node.

Expected behavior
I expect that curl -Ik iis.isses.svc.cluster.local will be success in 100% cases

Kubernetes (please complete the following information):

Windows Server version: 1809 (10.0.17763.1554)
Kubernetes Version: 1.19.0 - 1.19.4
CNI: flannel 0.13 vxlan

Additional context
All pods can ping each others by IP address.
All windows pods can curl from the Internet and ping each other.
Windows pods can curl from windows pods on different nodes by IP address but failed when I try to curl by Service name or Service IP address.

My PR #111 doesn't affect it because it reproduced without it.

The text was updated successfully, but these errors were encountered:

vitaliy-leschenko · 2020-11-26T09:29:41Z

I tried yaml from step 2 on baremetal cluster (1.17.0) created by https://github.com/kubernetes-sigs/sig-windows-tools/blob/master/kubeadm/KubeCluster.ps1 and this issue doesn't reproduced.

vitaliy-leschenko · 2020-11-27T11:02:24Z

It is truly interesting but the issue doesn't reproduced on Windows Server 1809 (10.0.17763.1294)

vitaliy-leschenko · 2020-11-27T17:41:28Z

It is look like a Windows issue.

daschott · 2020-12-05T00:45:43Z

Is this still active?

vitaliy-leschenko · 2020-12-05T06:17:18Z

@daschott Yes, it is.

fejta-bot · 2021-03-05T06:50:28Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-04-04T07:34:06Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

fejta-bot · 2021-05-04T07:36:11Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

k8s-ci-robot · 2021-05-04T07:36:17Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

vitaliy-leschenko · 2021-06-25T14:53:44Z

It looks like in 10.0.17763.1999 this issues was fixed. Maybe in one of previous updates but it doesn't matter. Currently we can use latest updates and don't have the issue.

vitaliy-leschenko mentioned this issue Nov 27, 2020

Pod to service connectivity issues on August and September cumulative updates on Windows Server 2019 microsoft/Windows-Containers#61

Closed

vitaliy-leschenko closed this as completed Nov 27, 2020

vitaliy-leschenko reopened this Dec 5, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 5, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Apr 4, 2021

k8s-ci-robot closed this as completed May 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Communication with services sometimes failed #127

Communication with services sometimes failed #127

vitaliy-leschenko commented Nov 24, 2020 •

edited

Loading

vitaliy-leschenko commented Nov 26, 2020

vitaliy-leschenko commented Nov 27, 2020

vitaliy-leschenko commented Nov 27, 2020

daschott commented Dec 5, 2020

vitaliy-leschenko commented Dec 5, 2020

fejta-bot commented Mar 5, 2021

fejta-bot commented Apr 4, 2021

fejta-bot commented May 4, 2021

k8s-ci-robot commented May 4, 2021

vitaliy-leschenko commented Jun 25, 2021

Communication with services sometimes failed #127

Communication with services sometimes failed #127

Comments

vitaliy-leschenko commented Nov 24, 2020 • edited Loading

vitaliy-leschenko commented Nov 26, 2020

vitaliy-leschenko commented Nov 27, 2020

vitaliy-leschenko commented Nov 27, 2020

daschott commented Dec 5, 2020

vitaliy-leschenko commented Dec 5, 2020

fejta-bot commented Mar 5, 2021

fejta-bot commented Apr 4, 2021

fejta-bot commented May 4, 2021

k8s-ci-robot commented May 4, 2021

vitaliy-leschenko commented Jun 25, 2021

vitaliy-leschenko commented Nov 24, 2020 •

edited

Loading