Test force_update_version variable for better EKS version upgrade #5037

poornima-krishnasamy · 2023-11-21T16:21:57Z

Background

When the EKS version upgrade happens and the node are drained, if any of the pods are crashloopBackoff and have a poddisruptionBudget which cant be violated, then the draining of node cannot happen. Currently we have a cloudwatch script which runs and check for any errors and force delete the pod in order to proceed with the upgrade.

This is because of open issue which can be found in EKS container roadmap and kubernetes

In the EKS terraform module 1.18, there is a flag which can help to drain the nodes forcefully for these kind of errors.
cloudposse/terraform-aws-eks-node-group#151

Approach

Create a cluster and add workload which has pdb violations and crashlooping.
Update the variable and do an eks version upgrade.
Test if the upgrade is successful without the need of cloudwatch script

Which part of the user docs does this impact

https://runbooks.cloud-platform.service.justice.gov.uk/upgrade-eks-cluster.html#upgrade-eks-cluster

Communicate changes

post for #cloud-platform-update
Weeknotes item
Show the Thing/P&A All Hands/User CoP
Announcements channel

Questions / Assumptions

Definition of done

Reference

How to write good user stories

poornima-krishnasamy · 2024-05-24T16:53:52Z

Tha variable didnt improve the stuck pod eviction during node group upgrade. Probably that is used when doing upgrade via terraform and not apply to the one in console

ChikC added this to Cloud Platform Feb 19, 2024

sablumiah moved this to Todo in Cloud Platform Mar 11, 2024

poornima-krishnasamy added spike Firebreak labels Mar 25, 2024

tmahmood72 moved this from Todo to 🏗 In Progress in Cloud Platform May 14, 2024

poornima-krishnasamy assigned tmahmood72 May 14, 2024

tmahmood72 assigned poornima-krishnasamy May 15, 2024

poornima-krishnasamy closed this as completed May 24, 2024

github-project-automation bot moved this from 🏗 In Progress to 🥇 Done in Cloud Platform May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test force_update_version variable for better EKS version upgrade #5037

Test force_update_version variable for better EKS version upgrade #5037

poornima-krishnasamy commented Nov 21, 2023

poornima-krishnasamy commented May 24, 2024

Test force_update_version variable for better EKS version upgrade #5037

Test force_update_version variable for better EKS version upgrade #5037

Comments

poornima-krishnasamy commented Nov 21, 2023

Background

Approach

Which part of the user docs does this impact

Communicate changes

Questions / Assumptions

Definition of done

Reference

poornima-krishnasamy commented May 24, 2024