Fixes for node slice IPAM #503

xagent003 · 2024-09-13T20:01:18Z

This fixes some issues seen with the new node slice IPAM feature.

Disable some controllers that are not needed. Don't need informer on nodeSlicePool since that is an internal CR we manage. Don't need to reconcile for NADs when the resource version does not change. For nodes, only listen for node add and delete. Nodes are updated frequently for Status changes and conditions that we don't need - we only want to allocate a new slice pool when a node is created, and remove its allocation when node is deleted
We have multiple NADs(which map to multiple NICs), but with the same CIDR and network_name(because it is really one L2 network). With node slice pool feature enabled and a Pod requesting multiple networks, the same podRef and same containerID will be present multiple times in each IPpool, for each ifName(corresponding to each NAD). We also need to match by ifName so it deletes the correct entry, rather than first one
When node slice size or cidr is reconfigured, it was passing the wrong range - ipam.Range, rather than ipamConf.IPRanges[0].Range. ipam.Range is cleared and set to ipamConf.IPRanges[0].Range so it was error'ing out with empty CIDR
nodeSlice.Spec was not being written/saved when the node slice size/cidr was reconfigured
the Subnet mask/cidr used was incorrect. We should use the CIDR from the NAD, rather than the range for the node. The NAD's range really defines the cluster-wide subnet, whereas the nodeSLicePool's IPRange is just used for grouping IP allocations. Without this fix, each node was on a different subnet, resulting in a IP lookup via default route(primary CNI), rather than going over the NAD/Multus network. For example, suppose our NAD range is 10.0.0.0/8. The node slice size is a /24. If we use the range from the NodeSLicePool, Pods on a node are on a different /24, rather than all being on the same /8.

…on NAD changes, fix range parameter

coveralls · 2024-09-13T20:07:37Z

Pull Request Test Coverage Report for Build 10855353642

Details

8 of 24 (33.33%) changed or added relevant lines in 4 files are covered.
8 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.4%) to 54.134%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
pkg/iphelpers/iphelpers.go	1	2	50.0%
pkg/storage/kubernetes/ipam.go	0	2	0.0%
pkg/allocate/allocate.go	0	4	0.0%
pkg/node-controller/controller.go	7	16	43.75%

Files with Coverage Reduction	New Missed Lines	%
pkg/node-controller/controller.go	8	57.85%

Totals
Change from base Build 10831942310:	-0.4%
Covered Lines:	1434
Relevant Lines:	2649

💛 - Coveralls

ivelichkovich · 2024-09-13T21:04:11Z

/lgtm

Thank you!

cc @dougbtv

ivelichkovich · 2024-09-13T21:09:52Z

fixes: #498

dougbtv

Thank you Arjun!!!

Arjun Baindur added 2 commits September 13, 2024 11:46

NodeSlice IPAM fixes - disable nodeslicepool controller, update Spec …

80b7d62

…on NAD changes, fix range parameter

Set subnet mask to NAD range rather than node slice range

f2102bf

xagent003 requested a review from dougbtv as a code owner September 13, 2024 20:01

xagent003 mentioned this pull request Sep 13, 2024

Fixes for node slice IPAM feature #504

Open

dougbtv approved these changes Oct 15, 2024

View reviewed changes

dougbtv merged commit 57d5ac3 into k8snetworkplumbingwg:master Oct 15, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for node slice IPAM #503

Fixes for node slice IPAM #503

xagent003 commented Sep 13, 2024 •

edited

Loading

coveralls commented Sep 13, 2024

ivelichkovich commented Sep 13, 2024 •

edited

Loading

ivelichkovich commented Sep 13, 2024

dougbtv left a comment

Fixes for node slice IPAM #503

Fixes for node slice IPAM #503

Conversation

xagent003 commented Sep 13, 2024 • edited Loading

coveralls commented Sep 13, 2024

Pull Request Test Coverage Report for Build 10855353642

Details

💛 - Coveralls

ivelichkovich commented Sep 13, 2024 • edited Loading

ivelichkovich commented Sep 13, 2024

dougbtv left a comment

Choose a reason for hiding this comment

xagent003 commented Sep 13, 2024 •

edited

Loading

ivelichkovich commented Sep 13, 2024 •

edited

Loading