Equalize promoted #64206

PeterSolMS · 2022-01-24T15:29:01Z

I have observed that in server GC scenarios, the amount of promoted memory is often very uneven between heaps, which leads to suboptimal work distribution between server GC threads.

The PR introduces a new method equalize_promoted_bytes that attemps to even out the promoted memory between heaps by moving regions from heaps with lots of promotion of heaps with less promotion.

The algorithm used removes regions from heaps that have more than average promotion. These surplus regions are arranged into size classes by the amount of promoted memory in them. The heaps are also arranged into size classes by how much their promoted memory is short of the average promoted memory per heap.

Then we repeatedly move the surplus region with the most promoted memory to the heap with the biggest deficit. If the heap still has a deficit, it is reconsidered under its new deficit size class. Otherwise, it now has average or more promoted memory and is removed from further consideration.

Because regions may now move between heaps, it is no longer true that the finalization queue entry for an object is always on the same heap as the object itself. This necessitated moving the call to finalize_queue->UpdatePromotedGenerations to a later point in time when all threads have finished updating the final generation for a region.

Algorithm used is simple and pretty efficient, but only aims for rough equality rather than trying to do an optimal job.

…more sophisticated algorithm.

…nup, added dprintf for the case where uoh_alloc_done cannot find the entry to release.

…thm is doing.

ghost · 2022-01-24T15:29:11Z

Tagging subscribers to this area: @dotnet/gc
See info in area-owners.md if you want to be subscribed.

Issue Details

I have observed that in server GC scenarios, the amount of promoted memory is often very uneven between heaps, which leads to suboptimal work distribution between server GC threads.

The PR introduces a new method equalize_promoted_bytes that attemps to even out the promoted memory between heaps by moving regions from heaps with lots of promotion of heaps with less promotion.

The algorithm used removes regions from heaps that have more than average promotion. These surplus regions are arranged into size classes by the amount of promoted memory in them. The heaps are also arranged into size classes by how much their promoted memory is short of the average promoted memory per heap.

Then we repeatedly move the surplus region with the most promoted memory to the heap with the biggest deficit. If the heap still has a deficit, it is reconsidered under its new deficit size class. Otherwise, it now has average or more promoted memory and is removed from further consideration.

Because regions may now move between heaps, it is no longer true that the finalization queue entry for an object is always on the same heap as the object itself. This necessitated moving the call to finalize_queue->UpdatePromotedGenerations to a later point in time when all threads have finished updating the final generation for a region.

Author:	PeterSolMS
Assignees:	PeterSolMS
Labels:	`area-GC-coreclr`
Milestone:	-

Maoni0 · 2022-01-26T07:03:52Z

src/coreclr/gc/gc.cpp

+        int deficit_heaps[MAX_SUPPORTED_CPUS];
+        int num_deficit_heaps = 0;
+        int surplus_heaps[MAX_SUPPORTED_CPUS];
+        int num_surplus_heaps = 0;


you might want to make deficit_heaps and surplus_heaps arrays of shorts. we are taking quite a bit of stack space with these ararys..

Maoni0 · 2022-01-26T07:04:15Z

src/coreclr/gc/gc.cpp

+                    surplus_heaps[num_surplus_heaps++] = i;
+                }
+            }
+            //  all other heaps are not looked at further


not sure what this comment means

Maoni0 · 2022-01-26T07:04:37Z

src/coreclr/gc/gc.cpp

+        // step 3:
+        //  as long as we have surplus heaps and deficit heaps,
+        //  move regions from surplus heaps to deficit heaps
+        while (num_surplus_heaps > 0 && num_deficit_heaps > 0)


nit - formatting

`while ((num_surplus_heaps > 0) && (num_deficit_heaps > 0))`

Maoni0 · 2022-01-26T07:06:57Z

src/coreclr/gc/gc.cpp

+        heap_segment* basic_region = get_region_info (basic_region_start);
+        heap_segment_heap (basic_region) = nullptr;
+    }
+


would make sense to make a little util method of this as it's duplicated in thread_rw_region_front...

Good point, I made a method set_heap_for_contained_basic_regions that sets the heap_segment_heap for the contained basic regions.

Maoni0 · 2022-01-26T07:43:33Z

I do think the more sophisticated way is worthwhile so you can ignore my comments in the #if 0 block.

- remove code for first attempt - factor out common code

Maoni0

LGTM! the only minor thing if you don't actually need this dprintf it'd be good to get rid of it since it isn't related to this feature -

dprintf (3, ("uoh alloc: could not release lock on %Ix", obj));

PeterSolMS added 6 commits December 6, 2021 11:39

Equalize promoted bytes between heaps by moving regions between heaps.

4ac6df9

Algorithm used is simple and pretty efficient, but only aims for rough equality rather than trying to do an optimal job.

Work around issue with r/o segments in gen 2, add comment describing …

19e9a34

…more sophisticated algorithm.

Implemented more sophisticaed balancing algorithm, miscellaneous clea…

af4cb7a

…nup, added dprintf for the case where uoh_alloc_done cannot find the entry to release.

Add more dprintfs so once can better follow what the balancing algori…

50d07ce

…thm is doing.

Move calls to finalize_queue->UpdatePromotedGenerations.

9a76fc0

Undo instrumentation change.

fedf8a4

PeterSolMS requested review from cshung, Maoni0 and mangod9 January 24, 2022 15:29

ghost assigned PeterSolMS Jan 24, 2022

dotnet-issue-labeler bot added the area-GC-coreclr label Jan 24, 2022

Maoni0 reviewed Jan 26, 2022

View reviewed changes

PeterSolMS added 2 commits February 15, 2022 13:41

Address code review feedback:

f56bc91

- remove code for first attempt - factor out common code

Merge branch 'main' into Equalize_promoted

7c620d2

Maoni0 approved these changes Feb 23, 2022

View reviewed changes

Remove extra dprintf, fix GCC build issue, fix comment.

af3bca3

runfoapp bot mentioned this pull request Feb 23, 2022

System.IO.Tests work item failing with SIGKILL #65791

Closed

PeterSolMS merged commit 85b4f0e into dotnet:main Feb 24, 2022

ghost locked as resolved and limited conversation to collaborators Mar 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Equalize promoted #64206

Equalize promoted #64206

PeterSolMS commented Jan 24, 2022

ghost commented Jan 24, 2022

Maoni0 Jan 26, 2022

Maoni0 Jan 26, 2022

Maoni0 Jan 26, 2022

Maoni0 Jan 26, 2022

PeterSolMS Feb 15, 2022

Maoni0 commented Jan 26, 2022

Maoni0 left a comment

Equalize promoted #64206

Equalize promoted #64206

Conversation

PeterSolMS commented Jan 24, 2022

ghost commented Jan 24, 2022

Maoni0 Jan 26, 2022

Choose a reason for hiding this comment

Maoni0 Jan 26, 2022

Choose a reason for hiding this comment

Maoni0 Jan 26, 2022

Choose a reason for hiding this comment

Maoni0 Jan 26, 2022

Choose a reason for hiding this comment

PeterSolMS Feb 15, 2022

Choose a reason for hiding this comment

Maoni0 commented Jan 26, 2022

Maoni0 left a comment

Choose a reason for hiding this comment