Graph aggregation refactoring #8082

sokra · 2024-05-03T15:19:21Z

Description

Deletes the aggregation tree
Adds a new graph aggregation algorithm which is more efficient

The graph aggregation works as following:

For the graph aggregation: Every task is a node in the graph. Every parent-child relationship is an edge in the graph.

Every node has an "aggregation number" N.
There are 2 kinds of nodes: Leaf nodes and aggregating nodes.
If a node has N < LEAF_NUMBER, it's a leaf node, otherwise an aggregating node.
A higher N for a node usually means that a larger subgraph is aggregated into that node.
Next to normal edges there are two extra kind of edges for the graph aggregation: Upper edges and follower edges.
A node is considered as "inner" to another node when it has an "upper" edge pointing towards it.
The inner node has a lower N than the upper node. (This invariant might be temporarily violated while tree balancing is scheduled but not executed yet)
Aggregating nodes store an aggregated version of the state of all inner nodes and transitively inner nodes.
Changes in nodes are propagated to all upper nodes.
Every node has at least one upper node which is more aggregated than the node. Except for the root node of the graph, which doesn't have upper edges.
An aggregating node also has follower edges. They point to the nodes that are one normal edge after all inner and transitively inner nodes.
An leaf node doesn't have follower edges. For all purposes the normal edges of leaf nodes are considered as follower edges.
Follower nodes have a higher N than the origin node. (This invariant might be temporarily violated while tree balancing is scheduled but not executed yet)
This means large and larger subgraphs are aggregated.
Graph operations will ensure that these invariants (Higher N on upper and follower edges) are not violated.
The N of a node can only increase. So graph operations need to "fix" the invariants by increasing N or changing upper/follower edges. That later one is preferred. N is usually only increased if two nodes have equal N.
When new edges between leaf nodes are added, the target node's N is increased to the origin node's N + 4 if it's smaller. This adds a small tolerance range so increasing N doesn't cause long chains of N += 1 between leaf nodes.

Testing Instructions

Closes PACK-3036

vercel · 2024-05-03T15:19:34Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
examples-nonmonorepo	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 8, 2024 5:52pm
examples-svelte-web	🔄 Building (Inspect)	Visit Preview	💬 Add feedback	May 8, 2024 5:52pm
rust-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 8, 2024 5:52pm

7 Ignored Deployments

Name	Status	Preview	Updated (UTC)
examples-basic-web	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-designsystem-docs	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-gatsby-web	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-kitchensink-blog	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-native-web	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-tailwind-web	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm
examples-vite-web	⬜️ Ignored (Inspect)	Visit Preview	May 8, 2024 5:52pm

github-actions · 2024-05-03T15:21:25Z

🟢 Turbopack Benchmark CI successful 🟢

Thanks

github-actions · 2024-05-03T15:22:20Z

✅ This change can build next-swc

github-actions · 2024-05-03T15:26:54Z

⚠️ CI failed ⚠️

The following steps have failed in CI:

Turbopack Rust tests (mac/win, non-blocking)

See workflow summary for details

arlyon · 2024-05-08T10:21:44Z

crates/turbo-tasks-memory/src/aggregation/balance_edge.rs

+                    let count = extra_followers + extra_uppers;
+                    let target = ctx.node(target_id);
+                    if is_in_progress(ctx, upper_id) {
+                        drop(target);


What is the significance of borrowing before the branch and then dropping?

We need to check in progress (which is an atomic) while either holding the target or the upper lock. In progress is only set during the target lock, so when we read it it need to be under the target lock. If not in progress we can continue working with the target in the else branch.

Otherwise we want to enqueue our work to the upper node. So we acquire a upper lock. In the meantime the in_progress flag might have changed, so we need to check that again.

crates/turbo-tasks-memory/src/aggregation/in_progress.rs

Co-authored-by: Alexander Lyon <arlyon@me.com>

ForsakenHarmony · 2024-05-08T17:31:48Z

Can we maybe add a Markdown doc next to the code describing the overall approach (or just copy the PR description in there)

sokra · 2024-05-08T18:00:54Z

Can we maybe add a Markdown doc next to the code describing the overall approach (or just copy the PR description in there)

I'll add this in a follow-up PR

* vercel/turborepo#8082

### Description * Deletes the aggregation tree * Adds a new graph aggregation algorithm which is more efficient The graph aggregation works as following: For the graph aggregation: Every task is a node in the graph. Every parent-child relationship is an edge in the graph. * Every node has an "aggregation number" N. * There are 2 kinds of nodes: Leaf nodes and aggregating nodes. * If a node has N < LEAF_NUMBER, it's a leaf node, otherwise an aggregating node. * A higher N for a node usually means that a larger subgraph is aggregated into that node. * Next to normal edges there are two extra kind of edges for the graph aggregation: Upper edges and follower edges. * A node is considered as "inner" to another node when it has an "upper" edge pointing towards it. * The inner node has a lower N than the upper node. (This invariant might be temporarily violated while tree balancing is scheduled but not executed yet) * Aggregating nodes store an aggregated version of the state of all inner nodes and transitively inner nodes. * Changes in nodes are propagated to all upper nodes. * Every node has at least one upper node which is more aggregated than the node. Except for the root node of the graph, which doesn't have upper edges. * An aggregating node also has follower edges. They point to the nodes that are one normal edge after all inner and transitively inner nodes. * An leaf node doesn't have follower edges. For all purposes the normal edges of leaf nodes are considered as follower edges. * Follower nodes have a higher N than the origin node. (This invariant might be temporarily violated while tree balancing is scheduled but not executed yet) * This means large and larger subgraphs are aggregated. * Graph operations will ensure that these invariants (Higher N on upper and follower edges) are not violated. * The N of a node can only increase. So graph operations need to "fix" the invariants by increasing N or changing upper/follower edges. That later one is preferred. N is usually only increased if two nodes have equal N. * When new edges between leaf nodes are added, the target node's N is increased to the origin node's N + 4 if it's smaller. This adds a small tolerance range so increasing N doesn't cause long chains of N += 1 between leaf nodes. ### Testing Instructions  Closes PACK-3036 --------- Co-authored-by: Alexander Lyon <arlyon@me.com>

* vercel/turborepo#8082

turbo-orchestrator bot added created-by: turbopack labels May 3, 2024

vercel bot deployed to Preview – examples-nonmonorepo May 3, 2024 15:32 View deployment

vercel bot deployed to Preview – rust-docs May 3, 2024 15:41 View deployment

sokra marked this pull request as ready for review May 3, 2024 15:57

sokra requested a review from a team as a code owner May 3, 2024 15:57

sokra force-pushed the sokra/aggregation-refactor branch from 40c155f to 759d075 Compare May 6, 2024 16:27

vercel bot deployed to Preview – examples-nonmonorepo May 6, 2024 16:27 View deployment

vercel bot deployed to Preview – rust-docs May 6, 2024 16:37 View deployment

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 06:07 View deployment

sokra force-pushed the sokra/aggregation-refactor branch from 7317f43 to 31eb100 Compare May 7, 2024 06:07

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 06:08 View deployment

vercel bot deployed to Preview – rust-docs May 7, 2024 06:26 View deployment

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 14:45 View deployment

vercel bot deployed to Preview – rust-docs May 7, 2024 14:55 View deployment

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 15:12 View deployment

vercel bot deployed to Preview – rust-docs May 7, 2024 15:22 View deployment

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 15:34 View deployment

vercel bot deployed to Preview – rust-docs May 7, 2024 15:44 View deployment

sokra force-pushed the sokra/aggregation-refactor branch from eedf18b to 4d25ad5 Compare May 7, 2024 16:31

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 16:32 View deployment

vercel bot deployed to Preview – examples-nonmonorepo May 7, 2024 16:37 View deployment

vercel bot deployed to Preview – rust-docs May 7, 2024 16:51 View deployment

sokra added 3 commits May 8, 2024 09:27

refactor aggregation tree, aggregation test cases passing

546ca88

turbo-tasks-memory tests passing

b6b653a

balance with only overcounting

f5e3d88

fix imports

3c70eab

vercel bot deployed to Preview – examples-nonmonorepo May 8, 2024 09:10 View deployment

vercel bot deployed to Preview – rust-docs May 8, 2024 09:20 View deployment

arlyon approved these changes May 8, 2024

View reviewed changes

Apply suggestions from code review

e63d7e8

Co-authored-by: Alexander Lyon <arlyon@me.com>

vercel bot deployed to Preview – examples-nonmonorepo May 8, 2024 10:54 View deployment

vercel bot deployed to Preview – rust-docs May 8, 2024 11:04 View deployment

sokra added 9 commits May 8, 2024 19:40

fix loom test case

9ee48da

more room for performance on CI

b285297

disable loom tests

7374fca

fix expensive node handling for leaf nodes

706bbe2

use production number for test cases too

424fb75

optimize uppers instead of followers for new edges

c0fc04e

upgrading node is considered as optimizing

60c516d

optimize when high affected nodes during notify_new_follower

eac2c7f

select avg aggregation number when optimizing

ec2df26

vercel bot deployed to Preview – examples-nonmonorepo May 8, 2024 17:42 View deployment

vercel bot deployed to Preview – rust-docs May 8, 2024 17:52 View deployment

sokra merged commit adfb599 into main May 8, 2024
46 of 47 checks passed

sokra deleted the sokra/aggregation-refactor branch May 8, 2024 18:01

sokra mentioned this pull request May 8, 2024

Turbopack: new graph aggregation vercel/next.js#65206

Merged

sokra added a commit to vercel/next.js that referenced this pull request May 8, 2024

Turbopack: new graph aggregation (#65206)

a7ebbde

* vercel/turborepo#8082

ForsakenHarmony pushed a commit to vercel/next.js that referenced this pull request Aug 16, 2024

Turbopack: new graph aggregation (#65206)

fe976cc

* vercel/turborepo#8082

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph aggregation refactoring #8082

Graph aggregation refactoring #8082

sokra commented May 3, 2024 •

edited

Loading

vercel bot commented May 3, 2024 •

edited

Loading

github-actions bot commented May 3, 2024 •

edited

Loading

github-actions bot commented May 3, 2024

github-actions bot commented May 3, 2024 •

edited

Loading

arlyon May 8, 2024

sokra May 8, 2024

ForsakenHarmony commented May 8, 2024

sokra commented May 8, 2024

Graph aggregation refactoring #8082

Graph aggregation refactoring #8082

Conversation

sokra commented May 3, 2024 • edited Loading

Description

Testing Instructions

vercel bot commented May 3, 2024 • edited Loading

github-actions bot commented May 3, 2024 • edited Loading

🟢 Turbopack Benchmark CI successful 🟢

github-actions bot commented May 3, 2024

github-actions bot commented May 3, 2024 • edited Loading

⚠️ CI failed ⚠️

arlyon May 8, 2024

Choose a reason for hiding this comment

sokra May 8, 2024

Choose a reason for hiding this comment

ForsakenHarmony commented May 8, 2024

sokra commented May 8, 2024

sokra commented May 3, 2024 •

edited

Loading

vercel bot commented May 3, 2024 •

edited

Loading

github-actions bot commented May 3, 2024 •

edited

Loading

github-actions bot commented May 3, 2024 •

edited

Loading