Fix regional dependencies for sub-`TaskList`s and make solvers work for abitrary partitioning #1132

lroberts36 · 2024-06-26T02:21:06Z

PR Summary

Currently, sublists of TaskLists do not have their regional dependencies set correctly. This results in the bug described in #1125. This PR changes TaskRegion::BuildGraph() to correctly register regional dependencies among all TaskLists and closes #1125.

Additionally, it takes a second step to make the multigrid solver work when parthenon/mesh/pack_size != -1 by including "self-communication" between coarse blocks on a finer two-level composite grid and fine leaf blocks on the next coarser two-level composite grid. Although no data is communicated in these communication channels, this introduces a dependency (outside of the tasking framework) that ensures operations on these blocks that are part of multiple gmg levels occur in the correct order.

Getting the multigrid stuff to work was a painful debugging process. As a result, I introduced some capability for more customized naming of tasks and the ability to write task labels to screen as the tasks are called. This turned out to be fairly useful for debugging what was going on.

PR Checklist

Yurlungur

I really feel like @jdolence should look at this one.

…tion

…ltigrid

lroberts36 · 2024-07-22T15:37:08Z

src/bvals/comms/bnd_info.cpp

@@ -177,7 +177,7 @@ CalcIndices(const NeighborBlock &nb, MeshBlock *pmb,
                                (neighbor_bounds[dir].e - neighbor_bounds[dir].s + 1);
        s[dir] += nb.origin_loc.l(dir) % 2 == 1 ? extra_zones - interior_offset : 0;
        e[dir] -= nb.origin_loc.l(dir) % 2 == 0 ? extra_zones - interior_offset : 0;
-        if (ir_type == IndexRangeType::InteriorSend) {
+        if (ir_type == IndexRangeType::InteriorSend && !prores) {


Changes to CalcIndices are required since we are actually passing the correct neighbor information in GetInterior*.

lroberts36 · 2024-07-22T15:38:29Z

src/bvals/comms/bnd_info.hpp

@@ -59,6 +59,7 @@ struct BndInfo {
  bool allocated = true;
  bool buf_allocated = true;
  int alloc_status;
+  bool same_to_same = false;


This flag denotes that this buffer represents a communication from this block to itself and that the buffer does not need to be filled (a message just needs to be sent).

lroberts36 · 2024-07-22T15:40:02Z

src/bvals/comms/bvals_utils.hpp

+  int other = (nb.gid == pmb->gid && (btype == BoundaryType::gmg_restrict_recv ||
+                                      btype == BoundaryType::gmg_restrict_send))
+                  ? 1
+                  : 0;
+  return {sender_id, receiver_id, pcv->label(), location_idx, other};


Add more information to the communication key so that we can represent same-to-same restriction and same-to-same prolongation as different channels.

lroberts36 · 2024-07-22T15:42:56Z

src/tasks/tasks.hpp

+      : Task(std::forward<TID>(dep), label, 0, func, limits) {}
+  template <typename TID>
+  Task(TID &&dep, const std::string &label, int verbose_level,
+       const std::function<TaskStatus()> &func, std::pair<int, int> limits = {1, 1})
+      : label_(label), verbose_level_(verbose_level), f(func), exec_limits(limits) {


Add verbosity flag to tasks, if verbose_level_ > 0 the task will print its name out to stdout before starting the task. Useful for debugging.

This is a useful feature. I wonder if the verbosity should be blanket applied to all tasks based off a verbose flag set in pin, rather than enrolled via each AddTask?

EDIT: this proposal could actually be handled via the downstream app

I agree that would be nice, but I think that it requires a bit of threading to make it work (since the tasking stuff doesn't directly access pin currently). I would leave this for a future PR.

In practice, I just manually comment out the verbose_level_ > 0 test and recompile if I want to see everything...

lroberts36 · 2024-07-22T15:44:26Z

src/tasks/tasks.hpp

+      return AddTaskImpl(tq, dep, std::forward<Arg1>(arg1), 0,
+                         std::forward<Args>(args)...);
+    } else if constexpr (is_tuple_t<Arg1>::value) {
+      return AddTaskImpl(tq, dep, std::get<0>(arg1), std::get<1>(arg1), std::get<2>(arg1),
+                         std::forward<Args>(args)...);


Allow passing a tuple containing the name of the task, the verbosity level of the task, and the task function itself. Allows easy print out of the order in which tasks are run.

lroberts36 · 2024-07-22T15:45:24Z

src/tasks/tasks.hpp

+  std::vector<TaskList *> GetAllTaskLists() {
+    std::vector<TaskList *> list;
+    GetAllTaskListsInternal(list);
+    return list;
+  }


Necessary for linking regional tasks in iterative task lists.

Yurlungur

Very minor nitpicks on the debugging code. Otherwise LGTM.

src/bvals/comms/bnd_info.cpp

src/tasks/tasks.hpp

pdmullen

LGTM! I have tested this in a downstream app invoking iterative tasking. This MR fixes (1) hangs and (2) undefined behavior when invoking pack_size = 1 that currently exist with parthenon/develop.

src/interface/mesh_data.hpp

pdmullen · 2024-07-25T17:16:17Z

src/tasks/tasks.hpp

+      : Task(std::forward<TID>(dep), label, 0, func, limits) {}
+  template <typename TID>
+  Task(TID &&dep, const std::string &label, int verbose_level,
+       const std::function<TaskStatus()> &func, std::pair<int, int> limits = {1, 1})
+      : label_(label), verbose_level_(verbose_level), f(func), exec_limits(limits) {


This is a useful feature. I wonder if the verbosity should be blanket applied to all tasks based off a verbose flag set in pin, rather than enrolled via each AddTask?

EDIT: this proposal could actually be handled via the downstream app

tst/regression/test_suites/poisson_gmg/parthinput.poisson

lroberts36 added 4 commits June 25, 2024 19:25

start

aa78114

Actually add regional dependencies of sublists

0425f96

Merge branch 'develop' into lroberts36/fix-iterative-task-qualifiers

4bbe65b

try to fix solver task lists

012381b

lroberts36 changed the title ~~WIP: Fix regional dependencies for sublists~~ WIP: Fix regional dependencies for sub-TaskLists Jun 26, 2024

Yurlungur requested a review from jdolence June 26, 2024 21:14

Yurlungur reviewed Jun 26, 2024

View reviewed changes

lroberts36 added 21 commits July 3, 2024 11:37

Extended debug output

76f915b

more shit

699e44f

Work on making neighbor block get passed correctly

7a4d480

actually use neighbor info for full interior prolongation and restric…

781c8df

…tion

Add same to same flag

762ebfd

Apparently working communication

6ffa6af

make buffer size zero for same to same

8ab4c62

Merge branch 'add-self-communication' into debug-multigrid

390192d

remove unecessary change

e545ade

also self communicate when going to finer

a9ee23d

small

36b1150

almost there...

20ec5cf

Merge branch 'develop' into lroberts36/fix-iterative-task-qualifiers

0cfdfd1

Merge branch 'lroberts36/fix-iterative-task-qualifiers' into debug-mu…

8b4e6f4

…ltigrid

correctly build neighbor block after merge

63b46e5

Add another key indicator

c2c644b

remove debug output

09228c9

fix tasking bug

9ed160c

use pack_size=1 for tests, fails with MPI

9478366

format

e9a5253

lint

92e14b1

lroberts36 commented Jul 22, 2024

View reviewed changes

lroberts36 added 8 commits July 23, 2024 10:02

Add a little more error output

3dc19c3

Add label method for grid identifier

652eacc

add method for querying MeshData about gids

a135717

change verbose output to include status

37f1edf

fix MG tasking(?)

cf78353

format and lint

ccd7e7b

revert unecessary changes in Mesh

754e6bd

changelog

92c6633

lroberts36 changed the title ~~WIP: Fix regional dependencies for sub-TaskLists~~ Fix regional dependencies for sub-TaskLists and make solvers work for abitrary partitioning Jul 23, 2024

lroberts36 requested review from bprather, pgrete, pdmullen and Yurlungur July 23, 2024 16:34

Merge branch 'develop' into lroberts36/fix-iterative-task-qualifiers

9334be2

Yurlungur approved these changes Jul 25, 2024

View reviewed changes

src/bvals/comms/bnd_info.cpp Outdated Show resolved Hide resolved

src/tasks/tasks.hpp Outdated Show resolved Hide resolved

pdmullen approved these changes Jul 25, 2024

View reviewed changes

respond to Jonah comments

63f9b08

lroberts36 enabled auto-merge July 25, 2024 17:32

lroberts36 merged commit ea1039f into develop Jul 25, 2024
53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix regional dependencies for sub-`TaskList`s and make solvers work for abitrary partitioning #1132

Fix regional dependencies for sub-`TaskList`s and make solvers work for abitrary partitioning #1132

lroberts36 commented Jun 26, 2024 •

edited

Loading

Yurlungur left a comment

lroberts36 Jul 22, 2024

lroberts36 Jul 22, 2024

lroberts36 Jul 22, 2024

lroberts36 Jul 22, 2024

pdmullen Jul 25, 2024 •

edited

Loading

lroberts36 Jul 25, 2024

lroberts36 Jul 22, 2024

lroberts36 Jul 22, 2024

Yurlungur left a comment

pdmullen left a comment

pdmullen Jul 25, 2024 •

edited

Loading

Fix regional dependencies for sub-TaskLists and make solvers work for abitrary partitioning #1132

Fix regional dependencies for sub-TaskLists and make solvers work for abitrary partitioning #1132

Conversation

lroberts36 commented Jun 26, 2024 • edited Loading

PR Summary

PR Checklist

Yurlungur left a comment

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

pdmullen Jul 25, 2024 • edited Loading

Choose a reason for hiding this comment

lroberts36 Jul 25, 2024

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

lroberts36 Jul 22, 2024

Choose a reason for hiding this comment

Yurlungur left a comment

Choose a reason for hiding this comment

pdmullen left a comment

Choose a reason for hiding this comment

pdmullen Jul 25, 2024 • edited Loading

Choose a reason for hiding this comment

Fix regional dependencies for sub-`TaskList`s and make solvers work for abitrary partitioning #1132

Fix regional dependencies for sub-`TaskList`s and make solvers work for abitrary partitioning #1132

lroberts36 commented Jun 26, 2024 •

edited

Loading

pdmullen Jul 25, 2024 •

edited

Loading

pdmullen Jul 25, 2024 •

edited

Loading