Async IO for Particles #1058

atmyers · 2020-06-25T19:57:57Z

No description provided.

WeiqunZhang · 2020-06-25T21:01:30Z

Src/Particle/AMReX_ParticleIO.H

+    if (AsyncOut::UseAsyncOut()) {
+        WriteBinaryParticleDataAsync(*this, dir, name,
+                                     write_real_comp, write_int_comp,
+                                     real_comp_names, int_comp_names);


Any reason why the async version does not take F?

I plan to remove F soon. WarpX will implement this differently, by copying to a tmp particle container and applying a filter during the copy. That functionality will be useful in other places, and it will let us remove all those extra overloads of WritePlotFile and Checkpoint.

WeiqunZhang · 2020-06-25T21:09:22Z

Src/Particle/AMReX_WriteBinaryParticleData.H

+            if (MyProc == IOProcNumber)
+            {
+                if ( ! amrex::UtilCreateDirectory(LevelDir, 0755))
+                {
+                    amrex::CreateDirectoryFailed(LevelDir);
+                }
+            }
+            ParallelDescriptor::Barrier();


Can we try to merge this with lines 340-347?

WeiqunZhang · 2020-06-25T21:12:15Z

Src/Particle/AMReX_WriteBinaryParticleData.H

+    Vector<Vector<Long> > np_per_grid(pc.finestLevel()+1);
+    for (int lev = 0; lev <= pc.finestLevel(); lev++)
+    {
+        np_per_grid[lev] = pc.NumberOfParticlesInGrid(lev);


We could also do local reduction only and then do mpi_allreduce after the for loop.

And for the following code in NumberOfParticlesInGrid

ParallelAllReduce::Sum(&nparticles[0], ngrids, ParallelContext::CommunicatorSub()); }

I think @mrowan137 has shown it's much faster to gather and broadcast. @mrowan137 Can you comment on that? There is a function for the gather ParallelDescriptor::GatherLayoutDataToVector.

@atmyers if you look at DistributionMapping::makeKnapSack (const LayoutData<Real>& rcost_local, ... in AMReX_DistributionMapping.cpp you can see the alternate gather/broadcast sequence, it should probably be faster than the ParallelAllReduce::Sum here.

WeiqunZhang · 2020-06-25T21:39:23Z

Src/Particle/AMReX_WriteBinaryParticleData.H

+    auto wrc = std::make_shared<Vector<int> >(write_real_comp);
+    auto wic = std::make_shared<Vector<int> >(write_int_comp);
+    auto rcn = std::make_shared<Vector<std::string> >(real_comp_names);
+    auto icn = std::make_shared<Vector<std::string> >(int_comp_names);


Are they needed? Can we just use write_real_comp etc. directly in the lambda below?

Indeed they are not. Fixed.

WeiqunZhang · 2020-06-25T21:43:16Z

It's a big PR. Although I made some comments, I can approve it and then you can decide whether you want to improve it later in follow-up PRs.

atmyers · 2020-06-25T21:50:41Z

Thanks for your comments - there's no hurry to merge this right now. I'll try your suggestions.

atmyers and others added 24 commits June 16, 2020 15:18

add ability to specify allocator for a particle tile

2fbfd91

fix ifdef

88adadd

missed a few places where the default allocator was used

bc9add0

add stub for particle async io

d729659

stub for Async IO Particle test

9c7315c

formatting changes

c1f0eb8

work on test for particle async io

24e48fc

generalize this function template

e6a0d81

formatting changes

9747766

make temporary particle tiles in pinned memory to write

b6f4624

add rest of the header

3a957b9

write the complete plotfile

8cda358

some work on async particle io

a2bc873

working async io code for particles

1946c8f

set up test to more steps by default

dfb948b

tweak test

1c52663

removed unused

175c834

add id check here

e46c33e

more reorganization

2c1afc9

forgot to add file

cfa27cd

tweak test

2f29c05

merging

42d8e4e

fix OK

4544009

some mutable

47bc70c

atmyers requested a review from WeiqunZhang June 25, 2020 20:48

WeiqunZhang reviewed Jun 25, 2020

View reviewed changes

atmyers added 5 commits June 25, 2020 18:57

don't need tmp copies of these

d9ea27d

merge making this directories so as to call Barrier fewer times

418c224

can also merge these loops

1856296

generalize the filter particles function template bit

858ba3c

some cleaning / optimization for async particle io

706b02e

WeiqunZhang self-requested a review July 1, 2020 18:00

WeiqunZhang approved these changes Jul 1, 2020

View reviewed changes

WeiqunZhang merged commit 443fad7 into AMReX-Codes:development Jul 1, 2020

dwillcox pushed a commit to dwillcox/amrex that referenced this pull request Oct 3, 2020

Async IO for Particles (AMReX-Codes#1058)

b8de549

sayerhs mentioned this pull request Nov 14, 2020

d/fcompare fix sayerhs/amrex#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async IO for Particles #1058

Async IO for Particles #1058

atmyers commented Jun 25, 2020

WeiqunZhang Jun 25, 2020

atmyers Jun 25, 2020

WeiqunZhang Jun 25, 2020

WeiqunZhang Jun 25, 2020

WeiqunZhang Jun 25, 2020

mrowan137 Jun 25, 2020

WeiqunZhang Jun 25, 2020

atmyers Jun 25, 2020

WeiqunZhang commented Jun 25, 2020

atmyers commented Jun 25, 2020

Async IO for Particles #1058

Async IO for Particles #1058

Conversation

atmyers commented Jun 25, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WeiqunZhang commented Jun 25, 2020

atmyers commented Jun 25, 2020