Broken pipe error when processing top boundary conditions #9

Gabrielsvc · 2022-04-04T19:31:31Z

Processing top boundary conditions...
0%| | 0/5 [00:00<?, ?it/s]
Killed
(wrf4palm) isi-user@eolic-support:~/Repos/WRF4PALM$ Process ForkPoolWorker-121:
Traceback (most recent call last):
File "/home/isi-user/miniconda3/envs/wrf4palm/lib/python3.9/site-packages/multiprocess/pool.py", line 131, in worker
put((job, i, result))
File "/home/isi-user/miniconda3/envs/wrf4palm/lib/python3.9/site-packages/multiprocess/queues.py", line 381, in put
self._writer.send_bytes(obj)
File "/home/isi-user/miniconda3/envs/wrf4palm/lib/python3.9/site-packages/multiprocess/connection.py", line 208, in send_bytes
self._send_bytes(m[offset:offset + size])
File "/home/isi-user/miniconda3/envs/wrf4palm/lib/python3.9/site-packages/multiprocess/connection.py", line 413, in _send_bytes
self._send(buf)
File "/home/isi-user/miniconda3/envs/wrf4palm/lib/python3.9/site-packages/multiprocess/connection.py", line 376, in _send
n = write(self._handle, buf)
BrokenPipeError: [Errno 32] Broken pipe

This happened with max_pool=4 and 12, using the wrf4palm environment setup with conda. Will look further on why this is happening. Our namelist is attached to the issue.
namelist_wrf4palm.txt
And here is the wrfout file, which contains the first day of january with 48 timesteps, one for each 30 minutes.
Drive link for wrfout file

The text was updated successfully, but these errors were encountered:

Gabrielsvc · 2022-04-05T12:49:37Z

Huh, running in a more powerful machine gave me one more iteration done before getting the process killed(2 out of 5, instead of 0 out of 5 or 1 out of 5 iterations) Maybe it's breaking due to poor hardware?

Gabrielsvc · 2022-04-05T13:09:38Z

Found the culprit. Memory was blowing up when running this step, as shown by running htop together with run_config_wrf4palm.py The extra iteration was being reached due to the extra 8GB RAM I have on the new machine. I've increased the swap size from 4GB to 18GB.

Maybe add a verification in some point of the code so that future users will know what caused the problem?

dongqi-DQ · 2022-04-21T23:39:48Z

The RAM usage has long been an issue... I will try to figure out how to optimize this more and will add more info in the code and the documentation.

dongqi-DQ · 2022-05-10T00:02:12Z

I've modified the code for top boundary processing (see commit d6fc6e2). Before this, the code loaded the entire dataset into RAM, which could be very large. Testing from my side shows a 40% drop of RAM usage. But when very fine grid spacing is used, the problem might still exist. So I will leave this issue open for now and see if I can figure out something to optimise this further.

dongqi-DQ added the bug Something isn't working label Apr 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broken pipe error when processing top boundary conditions #9

Broken pipe error when processing top boundary conditions #9

Gabrielsvc commented Apr 4, 2022

Gabrielsvc commented Apr 5, 2022

Gabrielsvc commented Apr 5, 2022

dongqi-DQ commented Apr 21, 2022

dongqi-DQ commented May 10, 2022

Broken pipe error when processing top boundary conditions #9

Broken pipe error when processing top boundary conditions #9

Comments

Gabrielsvc commented Apr 4, 2022

Gabrielsvc commented Apr 5, 2022

Gabrielsvc commented Apr 5, 2022

dongqi-DQ commented Apr 21, 2022

dongqi-DQ commented May 10, 2022