RuntimeError #7

demonlj · 2019-07-07T12:00:48Z

system

ubuntu
Linux pve-ubuntu 4.15.0-43-generic #46-Ubuntu SMP Thu Dec 6 14:45:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

python train.py --data_path data/pubmed_abstract --model_dp abstract_model/

Epoch 0/99
Traceback (most recent call last):
File "train.py", line 236, in
batch_o_t, teacher_forcing_ratio=1)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in call
result = self.forward(*input, **kwargs)
File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/seq2seq.py", line 18, in forward
stopwords, sflag)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in call
result = self.forward(*input, **kwargs)
File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/Decoder.py", line 134, in forward
max_source_oov, term_output, term_id, term_mask)
File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/Decoder.py", line 68, in decode_step
term_context, term_attn = self.memory(_h.unsqueeze(0), term_output, term_mask, cov_mem)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in call
result = self.forward(*input, **kwargs)
File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/utils.py", line 32, in forward
e_t = self.vt_layers[i](torch.tanh(enc_proj + dec_proj).view(batch_size * max_enc_len, -1))
RuntimeError: [enforce fail at CPUAllocator.cpp:56] posix_memalign(&data, gAlignment, nbytes) == 0. 12 vs 0

EagleW · 2019-07-07T13:40:45Z

Thank you for your interest. Which pytorch version did you use? Have you used cuda?

demonlj · 2019-07-07T13:43:47Z

Thanks a lot for your kind reply.
pytorch version: torch-1.1.0-cp36-cp36m-manylinux1_x86_64.whl
I'm not sure wether cuda is used. Follow your instruction, I did not find the requirements on cuda.

EagleW · 2019-07-07T13:45:06Z

I mean have you use gpu?

EagleW · 2019-07-07T13:45:34Z

The default setting is to use GPU if your system has one

demonlj · 2019-07-07T13:46:55Z

Maybe I need to disable the GPU option, because it takes a lot time to run the script. so, i move the script from osx to remote server

demonlj · 2019-07-07T13:47:20Z

How to disable the GPU option?

EagleW · 2019-07-07T13:47:41Z

Emm, I haven't tried to use cpu, I recommend to use the gpu, you can disable it by adding --gpu 0

demonlj · 2019-07-07T13:50:12Z

no lucky.

Traceback (most recent call last): File "train.py", line 236, in <module> batch_o_t, teacher_forcing_ratio=1) File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in __call__ result = self.forward(*input, **kwargs) File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/seq2seq.py", line 18, in forward stopwords, sflag) File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in __call__ result = self.forward(*input, **kwargs) File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/Decoder.py", line 134, in forward max_source_oov, term_output, term_id, term_mask) File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/Decoder.py", line 68, in decode_step term_context, term_attn = self.memory(_h.unsqueeze(0), term_output, term_mask, cov_mem) File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 493, in __call__ result = self.forward(*input, **kwargs) File "/mnt/sync/ubuntu/PaperRobot-master/New paper writing/memory_generator/utils.py", line 32, in forward e_t = self.vt_layers[i](torch.tanh(enc_proj + dec_proj).view(batch_size * max_enc_len, -1)) RuntimeError: [enforce fail at CPUAllocator.cpp:56] posix_memalign(&data, gAlignment, nbytes) == 0. 12 vs 0

EagleW · 2019-07-07T13:50:26Z

Let me test on my computer without gpu

EagleW · 2019-07-07T13:58:29Z

I haven't encountered the same problem, maybe you can try a smaller batch size by setting --batch_size 10?

demonlj · 2019-07-07T14:02:11Z

Lucky for me. It's running now with the setting --batch_size 10

EagleW · 2019-07-07T14:03:30Z

Great, I think maybe it's out of memory

EagleW · 2019-07-07T14:36:32Z

I found that the previous error is a CPU OOM message pytorch/pytorch#20618

EagleW closed this as completed Jul 7, 2019

EagleW reopened this Jul 7, 2019

EagleW closed this as completed Jul 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError #7

RuntimeError #7

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

EagleW commented Jul 7, 2019 •

edited

Loading

demonlj commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019 •

edited

Loading

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019 •

edited

Loading

EagleW commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

EagleW commented Jul 7, 2019

RuntimeError #7

RuntimeError #7

Comments

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

EagleW commented Jul 7, 2019 • edited Loading

demonlj commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019 • edited Loading

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019 • edited Loading

EagleW commented Jul 7, 2019

demonlj commented Jul 7, 2019

EagleW commented Jul 7, 2019

EagleW commented Jul 7, 2019

EagleW commented Jul 7, 2019 •

edited

Loading

EagleW commented Jul 7, 2019 •

edited

Loading

EagleW commented Jul 7, 2019 •

edited

Loading