You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "train.py", line 146, in
torch.distributed.init_process_group(backend="nccl", world_size=args.world_size)
AttributeError: module 'torch.distributed' has no attribute 'init_process_group'
Traceback (most recent call last):
File "C:\ProgramData\Anaconda3\envs\rife\lib\runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "C:\ProgramData\Anaconda3\envs\rife\lib\runpy.py", line 87, in _run_code
exec(code, run_globals)
File "C:\ProgramData\Anaconda3\envs\rife\lib\site-packages\torch\distributed\launch.py", line 261, in
main()
File "C:\ProgramData\Anaconda3\envs\rife\lib\site-packages\torch\distributed\launch.py", line 256, in main
raise subprocess.CalledProcessError(returncode=process.returncode,
subprocess.CalledProcessError: Command '['C:\ProgramData\Anaconda3\envs\rife\python.exe', '-u', 'train.py', '--local_rank=0', '--world_size=1']' returned non-zero exit status 1.
The text was updated successfully, but these errors were encountered:
I think your work is great. Unfortunately I do not manage to train the NN. The following error message appears in the console:
(rife) PS C:\Users\C\PycharmProjects\RIFE> python -m torch.distributed.launch --nproc_per_node=1 train.py --world_size=1
Traceback (most recent call last):
File "train.py", line 146, in
torch.distributed.init_process_group(backend="nccl", world_size=args.world_size)
AttributeError: module 'torch.distributed' has no attribute 'init_process_group'
Traceback (most recent call last):
File "C:\ProgramData\Anaconda3\envs\rife\lib\runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "C:\ProgramData\Anaconda3\envs\rife\lib\runpy.py", line 87, in _run_code
exec(code, run_globals)
File "C:\ProgramData\Anaconda3\envs\rife\lib\site-packages\torch\distributed\launch.py", line 261, in
main()
File "C:\ProgramData\Anaconda3\envs\rife\lib\site-packages\torch\distributed\launch.py", line 256, in main
raise subprocess.CalledProcessError(returncode=process.returncode,
subprocess.CalledProcessError: Command '['C:\ProgramData\Anaconda3\envs\rife\python.exe', '-u', 'train.py', '--local_rank=0', '--world_size=1']' returned non-zero exit status 1.
The text was updated successfully, but these errors were encountered: