Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValueError: too many values to unpack (expected 4) - Training the backbone model. #3

Open
navarmn opened this issue Dec 21, 2018 · 1 comment

Comments

@navarmn
Copy link

navarmn commented Dec 21, 2018

Epoch: [1][20577/20578] Time 0.761 Data 0.001 Loss 136.7847
Epoch: [1][20578/20578] Time 0.761 Data 0.001 Loss 157.6949
Traceback (most recent call last):
File "train.py", line 401, in
trainer.run(train_loader, args.config.training.num_epochs)
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 326, in run
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 291, in _handle_exception
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 317, in run
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 226, in _fire_event
File "train.py", line 269, in log_epoch
evaluator.run(train_loader)
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 326, in run
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 291, in _handle_exception
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 313, in run
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 280, in _run_once_on_dataset
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 291, in _handle_exception
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 273, in _run_once_on_dataset
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/engine/engine.py", line 226, in _fire_event
File "/usr/local/lib/python3.6/dist-packages/torch/autograd/grad_mode.py", line 43, in decorate_no_grad
return func(*args, **kwargs)
File "/usr/local/lib/python3.6/dist-packages/pytorch_ignite-0.1.2-py3.6.egg/ignite/metrics/metric.py", line 65, in iteration_completed
File "/home/navar/savoz/aes-lac-2018/codes/metrics.py", line 17, in update
self.metrics[i].update(output)
File "/home/navar/savoz/aes-lac-2018/codes/metrics.py", line 46, in update
out, targets, out_sizes, target_sizes = output
ValueError: too many values to unpack (expected 4)

@navarmn
Copy link
Author

navarmn commented Dec 21, 2018

A turn-around, not a solution:
I just realized the --checkpoint argument is not working. I am using --checkpoint-per-batch instead.

The model saves fine at the end of each batch, and the code brokes at the end of the training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant