[Retiarii] add validation in base trainers #3184

hzhua · 2020-12-11T07:55:32Z

No description provided.

ultmaster · 2020-12-14T04:17:47Z

nni/retiarii/trainer/pytorch/base.py

+        self._val_dataset = getattr(datasets, dataset_cls)(train=False,
+                                                           transform=get_default_transform(
+                                                               dataset_cls),
+                                                           **(dataset_kwargs or {}))
        self._optimizer = getattr(torch.optim, optimizer_cls)(
            model.parameters(), **(optimizer_kwargs or {}))
        self._trainer_kwargs = trainer_kwargs or {'max_epochs': 10}

        # TODO: we will need at least two (maybe three) data loaders in future.


Remove TODO

ultmaster · 2020-12-14T04:19:49Z

nni/retiarii/trainer/pytorch/base.py

    def training_step(self, batch: Tuple[torch.Tensor, torch.Tensor], batch_idx: int) -> Dict[str, Any]:
        x, y = self.training_step_before_model(batch, batch_idx)
        y_hat = self.model(x)
        return self.training_step_after_model(x, y, y_hat)

-    def training_step_before_model(self, batch: Tuple[torch.Tensor, torch.Tensor], batch_idx: int, device = None):
+    def training_step_before_model(self, batch: Tuple[torch.Tensor, torch.Tensor], batch_idx: int, device=None):


Suggest using self.device

In MultiModel, different model's input may need to be placed on different devices (called in _train). Currently, the trainer just sets one GPU per model in hard-code.

BTW, train_step and validation_step are not used in PyTorchImageClassificationTrainer. Removed.

ultmaster · 2020-12-14T04:19:57Z

nni/retiarii/trainer/pytorch/base.py

            summed_loss = sum(losses)
            summed_loss.backward()
            for opt in self._optimizers:
                opt.step()
-            if batch_idx % 50 == 0:
-                nni.report_intermediate_result(report_loss)
+            # if batch_idx % 50 == 0:


Why comment this?

It was for debug. training_loss is not reported. Removed.

ultmaster · 2020-12-14T04:28:15Z

NNI's line limit is 140. You might need to configure your autopep to avoid unwanted linebreaks. :)

…rainer

hzhua added 14 commits November 19, 2020 08:33

cross-graph optimization: input dedup

58de5a3

nni integration test of cross-graph optimization

8022ab8

update cross-graph ut

510f572

Merge branch 'dev-retiarii' into dev-retiarii

1b60074

sovle merge conflict

a0f7d09

Merge remote-tracking branch 'upstream/dev-retiarii' into dev-retiarii

8d04404

fix inconsistent implementation with upstream of new code converter

4124371

remove duplicated __hash__ in nni.retiarii.graph

a82825a

remove bypass optimization

c93c4f3

use __name__ in CGOExecutionEngine logger

b1de4be

Merge remote-tracking branch 'upstream/dev-retiarii' into dev-retiarii

9dfacaa

add validation in PyTorchMultiModelTrainer

21eb936

add validation in PyTorchImageClassificationTrainer

ae41b3a

Merge remote-tracking branch 'upstream/dev-retiarii' into dev-retiarii

6752a2e

QuanluZhang requested a review from ultmaster December 11, 2020 08:12

ultmaster reviewed Dec 14, 2020

View reviewed changes

hzhua added 12 commits December 14, 2020 06:19

format file

2d4bbda

format file

f6328d6

remove unused training_step and validation_step in PyTorchMultiModelT…

2033846

…rainer

remove todo: add val_data_loader

fbf6e19

format

501ad90

format

a31b0db

Merge remote-tracking branch 'upstream/dev-retiarii' into dev-retiarii

67b3a85

format

c0a6ea0

Merge remote-tracking branch 'upstream/dev-retiarii' into dev-retiarii

b1e2751

format

6aba977

fix pylint

dd0017b

fix pylint

ba9c00f

ultmaster merged commit a0e2f8e into microsoft:dev-retiarii Dec 15, 2020

kvartet added the retiarii-2.0 label Jan 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Retiarii] add validation in base trainers #3184

[Retiarii] add validation in base trainers #3184

hzhua commented Dec 11, 2020

ultmaster Dec 14, 2020

hzhua Dec 14, 2020

ultmaster Dec 14, 2020

hzhua Dec 14, 2020

ultmaster Dec 14, 2020

hzhua Dec 14, 2020

ultmaster commented Dec 14, 2020

[Retiarii] add validation in base trainers #3184

[Retiarii] add validation in base trainers #3184

Conversation

hzhua commented Dec 11, 2020

ultmaster Dec 14, 2020

Choose a reason for hiding this comment

hzhua Dec 14, 2020

Choose a reason for hiding this comment

ultmaster Dec 14, 2020

Choose a reason for hiding this comment

hzhua Dec 14, 2020

Choose a reason for hiding this comment

ultmaster Dec 14, 2020

Choose a reason for hiding this comment

hzhua Dec 14, 2020

Choose a reason for hiding this comment

ultmaster commented Dec 14, 2020