Port trial examples' config file to v2 #3721

liuzhe-lz · 2021-06-03T04:01:37Z

Notes:

config_pai and other non-local training services in most examples are removed. Instead added a comment asking users to check mnist-pytorch for training service examples.
Shared storage example is not ported. I don't have environment to test.
system_auto_tuning is not ported. I don't know how to test.
mnist-tfv1 and -keras are not ported. They are deprecated.
Non-reusable k8s training services recommends v1 config for now.
Added remote example for mnist-pytorch.
Added "config_detailed.yml" for mnist-pytorch and -tfv2.
mnist-distributed is renamed to mnist-distributed-tfv1, to match mnist-distributed-pytorch.

Fixed bugs:

When GPU indices contains only one index, the YAML field becomes a int instead of string. This causes problems.
Custom tuner's field codeDirectory has different name in code and doc.
Expanding annotation only uses v1 trial code directory.

SparkSnail · 2021-06-04T06:58:17Z

Add example for config_hybrid.yml and config_windows_v2.yml?

liuzhe-lz · 2021-06-04T07:23:18Z

Add example for config_hybrid.yml and config_windows_v2.yml?

I added a comment line in every "basic" example to inform windows users. I think a separate Windows example is bad because it makes me think NNI behaves differently on Windows and Linux. Let's discuss it in the meeting.
A hybrid example should make sense. I'm testing it now.

SparkSnail · 2021-06-04T08:21:00Z

examples/trials/efficientnet/config.yml

@@ -0,0 +1,15 @@
+searchSpaceFile: search_net.json
+trialCodeDirectory: EfficientNet-PyTorch
+trialCommand: python main.py /data/imagenet -j 12 -a efficientnet --batch-size 48 --lr 0.048 --wd 1e-5 --epochs 5 --request-from-nni


QuanluZhang · 2021-06-08T01:33:45Z

examples/trials/cifar10_pytorch/config.yml

@@ -1,23 +1,14 @@
-authorName: default
-experimentName: example_pytorch_cifar10
+searchSpaceFile: search_space.json


suggest at least for this simple example, we add "experimentName", because if users really use nni to run experiments, they want to give each experiment an easy-to-remember name (not experiment ID). If this field is not added in the example, users have to check config references, which is not friendly

QuanluZhang · 2021-06-08T01:39:04Z

examples/trials/mnist-pytorch/config.yml

-experimentName: example_mnist_pytorch
+# This is the minimal config file for an NNI experiment.
+# Use "nnictl create --config config.yml" to launch this experiment.
+# Afterwards, you can check "config_detailed.yml" for more explaination. 


explaination -> explanation

QuanluZhang · 2021-06-08T01:41:33Z

examples/trials/mnist-pytorch/config_detailed.yml

@@ -0,0 +1,42 @@
+# This example shows more configurable fields comparing to the minimal "config.yml"
+# You can use "nnictl create --config config_detailed.yml" to launch this experiment.
+# If you see an error message saying "port 8080 is used", use "nnictl stop --all" to stop previous experiment.


-> use "nnictl stop --port 8080" to stop that experiment, or use "nnictl stop --all" to stop all the previous experiments.

I prefer not to provide that much details in example. At this point the user does not need to know how to manage multiple experiments.
I'm afraid that "--port 8080" might threaten newbie users.

QuanluZhang · 2021-06-08T01:42:34Z

examples/trials/mnist-pytorch/config_detailed.yml

+
+trialCommand: python3 mnist.py  # The command to launch a trial. NOTE: change "python3" to "python" if you are using Windows.
+trialCodeDirectory: .           # The path of trial code. By default it's ".", which means the same directory of this config file.
+trialGpuNumber: 1               # How many GPUs should each trial use. CUDA is required when it's greator than zero.


greator -> greater

QuanluZhang · 2021-06-08T01:44:28Z

examples/trials/mnist-pytorch/config_detailed.yml

+  momentum:
+    _type: uniform
+    _value: [0, 1]
+


yeah, "experimentName" should be put here, we can tell users that they can omit it if they don't want to write it.

QuanluZhang · 2021-06-08T01:57:23Z

looks great!

liuzhe added 4 commits June 3, 2021 11:47

port examples to v2 and fix bugs

73044ae

add removed example back

357fa15

bugfix

cb449e4

remove debug code

d5ca817

ultmaster approved these changes Jun 4, 2021

View reviewed changes

ultmaster requested review from QuanluZhang, J-shang and SparkSnail June 4, 2021 06:36

J-shang approved these changes Jun 4, 2021

View reviewed changes

SparkSnail reviewed Jun 4, 2021

View reviewed changes

QuanluZhang requested a review from scarlett2018 June 4, 2021 11:36

liuzhe added 2 commits June 7, 2021 13:50

python -> python3

42dc350

add hybrid example

1df4f66

SparkSnail approved these changes Jun 7, 2021

View reviewed changes

QuanluZhang reviewed Jun 8, 2021

View reviewed changes

fix typo and add name field to detailed example

e43f044

scarlett2018 approved these changes Jun 8, 2021

View reviewed changes

ultmaster merged commit eb65bc3 into microsoft:master Jun 8, 2021

liuzhe-lz deleted the v2-example branch June 9, 2021 03:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port trial examples' config file to v2 #3721

Port trial examples' config file to v2 #3721

liuzhe-lz commented Jun 3, 2021 •

edited

Loading

SparkSnail commented Jun 4, 2021 •

edited

Loading

liuzhe-lz commented Jun 4, 2021

SparkSnail Jun 4, 2021

QuanluZhang Jun 8, 2021

QuanluZhang Jun 8, 2021

QuanluZhang Jun 8, 2021

liuzhe-lz Jun 8, 2021

QuanluZhang Jun 8, 2021

QuanluZhang Jun 8, 2021

QuanluZhang commented Jun 8, 2021

Port trial examples' config file to v2 #3721

Port trial examples' config file to v2 #3721

Conversation

liuzhe-lz commented Jun 3, 2021 • edited Loading

SparkSnail commented Jun 4, 2021 • edited Loading

liuzhe-lz commented Jun 4, 2021

SparkSnail Jun 4, 2021

Choose a reason for hiding this comment

QuanluZhang Jun 8, 2021

Choose a reason for hiding this comment

QuanluZhang Jun 8, 2021

Choose a reason for hiding this comment

QuanluZhang Jun 8, 2021

Choose a reason for hiding this comment

liuzhe-lz Jun 8, 2021

Choose a reason for hiding this comment

QuanluZhang Jun 8, 2021

Choose a reason for hiding this comment

QuanluZhang Jun 8, 2021

Choose a reason for hiding this comment

QuanluZhang commented Jun 8, 2021

liuzhe-lz commented Jun 3, 2021 •

edited

Loading

SparkSnail commented Jun 4, 2021 •

edited

Loading