Convert protobuf model to native format #550

psfoley · 2022-10-27T21:59:45Z

This PR:

Adds a new command, fx model save, that takes as input the .pbuf model file produced by a federated experiment and converts it to the native PyTorch / TensorFlow model representation for future use.

~~This PR is WIP~~. The core functionality to be refactored by @igor-davidyuk into the utilities folder, so this can also be used via API.

closes #552

…(to be refactored)

igor-davidyuk · 2022-10-31T14:51:38Z

We want to introduce an additional piece of functionality in two forms:

CLI: fx model save command called from the workspace must save the model on disk.
Python API: openfl.model_save(path) must return the model object.

We start with model.pbuf path to the model snapshot that we need to decompress and load weights into the model object to save or return it

To save the model in a native format we need save_native method implemented in TaskRunner
This method is not implemented in all our template TaskRunners but this is fine as we may ask users to implement it.
To return the model as a Python object we need to access this object
Models are stored in different ways in our template TaskRunners. Moreover, there is a particular problem with TF1 examples. We may need to introduce an additional get_model method to all of our template TaskRunners.
To get access to the model we need to initialize TaskRunner
Model initialization happens in TaskRunner code and often requires feature_shape information to build a model.
To initialize TaskRunner we need the plan.yaml path and DataLoader
Again, model initialization is coupled with DataLoader initialization.
To initialize DataLoader we need data.yaml and cols.yaml
Here we use the first collaborator's name to get its data_path and load data to get the feature_shape. It works for our templates, which use synthetic or downloaded datasets, but in real-world usecases, users often may not have access to any data from their machine. A way to overcome this obstacle may be to define feature_shape in plan.yaml or hardcode it inside DataLoader.

With all said above, at this point, we can not guarantee model save API functioning to all users in the intended way.

igor-davidyuk · 2022-11-03T11:28:35Z

As a solution to the abovementioned problems I propose the following approach:

For the CLI command we will rely on TaskRunner's save_native method that users can implement or change if needed.
For the Python API part, we will return a TaskRunner object, thus allowing users to interact with the linked model.

These signatures allow us to mitigate issues caused by the diversity of model definition forms in our TuskRunners and yet deliver the expected functionality. At the same time, the data problem mentioned under point 5 is ignored, as we know it is already solved prior to executing fx plan initialize which follows a similar path.

Work left in this PR:

Update the docs
Write tests

itrushkin · 2022-11-03T14:57:51Z

openfl/interface/model.py

+@option('-i', '--input', 'model_protobuf_path', required=True,
+        help='The model protobuf to convert',
+        type=ClickPath(exists=True))
+@option('-o', '--output', 'output_filepath', required=False,


We could require this option as well for users to make sure they provide the path for the saved model. In case of someone forgot to specify it, it would be easier to notify them about the wrong command call and ask for the path. Otherwise, they have to search the save path in the console output.

I understand the intention, but I would argue that we should make arguments required only if they are literally required.

output_filepath is required by the _save function for calling TaskRunner.save_native.

For example, torch.save and tf.keras.Model.save_weights do require a file argument.

Yes, but those functions are python function calls, in our case we have a CLI command which is called from a specific place, namely, an experiment workspace.
Honestly, I am not against making this argument required. @mansishr, what do you think?

I believe that here is used good approach with default value. It is checked that file already exists and is asked user for the confirmation to rewrite. At the end, it is written to the output in witch path the model is saved.

I agree, specifying a default path helps for our CLI command.

Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com>

openfl/interface/model.py

Co-authored-by: Ilya Trushkin <ilya.trushkin@intel.com>

mansishr

Looks good to me

tests/openfl/interface/test_model_api.py

* Initial implementation of CLI command to save model in native format (to be refactored) * method functioning * updated function and docs * Add check for overwriting output file * add unit and integrational tests * fix test * signed-off commit Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com> * support calling python command outside workspace Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com> * Illiyas typo fix Co-authored-by: Ilya Trushkin <ilya.trushkin@intel.com> * Update tests/openfl/interface/test_model_api.py * Update tests/openfl/interface/test_model_api.py Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com> Co-authored-by: igor-davidyuk <igor.davidyuk@intel.com> Co-authored-by: Ilya Trushkin <ilya.trushkin@intel.com> Signed-off-by: Aleksandr Mokrov <aleksandr.mokrov@intel.com>

Initial implementation of CLI command to save model in native format …

8e260c3

…(to be refactored)

psfoley requested a review from igor-davidyuk October 27, 2022 21:59

igor-davidyuk self-assigned this Oct 31, 2022

igor-davidyuk marked this pull request as draft October 31, 2022 07:54

igor-davidyuk removed their request for review October 31, 2022 07:54

method functioning

a3c526a

updated function and docs

3a4d5b0

igor-davidyuk marked this pull request as ready for review November 3, 2022 13:56

igor-davidyuk changed the title ~~WIP: Convert protobuf model to native format~~ Convert protobuf model to native format Nov 3, 2022

igor-davidyuk requested review from sun1lach, aleksandr-mokrov, itrushkin and mansishr November 3, 2022 13:57

itrushkin reviewed Nov 3, 2022

View reviewed changes

igor-davidyuk added 2 commits November 8, 2022 16:29

Add check for overwriting output file

7c229f0

add unit and integrational tests

0411f6b

igor-davidyuk self-requested a review November 9, 2022 13:27

igor-davidyuk added 2 commits November 9, 2022 16:49

fix test

030bbc3

signed-off commit

31091e0

Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com>

aleksandr-mokrov approved these changes Nov 10, 2022

View reviewed changes

support calling python command outside workspace

89ad9ce

Signed-off-by: igor-davidyuk <igor.davidyuk@intel.com>

igor-davidyuk requested review from aleksandr-mokrov and itrushkin November 13, 2022 10:50

itrushkin reviewed Nov 14, 2022

View reviewed changes

openfl/interface/model.py Outdated Show resolved Hide resolved

Illiyas typo fix

dab1531

Co-authored-by: Ilya Trushkin <ilya.trushkin@intel.com>

mansishr reviewed Nov 17, 2022

View reviewed changes

tests/openfl/interface/test_model_api.py Outdated Show resolved Hide resolved

igor-davidyuk reviewed Nov 18, 2022

View reviewed changes

tests/openfl/interface/test_model_api.py Outdated Show resolved Hide resolved

igor-davidyuk added 2 commits November 18, 2022 13:25

Update tests/openfl/interface/test_model_api.py

ec8086a

Update tests/openfl/interface/test_model_api.py

a28baef

mansishr approved these changes Nov 18, 2022

View reviewed changes

psfoley merged commit 3f2337d into securefederatedai:develop Nov 18, 2022

r-kellerm mentioned this pull request Nov 20, 2022

Using a trained model saved in native pytorch format fails to reproduce train accuracy #609

Closed

itrushkin mentioned this pull request Nov 23, 2022

Fix linter issues #619

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert protobuf model to native format #550

Convert protobuf model to native format #550

psfoley commented Oct 27, 2022 •

edited

Loading

igor-davidyuk commented Oct 31, 2022 •

edited

Loading

igor-davidyuk commented Nov 3, 2022 •

edited

Loading

itrushkin Nov 3, 2022

igor-davidyuk Nov 7, 2022

itrushkin Nov 7, 2022

igor-davidyuk Nov 8, 2022

aleksandr-mokrov Nov 10, 2022

mansishr Nov 17, 2022

mansishr left a comment

Convert protobuf model to native format #550

Convert protobuf model to native format #550

Conversation

psfoley commented Oct 27, 2022 • edited Loading

igor-davidyuk commented Oct 31, 2022 • edited Loading

igor-davidyuk commented Nov 3, 2022 • edited Loading

itrushkin Nov 3, 2022

Choose a reason for hiding this comment

igor-davidyuk Nov 7, 2022

Choose a reason for hiding this comment

itrushkin Nov 7, 2022

Choose a reason for hiding this comment

igor-davidyuk Nov 8, 2022

Choose a reason for hiding this comment

aleksandr-mokrov Nov 10, 2022

Choose a reason for hiding this comment

mansishr Nov 17, 2022

Choose a reason for hiding this comment

mansishr left a comment

Choose a reason for hiding this comment

psfoley commented Oct 27, 2022 •

edited

Loading

igor-davidyuk commented Oct 31, 2022 •

edited

Loading

igor-davidyuk commented Nov 3, 2022 •

edited

Loading