FedOpt with batchnorm #1851

holgerroth · 2023-07-12T02:13:04Z

Description

Support using FedOpt with models containing BatchNorm layers. BN layers are not included in self.model.named_parameters() and are not updated by the optimizer. With this PR, BN layers are updated using FedAvg.

By setting lr=1 and momentum=0 with no learning rate scheduler, FedOpt achieves the same global model performance as FedAvg using torchvision's mobilenet_v2, as expected

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Quick tests passed locally by running ./runtest.sh.
In-line docstrings updated.
Documentation updated.

holgerroth · 2023-07-12T02:18:57Z

/build

YuanTingHsieh · 2023-07-12T03:23:57Z

@holgerroth thank you for the fix.

One thing here, since now your implementation differs from original paper (as you have this workaround to support batch norm)
Could we add this information to the Class's docstring to make it clear?

Thanks!

holgerroth · 2023-07-12T14:49:43Z

@holgerroth thank you for the fix.

One thing here, since now your implementation differs from original paper (as you have this workaround to support batch norm) Could we add this information to the Class's docstring to make it clear?

Thanks!

Good point. Added a description of the new behavior.

holgerroth · 2023-07-12T14:49:49Z

/build

holgerroth · 2023-07-12T19:06:29Z

/build

nvflare/app_opt/pt/fedopt.py

holgerroth · 2023-07-12T22:00:05Z

/build

holgerroth · 2023-07-12T22:01:44Z

/build

guopengf

LGTM

* support FedOpt with batch norm layers * support filtered model diff

support FedOpt with batch norm layers

4465fde

holgerroth force-pushed the fedopt_with_batchnorm branch from 04edcaf to 4465fde Compare July 12, 2023 02:18

holgerroth mentioned this pull request Jul 12, 2023

[BUG] FedOpt algorithm not working as expected in cifar10 example #1718

Closed

holgerroth requested review from YuanTingHsieh, ZiyueXu77, guopengf and chesterxgchen July 12, 2023 02:24

support filtered model diff

ecc6c86

Merge branch 'dev' into fedopt_with_batchnorm

ed23dbb

guopengf reviewed Jul 12, 2023

View reviewed changes

nvflare/app_opt/pt/fedopt.py Show resolved Hide resolved

guopengf reviewed Jul 12, 2023

View reviewed changes

nvflare/app_opt/pt/fedopt.py Show resolved Hide resolved

Merge branch 'dev' into fedopt_with_batchnorm

523991b

holgerroth enabled auto-merge (squash) July 12, 2023 21:59

guopengf approved these changes Jul 12, 2023

View reviewed changes

holgerroth merged commit 75536cd into NVIDIA:dev Jul 12, 2023

holgerroth deleted the fedopt_with_batchnorm branch July 12, 2023 22:17

holgerroth added a commit to holgerroth/NVFlare that referenced this pull request Dec 4, 2023

FedOpt with batchnorm (NVIDIA#1851)

b4e06cc

* support FedOpt with batch norm layers * support filtered model diff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FedOpt with batchnorm #1851

FedOpt with batchnorm #1851

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

YuanTingHsieh commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

guopengf left a comment

FedOpt with batchnorm #1851

FedOpt with batchnorm #1851

Conversation

holgerroth commented Jul 12, 2023

Description

Types of changes

holgerroth commented Jul 12, 2023

YuanTingHsieh commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

holgerroth commented Jul 12, 2023

guopengf left a comment

Choose a reason for hiding this comment