Add mmsegmentation DeepLabv3(+) #684

Landanjs · 2022-03-08T00:38:21Z

Motivation

We were using torchvision's DeepLabv3 model for segmentation results, but most papers use DeepLabv3+ as a baseline for semantic segmentation tasks.

Implementation

This PR uses the DeepLabv3+ model from mmsegmentation which is a commonly used segmentation library (used in at least ConvNext, Swin, and SegFormer). One caveat is that mmsegmentation makes adjustments to the original DeepLabv3(+) model, but maybe this can be justified since ADE20k should be more difficult than the datasets DeepLabv3 was originally proposed for.

Instead of completely removing DeepLabv3, I've added a flag to toggle between the two implementations in mmsegmentation.

Somewhat unrelated, I added an argument to specify the url to download the backbone weights from to make some future experiments easier.

Side notes:

In the future, we should have a more general backbone-head interface when we have more backbones and heads.
One alternative to mmsegmentation is to use detectron2, but this seems less friendly. Please let me know if this would be a better alternative

Results

mmsegmentation one-run results: 45 mIoU and 45.47 mIoU for DeepLabv3 and DeepLabv3+, respectively. There are several differences that make it difficult to compare our numbers to theirs:

Auxiliary loss
Different pre-trained weights
Different ResNet stem (v1c)

Model	mIoU	TTT
Old Deeplabv3 Unoptimized	44.21 +/- 0.30	6.8 hours
DeepLabv3 Unoptimized	44.82 +/- 0.19	7.8 hours
DeepLabv3+ Unoptimized	44.69 +/- 0.34	7.3 hours
Old DeepLabv3 Optimized	45.62 +/- 0.16	4.9 hours
DeepLabv3 Optimized	45.95 +/- 0.20	6 hours
DeepLabv3+ Optimized	45.75 +/- 0.3	TBD

composer/models/deeplabv3/deeplabv3.py

A-Jacobson · 2022-03-08T01:19:47Z

Somewhat unrelated, I added an argument to specify the url to download the backbone weights from to make some future experiments easier.

I like the thought but is this necessary right now? does mmseg support something like this or are the urls aliased?

Side notes:

In the future, we should have a more general backbone-head interface when we have more backbones and heads.

Agreed, I'm thinking directories can specify whole model families sort of like our resnets. We should discuss this elsewhere

One alternative to mmsegmentation is to use detectron2, but this seems less friendly. Please let me know if this would be a better alternative

all the openmm lab stuff is more actively developed. additionally, it looks like mmseg/mmdet are supersets of detectron with similar api's so if we're going to pick one i think mmseg is a good bet.

Landanjs · 2022-03-08T01:49:01Z

Somewhat unrelated, I added an argument to specify the url to download the backbone weights from to make some future experiments easier.

I like the thought but is this necessary right now? does mmseg support something like this or are the urls aliased?

As of now, we are only using the heads from mmsegmentation, but still using torchvision backbones. The url argument functionality is to specify a specific torchvision url weight i.e. swap between weights from the old and new torchvision training recipes. This isn't necessary, but I foresee running experiments that swap between different pretrained weights.

Landanjs · 2022-03-08T18:34:37Z

@A-Jacobson Updated numbers in first table based off of recent runs. DeepLabv3 looks really good while DeepLabv3+ may be falling a bit short. One seed reaches the target mIoU the epoch before, not sure if that is sufficient 😬

A-Jacobson · 2022-03-09T01:57:46Z

@A-Jacobson Updated numbers in first table based off of recent runs. DeepLabv3 looks really good while DeepLabv3+ may be falling a bit short. One seed reaches the target mIoU the epoch before, not sure if that is sufficient 😬

I'm good with this as the baseline wasn't significantly different!

into landan/deeplabv3+

Landanjs · 2022-03-10T17:17:28Z

composer/models/deeplabv3/deeplabv3.py

+    try:
+        from mmseg.models import ASPPHead, DepthwiseSeparableASPPHead  # type: ignore
+    except ImportError as e:
+        raise ImportError(
+            textwrap.dedent("""\
+            Either mmcv or mmsegmentation is not installed. To install mmcv, please run pip install mmcv-full==1.4.4 -f
+             https://download.openmmlab.com/mmcv/dist/{cu_version}/{torch_version}/index.html where {cu_version} and
+             {torch_version} refer to your CUDA and PyTorch versions, respectively. To install mmsegmentation, please
+             run pip install mmsegmentation==0.22.0 on command-line.""")) from e


@ravi-mosaicml just double checking... Does this good to you?

ExtReMLapin · 2022-03-22T07:39:54Z

As you quoted detectron, do you have any quick solution to easily integrate this in detectron2 and possibly speed up training ?

dblalock · 2022-03-23T17:45:19Z

@Landanjs would be the right person to answer that, but he's on vacation this week. Maybe @florescl can comment?

A-Jacobson · 2022-03-25T00:58:54Z

Hi @ExtReMLapin! Thanks for checking out composer. We don't yet have any convenience wrappers for detectron2 like we do for timm, but you could still train a detectron model using composer methods. I can see two ways to accomplish this right now.

choose methods from our functional interface and drop them directly into your training loop. You may have to copy and modify the training loop from detectron if that's what you plan to use.
create a ComposerModel from your detectron model and use our trainer. This may be a bit tricky as detectron models can behave differently during training and inference + they calculated loss internally. Some of the huggingface models and SSD models in composer.models may serve as good examples for dealing with this.

Regarding which methods to use, SAM and SWA are likely to play nicely with detection/instance segmentation. Augmentation based methods and mixup style methods likely wont. Yet.

ExtReMLapin · 2022-03-25T08:05:22Z

Thank you for your answer, i'll give a try as soon as I'm confident enough with all the pytorch/detectron2 ecosystem

Landanjs added 11 commits February 26, 2022 00:49

First pass

67d4b4c

Working initial DeepLabv3+

25ae0a9

Allow initialization

a403bb5

Add batchnorm lol

c58ea7d

Toggle between deeplabv3 and +

09f591a

Merge with dev

80a642a

Minor changes

a1f2ea2

Fix typo and change setup

7415766

Fix setup.py

632658f

Merge branch 'dev' into landan/deeplabv3+

eb004a3

Merge

7147a30

Landanjs requested a review from A-Jacobson March 8, 2022 01:14

A-Jacobson reviewed Mar 8, 2022

View reviewed changes

composer/models/deeplabv3/deeplabv3.py Show resolved Hide resolved

Fix test imports

c1d771a

Merge with dev

ffdf02f

Merge branch 'dev' into landan/deeplabv3+

1015ad2

Landanjs requested a review from A-Jacobson March 8, 2022 20:55

Merge branch 'dev' into landan/deeplabv3+

6443058

A-Jacobson approved these changes Mar 9, 2022

View reviewed changes

Landanjs added 7 commits March 9, 2022 11:39

Merge branch 'dev' into landan/deeplabv3+

71f84ac

Remove mmcv and mmseg from setup; add import exception

9e15560

Merge with dev; update docstrings

fe22dbe

Oooops

b7dacd1

Type ignore missing import?

689550a

Break doctests

f937566

Merge branch 'dev' into landan/deeplabv3+

d055d43

Landanjs added 2 commits March 10, 2022 09:15

Rearrange docstrings

632efa6

Merge branch 'landan/deeplabv3+' of https://github.com/Landanjs/composer

1636aa9

into landan/deeplabv3+

Landanjs commented Mar 10, 2022

View reviewed changes

Merge branch 'dev' into landan/deeplabv3+

4379a06

Landanjs merged commit ad827e2 into mosaicml:dev Mar 10, 2022

Landanjs deleted the landan/deeplabv3+ branch June 20, 2022 22:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mmsegmentation DeepLabv3(+) #684

Add mmsegmentation DeepLabv3(+) #684

Landanjs commented Mar 8, 2022 •

edited

Loading

A-Jacobson commented Mar 8, 2022

Landanjs commented Mar 8, 2022 •

edited

Loading

Landanjs commented Mar 8, 2022

A-Jacobson commented Mar 9, 2022

Landanjs Mar 10, 2022

ExtReMLapin commented Mar 22, 2022 •

edited

Loading

dblalock commented Mar 23, 2022

A-Jacobson commented Mar 25, 2022

ExtReMLapin commented Mar 25, 2022

Add mmsegmentation DeepLabv3(+) #684

Add mmsegmentation DeepLabv3(+) #684

Conversation

Landanjs commented Mar 8, 2022 • edited Loading

Motivation

Implementation

Results

A-Jacobson commented Mar 8, 2022

Landanjs commented Mar 8, 2022 • edited Loading

Landanjs commented Mar 8, 2022

A-Jacobson commented Mar 9, 2022

Landanjs Mar 10, 2022

Choose a reason for hiding this comment

ExtReMLapin commented Mar 22, 2022 • edited Loading

dblalock commented Mar 23, 2022

A-Jacobson commented Mar 25, 2022

ExtReMLapin commented Mar 25, 2022

Landanjs commented Mar 8, 2022 •

edited

Loading

Landanjs commented Mar 8, 2022 •

edited

Loading

ExtReMLapin commented Mar 22, 2022 •

edited

Loading