Timm support #262

A-Jacobson · 2022-01-21T00:53:39Z

Timm model wrapper with MosaicClassifier
Timm Hparams interface
Timm resnet50 yaml
working W&B run

hanlint

Overall, this design looks good. We might want a common schema across huggingface and TIMM though. So for hugging-face, our models would then be:

model
  hf:
     model_name: '<name_here>'

Would that work for you @moinnadeem ?

A-Jacobson · 2022-01-21T03:58:43Z

Timm is all image classification on imagenet so was super straightforward.

looking at HF, they have model factories such as AutoModelForMaskedLM for each task. We can use the same pattern, but we'd probably have to have a wrapper for each factory. @moinnadeem

So hf may look like:

masked language modeling

model
  hf-maskedlm:
     model_name: '<name_here>'

question answering

model
  hf-qa:
    model_name

and so on.

hanlint · 2022-01-21T18:44:01Z

Does model_name always map to a unique model factory type in HF? If so, we could just require hf, model_name and auto-infer the factory type.

A-Jacobson · 2022-01-21T23:33:43Z

I don't think so. I'm seeing "Bert-base" show up in multiple examples. Though, I'm not sure of the differences between tasks. It may just be different weights or a different classifier head.

Landanjs

LGTM! 2 comments on docstrings and a question on if we should include a **kwargs argument.

@mosaicml/research-engineering does anyone know if we could have a kwargs-like argument in yahp? Maybe defining a dict hparam kwargs: dict = hp.optional(default={}), then using it in a yaml like: kwargs.image_size: 512?

composer/models/timm/model.py

jbloxham · 2022-01-26T18:53:44Z

LGTM! 2 comments on docstrings and a question on if we should include a **kwargs argument.

@mosaicml/research-engineering does anyone know if we could have a kwargs-like argument in yahp? Maybe defining a dict hparam kwargs: dict = hp.optional(default={}), then using it in a yaml like: kwargs.image_size: 512?

YAHP supports JSON types, at least to some extent. I don't have any experience using it.

moinnadeem · 2022-01-26T19:35:33Z

I don't think so. I'm seeing "Bert-base" show up in multiple examples. Though, I'm not sure of the differences between tasks. It may just be different weights or a different classifier head.

Yeah, the bert-base-uncased is a single set of weights, but doing AutoModelForClassification.from_pretrained("bert-base-uncased") will create some randomly initialized weights for the classifier head.

In other words, there is a bijection between model names and weights.

moinnadeem · 2022-01-26T19:38:26Z

Timm is all image classification on imagenet so was super straightforward.

looking at HF, they have model factories such as AutoModelForMaskedLM for each task. We can use the same pattern, but we'd probably have to have a wrapper for each factory. @moinnadeem

So hf may look like:

masked language modeling
model
  hf-maskedlm:
     model_name: '<name_here>'
question answering
model
  hf-qa:
    model_name
and so on.

Hm, we have something very similar to this at the moment. I would like to make our current version more into a factory soon (see how we have very thin wrappers around the current HF configs). However, we do need to insert some additional variables, such as config.num_labels in the wrapper.

Co-authored-by: Landan Seguin <landanjs@gmail.com>

hanlint

Looks good, a few suggested changes. Do you have a w&b run with the TIMM resnet50 to verify convergence?

tests/test_hparams.py

tests/test_load.py

tests/test_model_registry.py

A-Jacobson · 2022-01-31T04:13:41Z

Looks good, a few suggested changes. Do you have a w&b run with the TIMM resnet50 to verify convergence?

https://wandb.ai/mosaic-ml/timm-imagenet/reports/Shared-panel-22-01-30-20-01-86--VmlldzoxNTAzMTky

resnet50 convergence run to 76.89 accuracy. note on the yaml said _quality = '76.51' so it seems we're within the margin for error.

hanlint

LGTM

A-Jacobson added 11 commits January 20, 2022 15:21

timm hparams

a6c64d8

timm resnet50 yaml

83057ec

fix hparams interface

707c4e5

optional typing

a2b6dd6

model -> model_name

640909d

model -> model_name

af4d16a

timm model wrapper

2409e31

add timm to __init__

525c211

train -> total batch size

71a9330

timm model wrapper

124a973

back to train batchsize

d63d608

A-Jacobson requested review from hanlint and Landanjs January 21, 2022 00:53

Merge branch 'dev' into timm-support

1985f60

hanlint reviewed Jan 21, 2022

View reviewed changes

A-Jacobson added 3 commits January 20, 2022 19:49

Update model.py

ce5bc2a

Update timm_hparams.py

bcd7cc9

Update setup.py

a8d9bbd

hanlint and others added 2 commits January 25, 2022 05:54

Merge branch 'dev' into timm-support

dd8e6d1

sort imports

40726a5

Landanjs reviewed Jan 26, 2022

View reviewed changes

composer/models/timm/model.py Outdated Show resolved Hide resolved

composer/models/timm/model.py Outdated Show resolved Hide resolved

composer/models/timm/model.py Show resolved Hide resolved

A-Jacobson and others added 3 commits January 27, 2022 22:49

run yapf

f1564f0

Update composer/models/timm/model.py

6a13fc8

Co-authored-by: Landan Seguin <landanjs@gmail.com>

update docstring

c178612

A-Jacobson and others added 10 commits January 27, 2022 22:54

Merge branch 'dev' into timm-support

549cad9

pull dev

5f6b890

fix merge conflict

66733cf

add license

6b396a8

timm registry test

fe6fcac

skip timm tests if timm isn't installed

3e2a746

lint ignore lines

932bea9

lint

f593eca

lint test

7e6258f

Merge branch 'dev' into timm-support

9e6b72e

A-Jacobson requested a review from hanlint January 29, 2022 05:39

hanlint reviewed Jan 29, 2022

View reviewed changes

tests/test_hparams.py Outdated Show resolved Hide resolved

tests/test_load.py Outdated Show resolved Hide resolved

tests/test_model_registry.py Outdated Show resolved Hide resolved

hanlint added this to the v0.4 milestone Jan 30, 2022

hanlint added the release label Jan 30, 2022

don't skip non-timm tests

77a9924

A-Jacobson and others added 2 commits January 30, 2022 20:16

Merge branch 'dev' into timm-support

db6c4eb

fix imports

af182eb

hanlint approved these changes Feb 1, 2022

View reviewed changes

hanlint added 5 commits February 1, 2022 08:04

Merge branch 'dev' into timm-support

a8ce15c

Merge branch 'dev' into timm-support

7d75753

fix importorskip

a18c756

cleanup

f4c4758

Merge branch 'dev' into timm-support

26e5081

hanlint merged commit fa1b992 into dev Feb 1, 2022

hanlint deleted the timm-support branch February 1, 2022 18:17

A-Jacobson added a commit that referenced this pull request Feb 10, 2022

Timm support (#262)

793fa59

coryMosaicML pushed a commit to coryMosaicML/composer that referenced this pull request Feb 23, 2022

Timm support (mosaicml#262)

6751b5d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timm support #262

Timm support #262

A-Jacobson commented Jan 21, 2022

hanlint left a comment

A-Jacobson commented Jan 21, 2022 •

edited

Loading

hanlint commented Jan 21, 2022

A-Jacobson commented Jan 21, 2022

Landanjs left a comment

jbloxham commented Jan 26, 2022

moinnadeem commented Jan 26, 2022

moinnadeem commented Jan 26, 2022

hanlint left a comment

A-Jacobson commented Jan 31, 2022 •

edited

Loading

hanlint left a comment

Timm support #262

Timm support #262

Conversation

A-Jacobson commented Jan 21, 2022

hanlint left a comment

Choose a reason for hiding this comment

A-Jacobson commented Jan 21, 2022 • edited Loading

hanlint commented Jan 21, 2022

A-Jacobson commented Jan 21, 2022

Landanjs left a comment

Choose a reason for hiding this comment

jbloxham commented Jan 26, 2022

moinnadeem commented Jan 26, 2022

moinnadeem commented Jan 26, 2022

hanlint left a comment

Choose a reason for hiding this comment

A-Jacobson commented Jan 31, 2022 • edited Loading

hanlint left a comment

Choose a reason for hiding this comment

A-Jacobson commented Jan 21, 2022 •

edited

Loading

A-Jacobson commented Jan 31, 2022 •

edited

Loading