Updated the `DataSpec` for the timing abstraction (#146) parts 3 and 4 #178

ravi-mosaicml · 2021-12-18T02:21:59Z

For the timing abstraction (#146), the DataloaderSpec needed two addition functions -- get_num_samples_in_batch and get_num_tokens_in_batch.

Moved the DataSpec class to composer.core, as the DataSpec is now bound directly to the state. #120 will also need this change.

This PR implements part 3 and 4 of the timing abstraction (#146). The implementation differs from the GH issue by adding num_tokens, num_samples, get_batch_size, and get_num_tokens to the new DataSpec rather than the pytorch dataset class.

For the timing abstraction (#146), the `DataloaderSpec` needed two addition functions -- `get_num_samples_in_batch` and `get_num_tokens_in_batch`. It was getting messy to pass around function pointers in a named tuple, so instead converted `DataloaderSpec` from a NamedTuple into a regular class called `DataSpec`. Custom datasets can inherit the base `DataSpec` class and override functionality as needed. Moved the `DataSpec` class to `composer.core`, as the `DataSpec` is now bound directly to the state. #120 will also need this change. Renamed `train_dataloader` and `eval_dataloader` in the trainer and state to `train_data` and `eval_data`, since it encompasses more than the dataloader. This PR implements part 3 and 4 of the timing abstraction (#146). The implementation differs from the GH issue by adding `num_tokens`, `num_samples`, `get_batch_size`, and `get_num_tokens` to the new `DataSpec` rather than the pytorch dataset class.

composer/core/data_spec.py

composer/core/state.py

composer/core/data_spec.py

hanlint

Made a few suggestions-- I know this PR was just to update the DataSpec, but I think we should take this opportunity to remove DataSpec completely for better usability.

composer/core/state.py

composer/core/data_spec.py

composer/trainer/trainer.py

ravi-mosaicml · 2022-01-07T20:31:59Z

As discussed offline, we will leave dataspec for now and leave it attached to the state. A later PR will remove it. For the time being, however, dataspecs can be initialized using a dictionary which serves as kwargs for the class constructor.

Averylamp

lgtm

…s 3 and 4 (mosaicml#178) For the timing abstraction (mosaicml#146), the DataloaderSpec needed two addition functions -- get_num_samples_in_batch and get_num_tokens_in_batch. Moved the DataSpec class to composer.core, as the DataSpec is now bound directly to the state. mosaicml#120 will also need this change. This PR implements part 3 and 4 of the timing abstraction (mosaicml#146). The implementation differs from the GH issue by adding num_tokens, num_samples, get_batch_size, and get_num_tokens to the new DataSpec rather than the pytorch dataset class.

ravi-mosaicml requested review from moinnadeem, jbloxham, Averylamp, ajaysaini725, hanlint, abhi-mosaic and anisehsani December 18, 2021 02:21

ravi-mosaicml added 2 commits December 17, 2021 18:25

isort

455d018

Merge branch 'dev' into ravi/new_dataloader_spec

bae29d4

hanlint requested changes Dec 28, 2021

View reviewed changes

composer/core/data_spec.py Outdated Show resolved Hide resolved

composer/core/state.py Outdated Show resolved Hide resolved

composer/core/data_spec.py Show resolved Hide resolved

ravi-mosaicml added 4 commits January 4, 2022 16:31

Addressed some PR comments

0b7eae7

Addressed some pr comments

83d23c6

Merge branch 'dev' into ravi/new_dataloader_spec

af78a30

Finished addressing PR feedback

681dc63

ravi-mosaicml requested a review from hanlint January 5, 2022 00:46

Fixed tests

3d46d8b

hanlint reviewed Jan 5, 2022

View reviewed changes

composer/core/state.py Outdated Show resolved Hide resolved

composer/core/data_spec.py Show resolved Hide resolved

composer/trainer/trainer.py Show resolved Hide resolved

hanlint approved these changes Jan 6, 2022

View reviewed changes

ravi-mosaicml added 3 commits January 7, 2022 12:20

Merge branch 'dev' into ravi/new_dataloader_spec

01d9129

Allow dict for dataspec in trainer init and state init

67a458c

Cleaned up data_spec

d883741

Merge branch 'dev' into ravi/new_dataloader_spec

2a036d1

Averylamp approved these changes Jan 8, 2022

View reviewed changes

ravi-mosaicml merged commit 22cef05 into dev Jan 8, 2022

ravi-mosaicml deleted the ravi/new_dataloader_spec branch January 8, 2022 01:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated the `DataSpec` for the timing abstraction (#146) parts 3 and 4 #178

Updated the `DataSpec` for the timing abstraction (#146) parts 3 and 4 #178

ravi-mosaicml commented Dec 18, 2021 •

edited

Loading

hanlint left a comment

ravi-mosaicml commented Jan 7, 2022

Averylamp left a comment

Updated the DataSpec for the timing abstraction (#146) parts 3 and 4 #178

Updated the DataSpec for the timing abstraction (#146) parts 3 and 4 #178

Conversation

ravi-mosaicml commented Dec 18, 2021 • edited Loading

hanlint left a comment

Choose a reason for hiding this comment

ravi-mosaicml commented Jan 7, 2022

Averylamp left a comment

Choose a reason for hiding this comment

Updated the `DataSpec` for the timing abstraction (#146) parts 3 and 4 #178

Updated the `DataSpec` for the timing abstraction (#146) parts 3 and 4 #178

ravi-mosaicml commented Dec 18, 2021 •

edited

Loading