[train] TransformersPredictor: Add support for custom pipeline class #36494

krfricke · 2023-06-16T09:16:29Z

Why are these changes needed?

Creating a TransformersPredictor with a custom pipeline class is currently broken: The model can't be automatically inferred from a path. This only works in the transformers pipeline. This PR adds support for this by adding additional parameters to TransformersPredictor.from_checkpoint() that will call TransformersCheckpoint.get_model() to retrieve the model, if specified.

Related issue number

Solves https://discuss.ray.io/t/bug-in-ray-transformerpredictor-from-checkpoint/11033/2

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Kai Fricke <kai@anyscale.com>

Yard1

Thanks, this looks good to me.

krfricke · 2023-06-16T18:16:03Z

I will add more test cases for the new code paths.

Signed-off-by: Kai Fricke <kai@anyscale.com>

…e class (#36494)" This reverts commit 7ed5c6d.

…e class (#36494)" (#36701) This reverts commit 7ed5c6d.

… pipeline class (#36494)" (#36701)" This reverts commit 206d7e0.

… pipeline class"" (#36705) Reverts #36701 This re-activates the changes in #36494 which were generally working. The problem was that an import of `TFPreTrainedModel` on a GPU instance seems to initialize the GPU and make it unusable by Ray workers, so that CUDA memory allocations fail. Thus, imports of TF modules should be guarded behind the TYPE_CHECKING variable: ``` if TYPE_CHECKING: # ... from transformers.modeling_utils import PreTrainedModel from transformers.modeling_tf_utils import TFPreTrainedModel ``` Signed-off-by: Kai Fricke <kai@anyscale.com>

…ay-project#36494) Creating a `TransformersPredictor` with a custom pipeline class is currently broken: The model can't be automatically inferred from a path. This only works in the transformers pipeline. This PR adds support for this by adding additional parameters to `TransformersPredictor.from_checkpoint()` that will call `TransformersCheckpoint.get_model()` to retrieve the model, if specified. Signed-off-by: Kai Fricke <kai@anyscale.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

…e class (ray-project#36494)" (ray-project#36701) This reverts commit 7ed5c6d. Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

… pipeline class"" (ray-project#36705) Reverts ray-project#36701 This re-activates the changes in ray-project#36494 which were generally working. The problem was that an import of `TFPreTrainedModel` on a GPU instance seems to initialize the GPU and make it unusable by Ray workers, so that CUDA memory allocations fail. Thus, imports of TF modules should be guarded behind the TYPE_CHECKING variable: ``` if TYPE_CHECKING: # ... from transformers.modeling_utils import PreTrainedModel from transformers.modeling_tf_utils import TFPreTrainedModel ``` Signed-off-by: Kai Fricke <kai@anyscale.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

Kai Fricke added 2 commits June 16, 2023 11:14

[train] TransformersPredictor: Add support for custom pipeline class

c8957ae

Signed-off-by: Kai Fricke <kai@anyscale.com>

test imports

3019b85

Signed-off-by: Kai Fricke <kai@anyscale.com>

krfricke marked this pull request as ready for review June 16, 2023 13:33

krfricke requested a review from Yard1 June 16, 2023 13:33

krfricke assigned Yard1 Jun 16, 2023

Yard1 approved these changes Jun 16, 2023

View reviewed changes

Kai Fricke added 4 commits June 20, 2023 09:07

Merge remote-tracking branch 'upstream/master' into train/hf-predictor

04b78a5

import

f9e77c4

Signed-off-by: Kai Fricke <kai@anyscale.com>

Merge branch 'master' into train/hf-predictor

4b9b57e

Add test

4b2a070

Signed-off-by: Kai Fricke <kai@anyscale.com>

krfricke merged commit 7ed5c6d into ray-project:master Jun 21, 2023

krfricke deleted the train/hf-predictor branch June 21, 2023 11:02

krfricke added a commit that referenced this pull request Jun 22, 2023

Revert "[train] TransformersPredictor: Add support for custom pipelin…

fb4f5ca

…e class (#36494)" This reverts commit 7ed5c6d.

krfricke mentioned this pull request Jun 22, 2023

Revert "[train] TransformersPredictor: Add support for custom pipeline class" #36701

Merged

krfricke added a commit that referenced this pull request Jun 22, 2023

Revert "[train] TransformersPredictor: Add support for custom pipelin…

206d7e0

…e class (#36494)" (#36701) This reverts commit 7ed5c6d.

krfricke added a commit that referenced this pull request Jun 22, 2023

Revert "Revert "[train] TransformersPredictor: Add support for custom…

f39829d

… pipeline class (#36494)" (#36701)" This reverts commit 206d7e0.

krfricke mentioned this pull request Jun 23, 2023

Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

Merged

akshay-anyscale mentioned this pull request Jul 21, 2023

Add service deployment instructions to stable diffusion template #37645

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[train] TransformersPredictor: Add support for custom pipeline class #36494

[train] TransformersPredictor: Add support for custom pipeline class #36494

krfricke commented Jun 16, 2023

Yard1 left a comment

krfricke commented Jun 16, 2023

[train] TransformersPredictor: Add support for custom pipeline class #36494

[train] TransformersPredictor: Add support for custom pipeline class #36494

Conversation

krfricke commented Jun 16, 2023

Why are these changes needed?

Related issue number

Checks

Yard1 left a comment

Choose a reason for hiding this comment

krfricke commented Jun 16, 2023