Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

krfricke · 2023-06-22T16:33:07Z

This re-activates the changes in #36494 which were generally working. The problem was that an import of TFPreTrainedModel on a GPU instance seems to initialize the GPU and make it unusable by Ray workers, so that CUDA memory allocations fail.

Thus, imports of TF modules should be guarded behind the TYPE_CHECKING variable:

if TYPE_CHECKING:
    # ...
    from transformers.modeling_utils import PreTrainedModel
    from transformers.modeling_tf_utils import TFPreTrainedModel

… pipeline class (#36494)" (#36701)" This reverts commit 206d7e0.

Signed-off-by: Kai Fricke <kai@anyscale.com>

… pipeline class"" (ray-project#36705) Reverts ray-project#36701 This re-activates the changes in ray-project#36494 which were generally working. The problem was that an import of `TFPreTrainedModel` on a GPU instance seems to initialize the GPU and make it unusable by Ray workers, so that CUDA memory allocations fail. Thus, imports of TF modules should be guarded behind the TYPE_CHECKING variable: ``` if TYPE_CHECKING: # ... from transformers.modeling_utils import PreTrainedModel from transformers.modeling_tf_utils import TFPreTrainedModel ``` Signed-off-by: Kai Fricke <kai@anyscale.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

krfricke and others added 3 commits June 22, 2023 17:33

Revert "Revert "[train] TransformersPredictor: Add support for custom…

f39829d

… pipeline class (#36494)" (#36701)" This reverts commit 206d7e0.

Merge branch 'master' into revert-36701-revert-36494-train/hf-predictor

cbcdce5

Fix tensorflow import

abb03b1

Signed-off-by: Kai Fricke <kai@anyscale.com>

krfricke requested a review from Yard1 June 23, 2023 10:51

krfricke assigned Yard1 Jun 23, 2023

krfricke added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Jun 23, 2023

Yard1 approved these changes Jun 23, 2023

View reviewed changes

krfricke merged commit 0d9cb92 into master Jun 23, 2023

krfricke deleted the revert-36701-revert-36494-train/hf-predictor branch June 23, 2023 19:17

akshay-anyscale mentioned this pull request Jul 21, 2023

Add service deployment instructions to stable diffusion template #37645

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

krfricke commented Jun 22, 2023 •

edited

Loading

Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

Revert "Revert "[train] TransformersPredictor: Add support for custom pipeline class"" #36705

Conversation

krfricke commented Jun 22, 2023 • edited Loading

krfricke commented Jun 22, 2023 •

edited

Loading