Fix CogVideoX support #261

chengzeyi · 2024-09-10T05:47:35Z

Make flash_attn dependency optional when installing.
Do not force a fixed torch version in setup.py.
Add xFuserCogVideoXAttnProcessor2_0 to support CogVideoXAttnProcessor2_0 in newer version of diffusers.
Support latest diffusers (use a bundled apply_rotary_emb).
CogVideoX now supports ulysses sequence parallel.

Current status

torchrun --nproc_per_node=2 examples/cogvideox_example.py --ulysses_degree 2 \
    --model THUDM/CogVideoX-5b --height 480 --width 720 --num_frames 30 \
    --prompt "a panda playing piano"

Works perfectly now.

…or2_0

…eyi-mod

…ion _init_sync_pipeline

feifeibear · 2024-09-11T15:08:10Z

setup.py

+        ],
+        extras_require={
+            "all": [
+                "flash_attn>=2.6.3",


we should update readme.

pip install xfuser [flash_attn]?

Ok I changed all to flash_attn and modified the readme.

Eigensystem · 2024-09-12T10:18:30Z

setup.py

@@ -12,40 +12,34 @@ def get_cuda_version():
    except Exception as e:
        return 'no_cuda'

-def get_install_requires(cuda_version):


The func was introduced in #259, why remove it?

This package (xdit) does not require a specific CUDA version to build or run.
So this warning is meaningless.

And even this commit #259 is buggy. It actually wants to print a warning when the version is not 12.4. but actually it prints a warning when the version is equal to 12.4...😢

The way it checks the cuda version is also incorrect. It checks the version of the system cuda rather than the cuda bundled with the PyTorch installation. So the get_cuda_version func could be removed in a future commit.

Eigensystem · 2024-09-12T14:10:29Z

xfuser/model_executor/layers/attention_processor.py

 def torch_compile_disable_if_v100(func):
    if is_v100():
        return torch.compiler.disable(func)
    return func

+
+def apply_rotary_emb(


This change result in the following error:

torchrun --nproc_per_node=2 ./examples/flux_example.py --model /cfs/dit/FLUX.1-dev --ulysses_degree 2 --prompt "A snowy mountain" --num_inference_steps 20 [rank0]: Traceback (most recent call last): [rank0]: File "~/xDiT/./examples/flux_example.py", line 77, in <module> [rank0]: main() [rank0]: File "~/xDiT/./examples/flux_example.py", line 35, in main [rank0]: pipe.prepare_run(input_config) [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/pipeline_flux.py", line 69, in prepare_run [rank0]: self.__call__( [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context [rank0]: return func(*args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/base_pipeline.py", line 181, in wrapper [rank0]: return func(*args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/base_pipeline.py", line 133, in data_parallel_fn [rank0]: return func(self, *args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/base_pipeline.py", line 149, in check_naive_forward_fn [rank0]: return func(self, *args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/pipeline_flux.py", line 297, in __call__ [rank0]: latents = self._sync_pipeline( [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/pipeline_flux.py", line 399, in _sync_pipeline [rank0]: latents, encoder_hidden_states = self._backbone_forward( [rank0]: File "~/xDiT/xfuser/model_executor/pipelines/pipeline_flux.py", line 484, in _backbone_forward [rank0]: noise_pred, encoder_hidden_states = self.transformer( [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl [rank0]: return self._call_impl(*args, **kwargs) [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl [rank0]: return forward_call(*args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/models/transformers/transformer_flux.py", line 147, in forward [rank0]: encoder_hidden_states, hidden_states = block( [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl [rank0]: return self._call_impl(*args, **kwargs) [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl [rank0]: return forward_call(*args, **kwargs) [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/diffusers/models/transformers/transformer_flux.py", line 200, in forward [rank0]: attn_output, context_attn_output = self.attn( [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl [rank0]: return self._call_impl(*args, **kwargs) [rank0]: File "~/miniconda3/envs/long_ctx_attn/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl [rank0]: return forward_call(*args, **kwargs) [rank0]: File "~/xDiT/xfuser/model_executor/layers/attention_processor.py", line 223, in forward [rank0]: return self.processor( [rank0]: File "~/xDiT/xfuser/model_executor/layers/attention_processor.py", line 705, in __call__ [rank0]: query = apply_rotary_emb(query, image_rotary_emb) [rank0]: File "~/xDiT/xfuser/model_executor/layers/attention_processor.py", line 76, in apply_rotary_emb [rank0]: cos, sin = freqs_cis # [S, D] [rank0]: ValueError: not enough values to unpack (expected 2, got 1)

We can merge this pr again after the problem is fixed.

Oh, looks like the latest diffusers has changed a lot and make itself incompatible with the flux implementation in xdit anymore.
I guess a refactor could be needed.

After modifying the attention processor and remove resuing encoder_hidden_states it now works with diffusers==0.30.2.
Anyway it is still only for reference for you. I guess the frequent changes in diffusers and lack of some automatic code patching mechanism could be challenging to fit. So those changes need to be trated carefully.🙂

#264

This reverts commit 9484590.

chengzeyi added 11 commits September 10, 2024 04:11

make flash_attn dep optional and implement xFuserCogVideoXAttnProcess…

ca49736

…or2_0

fix CogVideoX support

a94c076

fix

26a6bb9

fix call update_and_get_kv_cache

9e1d0ed

support image_rotary_emb in cogvideox

f5ae1e7

Merge branch 'main' into chengzeyi-mod

16984e7

remove debug print

ffe696c

Merge branch 'chengzeyi-mod' of github.com:chengzeyi/xDiT into chengz…

d0e0371

…eyi-mod

fix image_rotary_emb in CogVideoX

83d0f5b

use bundled apply_rotary_emb in attention_processor.py

2949eb7

assign a default value to the parameter image_rotary_emb in the funct…

9606b27

…ion _init_sync_pipeline

feifeibear reviewed Sep 11, 2024

View reviewed changes

feifeibear approved these changes Sep 12, 2024

View reviewed changes

feifeibear merged commit 9484590 into xdit-project:main Sep 12, 2024

Eigensystem reviewed Sep 12, 2024

View reviewed changes

Eigensystem added a commit to Eigensystem/xDiT that referenced this pull request Sep 12, 2024

Revert "CogVideoX support with USP (xdit-project#261)"

41af264

This reverts commit 9484590.

Eigensystem added a commit that referenced this pull request Sep 12, 2024

Revert "CogVideoX support with USP (#261)"

19b844a

This reverts commit 9484590.

Eigensystem mentioned this pull request Sep 12, 2024

Revert "Fix CogVideoX support" #263

Merged

feifeibear pushed a commit to feifeibear/xDiT that referenced this pull request Oct 25, 2024

CogVideoX support with USP (xdit-project#261)

7f174dc

feifeibear pushed a commit to feifeibear/xDiT that referenced this pull request Oct 25, 2024

Revert "CogVideoX support with USP (xdit-project#261)"

69a36d3

This reverts commit 9484590.

feifeibear pushed a commit to feifeibear/xDiT that referenced this pull request Oct 25, 2024

Revert "CogVideoX support with USP (xdit-project#261)"

f7a89be

This reverts commit 9484590.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CogVideoX support #261

Fix CogVideoX support #261

chengzeyi commented Sep 10, 2024 •

edited

Loading

feifeibear Sep 11, 2024

chengzeyi Sep 12, 2024

Eigensystem Sep 12, 2024

chengzeyi Sep 12, 2024 •

edited

Loading

chengzeyi Sep 12, 2024

chengzeyi Sep 12, 2024

Eigensystem Sep 12, 2024

chengzeyi Sep 12, 2024

chengzeyi Sep 12, 2024 •

edited

Loading

Fix CogVideoX support #261

Fix CogVideoX support #261

Conversation

chengzeyi commented Sep 10, 2024 • edited Loading

Current status

feifeibear Sep 11, 2024

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024

Choose a reason for hiding this comment

Eigensystem Sep 12, 2024

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024

Choose a reason for hiding this comment

Eigensystem Sep 12, 2024

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024

Choose a reason for hiding this comment

chengzeyi Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

chengzeyi commented Sep 10, 2024 •

edited

Loading

chengzeyi Sep 12, 2024 •

edited

Loading

chengzeyi Sep 12, 2024 •

edited

Loading