Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Feng Liu¹^*, Shiwei Zhang², Xiaofeng Wang^1,3, Yujie Wei⁴, Haonan Qiu⁵
Yuzhong Zhao¹, Yingya Zhang², Qixiang Ye¹, Fang Wan¹^†

¹University of Chinese Academy of Sciences, ²Alibaba Group
³Institute of Automation, Chinese Academy of Sciences
⁴Fudan University, ⁵Nanyang Technological University

(* Work was done during internship at Alibaba Group. † Corresponding author.)

🫖 Introduction

We introduce Timestep Embedding Aware Cache (TeaCache), a training-free caching approach that estimates and leverages the fluctuating differences among model outputs across timesteps, thereby accelerating the inference. TeaCache works well for Video Diffusion Models, Image Diffusion models and Audio Diffusion Models. For more details and results, please visit our project page.

🔥 Latest News

If you like our project, please give us a star ⭐ on GitHub for the latest update.
[2025/01/24] 🔥 Support Cosmos for both T2V and I2V. Thanks @zishen-ucap.
[2025/01/20] 🔥 Support CogVideoX1.5-5B for both T2V and I2V. Thanks @zishen-ucap.
[2025/01/07] 🔥 Support TangoFlux. TeaCache works well for Audio Diffusion Models!
[2024/12/30] 🔥 Support Mochi and LTX-Video for Video Diffusion Models. Support Lumina-T2X for Image Diffusion Models.
[2024/12/27] 🔥 Support FLUX. TeaCache works well for Image Diffusion Models!
[2024/12/26] 🔥 Support ConsisID. Thanks @SHYuanBest.
[2024/12/24] 🔥 Support HunyuanVideo.
[2024/12/19] 🔥 Support CogVideoX.
[2024/12/06] 🎉 Release the code of TeaCache. Support Open-Sora, Open-Sora-Plan and Latte.
[2024/11/28] 🎉 Release the paper of TeaCache.

🧩 Community Contributions

If you develop/use TeaCache in your projects, welcome to let us know.

Model

ConsisID supports TeaCache. Thanks @SHYuanBest.
Ruyi-Models supports TeaCache. Thanks @cellzero.
EasyAnimate supports TeaCache. Thanks @hkunzhe and @bubbliiiing.

ComfyUI

ComfyUI-HunyuanVideoWrapper supports TeaCache4HunyuanVideo. Thanks @kijai, ctf05 and DarioFT.
ComfyUI-TeaCacheHunyuanVideo for TeaCache4HunyuanVideo. Thanks @facok.
ComfyUI-TeaCache for TeaCache. Thanks @YunjieYu.
Comfyui_TTP_Toolset supports TeaCache. Thanks @TTPlanetPig.
ComfyUI_Patches_ll supports TeaCache. Thanks @lldacing.
ComfyUI-TangoFlux supports TeaCache. Thanks @LucipherDev.

🎉 Supported Models

Text to Video

Image to Video

TeaCache4ConsisID
TeaCache4CogVideoX1.5
Ruyi-Models. See here.
EasyAnimate, see here.
TeaCache4Cosmos

Video to Video

EasyAnimate, see here.

Text to Image

Text to Audio

TeaCache4TangoFlux

🤖 Instructions for Supporting Other Models

Welcome for PRs to support other models.
If the custom model is based on or has similar model structure to the models we've supported, you can try to directly transfer TeaCache to the custom model. For example, rescaling coefficients for CogVideoX-5B can be directly applied to CogVideoX1.5, ConsisID and rescaling coefficients for FLUX can be directly applied to TangoFlux.
Otherwise, you can refer to these successful attempts, e.g., 1, 2.

💐 Acknowledgement

This repository is built based on VideoSys, Diffusers, Open-Sora, Open-Sora-Plan, Latte, CogVideoX, HunyuanVideo, ConsisID, FLUX, Mochi, LTX-Video, Lumina-T2X, TangoFlux and Cosmos. Thanks for their contributions!

🔒 License

The majority of this project is released under the Apache 2.0 license as found in the LICENSE file.
For VideoSys, Diffusers, Open-Sora, Open-Sora-Plan, Latte, CogVideoX, HunyuanVideo, ConsisID, FLUX, Mochi, LTX-Video, Lumina-T2X, TangoFlux and Cosmos, please follow their LICENSE.
The service is a research preview. Please contact us if you find any potential violations. (liufeng20@mails.ucas.ac.cn)

📖 Citation

If you find TeaCache is useful in your research or applications, please consider giving us a star ⭐ and citing it by the following BibTeX entry.

@article{liu2024timestep,
  title={Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model},
  author={Liu, Feng and Zhang, Shiwei and Wang, Xiaofeng and Wei, Yujie and Qiu, Haonan and Zhao, Yuzhong and Zhang, Yingya and Ye, Qixiang and Wan, Fang},
  journal={arXiv preprint arXiv:2411.19108},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
TeaCache4CogVideoX1.5		TeaCache4CogVideoX1.5
TeaCache4ConsisID		TeaCache4ConsisID
TeaCache4Cosmos		TeaCache4Cosmos
TeaCache4FLUX		TeaCache4FLUX
TeaCache4HunyuanVideo		TeaCache4HunyuanVideo
TeaCache4LTX-Video		TeaCache4LTX-Video
TeaCache4Lumina-T2X		TeaCache4Lumina-T2X
TeaCache4Mochi		TeaCache4Mochi
TeaCache4TangoFlux		TeaCache4TangoFlux
assets		assets
eval/teacache		eval/teacache
videosys		videosys
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

🫖 Introduction

🔥 Latest News

🧩 Community Contributions

🎉 Supported Models

🤖 Instructions for Supporting Other Models

💐 Acknowledgement

🔒 License

📖 Citation

About

Releases

Packages

Contributors 3

Languages

License

ali-vilab/TeaCache

Folders and files

Latest commit

History

Repository files navigation

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

🫖 Introduction

🔥 Latest News

🧩 Community Contributions

🎉 Supported Models

🤖 Instructions for Supporting Other Models

💐 Acknowledgement

🔒 License

📖 Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages