You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The devised video compressor is not a trainable component but a deterministic rule-based function applied to video input only. In the video compressor, we calculate the difference between the consecutive patches in the temporal dimension and discard those very similar (i.e., the patch difference is very small) to the previous ones.
A related question, how are these training stages integrated into the training scripts?
From what I understand, the training arguments for module-specific learning rates control which modules are trainable (train.py L608-622), but all scripts in scripts/train/ pass all 3 learning rates. I appreciate any clarification if I am misunderstanding the code!
Thanks for great work!
I have a question about trainable parameters in each stage.
Here's what I think, but is it right?
Thank you.
The text was updated successfully, but these errors were encountered: