Skip to content
Change the repository type filter

All

    Repositories list

    • [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
      Python
      8230160Updated Jan 27, 2025Jan 27, 2025
    • InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍
      Python
      22910Updated Jan 24, 2025Jan 24, 2025
    • STAR

      Public
      STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
      Python
      44823141Updated Jan 22, 2025Jan 22, 2025
    • AddSR

      Public
      Python
      Apache License 2.0
      4100110Updated Jan 8, 2025Jan 8, 2025
    • Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
      Python
      MIT License
      2353370Updated Jan 7, 2025Jan 7, 2025
    • JavaScript
      0000Updated Jan 7, 2025Jan 7, 2025
    • STTrack

      Public
      [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
      Python
      MIT License
      08600Updated Dec 30, 2024Dec 30, 2024