Skip to content

Pinned Loading

  1. understand-r1-zero understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 677 28

  2. zero-bubble-pipeline-parallelism zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 375 22

  3. lorahub lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 622 38

  4. envpool envpool Public

    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

    C++ 1.1k 108

  5. EditAnything EditAnything Public

    Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

    Python 3.4k 196

  6. Adan Adan Public

    Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

    Python 783 67

Repositories

Showing 10 of 83 repositories
  • understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    sail-sg/understand-r1-zero’s past year of commit activity
    Python 677 MIT 28 3 0 Updated Mar 27, 2025
  • jrystal Public

    A JAX-based Differentiable Density Functional Theory Framework for Materials

    sail-sg/jrystal’s past year of commit activity
    Python 9 Apache-2.0 0 4 1 Updated Mar 27, 2025
  • jax_xc Public

    Exchange correlation functionals translated from libxc to jax

    sail-sg/jax_xc’s past year of commit activity
    Python 45 MPL-2.0 2 4 0 Updated Mar 24, 2025
  • autofd Public

    Automatic Functional Differentiation in JAX

    sail-sg/autofd’s past year of commit activity
    Python 68 Apache-2.0 1 6 0 Updated Mar 24, 2025
  • oat-zero Public

    A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

    sail-sg/oat-zero’s past year of commit activity
    Python 216 MIT 10 2 0 Updated Mar 24, 2025
  • oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

    sail-sg/oat’s past year of commit activity
    Python 295 Apache-2.0 15 4 0 Updated Mar 23, 2025
  • CPO Public

    [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

    sail-sg/CPO’s past year of commit activity
    Python 104 4 0 0 Updated Mar 21, 2025
  • sailor2 Public

    🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

    sail-sg/sailor2’s past year of commit activity
    54 3 0 0 Updated Mar 21, 2025
  • Megatron-Sailor2 Public

    Megatron for Sailor2/Qwen2.5

    sail-sg/Megatron-Sailor2’s past year of commit activity
    Python 1 0 0 0 Updated Mar 21, 2025
  • SkyLadder Public Forked from jzhang38/TinyLlama

    The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

    sail-sg/SkyLadder’s past year of commit activity
    Python 27 Apache-2.0 525 0 0 Updated Mar 20, 2025