Skip to content
Change the repository type filter

All

    Repositories list

    • LLM checkpointing for DeepSpeed/Megatron
      C++
      MIT License
      31300Updated Feb 10, 2025Feb 10, 2025
    • Compute differences between immutable data states
      C++
      MIT License
      1111Updated Jan 29, 2025Jan 29, 2025
    • Multi-Version Ordered KV
      C++
      1101Updated Jan 27, 2025Jan 27, 2025
    • Distributed AI model repository with versioning, lineage and incremental storage support
      C++
      MIT License
      1200Updated Jan 8, 2025Jan 8, 2025
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.2k100Updated Dec 13, 2024Dec 13, 2024
    • artifacts

      Public
      Artifacts in support of reproducibility efforts
      Python
      MIT License
      0000Updated Oct 3, 2024Oct 3, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.5k000Updated Feb 21, 2024Feb 21, 2024