Stars
DeepEP: an efficient expert-parallel communication library
A statically typed programming language for scientific computations with first class support for physical dimensions and units
Cache-based autocomplete for python using argcomplete
A generative world for general-purpose robotics & embodied AI learning.
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Reinforcement learning theory book about foundations of deep RL algorithms with proofs.
NVIDIA Linux open GPU kernel module source
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Python API for acquisition and control of OMRON G3PW Power Controller
Entropy Based Sampling and Parallel CoT Decoding
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
AWS ParallelCluster Module for Terraform
A terminal workspace with batteries included
Configure and deploy complete EKS clusters.
Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!
Artificial Intelligence Infrastructure-as-Code Generator.
Amazon EC2 instance comparison site
🎨 Diagram as Code for prototyping cloud system architectures