Skip to content
View sooahleex's full-sized avatar
  • Samsung Electronics

Block or report sooahleex

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,877 151 Updated Feb 24, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,765 303 Updated Mar 5, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,285 237 Updated Mar 6, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,497 237 Updated Mar 5, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 2,981 263 Updated Mar 6, 2025

Ongoing research training transformer models at scale

Python 11,657 2,612 Updated Mar 6, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,613 186 Updated Mar 4, 2025

Fast and memory-efficient exact attention

Python 16,108 1,525 Updated Mar 5, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,965 1,400 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 22,190 1,988 Updated Mar 5, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,618 489 Updated Feb 28, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 299 23 Updated Feb 20, 2025

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

224 13 Updated Mar 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,379 6,060 Updated Mar 6, 2025

[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".

Python 31 4 Updated Jun 8, 2023

Out-of-distribution detection, robustness, and generalization resources. The repository contains a curated list of papers, tutorials, books, videos, articles and open-source libraries etc

880 73 Updated Nov 21, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,909 190 Updated Nov 28, 2023

Object Recognition as Next Token Prediction (CVPR 2024 Highlight)

Python 174 7 Updated Dec 24, 2024

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 18,860 2,164 Updated Mar 4, 2025

Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™

Python 1,161 447 Updated Mar 5, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,378 1,484 Updated Dec 25, 2024

A curated list of awesome open-source libraries for production LLM

458 46 Updated Dec 31, 2024

A collection of design patterns/idioms in Python

Python 41,014 6,956 Updated Sep 5, 2024

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,241 396 Updated Mar 6, 2025

Repository of Transformer based PyTorch Time Series Models

Jupyter Notebook 303 44 Updated Nov 8, 2024

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Python 3,653 286 Updated Mar 3, 2025

Code behind Arxiv Papers

Python 508 60 Updated Apr 2, 2024
Next
Showing results