-
AI Frameworks Engineer @intel
- SH
-
18:43
(UTC +08:00)
Pinned Loading
-
-
neural-compressor
neural-compressor PublicForked from intel/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, spar…
Python
-
pytorch-fork
pytorch-fork PublicForked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
-
torchao-fork
torchao-fork PublicForked from pytorch/ao
The torchao repository contains api's and workflows for quantization and pruning gpu models.
Python
-
-
vllm-fork
vllm-fork PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
If the problem persists, check the GitHub status page or contact support.