Skip to content

ScottTodd/shark-ai

 
 

Repository files navigation

shark-ai: SHARK Modeling and Serving Libraries

GitHub License pre-commit

SHARK Users

If you're looking to use SHARK check out our User Guide. For developers continue to read on.

Sub-projects

PyPI version CI - shortfin

The shortfin sub-project is SHARK's high performance inference library and serving engine.

  • API documentation for shortfin is available on readthedocs.

PyPI version CI - sharktank

The SHARK Tank sub-project contains a collection of model recipes and conversion tools to produce inference-optimized programs.

Warning

SHARK Tank is still under development. Experienced users may want to try it out, but we currently recommend most users download pre-exported or pre-compiled model files for serving with shortfin.

CI - Tuner

The Tuner sub-project assists with tuning program performance by searching for optimal parameter configurations to use during model compilation.

Warning

SHARK Tuner is still in early development. Interested users may want to try it out, but the tuner is not ready for general use yet. Check out the readme for more details.

Support matrix

Models

Model name Model recipes Serving apps Guide
SDXL sharktank/sharktank/models/punet/ shortfin/python/shortfin_apps/sd/ shortfin/python/shortfin_apps/sd/README.md
llama sharktank/sharktank/models/llama/ shortfin/python/shortfin_apps/llm/ docs/shortfin/llm/user/llama_serving.md

SHARK Developers

If you're looking to develop SHARK, check out our Developer Guide.

About

SHARK Inference Modeling and Serving

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 78.1%
  • C++ 18.5%
  • MLIR 1.4%
  • CMake 1.3%
  • Other 0.7%