This repository has been archived by the owner on Feb 15, 2025. It is now read-only.
v0.6.0
The first release of the LeapfrogAI monorepo.
🚧 TLDR Release notes incoming... 🚧
Published Container Images
Image | URL |
---|---|
leapfrogai-api | ghcr.io/defenseunicorns/leapfrogai/leapfrogai-api:0.6.0 |
llama-cpp-python | ghcr.io/defenseunicorns/leapfrogai/llama-cpp-python:0.6.0 |
vLLM | ghcr.io/defenseunicorns/leapfrogai/vllm:0.6.0 |
whisper | ghcr.io/defenseunicorns/leapfrogai/whisper:0.6.0 |
text-embeddings | ghcr.io/defenseunicorns/leapfrogai/text-embeddings:0.6.0 |
Published Zarf Packages
Package | URL |
---|---|
leapfrogai-api | ghcr.io/defenseunicorns/packages/leapfrogai/leapfrogai-api:0.6.0 |
llama-cpp-python | ghcr.io/defenseunicorns/packages/leapfrogai/llama-cpp-python:0.6.0 |
vLLM | ghcr.io/defenseunicorns/packages/leapfrogai/vllm:0.6.0 |
whisper | ghcr.io/defenseunicorns/packages/leapfrogai/whisper:0.6.0 |
text-embeddings | ghcr.io/defenseunicorns/packages/leapfrogai/text-embeddings:0.6.0 |
What's Changed
- Bump redis from 4.5.1 to 4.5.4 in /notebooks by @dependabot in #2
- Configure Renovate by @renovate in #1
- Update Weaviate OPENAI_BASE_URL by @gerred in #17
- Change text2vec image to all-miniLM-L6-v2 by @gerred in #18
- Added StableLM-7b and fixed model cache by @runyontr in #21
- Remove non Apache2 model by @runyontr in #20
- Updates for stablelm deployment by @runyontr in #22
- Update README.md to fix links by @Racer159 in #42
- Helm chart by @runyontr in #50
- Update dependency numpy to v1.24.3 by @renovate in #4
- Update dependency openai to v0.27.7 by @renovate in #5
- Update dependency protobuf to v4.23.2 by @renovate in #8
- Update dependency fastapi to v0.96.0 by @renovate in #9
- Update dependency grpcio to v1.54.2 by @renovate in #10
- Update dependency streamlit to v1.23.1 by @renovate in #12
- Update dependency termcolor to v2.3.0 by @renovate in #13
- Update dependency uvicorn to v0.22.0 by @renovate in #14
- Update dependency starlette to v0.27.0 [SECURITY] by @renovate in #40
- Bump starlette from 0.26.1 to 0.27.0 in /api by @dependabot in #41
- Bump requests from 2.28.2 to 2.31.0 in /notebooks by @dependabot in #47
- Update dependency pydantic to v1.10.8 by @renovate in #53
- Update dependency redis to v4.5.5 by @renovate in #54
- Create renovate.config by @runyontr in #56
- Add Whisper FastAPI backend by @gerred in #59
- added observibility to chart by @runyontr in #51
- add Dockerfile and model caching by @gerred in #62
- Add the whisper frontend and chart configuration by @gerred in #70
- updates for deployment by @runyontr in #69
- Change API container base to alpine by @gerred in #79
- Update README.md w/ Mermaid diagrams by @runyontr in #83
- Update cgr.dev/chainguard/python Docker tag to v3.11 by @renovate in #80
- Spell fix in README.md by @ansilh in #86
- Add python-multipart by @gerred in #85
- first proto example by @runyontr in #92
- Deployment by @runyontr in #90
- First golang generation by @runyontr in #94
- Change protobuf structure, adjust API and LLMs to match by @gerred in #93
- build images by @runyontr in #106
- Add user logos by @gerred in #111
- Update dependency numpy to v1.25.0 by @renovate in #110
- Update dependency grpcio to v1.56.0 by @renovate in #108
- Update dependency redis to v4.6.0 by @renovate in #113
- Update cgr.dev/chainguard/go Docker tag to v1.21 by @renovate in #105
- Update dependency datasets to ~=2.13.1 by @renovate in #107
- Update module google.golang.org/grpc to v1.56.1 by @renovate in #104
- Update dependency openai to v0.27.8 by @renovate in #63
- Update nvidia/cuda Docker tag to v11.8.0 by @renovate in #15
- Update dependency pandas to v2 by @renovate in #60
- Update nvidia/cuda Docker tag to v12 by @renovate in #61
- Update dependency transformers to ~=4.30.2 by @renovate in #72
- Update dependency streamlit to v1.24.0 by @renovate in #114
- move Go to 1.20 by @gerred in #119
- pass base image tag in makefile by @runyontr in #109
- Add PyPi build by @gerred in #121
- Dynamic Discovery of models by @runyontr in #124
- Refactor Config to net/http DefaultHandler pattern, move OpenAI backend into separate package by @gerred in #127
- Add OpenAI Streaming Support, CompletionStream protobuf, and ctransformers model backend by @gerred in #129
- feat: renamed generate endpoint to completion by @DayOfThePenguin in #132
- Add huggingface by @DayOfThePenguin in #136
- Chat completion by @DayOfThePenguin in #137
- feat: add working embedding endpoint for all-minilm-l6-v2 by @DayOfThePenguin in #141
- Weaviate fixes by @runyontr in #144
- GitHub artifact build by @runyontr in #146
- --confirm on publish by @runyontr in #148
- Mount OpenAI under /openai/v1 by @gerred in #147
- Draft getting-started.md by @gphorvath in #143
- dependency between jobs by @runyontr in #149
- Adding Pulze.ai Logo by @fabianbaier in #152
- Update README.md by @gerred in #162
- Modify dockerfiles by @YrrepNoj in #168
- Bug Fix: Stablelm and ctransformers by @YrrepNoj in #169
- add health services and reflections to our server gRPC endpoints by @YrrepNoj in #170
- update renovate config to batch all non minor updates for Go and Python by @YrrepNoj in #171
- Update ghcr.io/defenseunicorns/leapfrogai/base Docker tag to v0.3.2 by @renovate in #174
- Update zarf.yaml by @mcamick in #177
- move to asyncio grpc by @gerred in #179
- Adds initial vector db ADR by @CollectiveUnicorn in #204
- Updates Whisper dockerfile, readme, and adds async calls by @CollectiveUnicorn in #206
- Adds parallelization and testing, reorganizes files by @CollectiveUnicorn in #208
- fix zarf constants and reference new lfai-model skeleton package by @YrrepNoj in #216
- Updates Dockerfile, readme, and modifies model to use latest leapfrogai by @CollectiveUnicorn in #218
- [NA] Ops(Fixes): Manifests, README, and .gitignore by @justinthelaw in #217
- Updates README.md files with docker instructions for easy deployments and updates DOCKERFILE's by @CollectiveUnicorn in #219
- change publish workflow to only trigger on tags by @YrrepNoj in #220
- make included model examples use asyncio when running model by @YrrepNoj in #221
- Enables istio-injection to allow cluster communication by @CollectiveUnicorn in #225
- Changes all the references from whisper-1 to whisper by @CollectiveUnicorn in #230
- Bumps zarf package version to 0.0.6 by @CollectiveUnicorn in #231
- Adds docker-compose file and readme by @CollectiveUnicorn in #232
- update models to use model-skeleton 0.4.0 by @YrrepNoj in #228
- Updated VectorDB ADR to reflect ChromaDB as a provisional decision by @gphorvath in #234
- 223 - Expose metallb lb outside the network by @CollectiveUnicorn in #241
- Add configs, pydantic, confz by @gerred in #240
- Spike at new CLI and SDK implementation by @gerred in #243
- add graceful shutdown by @gerred in #245
- Split up monorepo and update README by @gphorvath in #247
- Updated readme to reflect current LeapfrogAI components by @gphorvath in #252
- added a section routing to deployment instructions by @gphorvath in #253
- techdebt: contributing guide by @fingermustache in #257
- Migrate Hugo Docs into main repo by @gphorvath in #259
- ADR for vLLM as a default inferencing backend by @gerred in #261
- Updates status of the backends in the readme by @CollectiveUnicorn in #263
- Adds rag and others to website by @CollectiveUnicorn in #264
- Changed Status from Accepted to Pending by @barronstone in #265
- feat: LLM Backend ADR and ADR Template by @gphorvath in #262
- chore: add CODEOWNERS to repository by @YrrepNoj in #269
- chore(ci): Add pre-commit linting and scorecard workflows by @YrrepNoj in #270
- chore: issue templates by @gphorvath in #301
- chore: Consolidate our repos into a monorepo by @YrrepNoj in #268
- refactor: consolidate model backends into the monorepo by @YrrepNoj in #275
- UDS Documentation Updates by @jalling97 in #328
- Updates the ruff version to latest (0.3.4) by @CollectiveUnicorn in #333
- New UI Feature Request Template by @gregclark-defenseunicorns in #348
- test: add workflows for e2e tests, pytest, and release publishing by @YrrepNoj in #315
- Delete .github/ISSUE_TEMPLATE/feature_request_ui.md by @gregclark-defenseunicorns in #351
- Update feature_request.md by @gregclark-defenseunicorns in #364
- Monorepo local UDS deployment and README Updates by @jalling97 in #322
- chore: add zarf.yaml schema and set useful component names by @YrrepNoj in #373
- chore: remove redundant optional dependencies within our pyproject.toml by @YrrepNoj in #374
- API - Fixed typing for ChatFunction by @jmho in #366
- chore: cleanup todo comments throughout the codebase by @YrrepNoj in #375
- fix: correct typos in e2e workflow path triggers by @YrrepNoj in #376
- chore: create shim e2e workflow by @YrrepNoj in #377
- chore(ci): always run the pytest workflow by @YrrepNoj in #379
- chore: update vllm to use gptq quanitzed model by @YrrepNoj in #378
- fix: release workflow calling the wrong make target by @YrrepNoj in #383
- chore: Conform version numbers prior to release by @YrrepNoj in #382
New Contributors
- @dependabot made their first contribution in #2
- @renovate made their first contribution in #1
- @gerred made their first contribution in #17
- @runyontr made their first contribution in #21
- @Racer159 made their first contribution in #42
- @ansilh made their first contribution in #86
- @DayOfThePenguin made their first contribution in #132
- @gphorvath made their first contribution in #143
- @fabianbaier made their first contribution in #152
- @mcamick made their first contribution in #177
- @CollectiveUnicorn made their first contribution in #204
- @fingermustache made their first contribution in #257
- @barronstone made their first contribution in #265
- @jalling97 made their first contribution in #328
- @gregclark-defenseunicorns made their first contribution in #348
- @jmho made their first contribution in #366
Full Changelog: https://github.com/defenseunicorns/leapfrogai/commits/v0.6.0