Release v0.6.0 · defenseunicorns/leapfrogai

The first release of the LeapfrogAI monorepo.

🚧 TLDR Release notes incoming... 🚧

Published Container Images

Image	URL
leapfrogai-api	ghcr.io/defenseunicorns/leapfrogai/leapfrogai-api:0.6.0
llama-cpp-python	ghcr.io/defenseunicorns/leapfrogai/llama-cpp-python:0.6.0
vLLM	ghcr.io/defenseunicorns/leapfrogai/vllm:0.6.0
whisper	ghcr.io/defenseunicorns/leapfrogai/whisper:0.6.0
text-embeddings	ghcr.io/defenseunicorns/leapfrogai/text-embeddings:0.6.0

Published Zarf Packages

Package	URL
leapfrogai-api	ghcr.io/defenseunicorns/packages/leapfrogai/leapfrogai-api:0.6.0
llama-cpp-python	ghcr.io/defenseunicorns/packages/leapfrogai/llama-cpp-python:0.6.0
vLLM	ghcr.io/defenseunicorns/packages/leapfrogai/vllm:0.6.0
whisper	ghcr.io/defenseunicorns/packages/leapfrogai/whisper:0.6.0
text-embeddings	ghcr.io/defenseunicorns/packages/leapfrogai/text-embeddings:0.6.0

What's Changed

Bump redis from 4.5.1 to 4.5.4 in /notebooks by @dependabot in #2
Configure Renovate by @renovate in #1
Update Weaviate OPENAI_BASE_URL by @gerred in #17
Change text2vec image to all-miniLM-L6-v2 by @gerred in #18
Added StableLM-7b and fixed model cache by @runyontr in #21
Remove non Apache2 model by @runyontr in #20
Updates for stablelm deployment by @runyontr in #22
Update README.md to fix links by @Racer159 in #42
Helm chart by @runyontr in #50
Update dependency numpy to v1.24.3 by @renovate in #4
Update dependency openai to v0.27.7 by @renovate in #5
Update dependency protobuf to v4.23.2 by @renovate in #8
Update dependency fastapi to v0.96.0 by @renovate in #9
Update dependency grpcio to v1.54.2 by @renovate in #10
Update dependency streamlit to v1.23.1 by @renovate in #12
Update dependency termcolor to v2.3.0 by @renovate in #13
Update dependency uvicorn to v0.22.0 by @renovate in #14
Update dependency starlette to v0.27.0 [SECURITY] by @renovate in #40
Bump starlette from 0.26.1 to 0.27.0 in /api by @dependabot in #41
Bump requests from 2.28.2 to 2.31.0 in /notebooks by @dependabot in #47
Update dependency pydantic to v1.10.8 by @renovate in #53
Update dependency redis to v4.5.5 by @renovate in #54
Create renovate.config by @runyontr in #56
Add Whisper FastAPI backend by @gerred in #59
added observibility to chart by @runyontr in #51
add Dockerfile and model caching by @gerred in #62
Add the whisper frontend and chart configuration by @gerred in #70
updates for deployment by @runyontr in #69
Change API container base to alpine by @gerred in #79
Update README.md w/ Mermaid diagrams by @runyontr in #83
Update cgr.dev/chainguard/python Docker tag to v3.11 by @renovate in #80
Spell fix in README.md by @ansilh in #86
Add python-multipart by @gerred in #85
first proto example by @runyontr in #92
Deployment by @runyontr in #90
First golang generation by @runyontr in #94
Change protobuf structure, adjust API and LLMs to match by @gerred in #93
build images by @runyontr in #106
Add user logos by @gerred in #111
Update dependency numpy to v1.25.0 by @renovate in #110
Update dependency grpcio to v1.56.0 by @renovate in #108
Update dependency redis to v4.6.0 by @renovate in #113
Update cgr.dev/chainguard/go Docker tag to v1.21 by @renovate in #105
Update dependency datasets to ~=2.13.1 by @renovate in #107
Update module google.golang.org/grpc to v1.56.1 by @renovate in #104
Update dependency openai to v0.27.8 by @renovate in #63
Update nvidia/cuda Docker tag to v11.8.0 by @renovate in #15
Update dependency pandas to v2 by @renovate in #60
Update nvidia/cuda Docker tag to v12 by @renovate in #61
Update dependency transformers to ~=4.30.2 by @renovate in #72
Update dependency streamlit to v1.24.0 by @renovate in #114
move Go to 1.20 by @gerred in #119
pass base image tag in makefile by @runyontr in #109
Add PyPi build by @gerred in #121
Dynamic Discovery of models by @runyontr in #124
Refactor Config to net/http DefaultHandler pattern, move OpenAI backend into separate package by @gerred in #127
Add OpenAI Streaming Support, CompletionStream protobuf, and ctransformers model backend by @gerred in #129
feat: renamed generate endpoint to completion by @DayOfThePenguin in #132
Add huggingface by @DayOfThePenguin in #136
Chat completion by @DayOfThePenguin in #137
feat: add working embedding endpoint for all-minilm-l6-v2 by @DayOfThePenguin in #141
Weaviate fixes by @runyontr in #144
GitHub artifact build by @runyontr in #146
--confirm on publish by @runyontr in #148
Mount OpenAI under /openai/v1 by @gerred in #147
Draft getting-started.md by @gphorvath in #143
dependency between jobs by @runyontr in #149
Adding Pulze.ai Logo by @fabianbaier in #152
Update README.md by @gerred in #162
Modify dockerfiles by @YrrepNoj in #168
Bug Fix: Stablelm and ctransformers by @YrrepNoj in #169
add health services and reflections to our server gRPC endpoints by @YrrepNoj in #170
update renovate config to batch all non minor updates for Go and Python by @YrrepNoj in #171
Update ghcr.io/defenseunicorns/leapfrogai/base Docker tag to v0.3.2 by @renovate in #174
Update zarf.yaml by @mcamick in #177
move to asyncio grpc by @gerred in #179
Adds initial vector db ADR by @CollectiveUnicorn in #204
Updates Whisper dockerfile, readme, and adds async calls by @CollectiveUnicorn in #206
Adds parallelization and testing, reorganizes files by @CollectiveUnicorn in #208
fix zarf constants and reference new lfai-model skeleton package by @YrrepNoj in #216
Updates Dockerfile, readme, and modifies model to use latest leapfrogai by @CollectiveUnicorn in #218
[NA] Ops(Fixes): Manifests, README, and .gitignore by @justinthelaw in #217
Updates README.md files with docker instructions for easy deployments and updates DOCKERFILE's by @CollectiveUnicorn in #219
change publish workflow to only trigger on tags by @YrrepNoj in #220
make included model examples use asyncio when running model by @YrrepNoj in #221
Enables istio-injection to allow cluster communication by @CollectiveUnicorn in #225
Changes all the references from whisper-1 to whisper by @CollectiveUnicorn in #230
Bumps zarf package version to 0.0.6 by @CollectiveUnicorn in #231
Adds docker-compose file and readme by @CollectiveUnicorn in #232
update models to use model-skeleton 0.4.0 by @YrrepNoj in #228
Updated VectorDB ADR to reflect ChromaDB as a provisional decision by @gphorvath in #234
223 - Expose metallb lb outside the network by @CollectiveUnicorn in #241
Add configs, pydantic, confz by @gerred in #240
Spike at new CLI and SDK implementation by @gerred in #243
add graceful shutdown by @gerred in #245
Split up monorepo and update README by @gphorvath in #247
Updated readme to reflect current LeapfrogAI components by @gphorvath in #252
added a section routing to deployment instructions by @gphorvath in #253
techdebt: contributing guide by @fingermustache in #257
Migrate Hugo Docs into main repo by @gphorvath in #259
ADR for vLLM as a default inferencing backend by @gerred in #261
Updates status of the backends in the readme by @CollectiveUnicorn in #263
Adds rag and others to website by @CollectiveUnicorn in #264
Changed Status from Accepted to Pending by @barronstone in #265
feat: LLM Backend ADR and ADR Template by @gphorvath in #262
chore: add CODEOWNERS to repository by @YrrepNoj in #269
chore(ci): Add pre-commit linting and scorecard workflows by @YrrepNoj in #270
chore: issue templates by @gphorvath in #301
chore: Consolidate our repos into a monorepo by @YrrepNoj in #268
refactor: consolidate model backends into the monorepo by @YrrepNoj in #275
UDS Documentation Updates by @jalling97 in #328
Updates the ruff version to latest (0.3.4) by @CollectiveUnicorn in #333
New UI Feature Request Template by @gregclark-defenseunicorns in #348
test: add workflows for e2e tests, pytest, and release publishing by @YrrepNoj in #315
Delete .github/ISSUE_TEMPLATE/feature_request_ui.md by @gregclark-defenseunicorns in #351
Update feature_request.md by @gregclark-defenseunicorns in #364
Monorepo local UDS deployment and README Updates by @jalling97 in #322
chore: add zarf.yaml schema and set useful component names by @YrrepNoj in #373
chore: remove redundant optional dependencies within our pyproject.toml by @YrrepNoj in #374
API - Fixed typing for ChatFunction by @jmho in #366
chore: cleanup todo comments throughout the codebase by @YrrepNoj in #375
fix: correct typos in e2e workflow path triggers by @YrrepNoj in #376
chore: create shim e2e workflow by @YrrepNoj in #377
chore(ci): always run the pytest workflow by @YrrepNoj in #379
chore: update vllm to use gptq quanitzed model by @YrrepNoj in #378
fix: release workflow calling the wrong make target by @YrrepNoj in #383
chore: Conform version numbers prior to release by @YrrepNoj in #382

New Contributors

@dependabot made their first contribution in #2
@renovate made their first contribution in #1
@gerred made their first contribution in #17
@runyontr made their first contribution in #21
@Racer159 made their first contribution in #42
@ansilh made their first contribution in #86
@DayOfThePenguin made their first contribution in #132
@gphorvath made their first contribution in #143
@fabianbaier made their first contribution in #152
@mcamick made their first contribution in #177
@CollectiveUnicorn made their first contribution in #204
@fingermustache made their first contribution in #257
@barronstone made their first contribution in #265
@jalling97 made their first contribution in #328
@gregclark-defenseunicorns made their first contribution in #348
@jmho made their first contribution in #366

Full Changelog: https://github.com/defenseunicorns/leapfrogai/commits/v0.6.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.6.0

Published Container Images

Published Zarf Packages

What's Changed

New Contributors

Contributors