Releases: kubeagi/arcadia
Releases · kubeagi/arcadia
kuberay-operator-1.0.0
A Helm chart for Kubernetes
arcadia-0.2.9
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.8
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.7
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.6
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.4
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.3
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.2
A Helm chart(KubeBB Component) for KubeAGI Arcadia
arcadia-0.2.1
A Helm chart(KubeBB Component) for KubeAGI Arcadia
v0.1.0
arcadia-v0.1.0
Welcome to this new release! Our first release towards one-stop LLMOps!
Images built for this release:
kubeagi/arcadia:v0.1.0
kubeagi/data-processing:v0.1.0
Breaking Changes:
None
Feature summary 🚀 🚀 🚀
- Dataset management
- Manage data by integrating with object storage(s3), view excel file and add label to different data types
- Versioned dataset management with default datasource
ObjectStorageService
- Comprehensive data processing capabilities: data cleaning, text splitting (e.g., text segmentation, QA splitting using LLM)
- RDMA as an optional storage service that can speed up model/data download by about 10 times
- AI Knowledgebase
- Auto QA Embedding generation and indexing
- Chromadb as the vector store by default
- AI Model and Inference Service
- Manage the lifecycle of model and Inference Service
- Able to host llm and embedding models in Kubernetes via our
Worker
protocol: qwen, baichuan, vicuna, chatglm, bge-large-zh-v1.5, etc... - Able to integrate with powerful 3rd_party providers, like zhipuai, openai, etc...
- Model loading accelerations with rdma network protocols
- Support CPU & GPU Model Serving
- LLM Applications
- A powerful and flexible Application Runtime
- GPTs - initial implementation of LLM application orchestration capabilities. Manage and orchestrate Prompt, LLM/Retriever Chain nodes, and provide relevant example applications (based on streamlit)
- Provide LLMChain and RetrivalQAChain for common LLM applications and RAG applications
- Create/debug typical GPT like application using web console easily
- Support blocking and SSE mode chat
-
A all-in-one deployment helm chart
-
Documentation online doc link
Changelog
New Features
- b1bda3c: feat: Develop CRD Datasource (@zqq454224016)
- 8de4d5a: feat: a kubeagi yaml file to install arcadia with kubebb (@bjwswang)
- 82a1f43: feat: able to configure base url for openai kind of embedders/llms (@bjwswang)
- 933416c: feat: able to configure text splitter in local dataset management (@bjwswang)
- 5df65ad: feat: able to configure worker type when create and update (@bjwswang)
- 6ec48be: feat: able to get model list provided by Embedder and LLM (@bjwswang)
- 1113151: feat: able to load multiple files into vector store with arctl (@bjwswang)
- 04f4d1a: feat: able to offline worker instead of remove (@bjwswang)
- 0fc7121: feat: able to user worker's llm/embedding service as 3rd party provider (@bjwswang)
- 295710b: feat: able to view system pre-configured models (@bjwswang)
- 450f989: feat: add CRD Embeddings (@Lanture1064)
- 9a210ca: feat: add CRD LLM controller (@Lanture1064)
- 75e437a: feat: add CRD: KnowledgeBase and VectorStore (@Abirdcfly)
- 1823608: feat: add CRDs for App (@Abirdcfly)
- d966c36: feat: add LLM read & list api (@Lanture1064)
- fccd825: feat: add Model.gql (@dayuy)
- 8ec9adc: feat: add RDMA definition to datasource (@0xff-dev)
- c2f8d75: feat: add SDK to call the API (@)
- 73f20da: feat: add SSE invoke support for API server sample (@Lanture1064)
- 51f1416: feat: add a cli to load documents and chat with similarity search (@bjwswang)
- eb6100a: feat: add a new intermediate status OfflineInProgress (@bjwswang)
- 64bd42b: feat: add auto graphql sdk generator action (@Abirdcfly)
- a04cf90: feat: add baichuan-7b chatglm-6b in dashscope as llm (@Abirdcfly)
- 04b90a6: feat: add beijing_gjj_bot example (@Abirdcfly)
- d2c5b52: feat: add chart postgresql as arcadia dependency (@bjwswang)
- 9e90dea: feat: add checkModelService API & fix (@Lanture1064)
- b148b4e: feat: add chromadb (@0xff-dev)
- 2a8dbcd: feat: add dashscope as llm (@Abirdcfly)
- d4bdbd3: feat: add dataprocess helm chart (@Lanture1064)
- 5036518: feat: add dataset.gql and versioned_dataset.gql (@dayuy)
- 9c30084: feat: add datasource comments and update fields (@zqq454224016)
- ac7c619: feat: add document Q&A chat API server example (@Lanture1064)
- 536db67: feat: add embedder API (@zqq454224016)
- 8af5ab7: feat: add embedder and knowledgebase gql (@zqq454224016)
- 9e6d4fb: feat: add example for CRD Datasource test (@zqq454224016)
- 957057f: feat: add fastchat API server & controller helm chart (@Lanture1064)
- 99f70c5: feat: add internal url for endpoint (@bjwswang)
- bdf8a51: feat: add knowledgebase API and update comments (@zqq454224016)
- 9fd129b: feat: add knowledgebase application (@Abirdcfly)
- a821d92: feat: add kubeenv for data processing and lint with isort (@bjwswang)
- 7261fb5: feat: add llama2 in dashscope as llm (@Abirdcfly)
- f3b4b32: feat: add llm_config in data process config (@bjwswang)
- 6e98953: feat: add local worker as llm in app (@Abirdcfly)
- 7458335: feat: add make prepare-push (@Abirdcfly)
- b5d6307: feat: add model service CUD API (@Lanture1064)
- bc1df20: feat: add more chat api (@Abirdcfly)
- a25474a: feat: add more config for app (@Abirdcfly)
- 8c4fd2c: feat: add more detail log in chat api (@Abirdcfly)
- d41c1b3: feat: add parameter whether to enable oidc (@0xff-dev)
- 1166d27: feat: add prompt controller (@bjwswang)
- 9351150: feat: add rbac rating CLI as an example (@bjwswang)
- 850d2d1: feat: add spell test and fix existed misspellings (@Abirdcfly)
- 161a89f: feat: add swagger api doc (@Abirdcfly)
- e482d76: feat: add text-embedding in dashscope (@Abirdcfly)
- c3462f9: feat: add worker chart (@Lanture1064)
- 66f213a: feat: add worker graphql apis and optimize other graphql apis along with tiny changes to operator (@bjwswang)
- f4d1495: feat: add zhipuai embedding api & examples (@Lanture1064)
- 589f7eb: feat: breakpoint upload via MinIO (@dayuy)
- e886f54: feat: chat with llm now support sse (@Abirdcfly)
- a39191a: feat: check datasource bucket and object (@zqq454224016)
- 1139307: feat: create and delete buckets based on namespace changes (@0xff-dev)
- 35f464c: feat: create embedder&llm for worker based on model type (@bjwswang)
- 6bc9314: feat: dataset and versioneddataset impl (@0xff-dev)
- 89baee6: feat: dataset graphql api (@0xff-dev)
- ca63b19: feat: define CRD Dataset and VersionedDataset (@bjwswang)
- c694d5f: feat: define CRD LLM and update helm charts (@bjwswang)
- 40b3899: feat: deploy graphql-server (@zqq454224016)
- 009ac72: feat: develop model management API (@zqq454224016)
- 1404c53: feat: enable AI evaluator for Rating (@bjwswang)
- ba1dafa: feat: enable multiple gpus in worker (@bjwswang)
- 8b74fa1: feat: enable multiple gpus(sinle node) in runner fastchat (@bjwswang)
- 746251b: feat: enable sse-invoke in zhipuai and standarlize response body (@bjwswang)
- 4281700: feat: extract common funs in graphql server to pkg/common (@bjwswang)
- 0e489ea: feat: graphql base framework for bff server (@0xff-dev)
- c748a5c: feat: graphql-server supplement (@zqq454224016)
- ed7e262: feat: impl dataprocessing list api (@bjwswang)
- 1c491d6: feat: implement crd Model and define crd Worker (@bjwswang)
- 9c86805: feat: implement CRD Worker (@bjwswang)
- 2290b8f: feat: implement datasource management for arctl (@bjwswang)
- d064751: feat: init default controller con...