Discussion: Cortex.cpp Model and model.yaml #1090

dan-menlo · 2024-09-04T03:44:15Z

Overview

Linked: Discussion: Cortex.cpp Data Structures #1040
Where and how do we store models?
How do we structure model folders?
How do we detect models?
What is the format of our model.yaml?

Docs

The text was updated successfully, but these errors were encountered:

freelerobot · 2024-09-04T06:50:29Z

Data Folder Questions

What is the data structure of ~/.cortexcpp/models?
Where do the following go

model yaml
model binaries, especially of multiple binaries (model_1_of_5.bin)
versions of the same model, e.g. llama3.1, llama3.2
presets (if any, can also defer for later discussion)

Is our preference on a more flat folder structure? rather than super nested?
Previously, we had many bugs resulting from expecting folder & file names to be a certain way.
e.g. we expected unique_model_id (used by backend) to be the same as the model folder name
Something to be aware of in this iteration 🙏

Model Downloading

What happens when model download fails halfway (e.g. internet disconnected)?
How do we detect models? e.g. if users "import models locally" would it still work
How do we version models?

If we update our model.yaml, or remote model binary in the HF branch, will download still work?
Or will "redownloading/updating" fail due to "model exists"

Letting users do cortex models update is currently out of scope right?

Model importing

Can users import existing models?
Is it hard copy or symlink

Model YAML

We are auto populating the model.yaml if user downloads a new GGUF file (not from our HF repo)?
What happens when user deletes the YAML accidentally?
What happens when YAML is there but Binary is deleted? (should we spec little unit tests like this)
Is this up to date? https://cortex.so/docs/model-yaml/
If users update YAMLs, when do changes take effect?

vansangpfiev · 2024-09-05T01:58:53Z

Data Folder Questions

Q 1. What is the data structure of ~/.cortexcpp/models?

~/.cortexcpp/
|___ models
       |__ tinyllama.yaml
       |__ tinyllama
       |     |__ model_01.gguf
       |     |__ model_02.gguf
       |     |__ model.yml
       |__ llama3.1
       |__ llama3.2

After downloading model (from cortexso or other HF repository), cortex generates tinyllama.yaml file which is used for model management.

Q 2. Is our preference on a more flat folder structure? rather than super nested?
Can you give an example of the flat folder structure?
Q 3. Can you elaborate more about the issue? Do we have any ticket to track that issue yet?

Model importing
Q 1. Can users import existing models?

We don't support it yet. TBD when we will support it.
Q 2. Is it hard copy or symlink
It will be a symlink

cc: @0xSage

nguyenhoangthuan99 · 2024-09-05T02:00:24Z

Model Yaml

Folder structure:

~/.cortexcpp/
|___ models
       |__ tinyllama.yaml
       |__ tinyllama
       |     |__ model_01.gguf
       |     |__ model_02.gguf
       |     |__ model.yml
       |__ llama3.1
       |__ llama3.2

With model not from cortexso:

When model is downloaded, it will parse and save information to <model_id>/model.yml and <model_id>.yaml, this 2 files is the same.
When user delete 1 of 2 .yaml file. We can provide a command to recover it like cortex-cpp models recover, to check and resolve yml error.

when YAML is there but binary is deleted, will raise No such file or directory error when load models.
The doc from https://cortex.so/docs/model-yaml/ is up to date.
To apply update user need to stop running models and re run chat to load new configuration

namchuai · 2024-09-05T02:01:17Z

Model Downloading

What happens when model download fails halfway (e.g. internet disconnected)?

We don't support resume failed/pause download.
If model download is failed, its <model_id>.yaml (inside <data_folder>/models) won't be created and won't display in our model list.

How do we detect models? e.g. if users "import models locally" would it still work

Scan for <model_id>.yaml file inside <data_folder>/models.

How do we version models?

Currently, we using a field version inside yaml file to store the model's version. Please not that we don't have logic to support model update at the moment. For now, Version is just for display purpose.

3.1. If we update our model.yaml, or remote model binary in the HF branch, will download still work? Or will "redownloading/updating" fail due to "model exists"

It will display model exists.

Letting users do cortex models update is currently out of scope right?

Yes, it is. We haven't work on this.

dan-menlo added this to Jan & Cortex Sep 4, 2024

dan-menlo converted this from a draft issue Sep 4, 2024

freelerobot mentioned this issue Sep 4, 2024

Discussion: Cortex.cpp Data Structures #1040

Closed

freelerobot assigned vansangpfiev, namchuai and nguyenhoangthuan99 Sep 4, 2024

freelerobot added the P1: important Important feature / fix label Sep 4, 2024

janhq locked and limited conversation to collaborators Sep 5, 2024

dan-menlo converted this issue into discussion #1113 Sep 5, 2024

github-project-automation bot moved this from Need Investigation to Completed in Jan & Cortex Sep 5, 2024

dan-menlo moved this from Completed to Discontinued in Jan & Cortex Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Discussion: Cortex.cpp Model and model.yaml #1090

Discussion: Cortex.cpp Model and model.yaml #1090

dan-menlo commented Sep 4, 2024 •

edited by freelerobot

Loading

freelerobot commented Sep 4, 2024 •

edited

Loading

vansangpfiev commented Sep 5, 2024

nguyenhoangthuan99 commented Sep 5, 2024 •

edited

Loading

namchuai commented Sep 5, 2024

This issue was moved to a discussion.

This issue was moved to a discussion.

Discussion: Cortex.cpp Model and model.yaml #1090

Discussion: Cortex.cpp Model and model.yaml #1090

Comments

dan-menlo commented Sep 4, 2024 • edited by freelerobot Loading

Overview

Docs

freelerobot commented Sep 4, 2024 • edited Loading

Data Folder Questions

Model Downloading

Model importing

Model YAML

vansangpfiev commented Sep 5, 2024

nguyenhoangthuan99 commented Sep 5, 2024 • edited Loading

namchuai commented Sep 5, 2024

Model Downloading

This issue was moved to a discussion.

dan-menlo commented Sep 4, 2024 •

edited by freelerobot

Loading

freelerobot commented Sep 4, 2024 •

edited

Loading

nguyenhoangthuan99 commented Sep 5, 2024 •

edited

Loading