[Tracking] [Docs] Model Support Streamlining #1001

CharlieFRuan · 2023-10-02T17:00:30Z

Overview

This task focuses on streamlining the process of adding new models and keeping track of the existing prebuilt model libraries and model weights we offer. Specifically, there are three main goals:

Make the prebuilt model page more intuitive and well-structured
Create an end-to-end instruction for adding a new model, (as currently, “supporting a model” is relatively vague)
A better way to track the community’s model requests and contributions

For more on the design/layout, please see the doc here. Feel free to offer your suggestions, any insights would be greatly appreciated.

Co-author: @rickzx

Action Items

Revamp the current prebuilt model page
- [Docs] Model prebuilts tracking page revamp #1000
- [Docs] Iterate model prebuilts docs #1043
~~Create an end-to-end instruction/checklist on how to support a model, and where to update the documentation accordingly.~~ (Edit: Perhaps not needed for now; include some pointers in the model prebuilt page to direct how to contribute to the tables)
- ~~Draft preview in pdf; corresponding to this branch. Need to wait for the model revamp PR to merge first. Also need to add concrete examples to make the page easier to follow.~~
- Contribution log preview: so that contributors have a centralized place to record the models they compiled. We sync to the prebuilt model page once in a while (avoid and batchify cumbersome documentation updates).
Create a dashboard for new model requests (perhaps use Phi or Falcon for trial to see how viable the workflow is)
- https://github.com/orgs/mlc-ai/projects/2

Edit: move the below TODOs to the future when the new compile workflow lands

Add some form of automation for adding model libraries and prebuilt weights (suggested by @junrushao)
Add some form of automation that checks whether all the prebuilts we support are functioning (run weekly perhaps)

Links to Related Issues and PRs

There are various past efforts to improve this aspect of MLC-LLM's workflow, e.g.:

We would hope this attempt to be more future-proof and long-lasting.

junrushao · 2023-10-02T17:21:56Z

Hey @CharlieFRuan I love the work you've been leading on model streamlining! One additional question I'd love to discuss with you is that we will need not only a streamlined process, but also a transparent and reproducible workflow, preferably, say, a bash script or so, to release new models on all platforms, pushing them to GitHub and weights to HuggingFace.

So far, @MasterJH5574 has been in charge of new model releasing process, while knowing it's theoretically feasible, non-CMU contributors like me usually don't have sufficient incentive to push through in case of potential breaking change. Having a transparent bash script will help me personally to add new models without having to bother Ruihang all the time :)

CharlieFRuan · 2023-10-02T18:57:02Z

Hi @junrushao, thanks for bringing this up! We agree that some automation would make the process a lot less cumbersome, especially given that we have so many degrees of freedom (mode size, quantization scheme, and most importantly, platforms). I added this point to the task; we will follow up on this! cc @rickzx

junrushao · 2023-10-05T16:27:19Z

There are three aspects of releasing a model:

A1. The model weights, which we usually update to HuggingFace; A model architecture could correspond to multiple model weights, for example, Llama2 for Llama2 and WizardLM;
A2. The model lib, or the compiled execution of the model; A model architecture usually corresponds to a single model lib, unless there's some minor tweaks in the architecture;
A3. The default mlc-chat-configuration.

Likely A1 and A3 could be bundled together, but I was curious if we have a plan displaying (A1 + A3) and A2 in a table?

CharlieFRuan · 2023-10-05T16:43:15Z

The PR here #1000 displays A1 and A2 into a set of tables. In the comment, there is a pdf for easier review of the page. There are three levels from high to low:

An all-in-one table, displaying all architectures and variants we support
Model lib table (A2 here), one for each architecture
Model weights (A1 here), one for each model variant

A3 is indeed missing, and we will add it to the level 3 tables! cc @rickzx

junrushao · 2023-10-05T16:46:22Z

Thanks! Let's move the discussion to that PR

CharlieFRuan added the status: tracking Tracking work in progress label Oct 2, 2023

github-project-automation bot added this to MLC LLM Tracking Oct 2, 2023

CharlieFRuan moved this to In Progress in MLC LLM Tracking Oct 2, 2023

CharlieFRuan assigned CharlieFRuan and unassigned CharlieFRuan Oct 2, 2023

tqchen assigned CharlieFRuan Oct 2, 2023

CharlieFRuan moved this from In Progress to Done in MLC LLM Tracking Nov 6, 2023

CharlieFRuan closed this as completed Feb 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tracking] [Docs] Model Support Streamlining #1001

[Tracking] [Docs] Model Support Streamlining #1001

CharlieFRuan commented Oct 2, 2023 •

edited

Loading

junrushao commented Oct 2, 2023

CharlieFRuan commented Oct 2, 2023

junrushao commented Oct 5, 2023

CharlieFRuan commented Oct 5, 2023 •

edited

Loading

junrushao commented Oct 5, 2023

[Tracking] [Docs] Model Support Streamlining #1001

[Tracking] [Docs] Model Support Streamlining #1001

Comments

CharlieFRuan commented Oct 2, 2023 • edited Loading

Overview

Action Items

Links to Related Issues and PRs

junrushao commented Oct 2, 2023

CharlieFRuan commented Oct 2, 2023

junrushao commented Oct 5, 2023

CharlieFRuan commented Oct 5, 2023 • edited Loading

junrushao commented Oct 5, 2023

CharlieFRuan commented Oct 2, 2023 •

edited

Loading

CharlieFRuan commented Oct 5, 2023 •

edited

Loading