Add token limit attribute to providers, derive chunk size for indexing docs from this #225

3coins · 2023-06-16T19:18:01Z

Problem

Each LLM provider and model has limits on the input size of prompts. When a user asks questions about indexed docs with /ask, the chunk size used to split docs in indexing affects the amount of context passed to the LLM in the prompt. Especially for smaller models that are running locally, an augmented prompt can end up exceeding the prompt size limit.

Proposed Solution

Add token limit attribute to provider class and implement this for different providers/models.
Use the token limit to derive the chunk size, overlap or a different chain that optimizes the indexing for that particular model.

The text was updated successfully, but these errors were encountered:

3coins added the enhancement New feature or request label Jun 16, 2023

3coins added this to the 0.9.0 Release milestone Jun 16, 2023

3coins mentioned this issue Jun 16, 2023

Add GPT4All local provider #209

Merged

dlqqq modified the milestones: 0.9.0 Release, 0.10.0 Release Jun 23, 2023

3coins mentioned this issue Jun 29, 2023

Update prompts for locally installed models #226

Open

JasonWeill modified the milestones: 0.10.0 Release, 0.11.0 Release Jul 17, 2023

JasonWeill changed the title ~~Add token limit attribute to providers, derive chuck size for indexing docs from this~~ Add token limit attribute to providers, derive chunk size for indexing docs from this Jul 27, 2023

JasonWeill modified the milestones: 2.1.0 Release, 2.2.0 Release Aug 14, 2023

dlqqq mentioned this issue Aug 23, 2023

View per-request token counts #356

Open

JasonWeill modified the milestones: 2.2.0 Release, 2.3.0 Release Aug 28, 2023

JasonWeill added the @jupyter-ai/core label Aug 28, 2023

JasonWeill removed this from the 2.3.0 Release milestone Jan 25, 2024

JasonWeill mentioned this issue Feb 5, 2024

Add OpenAI text-embedding-3-small, -large models #620

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add token limit attribute to providers, derive chunk size for indexing docs from this #225

Add token limit attribute to providers, derive chunk size for indexing docs from this #225

3coins commented Jun 16, 2023 •

edited by JasonWeill

Loading

Add token limit attribute to providers, derive chunk size for indexing docs from this #225

Add token limit attribute to providers, derive chunk size for indexing docs from this #225

Comments

3coins commented Jun 16, 2023 • edited by JasonWeill Loading

Problem

Proposed Solution

3coins commented Jun 16, 2023 •

edited by JasonWeill

Loading