Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Close #59
描述
在引入 Tiktoken 后,由于错误的调用,会导致加载服务/助理时会将会话历史里的所有会话的TOKEN都计算一遍,造成巨大的延迟。
现在增加了限制,只会在加载当前会话时计算当前会话的token,不会再额外计算其它会话。
PR 类型
这个 PR 的目的是什么?
PR 检查清单
请检查你的 PR 是否满足以下要求: