Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/llm responses #376

Merged
merged 234 commits into from
Feb 18, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
234 commits
Select commit Hold shift + click to select a range
f15be68
Started working on llm_responses
NotBioWaste905 Jul 19, 2024
56b7789
Created class, created 1st tutorial
NotBioWaste Jul 22, 2024
af60115
Added dependecies for langchain
NotBioWaste Jul 22, 2024
b3b79a5
Fixed adding custom prompt for each node
NotBioWaste Jul 22, 2024
6eb910d
Added image processing, updated tutorial
NotBioWaste Jul 22, 2024
1f8cddc
Added typehint
NotBioWaste Jul 22, 2024
74cd954
Added llm_response, LLM_API, history management
NotBioWaste Jul 22, 2024
1fd31a2
Fixed image reading
NotBioWaste Jul 22, 2024
2c48490
Started llm condition
NotBioWaste Jul 24, 2024
a1884e5
Added message_to_langchain
NotBioWaste Jul 24, 2024
61f302e
Implementing deepeval integration
NotBioWaste Jul 29, 2024
38a8f8f
Figured out how to implement DeepEval functions
NotBioWaste905 Jul 30, 2024
592267f
Adding conditions
NotBioWaste Jul 31, 2024
baccc47
Implemented simple conditions call, added BaseMethod class, renaming,…
NotBioWaste Aug 1, 2024
8e84ba1
Fixed history extraction
NotBioWaste Aug 2, 2024
2b2847b
Delete test_bot.py
NotBioWaste905 Aug 2, 2024
7e336ac
Fixed prompt handling, switched to AIMessage in LLM response
NotBioWaste Aug 5, 2024
71babbf
Merge branch 'feat/llm_responses' of https://github.com/deeppavlov/di…
NotBioWaste Aug 5, 2024
351ae06
Fixed conditions call
NotBioWaste Aug 5, 2024
e3d0d15
Working on autotesting
NotBioWaste Aug 5, 2024
0405998
Added tests
NotBioWaste Aug 7, 2024
3dbfd0c
Removed unused method
NotBioWaste Aug 7, 2024
5c876ba
Added annotations
NotBioWaste Aug 7, 2024
8f1932c
Added structured output support, tweaked tests
NotBioWaste Aug 7, 2024
aedf47e
Reworking tutorials
NotBioWaste Aug 7, 2024
adadb05
Reworked prompt usage and hierarchy, reworked filters and methods
NotBioWaste Aug 12, 2024
0288896
No idea how to make script smaller in tutorials
NotBioWaste Aug 12, 2024
67e2758
Small fixes in tutorials and structured generation
NotBioWaste Aug 13, 2024
428a9f0
Working on user guide
NotBioWaste Aug 14, 2024
5e26b4b
Fixed some tutorials, finished user guide
NotBioWaste Aug 14, 2024
5dbb6cd
Bugfixes in docs
NotBioWaste Aug 14, 2024
db63d1a
Lint
NotBioWaste Aug 14, 2024
2b9080f
Removed type annotation that broke docs building
NotBioWaste Aug 14, 2024
2bcda71
Tests and bugfixes
NotBioWaste Aug 15, 2024
d2f28ed
Deleted DeepEval references
NotBioWaste Aug 15, 2024
7318c91
Numpy versions trouble
NotBioWaste Aug 15, 2024
27eae27
Fixed dependecies
NotBioWaste Aug 16, 2024
3fed1fc
Made everything asynchronous
NotBioWaste Aug 16, 2024
30862ca
Added and unified docstring
NotBioWaste Aug 16, 2024
06ab5bc
Added 4th tutorial, fixed message_schema parameter passing
NotBioWaste Aug 16, 2024
798a77b
Bugfix, added max_size to the message_to_langchain function
NotBioWaste Aug 20, 2024
3343159
Made even more everything asynchronous
NotBioWaste Aug 21, 2024
014ff7e
Remade condition, added logprob check
NotBioWaste Aug 21, 2024
761bd81
Async bugfix, added model_result_to_text, working on message_schema f…
NotBioWaste Aug 22, 2024
90a811e
Minor fixes, tinkering tests
NotBioWaste Aug 23, 2024
5bff191
Merge branch 'refs/heads/dev' into feat/llm_responses
RLKRo Aug 23, 2024
8b88ba6
update lock file
RLKRo Aug 23, 2024
20c4afd
Merge remote-tracking branch 'origin/feat/llm_responses' into feat/ll…
RLKRo Aug 23, 2024
0139421
Merge remote-tracking branch 'origin/master' into feat/llm_responses
NotBioWaste905 Sep 18, 2024
9bb0cba
Updating to v1.0
NotBioWaste905 Sep 23, 2024
f2d6b68
Finished tests, finished update
NotBioWaste905 Sep 26, 2024
6fddaea
lint
NotBioWaste905 Sep 26, 2024
e06bc2b
Started working on llm slots
NotBioWaste905 Sep 26, 2024
22d8efc
Resolving pydantic errors
NotBioWaste905 Sep 27, 2024
aa735b5
Delete llmslot_test.py
NotBioWaste905 Sep 27, 2024
cc91133
Finished LLMSlot, working on LLMGroupSlot
NotBioWaste905 Sep 27, 2024
8756838
Merge remote-tracking branch 'origin/feat/llm_responses' into feat/ll…
NotBioWaste905 Sep 27, 2024
f1857f6
Added flag to
NotBioWaste905 Oct 1, 2024
c334ff5
First test attempts
NotBioWaste905 Oct 1, 2024
8306bbb
linting
NotBioWaste905 Oct 1, 2024
f842776
Merge branch 'feat/slots_extraction_update' into feat/llm_responses
NotBioWaste905 Oct 1, 2024
ada17ca
Merge remote-tracking branch 'origin/feat/llm_responses' into feat/ll…
NotBioWaste905 Oct 1, 2024
a45f653
File structure fixed
NotBioWaste905 Oct 3, 2024
3838d30
Fixed naming
NotBioWaste905 Oct 3, 2024
0e650f8
Create LLMCondition and LLMResponse classes
NotBioWaste905 Oct 3, 2024
015cb4f
Debugging flattening
NotBioWaste905 Oct 23, 2024
b6e5eeb
Bugfix
NotBioWaste905 Oct 23, 2024
b20137e
Added return_type property for LLMSlot
NotBioWaste905 Oct 23, 2024
25f5b04
Changed return_type from Any to type
NotBioWaste905 Oct 23, 2024
b651087
lint
NotBioWaste905 Oct 23, 2024
1b5a77b
removed deprecated from_script from tutorials
NotBioWaste905 Nov 2, 2024
c18d375
Fixed LLMCondition class
NotBioWaste905 Nov 2, 2024
459f7fc
Fixed missing 'models' field in Pipeline, updated tutorials
NotBioWaste905 Nov 6, 2024
24300e8
create __get_llm_response method in LLM_API, refactoring LLM Conditio…
NotBioWaste905 Nov 7, 2024
03b02be
Merge branch 'refs/heads/dev' into feat/llm_responses
RLKRo Nov 7, 2024
e6663b3
update lock file
RLKRo Nov 7, 2024
2e1c190
remove outdated entries from conf.py
RLKRo Nov 7, 2024
859c57a
small fixes to user guide
RLKRo Nov 7, 2024
fb3142b
minor tutorial changes
RLKRo Nov 7, 2024
ff81267
Moved docstring, removed pipeline parameter
NotBioWaste905 Nov 13, 2024
7518259
Fixed type annotation for models field in Pipeline
NotBioWaste905 Nov 13, 2024
ac28d78
removed unused imports from llm/__init__.py
NotBioWaste905 Nov 13, 2024
2d4998c
Fix redundancy in chatsky/slots/llm.py
NotBioWaste905 Nov 13, 2024
23d6a31
Fixed circular LLM_API<=>Pipeline import
NotBioWaste905 Nov 13, 2024
ef9baa3
Merge remote-tracking branch 'origin/feat/llm_responses' into feat/ll…
NotBioWaste905 Nov 13, 2024
4bf5bba
Update import order chatsky/llm/filters.py
NotBioWaste905 Nov 13, 2024
9188b89
Fixes in filters
NotBioWaste905 Nov 14, 2024
02894f0
Fixes of LLM_API annotations and docs
NotBioWaste905 Nov 14, 2024
8e839a1
Removed __get_llm_response, lint
NotBioWaste905 Nov 14, 2024
210b10a
Added context_to_history util, some tweaks in responses
NotBioWaste905 Nov 14, 2024
784f323
remove llm_response object initialization from tutorials
RLKRo Nov 14, 2024
042d256
fix imports in __init__ files:
RLKRo Nov 14, 2024
10533ed
fix: rename llm_response to LLMResponse, rename llm_condition to LLMC…
RLKRo Nov 14, 2024
8f21069
fix codeblocks in user guide
RLKRo Nov 14, 2024
95e2418
fix: message_to_langchain accepts context instead of pipeline
RLKRo Nov 15, 2024
934a0b8
remove defaults from filter definitions
RLKRo Nov 15, 2024
1be58a0
check field not none in filters
RLKRo Nov 15, 2024
4d68a29
remove model_name from LLM_API.respond
RLKRo Nov 15, 2024
fa0ae70
make LLMResponse prompt AnyResponse, remove __prompt_to_message
RLKRo Nov 15, 2024
8778637
fix return style in LLM_API.respond
RLKRo Nov 15, 2024
d4b67a1
fix LLM_API.condition signature
RLKRo Nov 15, 2024
4a29687
some doc fixes
RLKRo Nov 15, 2024
37aafb3
fix message schema json dumping
RLKRo Nov 15, 2024
54a7376
remove unused imports
RLKRo Nov 15, 2024
86da03e
fix circular import
RLKRo Nov 15, 2024
eac43e0
fix tests
RLKRo Nov 15, 2024
51c66a8
remove cnd.true()
RLKRo Nov 15, 2024
33242ca
Fixed empty prompt popping up
NotBioWaste905 Nov 15, 2024
65f7c8f
Format
NotBioWaste905 Nov 15, 2024
dc92132
Switched model from 3.5-turbo to 4o-mini
NotBioWaste905 Nov 15, 2024
020a7ef
Updated all of the models
NotBioWaste905 Nov 15, 2024
c9891f6
Fixes and logging
NotBioWaste905 Nov 15, 2024
c678f89
Codestyle
NotBioWaste905 Nov 15, 2024
f2df441
update lock file
RLKRo Nov 15, 2024
f20d463
simplify history text
RLKRo Nov 15, 2024
44e5571
fix codestyle
RLKRo Nov 15, 2024
9f97ce2
fix doc building
RLKRo Nov 15, 2024
b9e738a
Merge branch 'refs/heads/dev' into feat/llm_responses
RLKRo Nov 15, 2024
39750ba
update lock file
RLKRo Nov 15, 2024
6603f7d
remove unnecessary langchain extras
RLKRo Nov 15, 2024
3827462
update lock file
RLKRo Nov 15, 2024
f7e7684
protect langchain imports & sort imports in modules
RLKRo Nov 15, 2024
a4e0462
skip llm tests on missing langchain
RLKRo Nov 15, 2024
13923ab
Added docstrings in llm/methods.py
NotBioWaste905 Nov 20, 2024
537d8cc
Docstring fixes
NotBioWaste905 Nov 20, 2024
35d9d7d
Fixes in message_to_langchain
NotBioWaste905 Nov 20, 2024
e5c83fb
lint
NotBioWaste905 Nov 20, 2024
5a7313f
Fixed overseen raise condition
NotBioWaste905 Nov 20, 2024
0000414
Signature fixes
NotBioWaste905 Nov 20, 2024
36a9f54
Responses related fixes
NotBioWaste905 Nov 20, 2024
ba95767
Slot related fixes + lint
NotBioWaste905 Nov 20, 2024
3d79cec
Fixed abstract call
NotBioWaste905 Nov 20, 2024
8e22b97
Adding tests
NotBioWaste905 Nov 20, 2024
b8de244
Bunch of documentation fixes, removed attachment_to_content
NotBioWaste905 Nov 25, 2024
bfba582
Added tests, need fix
NotBioWaste905 Nov 25, 2024
2b3c02b
Renamed FromTheModel to FromModel
NotBioWaste905 Nov 25, 2024
47f3855
Changes in BaseFilter class
NotBioWaste905 Nov 25, 2024
248d77f
Switched to localhost models in tutorials
NotBioWaste905 Nov 26, 2024
b5ecc1a
Renamed BaseFilter into BaseHistoryFilter, added API reference
NotBioWaste905 Nov 26, 2024
34e5536
Lint
NotBioWaste905 Nov 26, 2024
60c7c97
Slots and tutorials update
NotBioWaste905 Nov 27, 2024
3cf1df7
Tutorials and structured output update
NotBioWaste905 Nov 28, 2024
7f00028
More clear instructions in tutorial
NotBioWaste905 Nov 28, 2024
513eb19
Fixes in llm slots and tutorial
NotBioWaste905 Nov 28, 2024
2cd5d41
lint
NotBioWaste905 Nov 28, 2024
6a0845d
Finalizing tweaks
NotBioWaste905 Nov 29, 2024
81a86e9
Lint
NotBioWaste905 Nov 29, 2024
24e65c5
Removed import test
NotBioWaste905 Nov 29, 2024
b6af8f5
Removed dotenv, fixed Union
NotBioWaste905 Nov 29, 2024
ee5f643
Conditions cleanup
NotBioWaste905 Dec 4, 2024
1ff7020
Switched to the '|' operator, IsImportant and FromModel are now inher…
NotBioWaste905 Dec 4, 2024
2f65265
Added partial extraction to the tutorial
NotBioWaste905 Dec 4, 2024
04c5b54
Moved history flag annotation to another tutorial
NotBioWaste905 Dec 4, 2024
0d56e75
Fixed docstrings
NotBioWaste905 Dec 4, 2024
74c6d5e
Quickfix for message_to_langchain
NotBioWaste905 Dec 4, 2024
7e2da91
Fixed signatures in filters, lint
NotBioWaste905 Dec 4, 2024
7a313d1
Fixed tutorial link
NotBioWaste905 Dec 4, 2024
9b31ac9
Actually fixed tutorial link
NotBioWaste905 Dec 4, 2024
1c4aa24
Fixed splitted lines in tutorials, reworked system prompt handling af…
NotBioWaste905 Dec 4, 2024
419ab8d
Added missing docstrings for LLM_API
NotBioWaste905 Dec 9, 2024
e723334
Small docstring fix
NotBioWaste905 Dec 9, 2024
6b1ffed
Added test for conditions + fixed some bugs
NotBioWaste905 Dec 11, 2024
2a7bd4f
Removed return_schema from condition due to not using it for now
NotBioWaste905 Dec 12, 2024
e25e2f8
Experiencing issues with slot testing
NotBioWaste905 Dec 12, 2024
8e553bd
lint
NotBioWaste905 Dec 12, 2024
fea185c
Fixes in LLM Slot testing
NotBioWaste905 Dec 12, 2024
968fe75
Refactor context_to_history function to streamline filtering of dialo…
NotBioWaste905 Dec 12, 2024
8bc71ce
Working on Prompt rework
NotBioWaste905 Dec 23, 2024
e27d85f
Returned test case
NotBioWaste905 Dec 23, 2024
13e6a31
Started working on get_langchain_context
NotBioWaste905 Jan 13, 2025
93412e8
Working on prompt processing
NotBioWaste905 Jan 20, 2025
3b6f941
Resolved typeching issues in Pipeline
NotBioWaste905 Jan 22, 2025
24237fb
Added some logging, WIP
NotBioWaste905 Jan 22, 2025
f4d1852
Renamed `model_name` parameter into `llm_model_name`
NotBioWaste905 Jan 22, 2025
f0f0e2d
Update LLMResponse
NotBioWaste905 Jan 24, 2025
8b8085f
Update LLM_API to work with LLM Response
NotBioWaste905 Jan 24, 2025
09b0487
Renamed DesaultPositionConfig to PositionConfig
NotBioWaste905 Jan 24, 2025
6eb50e7
Reworked context related functions
NotBioWaste905 Jan 24, 2025
bba5178
Added buch on TODOs
NotBioWaste905 Jan 24, 2025
d1063b9
Made request and response optional for history filters, renamed field…
NotBioWaste905 Jan 29, 2025
cee86a9
Removed deprecated TODO
NotBioWaste905 Jan 29, 2025
3c0fe22
Updated conditions.llm to use get_langchain_context
NotBioWaste905 Jan 29, 2025
44935ff
Added docstring for get_langchain_context, lint
NotBioWaste905 Jan 29, 2025
2b37d59
Fixed renaming issue
NotBioWaste905 Jan 31, 2025
32eae7d
Fixing tests
NotBioWaste905 Jan 31, 2025
6440eaf
Fixed appending empty strings + wrong prompt positions in tests
NotBioWaste905 Jan 31, 2025
b9f3925
Added missing PositionConfig
NotBioWaste905 Jan 31, 2025
d43b468
Added de-flattening func to slots.llm
NotBioWaste905 Feb 3, 2025
4968907
Update prompt handling in LLM conditions and tests
NotBioWaste905 Feb 3, 2025
42b6ced
Refactor Prompt model to use float for position attribute, not BasePr…
NotBioWaste905 Feb 4, 2025
b11f44b
Added tests for get_langchain_context
NotBioWaste905 Feb 4, 2025
5bedd3f
lint
NotBioWaste905 Feb 4, 2025
c792f93
Modified tutorial to include prompt positioning
NotBioWaste905 Feb 4, 2025
c8dc417
Added mock OPENAI_API_KEY for tutorials to be testes
NotBioWaste905 Feb 5, 2025
1f31292
removed pipe symbol from union
NotBioWaste905 Feb 5, 2025
bf8f7cf
lint
NotBioWaste905 Feb 5, 2025
6204b85
Added actual Union
NotBioWaste905 Feb 5, 2025
b7f1cd7
Fixed wrong method override
NotBioWaste905 Feb 6, 2025
65e24f9
Updated tutorial
NotBioWaste905 Feb 6, 2025
8f0587f
lint
NotBioWaste905 Feb 6, 2025
94aa660
Added missing mock ANTHROPIC_API_KEY
NotBioWaste905 Feb 6, 2025
2a21f5a
Trying to fix escape sequence
NotBioWaste905 Feb 6, 2025
be76bf1
Okay this breaks everything
NotBioWaste905 Feb 6, 2025
5b120c6
Fixed typo
NotBioWaste905 Feb 6, 2025
37ae4ae
Updated userguide
NotBioWaste905 Feb 6, 2025
db4f8e1
readability improvements
NotBioWaste905 Feb 10, 2025
5d9681b
tests are grouped into classes
NotBioWaste905 Feb 12, 2025
ed7ba23
Fixed formatting
NotBioWaste905 Feb 12, 2025
aed1af8
Updated tutorials
NotBioWaste905 Feb 12, 2025
fc706f9
Updated tutorial via llm
NotBioWaste905 Feb 12, 2025
0bd0e15
Formating fixes and docstrings
NotBioWaste905 Feb 12, 2025
0ee3a8c
Reformatted and improved readability for tutorials
NotBioWaste905 Feb 13, 2025
e25921e
Fixed some hallucinations
NotBioWaste905 Feb 13, 2025
62f0bdd
And once more
NotBioWaste905 Feb 13, 2025
5dd6fc0
Deleted new line
NotBioWaste905 Feb 13, 2025
6570738
Trying to fix doc building
NotBioWaste905 Feb 13, 2025
0d2d5a1
Updated user guide
NotBioWaste905 Feb 14, 2025
6f64e24
Merge branch 'refs/heads/dev' into feat/llm_responses
RLKRo Feb 17, 2025
cb3cd70
fix llm tests with after #93
RLKRo Feb 17, 2025
41978a4
update history extraction after #93
RLKRo Feb 17, 2025
994b32c
change last_request in context history to last_turn
RLKRo Feb 17, 2025
98bb5d8
raise max size to 5000 and update its docstring
RLKRo Feb 17, 2025
f844f0f
improve first llm tutorial
RLKRo Feb 17, 2025
873e3ef
Merge branch 'refs/heads/dev' into feat/llm_responses
RLKRo Feb 17, 2025
4a54f53
lint first tutorial
RLKRo Feb 17, 2025
35cddbd
add loggers
RLKRo Feb 17, 2025
cfdc261
improve docs for condition and response
RLKRo Feb 17, 2025
3666b9a
move check_langchain_available from response to langchain_context
RLKRo Feb 17, 2025
54e3ba3
improve pipeline docs
RLKRo Feb 17, 2025
997e0be
add more entries to llm init
RLKRo Feb 17, 2025
315e697
remove unnecessary logs
RLKRo Feb 18, 2025
d043e7f
fix filters (they did not work at all) & improve documentation
RLKRo Feb 18, 2025
93abd3b
documentation improvements & a few code improvements
RLKRo Feb 18, 2025
6be72b9
move slots to the status of experimental feature
RLKRo Feb 18, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .env_file
Original file line number Diff line number Diff line change
@@ -1,4 +1,6 @@
TG_BOT_TOKEN=token
OPENAI_API_KEY=api_key
ANTHROPIC_API_KEY=api_key
MYSQL_USERNAME=root
MYSQL_PASSWORD=pass
MYSQL_ROOT_PASSWORD=pass
Expand Down
1 change: 1 addition & 0 deletions chatsky/__rebuild_pydantic_models__.py
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@
from chatsky.core.ctx_dict import ContextDict
from chatsky.core.ctx_utils import ServiceState, FrameworkData, ContextMainInfo
from chatsky.core.service import PipelineComponent
from chatsky.llm import LLM_API

ContextMainInfo.model_rebuild()
ContextDict.model_rebuild()
Expand Down
1 change: 1 addition & 0 deletions chatsky/conditions/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,3 +11,4 @@
)
from chatsky.conditions.slots import SlotsExtracted
from chatsky.conditions.service import ServiceFinished
from chatsky.conditions.llm import LLMCondition
77 changes: 77 additions & 0 deletions chatsky/conditions/llm.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
"""
LLM Conditions
--------------
This module provides LLM-based conditions.
"""

from pydantic import Field
from typing import Optional

from chatsky.core import BaseCondition, Context
from chatsky.core.script_function import AnyResponse
from chatsky.llm.methods import BaseMethod
from chatsky.llm.langchain_context import get_langchain_context
from chatsky.llm.filters import BaseHistoryFilter, DefaultFilter
from chatsky.llm.prompt import PositionConfig, Prompt


class LLMCondition(BaseCondition):
"""
LLM-based condition.
Uses prompt to produce result from model and evaluates the result using given method.
"""

llm_model_name: str
"""
Key of the model in the :py:attr:`~chatsky.core.pipeline.Pipeline.models` dictionary.
"""
prompt: AnyResponse = Field(default="", validate_default=True)
"""
Condition prompt.
"""
history: int = 1
"""
Number of dialogue turns aside from the current one to keep in history. `-1` for full history.
"""
filter_func: BaseHistoryFilter = Field(default_factory=DefaultFilter)
"""
Filter function to filter messages in history.
"""
prompt_misc_filter: str = Field(default=r"prompt")
"""
Regular expression to find prompts by key names in MISC dictionary.
"""
position_config: Optional[PositionConfig] = None
"""
Config for positions of prompts and messages in history.
"""
max_size: int = 5000
"""
Maximum size of any message in chat in symbols.
If a message exceeds the limit it will not be sent to the LLM and a warning
will be produced.
"""
method: BaseMethod
"""
Method that takes model's output and returns boolean.
"""

async def call(self, ctx: Context) -> bool:
model = ctx.pipeline.models[self.llm_model_name]

history_messages = []
history_messages.extend(
await get_langchain_context(
system_prompt=await model.system_prompt(ctx),
ctx=ctx,
call_prompt=Prompt(message=self.prompt),
prompt_misc_filter=self.prompt_misc_filter,
position_config=self.position_config or model.position_config,
length=self.history,
filter_func=self.filter_func,
llm_model_name=self.llm_model_name,
max_size=self.max_size,
)
)

return await model.condition(history_messages, self.method)
12 changes: 11 additions & 1 deletion chatsky/core/pipeline.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,10 +8,11 @@
including :py:class:`.Actor`.
"""

from __future__ import annotations
import asyncio
import logging
from functools import cached_property
from typing import Union, List, Optional
from typing import Union, List, Dict, Optional, TYPE_CHECKING
from pydantic import BaseModel, Field, model_validator, computed_field

from chatsky.core.script import Script
Expand All @@ -30,6 +31,9 @@
from chatsky.core.node_label import AbsoluteNodeLabel, AbsoluteNodeLabelInitTypes
from chatsky.core.script_parsing import JSONImporter, Path

if TYPE_CHECKING:
from chatsky.llm.llm_api import LLM_API

logger = logging.getLogger(__name__)


Expand Down Expand Up @@ -78,6 +82,10 @@ class Pipeline(BaseModel, extra="forbid", arbitrary_types_allowed=True):
"""
Slots configuration.
"""
models: Dict[str, LLM_API] = Field(default_factory=dict)
"""
LLM models to be made available in custom functions.
"""
messenger_interface: MessengerInterface = Field(default_factory=CLIMessengerInterface)
"""
A `MessengerInterface` instance for this pipeline.
Expand Down Expand Up @@ -116,6 +124,7 @@ def __init__(
*,
default_priority: float = None,
slots: GroupSlot = None,
models: dict = None,
messenger_interface: MessengerInterface = None,
context_storage: DBContextStorage = None,
pre_services: ServiceGroupInitTypes = None,
Expand All @@ -133,6 +142,7 @@ def __init__(
"fallback_label": fallback_label,
"default_priority": default_priority,
"slots": slots,
"models": models,
"messenger_interface": messenger_interface,
"context_storage": context_storage,
"pre_services": pre_services,
Expand Down
4 changes: 4 additions & 0 deletions chatsky/llm/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
from chatsky.llm.filters import BaseHistoryFilter, FromModel, IsImportant, MessageFilter, Return
from chatsky.llm.methods import BaseMethod, LogProb, Contains
from chatsky.llm.llm_api import LLM_API
from chatsky.llm.prompt import Prompt, PositionConfig
25 changes: 25 additions & 0 deletions chatsky/llm/_langchain_imports.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
from typing import Any

try:
from langchain_core.output_parsers import StrOutputParser
from langchain_core.language_models.chat_models import BaseChatModel
from langchain_core.messages.base import BaseMessage
from langchain_core.messages import HumanMessage, SystemMessage, AIMessage
from langchain_core.outputs.llm_result import LLMResult

langchain_available = True
except ImportError: # pragma: no cover
StrOutputParser = Any
BaseChatModel = Any
BaseMessage = Any
HumanMessage = Any
SystemMessage = Any
AIMessage = Any
LLMResult = Any

langchain_available = False


def check_langchain_available(): # pragma: no cover
if not langchain_available:
raise ImportError("Langchain is not available. Please install it with `pip install chatsky[llm]`.")
164 changes: 164 additions & 0 deletions chatsky/llm/filters.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,164 @@
"""
Filters
---------
This module contains a collection of basic functions for history filtering to avoid cluttering LLMs context window.
"""

import abc
from enum import Enum
from logging import Logger
from typing import Union, Optional

from pydantic import BaseModel

from chatsky.core.message import Message
from chatsky.core.context import Context


logger = Logger(name=__name__)


class Return(Enum):
"""
Enum that defines options for filtering turns.
"""

NoReturn = 0
"""
Do not include the turn.
"""
Request = 1
"""
Include request only.
"""
Response = 2
"""
Include response only.
"""
Turn = 3
"""
Include the entire turn (both request and response).
"""


class BaseHistoryFilter(BaseModel, abc.ABC):
"""
Base class for all message history filters.
"""

@abc.abstractmethod
def call(
self, ctx: Context, request: Optional[Message], response: Optional[Message], llm_model_name: str
) -> Union[Return, int]:
"""
Decide whether to include request or response or both in the context history from
a single turn.

The filter function is called repeatedly over all turns in context (up to history length limit in
:py:func:`~chatsky.llm.langchain_context.context_to_history`) to determine which parts of the turn
to include.

Both request and response may be ``None``. Even if such messages are not filtered out by this filter,
they won't be included in history.

:param ctx: Context object.
:param request: Request message.
:param response: Response message.
:param llm_model_name: Name of the model that calls this filter in the Pipeline.models.

:return: Instance of Return enum or a corresponding int value.
"""
raise NotImplementedError()

def __call__(
self, ctx: Context, request: Optional[Message], response: Optional[Message], llm_model_name: str
) -> Return:
"""
Wrapper for call that catches exceptions and does not return any turn items if an exception occurs.

:param ctx: Context object.
:param request: Request message.
:param response: Response message.
:param llm_model_name: Name of the model that calls this filter in the Pipeline.models.

:return: Instance of Return enum.
"""
try:
result = self.call(ctx, request, response, llm_model_name)

if isinstance(result, int):
result = Return(result)

return result
except Exception as exc:
logger.warning(exc)
return Return.NoReturn


class MessageFilter(BaseHistoryFilter):
"""
Variant of history filter that allows to define simple filters that do not
differentiate between requests and responses.
"""

@abc.abstractmethod
def single_message_filter_call(self, ctx: Context, message: Optional[Message], llm_model_name: str) -> bool:
"""
Determine based on a single message (which may be either request or response)
whether to include the message in history.

:param ctx: Context object.
:param message: Either request or response message.
:param llm_model_name: Name of the model that calls this filter in the Pipeline.models.

:return: Whether the `message` should be included in history.
"""
raise NotImplementedError()

def call(
self, ctx: Context, request: Optional[Message], response: Optional[Message], llm_model_name: str
) -> Union[Return, int]:
return (
int(self.single_message_filter_call(ctx, request, llm_model_name)) * Return.Request.value
| int(self.single_message_filter_call(ctx, response, llm_model_name)) * Return.Response.value
)


class DefaultFilter(BaseHistoryFilter):
"""
Filter used by default.
Never filters out messages.
"""

def call(
self, ctx: Context, request: Optional[Message], response: Optional[Message], llm_model_name: str
) -> Union[Return, int]:
return Return.Turn


class IsImportant(MessageFilter):
"""
Filter that checks if the "important" field in a Message.misc is True.
"""

def single_message_filter_call(self, ctx: Context, message: Optional[Message], llm_model_name: str) -> bool:
if message is not None and message.misc is not None and message.misc.get("important", None):
return True
return False


class FromModel(BaseHistoryFilter):
"""
Filter that checks if the response of the turn is generated by the currently
"""

def call(
self, ctx: Context, request: Optional[Message], response: Optional[Message], llm_model_name: str
) -> Union[Return, int]:
if (
response is not None
and response.annotations is not None
and response.annotations.get("__generated_by_model__") == llm_model_name
):
return Return.Turn
return Return.NoReturn
Loading
Loading