feat: create a text editor tool with multiple commands #186

salman1993 · 2024-10-23T17:28:01Z

Changes:

support optional params, literals in docstring -> jsonschema: feat: support optional params jsonschema conversion in exchange #188
create a tool that calls out to different commands

Anthropic is using this text editor tool for Computer Use. It might be useful to try this in goose - docs, code. It seems like they prefer to have fewer tools, but each tool has more optional args which are used for specific actions. For example, in goose we have separate tools for read_file, write_file, patch_file but in this case, its a single EditorTool with command as an Enum that routes to the editor action (view, create, insert).

Test:

tool descriptions are correctly passed

lamchau · 2024-10-23T22:08:44Z

packages/exchange/src/exchange/utils.py

@@ -1,7 +1,7 @@
 import inspect
 import uuid
 from importlib.metadata import entry_points
-from typing import get_args, get_origin
+from typing import Literal, get_args, get_origin, Any, List, Tuple, Dict, Union


:nit: we can use list, tuple, and dict directly now

michaelneale · 2024-10-23T22:59:54Z

interesting, will try this out, could be promising. I did notice that tool calling seemed to work well with parameters in the past, and even with richer CLIs - ie able to use the one "tool" different ways seems to suit it more so than choosing between tools (intuitively that makes sense, as a human, to some limited extent, you don't have the cognitive overload of choosing the tool)

michaelneale · 2024-10-24T00:49:16Z

@salman1993 any tips on how to try it out to A/B test?

michaelneale · 2024-10-24T07:03:52Z

tried it out with editing just one spot in a yaml file... just FYI:

The phrase "Found packages with disallowed licenses" occurs multiple times in the .github/workflows/license-check.yml file, making it necessary to be more      
specific in the string replace operation. I'll view more of the file to identify the different locations and modify them accordingly. Let's check the           
occurrences separately.                                                                                                                                         

─── .github/workflows/license-check.yml ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
                                                                                                                                                                
         uses: actions/setup-python@v5                                                                                                                          
         with:                                                                                                                                                  
           python-version: '3.10'                                                                                                                               
                                                                                                                                                                
       - name: Install dependencies                                                                                                                             
         run: |                                                                                                                                                 
           python -m pip install --upgrade pip                                                                                                                  
           pip install tomli requests urllib3                                                                                                                   
                                                                                                                                                                
       - name: Check licenses                                                                                                                                   
         run: |                                                                                                                                                 
           python .github/workflows/scripts/check_licenses.py \                                                                                                 
             pyproject.toml || exit_code=$?                                                                                                                     
           if [ "${exit_code:-0}" -ne 0 ]; then                                                                                                                 
             echo "::error::Found packages with disallowed licenses"                                                                                            
             exit 1                                                                                                                                             
           fi                                                                                                                                                   
                                                                                                                                                                
       - name: Check Exchange licenses                                                                                                                          
         run: |                                                                                                                                                 
           python .github/workflows/scripts/check_licenses.py \                                                                                                 
             packages/exchange/pyproject.toml || exit_code=$?                                                                                                   
           if [ "${exit_code:-0}" -ne 0 ]; then                                                                                                                 
             echo "::error::Found packages with disallowed licenses in exchange"                                                                                
             exit 1                                                                                                                                             
           fi                                                                                                                                                   
                                                                                                                                                                


─── .github/workflows/license-check.yml ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
                                                                                                                                                                
                                                                                                                                                                
                                                                                                                                                                


─── .github/workflows/license-check.yml ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
                                                                                                                                                                
                                                                                                                                                                
                                                                                                                                                                

Let's specify the contexts where we succeed the operations, starting with the first one.                                                                        

─── .github/workflows/license-check.yml ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
                                                                                                                                                                
 echo "::error::Found packages with disallowed licenses"                                                                                                        
                                                                                                                                                                

->                                                                                                                                                              

                                                                                                                                                                
 echo "::error::Found packages with disallowed licenses, it was in the goose package."                                                                          
                                                                                                                                                                


─── .github/workflows/license-check.yml ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
                                                                                                                                                                
 echo "::error::Found packages with disallowed licenses in exchange"                                                                                            
                                                                                                                                                                

->                                                                                                                                                              

                                                                                                                                                                
 echo "::error::Found packages with disallowed licenses, it was in the goose package."

some odd artifacts like:

michaelneale · 2024-10-24T07:05:03Z

so far seems to do a good job - but not pushed too hard today

lamchau · 2024-10-24T10:59:13Z

src/goose/synopsis/toolkit.py

        patho = system.to_patho(path)

        if patho.exists() and not system.is_active(path):
            print(f"We are warning the LLM to view before write in write_file, with path={path} and patho={str(patho)}")
            raise ValueError(f"You must view {path} using read_file before you overwrite it")

+        self._save_file_history(patho)  # Save current content to history


:nit: i see a lot of erroneous comments like these in this pr and most of the time they're already self documenting. maybe remove?

lamchau · 2024-10-24T11:07:02Z

src/goose/synopsis/toolkit.py

+        return f"Successfully undid the last edit on {path}"
+
+    @tool
+    def text_editor(


there's a lot of conflated concerns/responsibilities in a single method. it's a little tough to read and would even harder to test. how about trying out a dispatch pattern?

command_dispatch: dict[str, callable] = { "create": self._create_file, "insert": self._insert_string, "undo_edit": self._undo_edit, ... }

we should possibly consider creating a text editor abstraction given the complex interactions we have here. maybe these things would play well with the editor integrations people have made plugins for?

like the suggestion for dispatcher pattern. i opened another PR cause i also batched up the changes for bash and process manager: #191

@lamchau I think this is the approach that claude took and it seems to work well - a smaller number of tools I think is cognitively more pleasant vs LLM needing to decide which tool to use (also lets us put more smarts in the tool vs relying on LLM always to choose the right way) - so I think this general approach feels right?

@michaelneale not sure i follow? i think the logic itself is sound just tidying it up

salman1993 · 2024-10-25T01:30:59Z

changes from this PR are in this one: #191

salman1993 added 3 commits October 23, 2024 12:44

Parse optional params in tools from docstring

1b1b8f9

Add docstring support for Literal -> jsonschema enum

da4f54c

Create a text editor tool collection, with multiple commands

adff4a6

salman1993 requested a review from baxen October 23, 2024 17:28

salman1993 changed the title ~~Create a text editor tool with multiple commands~~ feat: create a text editor tool with multiple commands Oct 23, 2024

salman1993 added 2 commits October 23, 2024 13:44

Fix ruff checks - docstring lines were too long

8e22654

Use text_editor toolkit for tests

adaaf16

lamchau reviewed Oct 23, 2024

View reviewed changes

michaelneale approved these changes Oct 24, 2024

View reviewed changes

lamchau reviewed Oct 24, 2024

View reviewed changes

Merge changes for jsonschema conversion

128d097

salman1993 mentioned this pull request Oct 24, 2024

feat: reduce tool entrypoints in synopsis for text editor, bash, process manager #191

Merged

salman1993 closed this Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: create a text editor tool with multiple commands #186

feat: create a text editor tool with multiple commands #186

salman1993 commented Oct 23, 2024 •

edited

Loading

lamchau Oct 23, 2024 •

edited

Loading

michaelneale commented Oct 23, 2024

michaelneale commented Oct 24, 2024

michaelneale commented Oct 24, 2024

michaelneale commented Oct 24, 2024

lamchau Oct 24, 2024 •

edited

Loading

lamchau Oct 24, 2024 •

edited

Loading

lamchau Oct 24, 2024

salman1993 Oct 24, 2024

michaelneale Oct 24, 2024

lamchau Oct 24, 2024

salman1993 commented Oct 25, 2024

feat: create a text editor tool with multiple commands #186

feat: create a text editor tool with multiple commands #186

Conversation

salman1993 commented Oct 23, 2024 • edited Loading

lamchau Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

michaelneale commented Oct 23, 2024

michaelneale commented Oct 24, 2024

michaelneale commented Oct 24, 2024

michaelneale commented Oct 24, 2024

lamchau Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

lamchau Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

lamchau Oct 24, 2024

Choose a reason for hiding this comment

salman1993 Oct 24, 2024

Choose a reason for hiding this comment

michaelneale Oct 24, 2024

Choose a reason for hiding this comment

lamchau Oct 24, 2024

Choose a reason for hiding this comment

salman1993 commented Oct 25, 2024

salman1993 commented Oct 23, 2024 •

edited

Loading

lamchau Oct 23, 2024 •

edited

Loading

lamchau Oct 24, 2024 •

edited

Loading

lamchau Oct 24, 2024 •

edited

Loading