Update Auto-GPT score #106

waynehamadi · 2023-07-15T16:24:23Z

Auto-GPT Benchmark

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Radio chart for each agent coming soon !

Detailed results

⚠️ These results are constantly evolving at the moment. We will publish an official benchmark result very soon.

Interface

Task	Auto-GPT	gpt-engineer	mini-agi	smol-developer
Write File	❌	✅	tbd	✅
Read File	❌	❌	tbd	❌
Search File	❌	❌	tbd	❌

Code

Task	Auto-GPT	gpt-engineer	mini-agi	smol-developer
Debug Simple Typo With Guidance	❌	❌	tbd	❌
Debug Simple Typo Without Guidance	❌	❌	tbd	❌
Basic Code Generation	❌	✅	tbd	✅
Create Simple Web Server	❌	❌	tbd	❌

Memory

Task	Auto-GPT
Basic Memory	❌
Remember Multiple Ids	❌
Remember Multiple Ids With Noise	❌
Remember Multiple Phrases With Noise	❌

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>

ai-maintainer

AI-Maintainer Review for PR - Update Auto-GPT score

Title and Description 😐

Needs more context

The title of the pull request is clear and indicates that the changes are related to updating the Auto-GPT score. However, the description lacks detail and context. It would be beneficial for the author to provide more information about the rationale behind these changes, any related discussions or issues, and how these changes align with the project's overall direction.

Scope of Changes 👍

Narrowly focused

The changes in this pull request are narrowly focused. The diff shows that the changes are limited to updating the status of various tasks for different agents in a table in the README.md file. There are no unrelated or "extra" changes, indicating that the author is focused on resolving a specific issue.

Testing 😐

Testing details missing

The description does not provide information about how the author tested the changes. While the code quality checklist indicates that linters and code formatting tools were run, there is no mention of any specific tests or test cases that were executed to verify the correctness and functionality of the changes. It would be helpful for the author to include details about the testing approach and any relevant test results.

Suggested Changes

Please provide more context in the description about the rationale behind these changes, any related discussions or issues, and how these changes align with the project's overall direction.
Include details about how you tested the changes. This could include specific tests or test cases that were executed, the testing approach used, and any relevant test results.

Reviewed with AI Maintainer

Update Auto-GPT score

87889cc

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>

ai-maintainer bot reviewed Jul 15, 2023

View reviewed changes

Merge branch 'master' into update-auto-gpt-score

c4f1835

waynehamadi merged commit dab4e90 into Significant-Gravitas:master Jul 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Auto-GPT score #106

Update Auto-GPT score #106

waynehamadi commented Jul 15, 2023 •

edited

Loading

ai-maintainer bot left a comment •

edited

Loading

Update Auto-GPT score #106

Update Auto-GPT score #106

Conversation

waynehamadi commented Jul 15, 2023 • edited Loading

Auto-GPT Benchmark

Scores:

Detailed results

ai-maintainer bot left a comment • edited Loading

Choose a reason for hiding this comment

AI-Maintainer Review for PR - Update Auto-GPT score

Title and Description 😐

Scope of Changes 👍

Testing 😐

Suggested Changes

waynehamadi commented Jul 15, 2023 •

edited

Loading

ai-maintainer bot left a comment •

edited

Loading