Skip to content
This repository has been archived by the owner on Jun 9, 2024. It is now read-only.

Update Auto-GPT score #106

Merged

Conversation

waynehamadi
Copy link
Contributor

@waynehamadi waynehamadi commented Jul 15, 2023

Auto-GPT Benchmark

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Radio chart for each agent coming soon !

Detailed results

⚠️ These results are constantly evolving at the moment. We will publish an official benchmark result very soon.

Interface

Task Auto-GPT gpt-engineer mini-agi smol-developer
Write File tbd
Read File tbd
Search File tbd

Code

Task Auto-GPT gpt-engineer mini-agi smol-developer
Debug Simple Typo With Guidance tbd
Debug Simple Typo Without Guidance tbd
Basic Code Generation tbd
Create Simple Web Server tbd

Memory

Task Auto-GPT
Basic Memory
Remember Multiple Ids
Remember Multiple Ids With Noise
Remember Multiple Phrases With Noise

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Copy link

@ai-maintainer ai-maintainer bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AI-Maintainer Review for PR - Update Auto-GPT score

Title and Description 😐

Needs more context The title of the pull request is clear and indicates that the changes are related to updating the Auto-GPT score. However, the description lacks detail and context. It would be beneficial for the author to provide more information about the rationale behind these changes, any related discussions or issues, and how these changes align with the project's overall direction.

Scope of Changes 👍

Narrowly focused The changes in this pull request are narrowly focused. The diff shows that the changes are limited to updating the status of various tasks for different agents in a table in the README.md file. There are no unrelated or "extra" changes, indicating that the author is focused on resolving a specific issue.

Testing 😐

Testing details missing The description does not provide information about how the author tested the changes. While the code quality checklist indicates that linters and code formatting tools were run, there is no mention of any specific tests or test cases that were executed to verify the correctness and functionality of the changes. It would be helpful for the author to include details about the testing approach and any relevant test results.

Suggested Changes

  • Please provide more context in the description about the rationale behind these changes, any related discussions or issues, and how these changes align with the project's overall direction.
  • Include details about how you tested the changes. This could include specific tests or test cases that were executed, the testing approach used, and any relevant test results.

Reviewed with AI Maintainer

@waynehamadi waynehamadi merged commit dab4e90 into Significant-Gravitas:master Jul 15, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant