mirror of https://github.com/aljazceru/Auto-GPT.git synced 2025-12-17 22:14:28 +01:00

Files

Krzysztof Czerwinski a74548d3cd feat(agent): Component-based Agents (#7054 )

This incremental re-architecture unifies Agent code and plugins, so everything is component-based.

## Breaking changes

- Removed command categories and `DISABLED_COMMAND_CATEGORIES` environment variable. Use `DISABLED_COMMANDS` environment variable to disable individual commands.
- Changed `command` decorator; old-style commands are no longer supported. Implement `CommandProvider` on components instead.
- Removed `CommandRegistry`, now all commands are provided by components implementing `CommandProvider`.
- Removed `prompt_config` from `AgentSettings`.
- Removed plugin support: old plugins will no longer be loaded and executed.
- Removed `PromptScratchpad`, it was used by plugins and is no longer needed.
- Changed `ThoughtProcessOutput` from tuple to pydantic `BaseModel`.

## Other changes

- Created `AgentComponent`, protocols and logic to execute them.
- `BaseAgent` and `Agent` is now composed of components.
- Moved some logic from `BaseAgent` to `Agent`.
- Moved agent features and commands to components.
- Removed check if the same operation is about to be executed twice in a row.
- Removed file logging from `FileManagerComponent` (formerly `AgentFileManagerMixin`)
- Updated tests
- Added docs

See [Introduction](https://github.com/kcze/AutoGPT/blob/kpczerwinski/open-440-modular-agents/docs/content/AutoGPT/component%20agent/introduction.md) for more information.

2024-04-22 19:20:01 +02:00

3.1 KiB

Raw Blame History

Built-in Components

This page lists all 🧩 Components and ⚙️ Protocols they implement that are natively provided. They are used by the AutoGPT agent.

`SystemComponent`

Essential component to allow an agent to finish.

DirectiveProvider

Constraints about API budget

MessageProvider

Current time and date
Remaining API budget and warnings if budget is low

CommandProvider

finish used when task is completed

`UserInteractionComponent`

Adds ability to interact with user in CLI.

CommandProvider

ask_user used to ask user for input

`FileManagerComponent`

Adds ability to read and write persistent files to local storage, Google Cloud Storage or Amazon's S3. Necessary for saving and loading agent's state (preserving session).

DirectiveProvider

Resource information that it's possible to read and write files

CommandProvider

read_file used to read file
write_file used to write file
list_folder lists all files in a folder

`CodeExecutorComponent`

Lets the agent execute non-interactive Shell commands and Python code. Python execution works only if Docker is available.

CommandProvider

execute_shell execute shell command
execute_shell_popen execute shell command with popen
execute_python_code execute Python code
execute_python_file execute Python file

`EventHistoryComponent`

Keeps track of agent's actions and their outcomes. Provides their summary to the prompt.

MessageProvider

Agent's progress summary

AfterParse

ExecutionFailuer

Rewinds the agent's action, so it isn't saved

AfterExecute

Saves the agent's action result in the history

`GitOperationsComponent`

CommandProvider

clone_repository used to clone a git repository

`ImageGeneratorComponent`

Adds ability to generate images using various providers, see Image Generation configuration to learn more.

CommandProvider

generate_image used to generate an image given a prompt

`WebSearchComponent`

Allows agent to search the web.

DirectiveProvider

Resource information that it's possible to search the web

CommandProvider

search_web used to search the web using DuckDuckGo
google used to search the web using Google, requires API key

`WebSeleniumComponent`

Allows agent to read websites using Selenium.

DirectiveProvider

Resource information that it's possible to read websites

CommandProvider

read_website used to read a specific url and look for specific topics or answer a question

`ContextComponent`

Adds ability to keep up-to-date file and folder content in the prompt.

MessageProvider

Content of elements in the context

CommandProvider

open_file used to open a file into context
open_folder used to open a folder into context
close_context_item remove an item from the context

`WatchdogComponent`

Watches if agent is looping and switches to smart mode if necessary.

AfterParse

Investigates what happened and switches to smart mode if necessary

3.1 KiB Raw Blame History

Built-in Components

SystemComponent

UserInteractionComponent

FileManagerComponent

CodeExecutorComponent

EventHistoryComponent

GitOperationsComponent

ImageGeneratorComponent

WebSearchComponent

WebSeleniumComponent

ContextComponent

WatchdogComponent

3.1 KiB

Raw Blame History

`SystemComponent`

`UserInteractionComponent`

`FileManagerComponent`

`CodeExecutorComponent`

`EventHistoryComponent`

`GitOperationsComponent`

`ImageGeneratorComponent`

`WebSearchComponent`

`WebSeleniumComponent`

`ContextComponent`

`WatchdogComponent`