Make Auto-GPT aware of its running cost (#762)

* Implemented running cost counter for chat completions This data is known to the AI as additional system context, and is printed out to the user * Added comments to api_manager.py * Added user-defined API budget. The user is now prompted if they want to give the AI a budget for API calls. If they enter nothing, there is no monetary limit, but if they define a budget then the AI will be told to shut down gracefully once it has come within 1 cent of its limit, and to shut down immediately once it has exceeded its limit. If a budget is defined, Auto-GPT is always aware of how much it was given and how much remains to be spent. * Chat completion calls are now done through api_manager. Total running cost is printed. * Implemented api budget setting and tracking User can now configure a maximum api budget, and the AI is aware of that and its remaining budget. The AI is instructed to shut down when exceeding the budget. * Update autogpt/api_manager.py Change "per token" to "per 1000 tokens" in a comment on the api cost Co-authored-by: Rob Luke <code@robertluke.net> * Fixed lint errors * Include embedding costs * Add embedding completion cost * lint * Added 'requires_api_key' decorator to test_commands.py, switched to a valid chat completions model * Refactor API manager, add debug mode, and add tests - Extract model costs to to avoid duplication - Add debug mode parameter to ApiManager class - Move debug mode configuration to - Log AI response and budget messages in debug mode - Implement 'test_api_manager.py' * Fixed test_setup failing. An extra user input is needed for api budget * Linting --------- Co-authored-by: Rob Luke <code@robertluke.net> Co-authored-by: Nicholas Tindle <nick@ntindle.com>
2025-12-22 00:14:23 +01:00 · 2023-04-23 14:04:31 -07:00
parent bf895eb656
commit d6ef9d1b5d
11 changed files with 401 additions and 30 deletions
--- a/autogpt/llm_utils.py
+++ b/autogpt/llm_utils.py
@@ -7,6 +7,7 @@ import openai
 from colorama import Fore, Style
 from openai.error import APIError, RateLimitError

+from autogpt.api_manager import api_manager
 from autogpt.config import Config
 from autogpt.logs import logger
 from autogpt.types.openai import Message
@@ -96,7 +97,7 @@ def create_chat_completion(
        backoff = 2 ** (attempt + 2)
        try:
            if CFG.use_azure:
-                response = openai.ChatCompletion.create(
+                response = api_manager.create_chat_completion(
                    deployment_id=CFG.get_azure_deployment_id_for_model(model),
                    model=model,
                    messages=messages,
@@ -104,7 +105,7 @@ def create_chat_completion(
                    max_tokens=max_tokens,
                )
            else:
-                response = openai.ChatCompletion.create(
+                response = api_manager.create_chat_completion(
                    model=model,
                    messages=messages,
                    temperature=temperature,
@@ -159,17 +160,9 @@ def create_embedding_with_ada(text) -> list:
    for attempt in range(num_retries):
        backoff = 2 ** (attempt + 2)
        try:
-            if CFG.use_azure:
-                return openai.Embedding.create(
-                    input=[text],
-                    engine=CFG.get_azure_deployment_id_for_model(
-                        "text-embedding-ada-002"
-                    ),
-                )["data"][0]["embedding"]
-            else:
-                return openai.Embedding.create(
-                    input=[text], model="text-embedding-ada-002"
-                )["data"][0]["embedding"]
+            return api_manager.embedding_create(
+                text_list=[text], model="text-embedding-ada-002"
+            )
        except RateLimitError:
            pass
        except APIError as e: