diff --git a/README.md b/README.md index f7adc42..0c065e0 100644 --- a/README.md +++ b/README.md @@ -110,7 +110,7 @@ Check out the [Budget Manual](https://github.com/n3d1117/chatgpt-telegram-bot/di | `IMAGE_QUALITY` | Quality of DALL·E images, only available for `dall-e-3`-model. Possible options: `standard` or `hd`, beware of [pricing differences](https://openai.com/pricing#image-models). | `standard` | | `IMAGE_STYLE` | Style for DALL·E image generation, only available for `dall-e-3`-model. Possible options: `vivid` or `natural`. Check availbe styles [here](https://platform.openai.com/docs/api-reference/images/create). | `vivid` | | `IMAGE_SIZE` | The DALL·E generated image size. Must be `256x256`, `512x512`, or `1024x1024` for dall-e-2. Must be `1024x1024` for dall-e-3 models. | `512x512` | -| `VISION_DETAIL` | The detail parameter for vision models, explained [Vision Guide](https://platform.openai.com/docs/guides/vision). Allowed values: `low` or `high` | `low` | +| `VISION_DETAIL` | The detail parameter for vision models, explained [Vision Guide](https://platform.openai.com/docs/guides/vision). Allowed values: `low` or `high` | `auto` | | `GROUP_TRIGGER_KEYWORD` | If set, the bot in group chats will only respond to messages that start with this keyword | - | | `IGNORE_GROUP_TRANSCRIPTIONS` | If set to true, the bot will not process transcriptions in group chats | `true` | | `IGNORE_GROUP_VISION` | If set to true, the bot will not process vision queries in group chats | `true` | diff --git a/bot/main.py b/bot/main.py index 75c86fb..8c87925 100644 --- a/bot/main.py +++ b/bot/main.py @@ -54,7 +54,7 @@ def main(): 'show_plugins_used': os.environ.get('SHOW_PLUGINS_USED', 'false').lower() == 'true', 'whisper_prompt': os.environ.get('WHISPER_PROMPT', ''), 'vision_prompt': os.environ.get('VISION_PROMPT', 'What is in this image'), - 'vision_detail': os.environ.get('VISION_DETAIL', 'low'), + 'vision_detail': os.environ.get('VISION_DETAIL', 'auto'), 'vision_max_tokens': int(os.environ.get('VISION_MAX_TOKENS', '300')), 'tts_model': os.environ.get('TTS_MODEL', 'tts-1'), 'tts_voice': os.environ.get('TTS_VOICE', 'alloy'),