TRAE SOLO不支持Minimax3的多模态,无法识别图片中的文字

你的运行环境(版本号「点击帮助-关于-复制」):

TRAE SOLO CN
版本: 0.1.15
提交: 18a1ac5cc57753e4f248b4095f404def974ec17b
日期: 2026-05-28T12:48:15.990Z
Electron: 39.2.7-release.1.37.3 (aha)
Node.js: 22.21.1
V8: 14.2.231.26-electron.0
OS: Windows_NT x64 10.0.26200
构建版本: 2.3.33252
设备ID: 7ab40327ae5c7088c9c76717f68fc81ad66cb27c4a2861db30a653a3d4023bfd
SOLO SDK Version: 1.3.39
Device Id: 3226424300813328

问题描述(你遇到了什么问题?):

服务商Minimax Global
自定义模型(因为目前还没法选择Minimax-M3)
模型ID:MiniMax-M3
高级配置中,选择【多模态】

Minimax-M3官方文档原生支持多模态。
但是在TRAE SOLO,work模式,给他一张图片(无论是粘贴给他,还是从附件选择目录下的文件),都无法识别图片。图片可以是很简单的一张包含文字的图片,识别一下图片中的文字即可。

这是我让他识别的图片。在TRAE SOLO,work模式,指定自定义的模型Minimax M3,无法识别。

使用Auto的模型是可以正确识别的。

使用Claude Code,选择Minimax M3,也是可以识别的。

你好呀!看到你这么详细的反馈,连设备信息和对比测试都准备全了,真的太专业啦,握个手!:handshake:

你反馈的 TRAE SOLO(桌面版)自定义 MiniMax-M3 模型无法识别图片 的情况我已经收到。既然 Auto 模型能识别,说明 SOLO 的图片上传链路是通的,问题大概率出在自定义模型对多模态协议的适配上。

为了帮你快速定位,咱们可以先做这几个小确认:

  1. 功能标识确认:在对话框的模型选择器里,把鼠标悬停在你配置的这个自定义模型名称上,看浮窗里的可用功能列表里有没有显示 “图片理解”?(如果有,说明 SOLO 识别到了你的多模态配置)。
  2. 配置路径确认:你在添加模型时,是选择了预设的 “Minimax Global” 服务商后输入的模型 ID,还是完全通过 “自定义配置” 手动填写的 API 地址?(MiniMax 的多模态接口格式有时与标准 OpenAI 略有差异,这可能影响图片的传递)。
  3. 报错现象:当你发图片给它时,AI 是直接报错(比如弹出错误码),还是回答说“我看不到图片”或“不支持图片”?

:clipboard: 为了让技术同学能进场“抓虫”,麻烦帮我提供一下 AI Trace 信息:

在出问题的那个“无法识别图片”的 AI 对话中,双击 AI 的头像,会自动复制一段 trace 信息,然后粘贴回复在帖子里就好。

有了这个 Trace ID,我们就能后台查看到底是请求格式没对上,还是服务商返回了特定的错误。

期待你的补充,我们一起把这个新模型“驯服”!:blush:

1 个赞

您好,AI Trace ID为:
3528750132365016:202fef5db7e1ecc258cfa4229a13db29_6a1cf5598654671abf671f6a.6a1cf5598654671abf671f6d.6a1cf5598654671abf671f6b:TRAE SOLO CN.0.1.15.no_sid.no_ppe.T(2026/6/1 10:58:33)

1 个赞

感谢您的反馈!我们的智能助手暂时无法处理您的请求,团队成员会尽快回复您。

1 个赞

配置截图可以发一下吗

1 个赞

1 个赞

不识别的表现是什么呢?

1 个赞

表现就是无法识别图片中的文字。:joy:

这是之前 @TRAE宝 让我提供的TRACE_ID里面的一部分内容。

The user is asking me to recognize text in an image. Let me read the uploaded image to see what’s in it.

已读取 1 个文件

Let me try to view the image content properly. The read tool only confirmed the file exists but didn’t show the content. Let me try a different approach - using a tool to read the image binary or use a vision capability. Let me try reading it again with a different approach, or use the browser to view it.

Actually, looking back at the tool description, the Read tool can read image files (PNG, JPG, JPEG, GIF, WEBP). The result just says “An image file has been read” - this is the standard response but I should still be able to see the content. Let me check the actual content again or use another method.

Wait, the result is just “An image file has been read” without showing content. This is unusual. Let me try using a Python script to view the image, or try opening it in the browser.

Actually, looking more carefully, the Read tool returned only that the file has been read but didn’t show the image content. This might mean I need to use a different approach. Let me try using Python with PIL to analyze the image, or use OCR.

Let me try to use Python with an OCR library to read the text from the image.

已执行 1 条命令

The image is 415x162 RGBA. Let me view it using the Read tool again - I should be able to see the image content. Let me try using the browser to view the image, or use OCR.

Actually, the Read tool should be able to show image content. Let me try again - maybe I need to display it. Let me use Python to convert it to base64 and try to view it, or use an OCR tool like pytesseract.

已执行 1 条命令

Tesseract OCR is not installed. Let me try to view the image directly using the Read tool. The previous Read tool call said “An image file has been read” which is the standard response for image files. Let me check the image by trying to display it. The image is small (415x162) so it should be readable.

Let me try to use the browser to view the image, or save it somewhere accessible.

已操作浏览器

Let me try copying the image to a different location and use a local server. Or let me try to use Python to convert the image to base64 and view it.

Actually, the Read tool says it can read image files. The output was just “An image file has been read” - maybe that’s because the result was truncated. Let me try to get the image content again or use a different method.

Let me try installing easyocr or use a different approach.

已执行 1 条命令

Let me try a different approach to install tesseract or use a different OCR method.

已执行 1 条命令

Tesseract is not available. Let me try to read the image using the Read tool which should support image content. Maybe I need to provide an offset or specific parameters. Let me try again with the Read tool. The previous response was just confirming the file was read but not showing content. Let me try with a different file path approach.

Wait, I should be able to view the image content using the Read tool. The “An image file has been read” response is the standard text response, but the actual image content should be visible to me. Let me try copying the file to my workspace and then read it again.

已执行 1 条命令

Let me try to read the image again from the new location.

已读取 1 个文件

The Read tool keeps returning “An image file has been read” without showing the content. This is unusual. Let me try to encode the image to base64 and then look at it, or use a different approach.

Let me try installing paddleocr or use another OCR method, or try to start a local server to view the image.

1 个赞

是不是要适配?应该是程序的问题

1 个赞

是的,我理解也是程序问题。

不知道客服是不是机器人。

如上描述,
先是有个客服找我要了TRACE_ID;然后有个客服问我,不能识别的表现是什么?(关键还打了个错字——“不是别的表现是什么呢?”)

晕啊,这就好比问我,你手机没电的表现是什么?:joy:

1 个赞

不识别的表现可能是“乱回答”,也可能是直接回复“无法识别”,需要确认下具体现象,“乱回答”并不是唯一值。

1 个赞

另外已经在另一个帖子跟进了,这个帖子就不赘述了哈。

1 个赞

对,另外一个帖子和这个是重复的。不过那个帖子没有做任何处理,并且也已经关闭了。
这个帖子问题没有解决,也不赘述了,是就不解决了吗?

那个帖子说反馈到bug区。这里不就是bug反馈吗?你们内部不能去协调吗?还要重新再发一次吗?

1 个赞

如果不解决了。

或者无法协调。

或者解决不了,您给我留言就行。

谢谢你们团队了,辛苦。我去抖音上看看能不能解决一下。

1 个赞

已经修复完毕,可以在重试下。看看还有问题吗

1 个赞

已解决,谢谢

2 个赞