understand-image-minimax
Verifiedunderstand image minimax — file & media tool with Minimax Coding Plan VLM API support.
Install
Claude Code
Add to .claude/skills/ About This Skill
Content available in Chinese
# Understand Image
使用 Minimax Coding Plan VLM API 分析图片内容。API Key 从环境变量 `MINIMAX_API_KEY` 读取。
当接收到用户发送的图片或用户询问图片的内容时请务必使用该技能
使用方法
```bash node {baseDir}/scripts/understand.cjs "你的问题" "图片URL或本地路径" ```
示例
```bash # 分析网络图片 node {baseDir}/scripts/understand.cjs "描述这张图片" "https://example.com/photo.jpg"
# 分析本地图片 node {baseDir}/scripts/understand.cjs "这张图片有什么" "/Users/xbos1314/Downloads/image.png"
# 询问具体问题 node {baseDir}/scripts/understand.cjs "图片中有几个人?" "https://example.com/group.jpg" ```
数据来源
- API Key 从环境变量读取:
- 环境变量:`MINIMAX_API_KEY`
- API Host:固定为 `https://api.minimaxi.com`
支持的图片格式
- JPEG (.jpg, .jpeg)
- PNG (.png)
- GIF (.gif)
- WebP (.webp)
输入支持
- 网络图片: 直接使用 HTTP/HTTPS URL
- 本地图片: 使用绝对路径或相对路径
- Base64: 支持 data: URL 格式
Use Cases
- Analyze images using the Minimax VLM (Vision Language Model) API
- Extract text, objects, and scene descriptions from images programmatically
- Build image understanding pipelines with the Minimax Coding Plan
- Answer questions about image content using multimodal AI capabilities
- Integrate visual AI analysis into automated content processing workflows
Pros & Cons
Pros
- + Provides access to Minimax's VLM API for image understanding tasks
- + Useful for automated image analysis in content moderation or cataloging
- + MCP-compatible integration for AI agent workflows
Cons
- - Requires Minimax API access and credentials
- - Minimal documentation on supported image formats and API limits
Frequently Asked Questions
What does understand-image-minimax do?
understand image minimax — file & media tool with Minimax Coding Plan VLM API support.
What platforms support understand-image-minimax?
understand-image-minimax is available on Claude Code, OpenClaw.
What are the use cases for understand-image-minimax?
Analyze images using the Minimax VLM (Vision Language Model) API. Extract text, objects, and scene descriptions from images programmatically. Build image understanding pipelines with the Minimax Coding Plan.
Stay Updated on Agent Skills
Get weekly curated skills + safety alerts