v1.1.0
New Features
🔍 look_at Tool & multimodal-looker Agent
- Add
look_attool for analyzing media files (PDFs, images, diagrams) - Add
multimodal-lookeragent using Gemini 2.5 Flash model - Analyze visual content in separate context window, saving main conversation tokens
Inspired by Sourcegraph Ampcode's look_at tool design.
Usage
look_at({
file_path: "/path/to/diagram.png",
goal: "Describe the architecture components"
})Supported File Types
- PDF documents
- Images (PNG, JPG)
- Screenshots
- Architecture diagrams
- Charts/Graphs