Visual Analysis
Visual analysis enables the AI to examine and describe images in your Zettelkasten.
Setup
Requirements
- Visual model configured - A model capable of image analysis
- Images in your vault - PNG, JPEG, GIF, or other common formats
Configuring a Visual Model
During setup, when asked:
Type y and select a vision-capable model:
For Ollama (2025 Models):
High Quality (36-48GB+ RAM):
- gemma3:27b - Recommended - Excellent vision, efficient resource use
- qwen3-vl:32b - Most powerful Qwen vision model
- llama3.2-vision:90b - Alternative (requires 64GB+ RAM)
Balanced (16-32GB RAM):
- gemma3:9b - Recommended - Good quality, reasonable resources
- qwen3-vl:8b - Fast and capable
- llama3.2-vision:11b - Alternative balanced option
Lightweight (8-16GB RAM):
- gemma3:2b - Fast inference with vision
- llava:7b - Fallback for very limited resources
- llava-phi3 - Minimal resource requirements
For OpenAI:
- gpt-4o - Latest multimodal GPT-4
- gpt-5 - State-of-the-art multimodal (if available)
Using Visual Analysis
Analyzing Images
Simply ask about an image:
You: What's in the image at diagrams/architecture.png?
AI: Using tool: analyze_image
Path: diagrams/architecture.png
The image shows a system architecture diagram depicting three main layers:
1. Frontend - React-based web application
2. Backend API - Node.js services
3. Database - PostgreSQL with Redis cache
...
Common Use Cases
Diagram Analysis
Photo Description
Chart Interpretation
Text Extraction
Supported Formats
- PNG (
.png) - JPEG (
.jpg,.jpeg) - GIF (
.gif) - WebP (
.webp) - BMP (
.bmp)
Image Paths
Images can be referenced:
Relative to vault:
In subdirectories:
Best Practices
- Be specific - Ask clear questions about what you want to know
- Good organization - Keep images in logical folders
- Descriptive names - Use meaningful filenames
- Ask follow-ups - Request more details if needed
Example Workflow
You: What diagrams do I have in my vault?
AI: [Lists image files]
Found 5 diagrams:
- architecture.png
- dataflow.png
- ui-mockup.png
...
You: Describe the architecture diagram
AI: Using tool: analyze_image
[Provides detailed description]
You: How does this relate to my implementation notes?
AI: Using tool: find_documents
Query: architecture implementation
[Finds related documents and explains connections]
Limitations
- Model dependent - Quality varies by model
- Complex images - May struggle with very detailed diagrams
- Text readability - Small text may not be readable
- File size - Large images may take longer to process
Troubleshooting
No Visual Model Available
Error: "Visual analysis not configured"
Solution:
Image Not Found
Error: "Image not found at path"
Check: - File exists in vault - Path is correct (relative to vault root) - File extension is supported
Poor Results
Try: - Use a different visual model - Ensure image quality is good - Ask more specific questions - Describe what you're looking for
See Also
- Available Tools - All tool capabilities
- Model Selection - Choosing models