An interactive multimodal application that allows you to upload or capture screenshots of images and ask intelligent questions about them using Gemini 2.0 Flash.
- Upload or capture screenshots of any window
- Intelligent image analysis using Google’s Gemini 2.0 Flash
- Streamlit UI for easy interaction
- Supports custom queries and AI-generated insights
Upload a screenshot of a chart or window
Ask questions like:
"What does this chart represent?" "Summarize the image contents" "Identify key trends or values"
🧠Powered By
Google Gemini 2.0 Flash via google-genai Streamlit Pillow, PyAutoGUI, PyGetWindow