AI-Powered Object Detection with Google Gemini: Visually Identify & Highlight Items
detail.loadingPreview
This workflow harnesses the power of Google's Gemini AI to automatically detect and highlight specific objects within an image. It retrieves an image, sends it to Gemini for object detection, and then visually draws bounding boxes around the identified items, providing instant visual insights.
About This Workflow
Unlock advanced computer vision capabilities effortlessly with this n8n workflow. It integrates directly with Google's cutting-edge Gemini 2.0 Flash API to perform sophisticated object detection. Simply provide an image and specify what you want to find, and Gemini will return precise bounding box coordinates. The workflow then intelligently scales these coordinates to the image's original dimensions and uses n8n's built-in image editing tools to draw clear, visible bounding boxes directly onto the image. This automation streamlines visual analysis, making complex AI tasks accessible without writing a single line of machine learning code.
Key Features
- Google Gemini Integration: Leverage the powerful Gemini 2.0 Flash API for state-of-the-art object detection.
- Automated Image Analysis: Automatically fetch images and process them through AI for specific object identification.
- Dynamic Bounding Box Visualization: Precisely scale AI-detected coordinates and draw visual bounding boxes on the original image.
- Customizable Object Prompts: Easily modify the AI prompt to detect any desired objects within your images.
- No-Code Computer Vision: Implement complex AI vision tasks using n8n's intuitive visual builder.
How To Use
- Initiate Workflow: Use the
When clicking ‘Test workflow’node for manual testing, or replace it with your desired trigger (e.g., webhook for image uploads, scheduled polling). - Provide Image: Configure the
Get Test ImageHTTP Request node to fetch your target image. This can be an external URL, a file from a cloud storage, or an input from a previous node. - Configure Gemini API: Ensure the
Gemini 2.0 Object Detectionnode is set up with valid Google Gemini (PaLM) API credentials. - Define Detection Prompt: Edit the
jsonBodyin theGemini 2.0 Object Detectionnode to specify the objects you want to detect (e.g., "rabbits," "cars," "faces"). - Run & Review: Execute the workflow to see the processed image with bounding boxes drawn around the detected objects. The output will contain the original image with the visual highlights.
Apps Used
Workflow JSON
{
"id": "c42b0c70-6af9-4fd0-9fb1-a9c87eeede22",
"name": "AI-Powered Object Detection with Google Gemini: Visually Identify & Highlight Items",
"nodes": 20,
"category": "Operations",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: c42b0c70-6af9...
About the Author
N8N_Community_Pick
Curator
Hand-picked high quality workflows from the global community.
Statistics
Related Workflows
Discover more workflows you might like
Instant WooCommerce Order Notifications via Telegram
When a new order is placed on your WooCommerce store, instantly receive detailed notifications directly to your Telegram chat. Stay on top of your e-commerce operations with real-time alerts, including order specifics and a direct link to view the order.
On-Demand Microsoft SQL Query Execution
This workflow allows you to manually trigger and execute any SQL query against your Microsoft SQL Server database. Perfect for ad-hoc data lookups, administrative tasks, or quick tests, giving you direct control over your database operations.
Automate Getty Images Editorial Search & CMS Integration
This n8n workflow automates searching for editorial images on Getty Images, extracts key details and embed codes, and prepares them for seamless integration into your Content Management System (CMS), streamlining your content creation process.