Extract Text from PDF and Image using Vertex AI (Gemini) into CSV
detail.loadingPreview
Extracts text from PDF and image files in Google Drive and converts it into a CSV format using Vertex AI (Gemini).
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This n8n workflow automates the extraction of text content from PDF documents and image files stored in Google Drive. It utilizes Google's Vertex AI (specifically Gemini) to process the files and extract relevant text. The extracted text is then converted into a structured CSV format and uploaded back to Google Drive.
Key Features
- Triggers on new file creation in a specified Google Drive folder.
- Routes files based on MIME type (PDF or image).
- Downloads PDF or image files from Google Drive.
- Extracts text from downloaded PDF files.
- Utilizes Vertex AI (Gemini) for text extraction and potentially structuring from images (though the provided snippet focuses on PDF extraction for AI processing).
- Converts extracted text data into CSV format.
- Uploads the generated CSV files to a specified Google Drive folder.
- Supports different processing paths for PDFs and images.
How To Use
- Google Drive Setup: Create a dedicated folder in Google Drive for uploading your PDF and image files. Share this folder with the provided Google Service Account email address.
- Google Credentials: Configure your Google Service Account credentials in n8n.
- Vertex AI Credentials: Set up your Google Gemini (PaLM) API credentials in n8n.
- Environment Variables: Ensure necessary environment variables like
API_BASE_URLare configured. - Workflow Trigger: Upload PDF or image files to the designated Google Drive folder to initiate the workflow.
- AI Prompting: Customize the
jsonBodyin the HTTP Request node to refine the AI's understanding and desired output format.
Apps Used
Workflow JSON
{
"id": "09df4006-7c38-4cc1-851b-e0cf0f7d67ec",
"name": "Extract Text from PDF and Image using Vertex AI (Gemini) into CSV",
"nodes": 0,
"category": "Extractfromfile",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 09df4006-7c38...
About the Author
SaaS_Connector
Integration Guru
Connecting CRM, Notion, and Slack to automate your life.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Convert Any Webpage to PDF and Save Locally
This n8n workflow provides a simple way to convert any public webpage URL into a PDF document. Once converted, the PDF file is automatically saved to your local disk, making it easy to archive or share web content offline.
Automate Instagram Reel Analysis with Gemini and Apify
Unlock deeper insights into your Instagram Reels by automating analysis. This workflow leverages Apify to fetch reel data and Gemini AI to dissect key elements like background, pose, text, and context, enabling better content replication.
Automate Event Attendee Data Extraction with AI-Powered Scraping
This n8n workflow leverages AI and Bright Data's MCP to automate the scraping of event attendee, venue, and feedback data from 10times.com. Schedule the extraction to run daily and automatically save the organized data to Google Sheets.
Automate DOCX to PDF Conversion and Save Locally
This workflow automates the process of converting a Microsoft Word DOCX file to a PDF document. It downloads a source DOCX file, uses ConvertAPI to perform the conversion, and then saves the resulting PDF file directly to your local disk.
Telegram Profanity & Toxicity Filter
This n8n workflow automatically monitors incoming Telegram messages for profanity and toxic language. It leverages Google's Perspective API to analyze message content, and if a message is deemed inappropriate, the workflow sends an automated warning response back to the sender.
Automate Competitor Tracking with Crunchbase & ClickUp
Streamline competitor analysis by automatically fetching data from Crunchbase and creating review tasks in ClickUp. Stay informed about market changes without manual effort.