Extract PDF and Image Text to CSV Using Vertex AI (Gemini)

Name: Extract PDF and Image Text to CSV Using Vertex AI (Gemini)
Rating: 5 (7 reviews)
Author: Free N8N

Community Verified

Beginner

0 nodes connected

detail.loadingPreview

Free N8N Temples

145 views

0 downloads

PDF and Document ProcessingAutomationCSVData ProcessingGeminiGoogle DriveImage to textOCRPDF extractionVertex AI

Automate text extraction from PDFs and images stored in Google Drive into a CSV format using Vertex AI (Gemini). This workflow streamlines data processing and reduces manual entry.

🚀Ready to Deploy This Workflow?

⚡Deploy on Zeabur 🎁Get $200 Credit on DigitalOcean

About This Workflow

Overview

This n8n workflow is designed to automatically extract text content from PDF documents and images stored in Google Drive, transforming them into a structured CSV file. It begins by monitoring a specified Google Drive folder for new files. Based on the file type (PDF or image), it downloads the content and then utilizes Google's Vertex AI (specifically the Gemini model) for robust text extraction. The extracted data is then converted into CSV format and uploaded back to Google Drive. This solution is ideal for automating the processing of invoices, receipts, scanned documents, or any visual data that needs to be converted into a tabular format for further analysis or storage.

Key Features

Monitors a Google Drive folder for new PDF or image files.
Supports both PDF and common image file types.
Leverages Google Vertex AI (Gemini) for accurate text extraction from both document and image sources.
Automatically converts extracted text into a CSV file.
Uploads the generated CSV file to a designated Google Drive folder.
Reduces manual data entry and improves processing speed.

How To Use

Set up Google Drive Integration: Ensure your n8n instance is connected to your Google Drive account, preferably using a service account for enhanced security and permissions.
Configure Google Drive Trigger: Set up the 'Google Drive Trigger' node to watch a specific folder for new file creations. This folder will be where you upload your PDFs and images.
Configure Vertex AI: Enable Vertex AI in your Google Cloud project and ensure your Google Cloud credentials in n8n have the necessary permissions to access and use the Vertex AI API, particularly for image and text analysis with Gemini.
Process Files: The workflow will route PDF files to the PDF extraction node and image files to the image text extraction process.
AI Text Extraction: The 'Google Gemini Chat Model' (or an equivalent Vertex AI node) will process the downloaded content to extract text.
Convert to CSV: Use the 'Convert to CSV' node to structure the extracted text into a CSV format.
Upload CSV: Configure the 'Upload to Google Drive' node to save the generated CSV file to a desired destination folder in your Google Drive.

Apps Used

Automation

CSV

Data Processing

Gemini

Google Drive

Image to text

OCR

PDF extraction

Vertex AI

Workflow JSON

{
  "id": "bbc74a39-ecb6-4b28-939b-da2456c545c5",
  "name": "Extract PDF and Image Text to CSV Using Vertex AI (Gemini)",
  "nodes": 0,
  "category": "PDF and Document Processing",
  "status": "active",
  "version": "1.0.0"
}

Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.

Get This Workflow

ID: bbc74a39-ecb6...

About the Author

SaaS_Connector

Integration Guru

Connecting CRM, Notion, and Slack to automate your life.

Statistics

Downloads0

Rating

7/5

Verification Info

Community Verified

This workflow has been verified by the community

📄

Source

awesome-n8n-templates

Get Custom Workflow

Need a specific automation? Our experts can build it for you.

Trusted by top companies
7+ years experience

Related Workflows

Discover more workflows you might like

Browse All n8n Workflows

Beginner✓ Verified

PDF and Document ProcessingaudiotranscriptionOpenAI

Automated Audio Transcription and Summarization from Google Drive to Notion

Automatically transcribe audio files from Google Drive using OpenAI Whisper, then summarize and send structured data to Notion.

0 nodes

150

View Workflow

Beginner✓ Verified

PDF and Document Processingresume analysisHR automationvision language model

Automated Resume Analysis Using PDF to Image Conversion and Vision Language Model

This workflow automates candidate resume analysis by converting PDFs to images, then using a Vision Language Model (VLM) to assess fit for a role, bypassing potential AI detection bypasses in resumes.

0 nodes

157

View Workflow

Beginner✓ Verified

PDF and Document Processinglangchainopenaipinecone

Chat with Documents Using LangChain and Pinecone

Ingest documents from Google Drive, vectorize them with OpenAI, store in Pinecone, and enable chat interactions with LangChain nodes. This workflow automates the process of creating a searchable knowledge base.

0 nodes

143

View Workflow

Browse All n8n Workflows

Overview

Set up Google Drive Integration: Ensure your n8n instance is connected to your Google Drive account, preferably using a service account for enhanced security and permissions.
Configure Google Drive Trigger: Set up the 'Google Drive Trigger' node to watch a specific folder for new file creations. This folder will be where you upload your PDFs and images.
Configure Vertex AI: Enable Vertex AI in your Google Cloud project and ensure your Google Cloud credentials in n8n have the necessary permissions to access and use the Vertex AI API, particularly for image and text analysis with Gemini.
Process Files: The workflow will route PDF files to the PDF extraction node and image files to the image text extraction process.
AI Text Extraction: The 'Google Gemini Chat Model' (or an equivalent Vertex AI node) will process the downloaded content to extract text.
Convert to CSV: Use the 'Convert to CSV' node to structure the extracted text into a CSV format.
Upload CSV: Configure the 'Upload to Google Drive' node to save the generated CSV file to a desired destination folder in your Google Drive.

{ "id": "bbc74a39-ecb6-4b28-939b-da2456c545c5", "name": "Extract PDF and Image Text to CSV Using Vertex AI (Gemini)", "nodes": 0, "category": "PDF and Document Processing", "status": "active", "version": "1.0.0" }