Automated PII Removal from CSV Files on Google Drive using OpenAI
detail.loadingPreview
This workflow automatically detects new CSV files in a Google Drive folder, uses OpenAI to identify and remove Personally Identifiable Information (PII) columns, and uploads the cleaned file back to Google Drive. It leverages Google Drive Trigger, Google Drive, OpenAI, and code nodes for robust data sanitization.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This n8n workflow is designed to automatically process CSV files uploaded to a specific Google Drive folder. It monitors the folder for new files, downloads them, and then utilizes the power of OpenAI to identify and remove columns that contain Personally Identifiable Information (PII). The sanitized data is then converted back into a CSV format and uploaded to a designated folder on Google Drive. This automation is crucial for maintaining data privacy and compliance when handling sensitive information stored in cloud storage.
Key Features
- Real-time monitoring of Google Drive folders for new CSV files.
- Automatic download of newly added files.
- Intelligent PII column identification using OpenAI.
- Removal of identified PII columns from the data.
- Generation of a new CSV file with sensitive data redacted.
- Automatic upload of the sanitized CSV back to Google Drive.
How To Use
- Configure Google Drive Trigger: Set up the
Google Drive Triggernode to watch a specific folder for new CSV files. Ensure your Google Drive credentials are correctly configured. - Download File: The
Google Drivenode downloads the detected file. - Extract Filename: Use the
Get filenamenode to extract the original filename for later use. - Extract Data: The
Extract from Filenode processes the downloaded file, preparing it for analysis. - Identify PII with OpenAI: The
OpenAInode analyzes the data (column headers and sample rows) and returns a comma-separated list of PII column names. TheGet resultnode extracts this content. - Merge Data: The
Mergenode combines the PII column list, the original filename, and the extracted file data for processing. - Remove PII Columns: The
Remove PII columns(Code node) takes the merged data, identifies PII columns using the output from OpenAI, removes them from each row, converts the remaining data into a CSV format, and generates a new filename. - Upload Sanitized File: The
Upload to Drivenode uploads the newly created CSV file (with PII removed) to a specified folder in your Google Drive.
Apps Used
Workflow JSON
{
"id": "4fc704c4-45a2-4073-a306-4d6362d73e0a",
"name": "Automated PII Removal from CSV Files on Google Drive using OpenAI",
"nodes": 0,
"category": "PDF and Document Processing",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 4fc704c4-45a2...
About the Author
Free n8n Workflows Official
System Admin
The official repository for verified enterprise-grade workflows.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Chat with Documents Using LangChain and Pinecone
Ingest documents from Google Drive, vectorize them with OpenAI, store in Pinecone, and enable chat interactions with LangChain nodes. This workflow automates the process of creating a searchable knowledge base.
Automated Audio Transcription and Summarization from Google Drive to Notion
Automatically transcribe audio files from Google Drive using OpenAI Whisper, then summarize and send structured data to Notion.
Automated Document Processing and Q&A with Langchain and Mistral AI
This workflow automates the processing of local documents, chunking them for efficient analysis, and then uses Mistral AI for question answering via a Qdrant vector store. It allows for intelligent retrieval and summarization of information from various document types.