Automate PII Removal from CSV Files in Google Drive
detail.loadingPreview
Effortlessly secure your sensitive data by automating the removal of Personally Identifiable Information (PII) from CSV files stored in Google Drive. This workflow leverages AI to identify and redact PII, then saves the clean data back to a designated Google Drive folder.
About This Workflow
This n8n workflow provides a robust solution for safeguarding sensitive data within your organization. It's designed to automatically detect new CSV files uploaded to a specified Google Drive folder. Upon detection, the workflow triggers an intelligent analysis using OpenAI's GPT-4o-mini to pinpoint columns containing Personally Identifiable Information (PII). Once identified, these PII columns are systematically removed from the CSV data. The resulting sanitized file, now free of sensitive information, is then uploaded to a separate, secure Google Drive folder, ensuring compliance and data privacy without manual intervention.
Key Features
- Automated PII Detection: Utilizes OpenAI to accurately identify columns containing sensitive PII.
- Seamless Google Drive Integration: Monitors a specific Google Drive folder for new CSV files and uploads processed files to another.
- Intelligent Data Sanitization: Automatically removes identified PII columns, ensuring data privacy.
- Customizable Workflow: Easily adaptable to different Google Drive folder structures and AI models.
- Time-Saving Automation: Frees up valuable resources by automating a critical data processing task.
How To Use
- Configure Google Drive Trigger: Connect your Google Drive account and set the 'Google Drive Trigger' node to watch a specific folder for new CSV files ('fileCreated' event).
- Download File Content: The 'Google Drive' node downloads the content of the newly created file.
- Extract Filename: Use the 'Get filename' node to capture the original filename for later use.
- Analyze for PII: Connect the 'Google Drive' download node to the 'OpenAI' node. Configure the OpenAI node with a prompt to identify PII columns from tabular data. Ensure the 'Extract from File' node is placed between the download and OpenAI nodes to properly format the input for analysis.
- Process and Remove PII: Route the output of the OpenAI node (containing PII column names) and the original file data to the 'Merge' node. Then, feed the merged data into the 'Remove PII columns' code node. This node uses JavaScript to parse the PII column names, remove them from the original data, and format the output as a CSV string with a new filename.
- Upload Sanitized File: Connect the 'Remove PII columns' node to the final 'Upload to Drive' node. Configure this node to upload the sanitized CSV content to your desired output folder in Google Drive.
Apps Used
Workflow JSON
{
"id": "00d3da5a-e050-4556-8127-45562af22f57",
"name": "Automate PII Removal from CSV Files in Google Drive",
"nodes": 22,
"category": "Operations",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 00d3da5a-e050...
About the Author
N8N_Community_Pick
Curator
Hand-picked high quality workflows from the global community.
Statistics
Related Workflows
Discover more workflows you might like
Universal CSV to JSON API Converter
Effortlessly transform CSV data into structured JSON with this versatile n8n workflow. Integrate it into any application as a custom API endpoint, supporting various input methods including file uploads and raw text.
Instant WooCommerce Order Notifications via Telegram
When a new order is placed on your WooCommerce store, instantly receive detailed notifications directly to your Telegram chat. Stay on top of your e-commerce operations with real-time alerts, including order specifics and a direct link to view the order.
On-Demand Microsoft SQL Query Execution
This workflow allows you to manually trigger and execute any SQL query against your Microsoft SQL Server database. Perfect for ad-hoc data lookups, administrative tasks, or quick tests, giving you direct control over your database operations.