Automate Web Scraping & Data Distribution with Ease
detail.loadingPreview
Effortlessly scrape data from any website, transform it into a CSV, and automatically distribute it via email, Google Sheets, and Microsoft Excel. Streamline your data collection and sharing processes.
About This Workflow
This powerful n8n workflow automates the entire process of web scraping, from fetching website content to distributing the extracted data. It begins by fetching HTML from a specified URL, then intelligently extracts structured data like book titles and prices using CSS selectors. The workflow then cleans and organizes this data, sorts it by price in descending order, and converts it into a universally compatible CSV file. Finally, it seamlessly sends this CSV as an email attachment via Gmail and simultaneously appends the data to both a Google Sheet and a Microsoft Excel worksheet, ensuring your valuable information is accessible and up-to-date across multiple platforms.
Key Features
- Automated Web Scraping: Extract specific data points from any web page using intuitive CSS selectors.
- Data Transformation: Convert raw scraped data into a structured CSV format.
- Multi-Platform Distribution: Automatically send CSVs via email, save to Google Sheets, and append to Microsoft Excel.
- Data Sorting: Organize your scraped data with customizable sorting options, like by price.
- Flexible Configuration: Easily adjust URLs, selectors, and destination details.
How To Use
- Trigger Workflow: Initiate the scraping process by clicking "Test workflow" or by setting up a different trigger.
- Fetch Website Content: Configure the
httpRequestnode with the URL of the website you want to scrape. - Extract HTML Content: Use the
htmlnode to define CSS selectors that pinpoint the data you need (e.g.,.row > lito find all book items). - Split Data: If your initial extraction yields multiple items, use the
splitOutnode to process each item individually. - Extract Individual Data Points: For each item, use another
htmlnode to extract specific details like titles and prices using precise CSS selectors. - Sort Data: Employ the
sortnode to order your extracted data based on fields like price. - Convert to CSV: Use the
convertToFilenode to transform your structured data into a CSV file. - Save to Google Sheets: Configure the
googleSheetsnode with your Google Cloud credentials and the target spreadsheet/sheet ID to append the data. - Save to Microsoft Excel: Set up the
microsoftExcelnode with your Microsoft 365 credentials and target workbook/worksheet to append the data. - Send via Email: Use the
gmailnode, providing your Gmail credentials and recipient details, to send the generated CSV as an attachment.
Apps Used
Workflow JSON
{
"id": "c8908a14-994f-420b-84ec-22bb96f7c249",
"name": "Automate Web Scraping & Data Distribution with Ease",
"nodes": 20,
"category": "Operations",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: c8908a14-994f...
About the Author
Free n8n Workflows Official
System Admin
The official repository for verified enterprise-grade workflows.
Statistics
Related Workflows
Discover more workflows you might like
Universal CSV to JSON API Converter
Effortlessly transform CSV data into structured JSON with this versatile n8n workflow. Integrate it into any application as a custom API endpoint, supporting various input methods including file uploads and raw text.
Google Sheets to Icypeas: Automated Bulk Domain Scanning
This workflow streamlines the process of performing bulk domain scans by integrating your Google Sheets data directly with the Icypeas platform. Automate the submission of company names from your spreadsheet to Icypeas for comprehensive domain information, saving valuable time and effort.
Instant WooCommerce Order Notifications via Telegram
When a new order is placed on your WooCommerce store, instantly receive detailed notifications directly to your Telegram chat. Stay on top of your e-commerce operations with real-time alerts, including order specifics and a direct link to view the order.