Scrape Books from URL with Dumpling AI, Clean HTML, Save to Sheets, Email as CSV
detail.loadingPreview
This workflow scrapes book data from a website, cleans the HTML, sorts it by price, and emails it as a CSV.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This workflow automates the process of scraping book information from a given URL. It utilizes Dumpling AI to fetch and clean the HTML content, then extracts book titles and prices. The extracted data is sorted by price in descending order and finally converted into a CSV file, which is then emailed to a specified recipient. This is a production-ready, optimized workflow designed for efficiency.
Key Features
- Scrapes book data from a provided URL using Dumpling AI.
- Cleans and extracts HTML content efficiently.
- Isolates book titles and prices using CSS selectors.
- Sorts the scraped book data by price in descending order.
- Converts the organized data into a CSV file.
- Automatically emails the generated CSV file as an attachment.
How To Use
- Configure Google Sheets Trigger: Set up a Google Sheets trigger to monitor a sheet for new row entries. Add the URLs you want to scrape in the specified sheet and column.
- Set up Dumpling AI Request: Ensure your Dumpling AI API key and the base URL are correctly configured in the environment variables.
- Configure HTML Extraction: Adjust the CSS selectors in the 'Extract all books' and 'Extract individual book price' nodes to match the structure of the target website.
- Set up Gmail Credentials: Authenticate your Gmail account for sending emails.
- Specify Email Recipient: Update the 'Send CSV via e-mail' node with the desired recipient email address.
- Enable Workflow: Activate the workflow to start the automated scraping and emailing process.
Apps Used
Workflow JSON
{
"id": "52f1554e-8848-4202-a04f-af7e1be56e73",
"name": "Scrape Books from URL with Dumpling AI, Clean HTML, Save to Sheets, Email as CSV",
"nodes": 0,
"category": "Web Scraping",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 52f1554e-8848...
About the Author
SaaS_Connector
Integration Guru
Connecting CRM, Notion, and Slack to automate your life.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Scrappey Web Scraper
Scrapes websites using Scrappey's API to bypass anti-bot measures.
Indeed Job Listing Scraper with Bright Data and LLMs
Scrape Indeed job listings for hiring signals using Bright Data and LLMs.
Community Webpage Crawler
Crawls a given URL and returns its content in Markdown format.
Web Scraper and Data Extractor for Products
Scrapes product data from web pages and saves it to Google Sheets.
Vision-Based AI Agent Scraper with Google Sheets, ScrapingBee, and Gemini
Scrape websites using visual AI and Gemini, storing results in Google Sheets.
HN Who Is Hiring Scraper
Scrapes 'Ask HN: Who is hiring?' posts to extract job details.