Community Contributed News Extraction (Unverified)
detail.loadingPreview
Extracts news articles from a website without an RSS feed, filters by date, and generates summaries and keywords.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This workflow is designed to scrape news articles from a website that does not provide an RSS feed. It extracts article links and dates, filters for recent posts, and then uses AI to generate summaries and identify key technical keywords for each article. This is a "Community Contributed" template and is currently unverified.
Key Features
- Scrapes news articles from websites lacking RSS feeds.
- Filters articles to include only those from the last 7 days (configurable).
- Extracts article dates using CSS selectors.
- Generates concise summaries for each article using OpenAI.
- Identifies up to 3 key technical keywords per article using OpenAI.
- Merges extracted data (dates, links, content) for unified processing.
How To Use
- Import Workflow: Import this JSON into your n8n instance.
- Configure Credentials: Set up your OpenAI API credentials.
- Set Base URL: Configure the
BASE_URLenvironment variable with the base URL of the website you want to scrape. - Adjust CSS Selectors: Inspect the target website and update the CSS selectors in the 'Extract the HTML with the right css class', 'Extract date', and 'Extract individual posts' nodes to accurately target the desired data.
- Modify Date Filter: Adjust the number of days in the 'Select posts of last 7 days' node's
jsCodeto change the date range for filtering articles. - Run Workflow: Execute the workflow to begin scraping and processing news.
Apps Used
Workflow JSON
{
"id": "20a0ed58-be55-4350-8475-b0eef44adba3",
"name": "Community Contributed News Extraction (Unverified)",
"nodes": 0,
"category": "Web Scraping",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 20a0ed58-be55...
About the Author
DevOps_Master_X
Infrastructure Expert
Specializing in CI/CD pipelines, Docker, and Kubernetes automations.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Scrape Trustpilot Reviews to Google Sheets
Scrapes reviews from Trustpilot for a specified company and saves them to a Google Sheet.
Selenium Ultimate Scraper Workflow
A comprehensive workflow for scraping web content using Selenium, including advanced features like cookie handling and driver cleanup.
Vision-Based AI Agent Scraper with Google Sheets, ScrapingBee, and Gemini
Scrape websites using visual AI and Gemini, storing results in Google Sheets.
Jina.ai Multipage Website Scraper
Scrape entire websites without an API key using Jina.ai.
Paul Graham Essay Scraper and Text Extractor
Scrapes the list of Paul Graham's essays, fetches the first few, and extracts their plain text content.
Selenium Ultimate Scraper Workflow
A comprehensive workflow to scrape websites using Selenium and process the extracted data.