Automated News Extraction, Summarization, and Keyword Identification
detail.loadingPreview
This workflow automates the extraction of news articles from a website, filters them by date, and uses AI to generate summaries and identify key technical keywords, making research more efficient.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This n8n workflow is designed to automate the process of extracting news articles from a website that lacks an RSS feed. It begins by scraping the news listing page to extract article links and publication dates. A code node then filters these articles to include only those published within a specified recent period (e.g., last 7 days). For each selected article, the workflow navigates to its individual page, extracts its content, and then leverages OpenAI's GPT-4 model to generate a concise summary and identify the three most important technical keywords. Finally, all extracted information, including the original link, date, summary, and keywords, is merged for further processing or storage.
Key Features
- Scrapes news article links and dates from a web page.
- Filters articles based on a specified date range.
- Utilizes AI (OpenAI) to summarize article content.
- Extracts key technical keywords from articles using AI.
- Merges extracted data for comprehensive reporting.
How To Use
- Configure HTML Extraction Nodes: Update the CSS selectors in the 'Extract the HTML with the right css class' and 'Extract date' nodes to match the structure of the target news website.
- Set Date Filter: Adjust the
setDateparameter in the 'Select posts of last 7 days' node to define the desired lookback period for news articles. - Configure OpenAI Nodes: Ensure your OpenAI API credentials are set up and verify the prompts in the 'Summary' and 'Keywords' nodes are appropriate for your needs.
- Define Output: Connect the final merge node to your desired output destination (e.g., database, file, another service).
Apps Used
Workflow JSON
{
"id": "8554b165-e0d4-4d04-9fd2-64086a74a0bf",
"name": "Automated News Extraction, Summarization, and Keyword Identification",
"nodes": 0,
"category": "Web Scraping & AI",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 8554b165-e0d4...
About the Author
N8N_Community_Pick
Curator
Hand-picked high quality workflows from the global community.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Universal CSV to JSON API Converter
Effortlessly transform CSV data into structured JSON with this versatile n8n workflow. Integrate it into any application as a custom API endpoint, supporting various input methods including file uploads and raw text.
Automate Local Business Outreach with AI-Powered Yelp Scraper
This workflow automates the process of scraping local business details from Yelp using AI, then leverages that data to send personalized partnership proposals via Gmail. It's perfect for sales and marketing teams looking to streamline lead generation and outreach campaigns.
Automate Getty Images Editorial Search & CMS Integration
This n8n workflow automates searching for editorial images on Getty Images, extracts key details and embed codes, and prepares them for seamless integration into your Content Management System (CMS), streamlining your content creation process.