HackerNoon Article Title And URL Extractor
detail.loadingPreview
This workflow efficiently scrapes the HackerNoon homepage to automatically extract article titles and their corresponding URLs. It's perfect for keeping up with the latest tech stories or building a custom content feed.
About This Workflow
This n8n workflow provides a powerful, no-code solution for extracting valuable information from websites, specifically demonstrating how to pull article titles and URLs from HackerNoon. Starting with a manual trigger, it performs an HTTP request to fetch the HackerNoon homepage HTML. Subsequent HTML Extract nodes precisely target and parse the <h2> elements, which typically contain article links. Finally, it refines the data to isolate each article's title and its direct URL, making it ready for further processing or storage. This versatile pattern can be adapted for various web scraping needs.
Key Features
- Extracts article titles and URLs from web pages.
- Utilizes robust HTML parsing to target specific elements.
- Modular design allowing for easy adaptation to other websites.
- No-code web scraping solution.
How To Use
- Manual Trigger: This node starts the workflow. Simply click 'Execute Workflow' in the n8n editor to run it.
- HTTP Request: This node is pre-configured to fetch the content from
https://hackernoon.com/. You can change theURLparameter to scrape a different website if needed. - HTML Extract: This node identifies and extracts all
<h2>tags from the fetched HTML, returning their inner HTML as 'item' properties. This step narrows down the content to article blocks. - HTML Extract1: This final extraction node processes each 'item' from the previous step. It specifically looks for
<a>tags within each<h2>block to extract the articletitle(text content) and itsurl(thehrefattribute). No further configuration is required for HackerNoon, but you might adjustcssSelectorif scraping another site.
Apps Used
Workflow JSON
{
"id": "3bcd6772-3361-4124-8e47-58c47bf28502",
"name": "HackerNoon Article Title And URL Extractor",
"nodes": 16,
"category": "Operations",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 3bcd6772-3361...
About the Author
Free n8n Workflows Official
System Admin
The official repository for verified enterprise-grade workflows.
Statistics
Related Workflows
Discover more workflows you might like
Instant WooCommerce Order Notifications via Telegram
When a new order is placed on your WooCommerce store, instantly receive detailed notifications directly to your Telegram chat. Stay on top of your e-commerce operations with real-time alerts, including order specifics and a direct link to view the order.
On-Demand Microsoft SQL Query Execution
This workflow allows you to manually trigger and execute any SQL query against your Microsoft SQL Server database. Perfect for ad-hoc data lookups, administrative tasks, or quick tests, giving you direct control over your database operations.
Automate Getty Images Editorial Search & CMS Integration
This n8n workflow automates searching for editorial images on Getty Images, extracts key details and embed codes, and prepares them for seamless integration into your Content Management System (CMS), streamlining your content creation process.