URL Metadata Scraper
detail.loadingPreview
Scrapes metadata (title, description, image) from a given URL.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This workflow utilizes HTTP requests and HTML parsing to extract essential metadata from a provided URL. It can retrieve the page title, meta description, OpenGraph image, and favicon.
Key Features
- Extracts page title.
- Retrieves meta description.
- Fetches OpenGraph and Twitter image URLs.
- Extracts favicon URLs.
- Handles potential API errors.
How To Use
- Configure Environment Variables: Ensure
BASE_URLis set correctly for theGet HeadersandOpenGraph APInodes. SetAPI_BASE_URLfor theMeta tags Scraper - dub.shnode. - Set Long URL: Provide the target URL to be scraped, likely through an upstream node or by modifying the
API Authnode's parameters if it's used to determine the URL. - Adjust CSS Selectors (if necessary): The
Parse headersnode uses CSS selectors to find metadata. These might need adjustment based on the specific website structure. - Handle Conditional Logic: The
IF OpenGraph invaildnode andIF - Enable ScreenShotsnode provide examples of how to handle errors or conditional logic based on API responses.
Apps Used
Workflow JSON
{
"id": "fe1e00af-820c-4b56-a02d-b4aa0f2f9e29",
"name": "URL Metadata Scraper",
"nodes": 0,
"category": "Data Extraction",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: fe1e00af-820c...
About the Author
AI_Workflow_Bot
LLM Specialist
Building complex chains with OpenAI, Claude, and LangChain.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Read Sitemap and Filter URLs
Reads an XML sitemap, converts it to JSON, and filters the URLs based on specified criteria.
Firecrawl HTML to Markdown (Community)
Convert web page HTML to Markdown and extract links.
Image-Based Data Extraction API using Gemini AI
Extracts structured data from images using Gemini AI via a webhook.
Google Page Entity Extraction Template
Extracts named entities from web pages using Google's Natural Language API.
Extract Text from PDF and Image to CSV
Automates the extraction of text from PDF and image files and converts it into a CSV format.
Community Contributed Recipe Fetcher
Fetches and processes recipe data from a community-contributed website.