Extract and Summarize Wikipedia Data with Bright Data and Gemini AI
detail.loadingPreview
This workflow extracts data from Wikipedia using Bright Data and summarizes it using Google Gemini AI.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This workflow demonstrates how to extract content from a Wikipedia page using Bright Data's web scraping capabilities and then summarize the extracted content using Google's Gemini AI. It's designed to provide a concise overview of the information found on a given Wikipedia page.
Key Features
- Extracts raw HTML content from Wikipedia pages via Bright Data.
- Processes and formats the extracted HTML into human-readable text.
- Utilizes Google Gemini AI (Flash and Pro models) for data extraction and summarization.
- Sends the final summary to a configured webhook URL.
How To Use
- Configure environment variables for
API_BASE_URL(Bright Data endpoint) andWEBHOOK_URL. - Set up the 'Header Auth account' credential for the Bright Data API.
- Set up the 'Google Gemini(PaLM) Api account' credential for Google Gemini AI.
- Update the
Set Wikipedia URL with Bright Data Zonenode with the desired Wikipedia URL and Bright Data zone. - Trigger the workflow by clicking 'Test workflow'.
Apps Used
Workflow JSON
{
"id": "5ca3588f-a5ae-4a87-9286-47da05ba41a9",
"name": "Extract and Summarize Wikipedia Data with Bright Data and Gemini AI",
"nodes": 0,
"category": "Data Extraction & AI Summarization",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 5ca3588f-a5ae...
About the Author
N8N_Community_Pick
Curator
Hand-picked high quality workflows from the global community.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Automate Event Attendee Data Extraction with AI-Powered Scraping
This n8n workflow leverages AI and Bright Data's MCP to automate the scraping of event attendee, venue, and feedback data from 10times.com. Schedule the extraction to run daily and automatically save the organized data to Google Sheets.
Automate Competitor Tracking with Crunchbase & ClickUp
Streamline competitor analysis by automatically fetching data from Crunchbase and creating review tasks in ClickUp. Stay informed about market changes without manual effort.
Automated Multi-Platform Social Media Publisher
Streamline your social media content creation and publishing with this n8n workflow. Simply fill out a web form with your caption, media (image or video), and target platforms, and let n8n automate the posting process across multiple social networks.
Telegram Profanity & Toxicity Filter
This n8n workflow automatically monitors incoming Telegram messages for profanity and toxic language. It leverages Google's Perspective API to analyze message content, and if a message is deemed inappropriate, the workflow sends an automated warning response back to the sender.
Automate Shipment Tracking with Telegram and n8n
Streamline your logistics operations by automating shipment tracking with Telegram and n8n. This workflow allows drivers to easily share updates, and ensures timely notifications to your team.
Universal CSV to JSON API Converter
Effortlessly transform CSV data into structured JSON with this versatile n8n workflow. Integrate it into any application as a custom API endpoint, supporting various input methods including file uploads and raw text.