Scrape Trustpilot Reviews, Extract Structured Metadata with DeepSeek, Analyze Sentiment with OpenAI, and Store in Google Sheets
detail.loadingPreview
Automate the scraping of Trustpilot reviews, extract detailed structured metadata using DeepSeek, analyze sentiment with OpenAI, and store the results in Google Sheets.
About This Workflow
This workflow automates the process of gathering customer feedback from Trustpilot. It starts by fetching reviews for a specified company, then uses DeepSeek's powerful language model to extract structured metadata from each review. Subsequently, it leverages OpenAI's sentiment analysis capabilities to categorize the sentiment of each review. Finally, all extracted and analyzed data is stored in a Google Sheet for further analysis and reporting. The workflow includes logic to check for existing reviews in Google Sheets to prevent duplicates.
Key Features
- Automated Review Scraping: Fetches reviews from Trustpilot for a given company and number of pages.
- Structured Metadata Extraction: Utilizes DeepSeek (via Langchain) to extract specific fields like author, rating, date, title, text, number of reviews, and country.
- Sentiment Analysis: Employs OpenAI (via Langchain) to determine the sentiment (Positive, Neutral, Negative) of each review.
- Duplicate Checking: Integrates with Google Sheets to check if a review has already been processed.
- Data Storage: Appends or updates processed reviews, along with their extracted metadata and sentiment, into a Google Sheet.
- Configurable Parameters: Allows setting the company ID and maximum number of pages to scrape.
How To Use
- Configure Company and Pages: Update the
Set Parametersnode (company_idandmax_page) with the target company's Trustpilot ID and the desired number of pages to scrape. - Set Up Google Sheets Credentials: Ensure your Google Sheets OAuth2 credentials are correctly configured in n8n and are linked to the
Get rows,Get Google Sheets, andUpdate sheetnodes. - Specify Google Sheet: In the
Get rows,Get Google Sheets, andUpdate sheetnodes, ensure thedocumentIdandsheetNamecorrectly point to your target Google Sheet. - Configure LLM Credentials: Ensure your DeepSeek and OpenAI API keys are set up correctly in n8n credentials and linked to the respective
DeepSeek Chat ModelandOpenAI Chat Modelnodes. - Run the Workflow: Execute the workflow manually or set up a schedule.
Node Explanations:
- When clicking ‘Test workflow’: Manual trigger to start the workflow.
- Set Parameters: Defines the company to scrape and the maximum number of pages.
- Get reviews (HTTP Request): Fetches the HTML content of the Trustpilot review page.
- Extract (HTML): Extracts review links from the fetched HTML.
- Split Out: Splits the array of review links into individual items.
- Limit1: Limits the number of reviews processed to a maximum of 3 (can be adjusted).
- Get Google Sheets: Checks if a review already exists in the Google Sheet based on its ID.
- If: Conditional node to proceed only if the review is not found in Google Sheets.
- Get Single review (HTTP Request): Fetches the individual review page HTML.
- Extract review (HTML): Extracts the main article content of a single review.
- Information Extractor (Langchain): Uses DeepSeek to parse the HTML and extract structured metadata from the review text.
- Sentiment Analysis (Langchain): Uses OpenAI to analyze the sentiment of the extracted review text.
- Update sheet (Google Sheets): Appends or updates the processed review data, including extracted metadata and sentiment, to the Google Sheet.
- Sticky Notes: Provide guidance and hints for configuration.
Apps Used
Workflow JSON
{
"id": "8f43efee-c2db-4f60-85c3-5ef62f17f5a5",
"name": "Scrape Trustpilot Reviews, Extract Structured Metadata with DeepSeek, Analyze Sentiment with OpenAI, and Store in Google Sheets",
"nodes": 9,
"category": "Marketing",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: 8f43efee-c2db...
About the Author
N8N_Community_Pick
Curator
Hand-picked high quality workflows from the global community.
Statistics
Related Workflows
Discover more workflows you might like
Automate LinkedIn Content Promotion for Your Ghost Blog with AI
Effortlessly promote your latest Ghost blog posts on LinkedIn. This workflow leverages AI to generate engaging, professional LinkedIn messages based on your article content and saves them, along with article metadata, directly to a Google Sheet.
AI-Powered On-Page SEO Audit & Report Automation
Instantly generate comprehensive on-page SEO technical and content audits for any website URL. This AI-powered workflow automates the entire process, from scraping the page to delivering a detailed report directly to your inbox, empowering you to optimize for better search rankings and user engagement.
AI-Powered Instagram Comment Automation
This n8n workflow intelligently automates responses to Instagram comments, leveraging advanced AI to engage with your audience. It filters out irrelevant content and personalizes replies, saving you time while boosting your social media presence.