Effortless LLM Comparison: OpenAI, OpenRouter & Google Sheets
detail.loadingPreview
This n8n workflow allows you to easily evaluate and compare the outputs of multiple Large Language Models (LLMs) side-by-side within a chat interface. It automates logging responses to Google Sheets, providing a structured way to analyze and select the best-performing LLM for your specific application.
About This Workflow
Accelerate your AI development by gaining clear insights into LLM performance with this powerful n8n workflow. It enables you to simultaneously test multiple language models, like those from OpenAI or via OpenRouter, against the same user input. Each model processes the prompt independently, maintaining its own conversational memory for a fair comparison. All responses, user inputs, and contexts are meticulously logged into a Google Sheet, providing a central hub for manual or automated evaluation. This setup is ideal for determining which LLM best meets your specific use case before deployment, saving development time and ensuring optimal agent behavior.
Key Features
- Side-by-Side LLM Comparison: Evaluate the outputs of two (extendable) LLMs directly in your chat interface for immediate feedback.
- Independent Conversational Memory: Each LLM maintains its own distinct chat history, ensuring accurate and isolated comparisons.
- Automated Data Logging to Google Sheets: Capture all prompts, model responses, and session data in a structured Google Sheet for comprehensive analysis and collaboration.
- Flexible Model Integration: Easily switch between different LLM providers like OpenAI, OpenRouter, or Google Vertex AI, or compare various models from the same provider.
- Customizable AI Agent Logic: Configure system prompts and tools within the AI Agent node to tailor model behavior to your exact application needs.
How To Use
- Duplicate Google Sheet Template: Access the provided Google Sheets template and make a copy (File > Make a Copy).
- Configure LLM Providers: In the "OpenRouter Chat Model" node (or similar LLM nodes if you've swapped them), set up your credentials for the desired LLM provider (e.g., OpenRouter API key).
- Define Models for Comparison: Locate the "Define Models to Compare" sticky note and update the
modelsarray within the associated data preparation node with the full IDs of the LLMs you wish to compare (e.g.,["openai/gpt-4.1", "mistralai/mistral-large"]). - Customize AI Agent Settings: Open the "AI Agent" node to define your System Prompt and Tools to suit the specific use case and context of your AI application.
- Activate and Chat: Activate the workflow. Send messages to the chat interface to trigger both models and observe their responses side-by-side, while their interactions are automatically logged to your Google Sheet.
Apps Used
Workflow JSON
{
"id": "c035397d-ef56-43a3-90f9-8c55f0a70af3",
"name": "Effortless LLM Comparison: OpenAI, OpenRouter & Google Sheets",
"nodes": 10,
"category": "Operations",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: c035397d-ef56...
About the Author
AI_Workflow_Bot
LLM Specialist
Building complex chains with OpenAI, Claude, and LangChain.
Statistics
Related Workflows
Discover more workflows you might like
Google Sheets to Icypeas: Automated Bulk Domain Scanning
This workflow streamlines the process of performing bulk domain scans by integrating your Google Sheets data directly with the Icypeas platform. Automate the submission of company names from your spreadsheet to Icypeas for comprehensive domain information, saving valuable time and effort.
Instant WooCommerce Order Notifications via Telegram
When a new order is placed on your WooCommerce store, instantly receive detailed notifications directly to your Telegram chat. Stay on top of your e-commerce operations with real-time alerts, including order specifics and a direct link to view the order.
On-Demand Microsoft SQL Query Execution
This workflow allows you to manually trigger and execute any SQL query against your Microsoft SQL Server database. Perfect for ad-hoc data lookups, administrative tasks, or quick tests, giving you direct control over your database operations.