AI Voice Chat Workflow with Webhook, OpenAI, Gemini, and ElevenLabs
detail.loadingPreview
Automate voice conversations using a webhook. This workflow transcribes speech with OpenAI, processes it with Google Gemini, and synthesizes a response with ElevenLabs, maintaining chat context.
🚀Ready to Deploy This Workflow?
About This Workflow
Overview
This n8n workflow enables real-time AI voice chat functionality. It leverages a webhook to receive audio input, converts speech to text using OpenAI's Whisper model, and then processes the text through a Google Gemini language model to generate a response. To ensure conversational coherence, memory management nodes (Get Chat, Insert Chat, Window Buffer Memory) are used to maintain chat history. Finally, the AI's text response is converted back into speech using ElevenLabs.io and sent back to the user via the webhook response.
This workflow solves the problem of integrating voice interaction with AI language models within an automation platform, offering a hands-free way to communicate with an AI.
Key Features
- Real-time voice input via webhook.
- Speech-to-text transcription using OpenAI.
- Conversational AI response generation with Google Gemini.
- Contextual memory management for natural dialogue.
- Text-to-speech synthesis with ElevenLabs.
- Seamless integration of multiple AI services.
How To Use
- Set up the
Webhooknode to receive audio data. - Configure the
OpenAI - Speech to Textnode with your OpenAI API credentials to transcribe the incoming audio. - Connect the
Get ChatandInsert Chatnodes with theWindow Buffer Memorynode to manage conversation context. - Integrate the
Google Gemini Chat Modelnode for AI response generation, ensuring theBasic LLM Chainnode includes the conversation history. - Set up the
ElevenLabs - Generate Audionode with your ElevenLabs API key and voice ID to convert the AI's text response into speech. - Use the
Respond to Webhooknode to send the synthesized audio back as a response.
Apps Used
Workflow JSON
{
"id": "d986b75a-4609-44a7-91c5-8b20113bbf42",
"name": "AI Voice Chat Workflow with Webhook, OpenAI, Gemini, and ElevenLabs",
"nodes": 0,
"category": "AI & Machine Learning",
"status": "active",
"version": "1.0.0"
}Note: This is a sample preview. The full workflow JSON contains node configurations, credentials placeholders, and execution logic.
Get This Workflow
ID: d986b75a-4609...
About the Author
Free n8n Workflows Official
System Admin
The official repository for verified enterprise-grade workflows.
Statistics
Verification Info
Related Workflows
Discover more workflows you might like
Visa Requirement Checker
A workflow to check visa requirements based on user input, leveraging Langchain, Cohere embeddings, Weaviate vector store, and Anthropic LLM.
AI Assistant for Structured Metadata Generation
Automates the generation of structured metadata in English and Chinese using AI, leveraging communication platforms and various data sources.
OpenAI Text-to-Speech Workflow
Generate audio from text using OpenAI's TTS API.