Voice AI Agent Architecture: Streaming Deepgram → OpenAI → ElevenLabs in Production | Bogotá .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

November 27, 2025 · Bogotá

Deepgram OpenAI ElevenLabs Production

A live technical walkthrough of building a production voice AI agent, detailing orchestration of Deepgram, OpenAI, and ElevenLabs with n8n and Supabase.

Overview
Tech stack
  • GPT-4
    GPT-4 is OpenAI’s large multimodal model: it processes both text and image inputs, delivering human-level performance on complex professional and academic benchmarks.
    This is OpenAI’s latest milestone in scaling deep learning: a large multimodal model accepting both text and image inputs. It demonstrates a significant capability leap over its predecessor, scoring in the top 10% on a simulated bar exam (GPT-3.5 scored in the bottom 10%). The model handles nuanced instructions and long-form content, supporting context windows up to 32,768 tokens (32K model). This capacity allows processing up to 25,000 words in a single, complex prompt. GPT-4 is engineered for enhanced reliability, steerability, and advanced reasoning across diverse tasks.
  • Deepgram
    Deepgram is the end-to-end Voice AI platform: delivering real-time, highly accurate Speech-to-Text (STT), Text-to-Speech (TTS), and conversational Voice Agents via a developer-first API.
    Deepgram is your enterprise-grade Voice AI platform, built on a proprietary end-to-end deep learning network for unmatched speed and accuracy. Our core APIs (STT, TTS, and the unified Voice Agent API) handle everything from real-time transcription to synthesizing natural speech. Specifically, models like Nova-3 and Flux deliver 2-4x better accuracy on alphanumeric data than competitors, and batch transcription processes one hour of audio in less than 30 seconds (120x real-time). We support flexible deployment (public cloud, private cloud, or self-hosted) and offer advanced features like diarization (up to 10 speakers) and custom model training, ensuring your voice applications—from contact centers to conversational AI—are fast, precise, and scalable.
  • ElevenLabs
    ElevenLabs delivers emotionally rich, human-like AI voice synthesis: text-to-speech, professional voice cloning, and AI dubbing across 30+ languages.
    ElevenLabs is the premier AI voice platform (founded 2022 by Piotr Dąbkowski and Mati Staniszewski), leveraging deep learning for superior audio. Key offerings include the expressive Eleven v3 text-to-speech model, professional Voice Cloning from minimal audio, and AI Dubbing for translating content into 30+ languages while preserving the original voice. The low-latency API (e.g., Flash v2.5 at 75ms) powers diverse applications: audiobooks, video voiceovers, and conversational AI agents for over a million users.
  • Supabase
    The open-source PostgreSQL development platform: a Firebase alternative for rapid backend deployment.
    Supabase is the Postgres development platform, providing a complete, open-source backend-as-a-service solution. It packages enterprise-grade tools like a full PostgreSQL database, a RESTful API (via PostgREST), and a GraphQL API (via pg_graphql) that are auto-generated from your schema. The platform includes a comprehensive suite of services: Auth (for user sign-ups and SSO), Storage (for files with S3 integration), Realtime (for database change subscriptions), and Edge Functions (for serverless logic). The core value proposition is clear: build fast with a powerful, scalable SQL database that offers 100% portability and robust security features like Row Level Security (RLS).
  • n8n
    n8n (pronounced 'n-eight-n') is the fair-code workflow automation platform: it gives technical teams the power of custom code (JavaScript/Python) via a visual, node-based editor.
    This is n8n: a powerful, self-hosted workflow automation platform designed for developers and technical teams. It uniquely blends the speed of a no-code UI with the flexibility of custom code, allowing users to build complex, multi-step automations that other tools cannot handle. With 400+ integrations and native AI capabilities, n8n ensures you maintain full control over your data and deployment (on-prem or cloud). For example, you can build a workflow to automatically pull data from a private API, transform it with a custom Python script, and push the results to both Salesforce and a Slack channel, all from a single visual canvas.

Related projects