Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Deepgram OpenAI ElevenLabs Production
A live technical walkthrough of building a production voice AI agent, detailing orchestration of Deepgram, OpenAI, and ElevenLabs with n8n and Supabase.
Live technical walkthrough of building RENATA, a production voice AI agent handling real property management queries.
I’ll show the actual implementation: how we orchestrate Retell AI webhooks with n8n workflows to process incoming calls, stream audio through Deepgram for real-time transcription, query Supabase vector stores for property-specific context, inject that context into OpenAI function calls, and stream responses back through ElevenLabs for natural voice output.
Code walkthrough includes:
n8n JavaScript nodes for parsing conversation state and building dynamic prompts
Supabase queries that retrieve property data based on conversation context
OpenAI function calling schema for actions (book reservation, escalate to human, check availability)
Retell AI custom LLM integration for handling interruptions and natural conversation flow
WhatsApp Business API webhook handling for message routing
Error handling and fallback logic when systems don’t respond
I’ll show the messy parts: how we handle race conditions in real-time voice, the hacky way we maintain conversation state across n8n executions, and the specific prompt engineering tricks that make the agent actually useful instead of generically polite.
Demo includes live code, actual n8n workflows, database schemas, and API calls.
- GPT-4GPT-4 is OpenAI’s large multimodal model: it processes both text and image inputs, delivering human-level performance on complex professional and academic benchmarks.This is OpenAI’s latest milestone in scaling deep learning: a large multimodal model accepting both text and image inputs. It demonstrates a significant capability leap over its predecessor, scoring in the top 10% on a simulated bar exam (GPT-3.5 scored in the bottom 10%). The model handles nuanced instructions and long-form content, supporting context windows up to 32,768 tokens (32K model). This capacity allows processing up to 25,000 words in a single, complex prompt. GPT-4 is engineered for enhanced reliability, steerability, and advanced reasoning across diverse tasks.
- DeepgramDeepgram is the end-to-end Voice AI platform: delivering real-time, highly accurate Speech-to-Text (STT), Text-to-Speech (TTS), and conversational Voice Agents via a developer-first API.Deepgram is your enterprise-grade Voice AI platform, built on a proprietary end-to-end deep learning network for unmatched speed and accuracy. Our core APIs (STT, TTS, and the unified Voice Agent API) handle everything from real-time transcription to synthesizing natural speech. Specifically, models like Nova-3 and Flux deliver 2-4x better accuracy on alphanumeric data than competitors, and batch transcription processes one hour of audio in less than 30 seconds (120x real-time). We support flexible deployment (public cloud, private cloud, or self-hosted) and offer advanced features like diarization (up to 10 speakers) and custom model training, ensuring your voice applications—from contact centers to conversational AI—are fast, precise, and scalable.
- ElevenLabsElevenLabs delivers emotionally rich, human-like AI voice synthesis: text-to-speech, professional voice cloning, and AI dubbing across 30+ languages.ElevenLabs is the premier AI voice platform (founded 2022 by Piotr Dąbkowski and Mati Staniszewski), leveraging deep learning for superior audio. Key offerings include the expressive Eleven v3 text-to-speech model, professional Voice Cloning from minimal audio, and AI Dubbing for translating content into 30+ languages while preserving the original voice. The low-latency API (e.g., Flash v2.5 at 75ms) powers diverse applications: audiobooks, video voiceovers, and conversational AI agents for over a million users.
- SupabaseThe open-source PostgreSQL development platform: a Firebase alternative for rapid backend deployment.Supabase is the Postgres development platform, providing a complete, open-source backend-as-a-service solution. It packages enterprise-grade tools like a full PostgreSQL database, a RESTful API (via PostgREST), and a GraphQL API (via pg_graphql) that are auto-generated from your schema. The platform includes a comprehensive suite of services: Auth (for user sign-ups and SSO), Storage (for files with S3 integration), Realtime (for database change subscriptions), and Edge Functions (for serverless logic). The core value proposition is clear: build fast with a powerful, scalable SQL database that offers 100% portability and robust security features like Row Level Security (RLS).
- n8nn8n (pronounced 'n-eight-n') is the fair-code workflow automation platform: it gives technical teams the power of custom code (JavaScript/Python) via a visual, node-based editor.This is n8n: a powerful, self-hosted workflow automation platform designed for developers and technical teams. It uniquely blends the speed of a no-code UI with the flexibility of custom code, allowing users to build complex, multi-step automations that other tools cannot handle. With 400+ integrations and native AI capabilities, n8n ensures you maintain full control over your data and deployment (on-prem or cloud). For example, you can build a workflow to automatically pull data from a private API, transform it with a custom Python script, and push the results to both Salesforce and a Slack channel, all from a single visual canvas.
Related projects
It's Not What You Say, Its How you Build It To Say It - Prosody For AI Voices
Bogotá
This talk examines how prosody—rhythm, stress, and intonation—improves AI voice naturalness, with practical techniques and demonstrations using meditation…
Engineering PerciBot: DDD + TDD for Agentic Voice AI
Bogotá
Learn how to build a payment‑negotiation voice assistant using LangGraph agents, ElevenLabs, domain‑driven design, and test‑driven development for…
Agentes de voz que modifican tu interface en tiempo real
Medellín
Build a real‑time, voice‑driven multi‑agent system using Google ADK and Gemini Live API, with live function calls that…
Agentes AI - Puesta en marcha
Bogotá
Learn how AI agents work and how to enhance language model functionality using open‑source tools and Google Vertex…
De los datos a la conversación: agentes de IA por voz para potenciar la interacción con el cliente
Manizales
La charla presenta agentes de IA por voz que integran PostgreSQL, Power BI, ChatGPT y otras tecnologías para…
Talking in Real Time: Voice Agents for Live Conversations
Miami
A walkthrough of building a low‑latency, customizable voice agent for real‑time meetings and call‑center use, including integration demos…