Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
AI Ops Circuit Breaker Fallback
See a live demo of a multi-agent system using a Circuit Breaker pattern to maintain uptime during AI model failures via automatic fallback.
A live 5-minute demonstration of production AI reliability patterns in action.
I’ll show how a multi-agent system (Router Agent, Health Monitor, and Fallback Agent) works together to achieve 99%+ uptime even when AI models hallucinate or fail. The demo features:
• Real-time hallucination detection using confidence scoring and consistency checks
• Circuit Breaker pattern implementation with three states (CLOSED, OPEN, HALF_OPEN)
• Automatic failover to a lighter, faster backup model (Gemini Flash) when the primary model (Gemini Pro) becomes unreliable
• Live metrics dashboard showing agent communication, system health, and graceful degradation
Attendees will watch the circuit breaker open after detecting 3 hallucinations, see the system automatically switch to fallback mode, and observe how the application continues serving requests without downtime. All code is Python + FastAPI with real-time WebSocket updates, and runs 100% locally using Gemini’s free API tier.
- FastAPIFastAPI is a modern, high-performance Python web framework for building APIs with automatic OpenAPI documentation.FastAPI is a robust, high-speed Python web framework: it is built on Starlette (for async capabilities) and Pydantic (for data validation and serialization). Leveraging standard Python 3.8+ type hints, the framework automatically generates interactive API documentation (Swagger UI/ReDoc) and enforces data validation, effectively reducing developer-induced errors by an estimated 40%. This architecture delivers performance on par with Node.js and Go, significantly increasing feature development speed (up to 300% faster). It is production-ready, fully supporting OpenAPI and JSON Schema standards for all API specifications.
- PythonPython: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .
- GeminiGoogle's natively multimodal AI model: understands and operates across text, code, audio, image, and video.Gemini is Google's most capable and general AI model, engineered from the ground up to be natively multimodal: it seamlessly understands and combines information across text, code, audio, image, and video inputs. The technology is optimized for flexibility, running efficiently on everything from data centers to mobile devices. It is deployed in three key sizes: Ultra (for highly complex tasks), Pro (for broad scaling), and Nano (for efficient on-device tasks). Developers access this power via the Gemini API to build next-generation applications.
- google-generativeaiThe unified Google Gen AI SDK: Access Gemini, Imagen, and Veo models through a single, stable API for multimodal application development.This technology provides the official Google Gen AI SDK, a unified interface for integrating Google’s advanced foundation models into applications. Developers leverage the SDK—available for Python, JavaScript, Go, and Java—to access the Gemini API (including models like Gemini 2.5 Pro and Flash) and specialized services like Imagen for image generation and Veo for video creation. It enables powerful multimodal capabilities, supporting use cases from complex reasoning and function calling to high-quality content generation. The platform offers a free tier and seamless migration paths between the core Gemini Developer API and the enterprise-ready Vertex AI platform.
- WebSocketsSecures a persistent, full-duplex TCP connection via a single HTTP handshake, delivering low-latency, bidirectional data streaming for real-time applications.WebSockets establishes a continuous, two-way communication channel: a critical shift from the request/response cycle of HTTP/1.1. The connection initiates with an HTTP `Upgrade` handshake, switching the protocol to `wss` (secure) on port 443. This persistent link minimizes network overhead, eliminating the need for inefficient HTTP polling. The protocol, standardized as IETF RFC 6455 in 2011, ensures near-instant data transfer: vital for high-performance use cases (e.g., live chat, collaborative editing, financial market data).
Related projects
MonoMorph: An Agentic tool for refactoring monoliths into microservices
Montreal
See how an agentic system refactors monoliths to microservices using structured code generation, template prompts, and an autonomous…
Nandayo - AI Driven Support Agent
Montreal
Learn how Nandayo, an AI-driven support agent, integrates with any infrastructure to automatically monitor, triage, and resolve routine…
Demo: Manus — The AI Agent That Thinks Before It Acts
Cincinnati
A live demonstration of Manis, an AI agent that visualizes its reasoning steps, showing how it plans and…
No Data for the Witch’s Cauldron: Using Safe, Open-Source AI Chats systems
Montreal
Learn how to assemble an open‑source chat system with OpenWeb UI, LiteLLM, Azure/Bedrock models, internal RAG, voice, image,…
Building Agentic AI for Intelligent Facility Operations
Toronto
Demonstration of an autonomous AI agent that ingests live building sensor data, identifies energy usage patterns, and automatically…
Vital AI Agent Ecosystem + Chat.ai
New York City
Demonstration of an open standard and open-source Vital AI Agent Ecosystem for deploying agents, enabling inter-agent communication, collaboration,…