AI All the Way Down: Building a Production Fuel Pricing System with 5 AI Agents | St Louis .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

February 04, 2026 · St Louis

AI Fuel Pricing Production System

Learn how to build a production fuel pricing system using five AI agents. This talk covers real-world implementation, multi-LLM strategies, self-healing architecture, and cost optimization for AI in production.

Overview
Links
Tech stack
  • Grok3
    xAI's frontier model trained on the 100,000-GPU Colossus cluster to dominate reasoning and real-time data synthesis.
    Grok3 is the latest flagship LLM from Elon Musk's xAI (developed at the Memphis Supercluster using over 100,000 Nvidia H100s). It targets top-tier performance on the HumanEval coding benchmark and MATH datasets to surpass current industry leaders like GPT-4o. By utilizing live data streams from the X platform, Grok3 provides immediate context on global events while maintaining a high-efficiency inference architecture for rapid-fire responses.
  • Claude-3
    Claude-3 is Anthropic's state-of-the-art multimodal model family (Opus, Sonnet, Haiku), setting new industry benchmarks for intelligence, speed, and vision capabilities.
    Claude-3, developed by Anthropic, is a powerful family of three generative AI models: Opus, Sonnet, and Haiku. Opus, the flagship, excels in complex reasoning, outperforming peers on key benchmarks (MMLU, GPQA) and supporting a 200,000-token context window. Sonnet offers an optimal balance for enterprise workloads, delivering performance that is 2x faster than its predecessor, Claude 2.1. Haiku is the fastest and most cost-effective option, capable of processing a 10,000-token research paper (including charts) in under three seconds. All three models are multimodal, featuring strong vision capabilities for analyzing charts, diagrams, and PDFs alongside text, enabling advanced data extraction and analysis.
  • Sonnet
    Sonnet is Anthropic's powerful, mid-tier AI model, balancing frontier intelligence with high-speed, cost-efficient performance for production-scale deployments.
    Sonnet (currently Claude Sonnet 4.5) is Anthropic’s versatile model, optimized for complex agentic workflows and coding tasks. It delivers state-of-the-art performance, achieving 77.2% on the SWE-bench Verified coding benchmark (cite: 2.2, 2.4). The model is engineered for high-volume, real-time applications like customer support automation and financial analysis, supporting a 200K token context window (cite: 2.8). Pricing is set for efficiency: $3 per million input tokens (cite: 2.8). This makes Sonnet the recommended choice for developers needing top-tier reasoning and coding capability at a practical, scalable cost.
  • Anthropic
    Anthropic is a frontier AI safety and research company, developing the Claude family of large language models (LLMs) via its Constitutional AI framework.
    Anthropic is an AI safety and research company, founded in 2021 by former OpenAI executives Dario and Daniela Amodei, and structured as a Public Benefit Corporation (PBC) . The core mission is building reliable, steerable AI systems, with a focus on interpretability and long-term alignment . Its flagship product is the Claude family of LLMs, which are highly capable models designed for complex reasoning and coding tasks . A key technical innovation is Constitutional AI (CAI), a training method that aligns the models with a set of ethical principles to ensure helpful, harmless, and honest outputs . The company has secured significant backing, including up to $4 billion from Amazon and a $2 billion commitment from Google .
  • Claude Code
    Anthropic's agentic coding tool: Unleash Claude's raw power directly in your terminal or IDE to turn complex, hours-long workflows into a single command.
    Claude Code is Anthropic’s powerful agentic coding assistant, designed for high-velocity development. It operates natively within your terminal, IDE (VS Code, JetBrains), or via a web interface, allowing you to delegate complex tasks like feature building, bug fixing, and codebase navigation. The agent plans, edits files, executes commands, and creates commits, maintaining awareness of your entire project structure. Internally, Anthropic engineers using Claude Code reported a 67% increase in productivity, demonstrating its capacity to deliver significant gains for Pro and Max plan users.
  • n8n
    n8n (pronounced 'n-eight-n') is the fair-code workflow automation platform: it gives technical teams the power of custom code (JavaScript/Python) via a visual, node-based editor.
    This is n8n: a powerful, self-hosted workflow automation platform designed for developers and technical teams. It uniquely blends the speed of a no-code UI with the flexibility of custom code, allowing users to build complex, multi-step automations that other tools cannot handle. With 400+ integrations and native AI capabilities, n8n ensures you maintain full control over your data and deployment (on-prem or cloud). For example, you can build a workflow to automatically pull data from a private API, transform it with a custom Python script, and push the results to both Salesforce and a Slack channel, all from a single visual canvas.
  • Workflow Automation
    Workflow Automation uses rule-based software to orchestrate and execute repetitive, multi-step business processes: it eliminates manual handoffs and human error.
    This technology streamlines operations across all departments (HR, IT, Sales) by transforming manual, document-heavy tasks into intelligent, automated flows. It routes approvals, syncs data between disparate systems, and manages process dependencies without human intervention. The result is a significant boost to efficiency and compliance: Deloitte reports a 22% reduction in task completion time for businesses leveraging automation. This frees up your team to focus on high-value, strategic work, ensuring faster, more accurate outcomes every time.
  • AI agent
    AI agents are autonomous software systems (driven by LLMs) that plan, reason, and execute complex, multi-step tasks without continuous human oversight.
    This technology represents the next major leap in AI: autonomous execution. Unlike basic chatbots, agents use a Perceive-Decide-Act-Learn cycle to achieve high-level goals. They integrate tools (APIs, databases, web search) to perform complex workflows like IT automation, software design, and end-to-end lead generation. Platforms like AutoGPT and LangChain enable this development, moving AI from an assistant role to a proactive, digital worker. The market reflects this impact: it hit $7.6 billion in 2025 and is projected for 49.6% annual growth through 2033.
  • Grok 3
    Grok-3 is xAI's flagship large language model (LLM), delivering 10x the computational power of its predecessor for superior real-time reasoning and problem-solving (e.g., scoring 93.3% on the 2025 AIME math competition).
    This is Grok-3: xAI's third-generation LLM, engineered for frontier-level performance. The model leverages the Colossus Supercomputer (200,000 NVIDIA H100 GPUs) to achieve 1.5 petaflops of processing power and a 10x boost in computational efficiency over Grok-2. Key features include DeepSearch for real-time web analysis and 'Think Mode' for advanced, multi-step reasoning on complex queries. Grok-3 is integrated into the X platform for Premium+ subscribers and is designed to lead in technical benchmarks, offering a powerful, context-aware, and multimodal AI solution.

Related projects