Evolving Contexts: Applying Agentic Context Engineering for Self-Improving LLM Agents | Bogotá .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

October 31, 2025 · Bogotá

ACE: Self-Improving LLM Agents

A walkthrough of Agentic Context Engineering, showing how Generator, Reflector, and Curator iteratively refine context to counter brevity bias and boost LLM performance without retraining.

Overview
Tech stack
  • GPT-4o
    GPT-4o (omni) is OpenAI's flagship multimodal model: it delivers GPT-4 intelligence with native, real-time processing across text, audio, and vision.
    This is GPT-4o, OpenAI’s 'omni' model: a single neural network natively handling text, audio, and image inputs and outputs. It matches GPT-4 performance on English text and code, but surpasses it on non-English language, vision, and audio benchmarks. The speed is a major upgrade: it achieves human-level responsiveness in voice, with an average response time of 0.32 seconds (a significant jump from GPT-4’s 5.4 seconds). Developers get a 128K token context window and a model that is more cost-efficient than its predecessor, making high-intelligence, real-time applications viable.
  • Ada
    A high-level, statically typed language designed for real-time systems where safety and reliability are non-negotiable.
    Ada remains the gold standard for high-integrity software in aerospace, defense, and rail. Originally commissioned by the U.S. Department of Defense (MIL-STD-1815), it provides robust compile-time checks and strong typing to eliminate common runtime errors. Modern standards like Ada 2022 integrate seamlessly with the SPARK toolset for formal verification. Whether managing flight control systems for Boeing or securing communication protocols, Ada delivers predictable performance through its native support for tasking and deterministic memory management.
  • LangChain
    The open-source framework for building and deploying reliable, data-aware Large Language Model (LLM) applications.
    LangChain is the essential framework for engineering LLM-powered applications: it simplifies connecting models (like GPT-4 or Claude) to external data, computation, and APIs. The platform provides a modular set of components—Chains, Agents, Tools, and Memory—allowing developers to quickly build complex workflows like Retrieval-Augmented Generation (RAG) pipelines and sophisticated conversational agents. Its Python and JavaScript libraries, combined with LangChain Expression Language (LCEL), offer a standardized interface for rapid prototyping and moving applications to production with confidence.
  • LangGraph
    A low-level orchestration framework for building long-running, stateful, and cyclic multi-agent systems using a graph-based architecture.
    LangGraph is the specialized, low-level runtime for developing complex AI agents, extending the LangChain ecosystem to handle intricate, stateful workflows. It models the agent's logic as a directed graph: nodes represent actions (LLM calls, tool use), and conditional edges dictate the flow, enabling critical features like cycles (loops) for iterative reasoning. This graph-based approach ensures durable execution, allowing agents to persist through failures and resume operations. Key capabilities include comprehensive memory management via a shared state object and built-in human-in-the-loop functionality (interrupts) for external oversight. This robust framework is trusted by production teams at companies like Klarna and Replit for deploying scalable, resilient agent architectures.
  • Python
    Python: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.
    Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .

Related projects