Coding agents and data science: building hill-climbing environments for LLMs | London .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

November 25, 2025 · London

Agents Hill-Climbing with Data Science

Dozens of coding agents attempt to reverse engineer a Spotify color assignment algorithm within a custom environment, showcasing data science thinking for agent improvement.

Overview
Links
Tech stack
  • Claude Code
    Anthropic's agentic coding tool: Unleash Claude's raw power directly in your terminal or IDE to turn complex, hours-long workflows into a single command.
    Claude Code is Anthropic’s powerful agentic coding assistant, designed for high-velocity development. It operates natively within your terminal, IDE (VS Code, JetBrains), or via a web interface, allowing you to delegate complex tasks like feature building, bug fixing, and codebase navigation. The agent plans, edits files, executes commands, and creates commits, maintaining awareness of your entire project structure. Internally, Anthropic engineers using Claude Code reported a 67% increase in productivity, demonstrating its capacity to deliver significant gains for Pro and Max plan users.
  • OpenCode
    OpenCode is the open-source AI coding agent (CLI tool), integrating LLMs like GPT-5 and Claude Sonnet 4 directly into the terminal for fast, context-aware development.
    OpenCode is the open-source AI coding agent, built for terminal-first developers who demand speed and privacy. It connects your local files, Git history, and a choice of LLMs (e.g., OpenAI's GPT-5 Nano, Anthropic's Claude Sonnet 4) to execute complex tasks directly from the command line . The tool bypasses IDE and browser dependencies, allowing developers to triage issues, fix errors, or implement features with commands like `opencode fix error in main.go` . With over 26,000 GitHub stars by October 2025, OpenCode delivers a secure, context-aware coding partner that keeps your code local and your workflow efficient .
  • Vercel AI Gateway
    The Vercel AI Gateway provides a unified API endpoint for accessing over 100 large language models (LLMs), including top providers like OpenAI and Anthropic
    The Vercel AI Gateway provides a unified API endpoint for accessing over 100 large language models (LLMs), including top providers like OpenAI and Anthropic. This centralized interface eliminates the overhead of managing multiple API keys and provider accounts, streamlining your production AI workloads. Key features include intelligent failover for increased uptime, built-in observability for detailed usage and cost tracking, and automatic prompt caching. Utilize the Bring Your Own Key (BYOK) option for a 0% markup on token pricing, ensuring cost-efficiency and maximum throughput.
  • Cursor
    The AI-native code editor designed for high-velocity development through deep LLM integration.
    Cursor is a fork of VS Code that embeds AI directly into the development workflow while maintaining full extension compatibility. It leverages models like Claude 3.5 Sonnet and GPT-4o to power features such as Cmd+K for inline edits and Cmd+L for codebase-wide chat. By indexing local files, Cursor provides precise context for its predictive 'Tab' completions and multi-file 'Composer' mode. This setup allows engineers to move from high-level intent to functional code without leaving the editor or losing context.
  • TypeScript
    TypeScript is an open-source superset of JavaScript: it adds static typing and compiles to clean, standards-based JavaScript.
    TypeScript is a high-level, open-source language developed by Microsoft: it acts as a superset of JavaScript, adding a powerful static type system. This system enables compile-time type checking, catching errors before runtime (a critical benefit for large-scale applications). The TypeScript Compiler (TSC) reliably transpiles all code into clean, standards-based JavaScript (ES3 or newer), ensuring compatibility across any browser or host environment (Node.js, React.js, etc.).

Related projects