Radiology Lab - Medical Imaging Segmentation Lab | Raleigh .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

February 11, 2026 · Raleigh

Radiology Lab: Medical Imaging Segmentation

Explore creating medical imaging segmentation and 3D models with SAM3 and Gaussian Splatting. Learn about iterative ML model refinement and deployment in this mainstage presentation.

Video
Overview
Tech stack
  • Turborepo
    High-performance build system for JavaScript and TypeScript monorepos that scales with your codebase.
    Turborepo (maintained by Vercel) accelerates monorepo workflows by skipping work that is already finished. It uses a remote caching system to store build artifacts (locally and in the cloud) and executes tasks in parallel based on a directed acyclic graph (DAG). The result: build times drop from minutes to seconds. Whether you are managing five packages or 500, Turborepo handles the orchestration of scripts like build, test, and lint with zero-config telemetry. It works seamlessly with tools like pnpm and Yarn to ensure your CI/CD pipelines stay fast.
  • Segment Anything Model
    SAM is Meta AI's foundation model for promptable image segmentation, delivering robust, zero-shot object mask generation in real-time.
    Segment Anything Model (SAM) is a breakthrough foundation model from Meta AI, designed to democratize image segmentation. It operates on a promptable task: input a point, box, or text cue, and the model rapidly outputs a high-quality segmentation mask (amortized real-time speed). The architecture utilizes a powerful image encoder and a fast, lightweight prompt encoder/mask decoder. SAM was trained on the massive SA-1B dataset, featuring over 1.1 billion masks across 11 million licensed images, enabling strong zero-shot generalization across diverse visual domains and downstream tasks without requiring further fine-tuning.
  • Meta
    A global technology conglomerate building the metaverse: connecting billions across Facebook, Instagram, WhatsApp, and Quest VR hardware.
    Meta Platforms, Inc. (formerly Facebook) operates the world's largest social media ecosystem: this includes key products like Facebook, Instagram, and WhatsApp. The company’s core technology strategy centers on developing the metaverse: an interconnected digital ecosystem blending virtual and augmented reality (VR/AR). This future is driven by Reality Labs, which develops hardware (Meta Quest headsets) and software. The Family of Apps segment, however, continues to connect billions of users globally and generates the majority of Meta's substantial advertising revenue.
  • text
    Text technology (NLP) processes and interprets human language data, enabling applications like sentiment analysis, machine translation, and advanced content classification.
    We handle Natural Language Processing (NLP): the core technology for making computers understand text. This is a critical AI subfield, transforming raw text into actionable data through specific steps (tokenization, parsing, classification). Modern systems, leveraging large language models (LLMs) like GPT and BERT, now perform complex tasks: extracting key entities from thousands of legal documents, classifying customer feedback with 90%+ accuracy, and powering sophisticated conversational agents. We are building the engine that turns unstructured language into structured, high-value intelligence for any enterprise.
  • React
    React is an open-source JavaScript library for building dynamic user interfaces (UIs).
    React is a component-based JavaScript library, developed by Meta (Facebook), engineered for building fast, declarative UIs. It mandates a one-way data flow and utilizes a Virtual DOM mechanism to ensure efficient, predictable updates to the user interface. Developers construct complex UIs by composing small, encapsulated components; this architecture promotes code reusability and simplifies state management across large applications. The library employs JSX (a syntax extension) to integrate HTML-like markup directly within JavaScript logic, supporting development for both web (React DOM) and native mobile platforms (React Native).
  • Vite
    Vite is the next-generation frontend build tool: instant server start with native ES modules and lightning-fast Hot Module Replacement (HMR).
    Vite (French for 'quick') is a modern build tool created by Evan You (Vue.js's creator) that delivers a dramatically faster development experience. It operates in two major parts: a dev server that serves source code over native ES modules, eliminating the initial bundling step for instant startup; and a production build command that leverages Rollup and esbuild for highly optimized static assets. This architecture provides consistently fast HMR, even for large applications, and offers out-of-the-box support for TypeScript, JSX, and CSS via a universal plugin API.
  • Web app
    A Web app is server-hosted software accessed via a standard browser (Chrome, Firefox), utilizing core technologies like HTML5, CSS3, and JavaScript to deliver dynamic, cross-platform functionality.
    Web applications are dynamic, server-side programs delivered over secure HTTP/HTTPS, eliminating the need for local installation. This client-server model uses the browser for the presentation layer (HTML/CSS/JavaScript) and a backend (like Node.js or Python) for application logic and database interaction. This architecture ensures cross-platform compatibility (Windows, macOS, Android), which is why over 85% of modern businesses rely on them for operational efficiency (Source: W3C, 2020). They power critical Software as a Service (SaaS) platforms: Gmail, Salesforce, and Netflix are prime examples of this robust, scalable technology.
  • PyTorch
    PyTorch is the open-source machine learning framework: it provides a Python-first tensor library with strong GPU acceleration and a dynamic computation graph for building deep neural networks.
    PyTorch, developed by Meta AI, is a premier open-source deep learning framework favored in both research and production environments. Its core is a powerful tensor library (like NumPy) optimized for GPU acceleration, delivering 50x or greater speedups for complex computations. The key differentiator is its 'Pythonic' design and dynamic computation graph (eager execution), which allows for rapid prototyping and simplified debugging compared to static-graph frameworks. Leveraging its Autograd system for automatic differentiation, practitioners build and train models for computer vision and NLP; major companies like Tesla (Autopilot) and Microsoft utilize PyTorch for critical AI applications.
  • Docker
    Docker is the open-source platform that packages applications and dependencies into standardized, portable containers for consistent execution across any environment.
    Docker is the industry-standard containerization platform, enabling developers to build, ship, and run applications efficiently. It uses the Docker Engine (the core runtime) to create lightweight, isolated environments called containers: these units bundle an application’s code, libraries, and configuration. This self-contained approach guarantees consistency, eliminating the 'it works on my machine' problem across development, testing, and production environments (local workstations, cloud, or on-premises). Docker debuted in 2013 and now serves over 20 million developers monthly, simplifying complex workflows like CI/CD and microservices architecture by leveraging tools like Docker Hub for image sharing and Docker Compose for multi-container applications.
  • Three
    Three.js is the industry-standard JavaScript library for rendering hardware-accelerated 3D graphics in web browsers via WebGL.
    Created by Ricardo Cabello (Mr.doob) and maintained by a global contributor base, Three.js abstracts the complexities of WebGL into a manageable scene graph API. It enables developers to deploy immersive 3D environments (like NASA's Eyes on the Solar System) using standard components: cameras, lights, and meshes. The library supports glTF 2.0 for efficient asset loading and integrates with physics engines (Cannon.js) for real-time interaction. By handling the heavy lifting of GLSL shaders and matrix math, it delivers high-performance rendering on any device with a modern browser.
  • SAM3
    Meta’s unified foundation model for Promptable Concept Segmentation (PCS) that detects, masks, and tracks every instance of a visual concept across images and videos.
    SAM 3 unifies detection, segmentation, and tracking into a single 848M-parameter architecture for Promptable Concept Segmentation (PCS). This model moves beyond the single-object clicks of its predecessors to handle open-vocabulary tasks: you can now isolate every instance of a concept (like "yellow school bus" or "person with backpack") across images and video frames simultaneously. It cuts through the complexity of multi-model pipelines by integrating a DETR-style detector with SAM 2’s temporal memory. For teams working in spatial AI, the companion SAM 3D model provides a direct path from single RGB images to high-fidelity 3D reconstructions.
  • Konva
    Konva is a high-performance HTML5 Canvas framework that enables a declarative scene graph for complex 2D graphics.
    Konva bridges the gap between low-level canvas commands and modern application state. It treats shapes (Rects, Circles, and Paths) as discrete objects: you can attach event listeners, animate properties, and manage z-index without manual pixel manipulation. The engine optimizes rendering by redrawing only updated layers, maintaining 60 FPS even with thousands of objects. It is the go-to choice for building web-based image editors, interactive floor plans, and data visualizations. Official integrations for React and Vue make it a natural fit for component-driven architectures.
  • GLSL
    GLSL is a high-level, C-style shading language designed to execute code directly on the GPU for real-time hardware acceleration.
    GLSL (OpenGL Shading Language) provides developers with direct control over the graphics pipeline through specialized stages: vertex, fragment, and compute shaders. It utilizes a C-style syntax optimized for vector and matrix operations (such as vec4 and mat4) to calculate lighting, shadows, and 3D geometry transformations. Current standards like version 4.60 ensure cross-platform compatibility via the Khronos Group: maintaining high performance across diverse hardware. By offloading intensive mathematical computations to the GPU's parallel architecture, GLSL powers everything from AAA game engines to complex scientific visualizations.

Related projects