Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Apple SHARP: Web and VR
Learn to convert Apple's SHARP ML for 2D to 3D scenes into browser and VR applications, with added OpenGL interactivity for virtual hands.
Apple’s SHARP project uses gaussian splatting to convert regular 2D images into full 3D scenes much faster than previous models. They released the model as pytorch, but i will show how to convert it to run in a browser or in VR on a VisionPro, and use opengl techniques to extend interactivity
VisionOS CoreML Gaussian Splatting adds shader jiggle physics to 3D photo reconstructions.
- VisionOSVisionOS is Apple's spatial computing operating system: it seamlessly blends digital content with the physical world, controlled by a revolutionary three-input system (eyes, hands, and voice).VisionOS is the foundational operating system for the Apple Vision Pro headset, establishing Apple’s first platform for spatial computing. Unveiled at WWDC23 and released on February 2, 2024, it provides an infinite canvas for applications, breaking free from traditional display constraints. Users navigate the 3D interface using natural, hands-free input: simply look at an element to select it, pinch two fingers to click, and use voice commands with Siri. The platform is built on familiar frameworks—like SwiftUI, RealityKit, and ARKit—allowing developers to create both 2D windows and fully immersive, volumetric experiences that respond to the user’s surroundings and lighting.
- ARKitARKit is Apple's augmented reality framework: it integrates device motion tracking and scene understanding to build immersive AR experiences on iOS, iPadOS, and visionOS.ARKit is the core framework for developing augmented reality applications across Apple's platforms, leveraging the device's camera and motion sensors for precise world tracking (6 degrees of freedom). It provides essential capabilities: plane detection (identifying horizontal and vertical surfaces), light estimation for realistic rendering, and advanced features like Location Anchors and Motion Capture (tracking human body movement). Since its debut with iOS 11, developers have used ARKit to power high-fidelity applications, from virtual furniture placement (e.g., IKEA Place) to interactive AR games, reaching hundreds of millions of users globally.
- WebGLWebGL is the JavaScript API for rendering high-performance, interactive 2D and 3D graphics directly in any compatible web browser: no plugins required.WebGL is a cross-platform, royalty-free open web standard managed by the Khronos Group. It serves as a low-level 3D graphics API, exposing the functionality of OpenGL ES to ECMAScript (JavaScript) via the HTML5 canvas element. This design enables direct GPU-accelerated rendering, delivering complex visuals like games and data visualizations with native speed. Specifically, WebGL 1.0 aligns with the OpenGL ES 2.0 feature set, while WebGL 2.0 implements the OpenGL ES 3.0 API, ensuring broad support across major browser vendors (Apple, Google, Microsoft, Mozilla).
- PyTorchPyTorch is the open-source machine learning framework: it provides a Python-first tensor library with strong GPU acceleration and a dynamic computation graph for building deep neural networks.PyTorch, developed by Meta AI, is a premier open-source deep learning framework favored in both research and production environments. Its core is a powerful tensor library (like NumPy) optimized for GPU acceleration, delivering 50x or greater speedups for complex computations. The key differentiator is its 'Pythonic' design and dynamic computation graph (eager execution), which allows for rapid prototyping and simplified debugging compared to static-graph frameworks. Leveraging its Autograd system for automatic differentiation, practitioners build and train models for computer vision and NLP; major companies like Tesla (Autopilot) and Microsoft utilize PyTorch for critical AI applications.
- ONNXONNX (Open Neural Network Exchange) is an open-source format: it standardizes machine learning models, ensuring interoperability across all major frameworks and deployment hardware.ONNX delivers critical model portability. It defines a standardized computation graph and operator set, allowing developers to train a model in one framework (e.g., PyTorch or TensorFlow) and deploy it seamlessly using a different runtime. This eliminates framework lock-in and optimizes production performance. Founded in 2017 by key industry players (AWS, Microsoft, Facebook), ONNX now boasts contributions from companies like NVIDIA, Intel, and Qualcomm. The high-performance ONNX Runtime, for example, powers AI inference across major Microsoft products, including Windows, Office, and Azure Cognitive Services, demonstrating its enterprise-grade efficiency and cross-platform capability (cloud, edge, mobile).
- PythonPython: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .
- metal shadersMetal Shading Language (MSL) is a C++14-based language designed to program high-performance graphics and data-parallel compute kernels on Apple GPUs.MSL provides developers with low-overhead access to the Apple Silicon GPU architecture. It leverages a unified memory model to eliminate redundant data copies between the CPU and GPU. By using C++14 syntax, it supports advanced features like operator overloading, templates, and constants for fine-grained control over vertex, fragment, and compute pipelines. This tight integration with the Metal framework allows for sub-millisecond command submission and peak hardware utilization in applications ranging from Final Cut Pro rendering to AAA gaming titles like Resident Evil Village.
Related projects
Think Outside the "Blocks": You Don’t Need a PhD to Solve Novel Problems in AI
Raleigh
Learn how a non-expert used LLMs to build a novel tool for generating blocky 3D models for games,…
Learning AI with AI
Denver
Learn how to use AI tools to study programming concepts, develop code, and launch a functional iOS application…
AI gives you wings
Seattle
Learn how to build fully functional prototypes using V0, Lovable, and Cursor without coding, covering app architecture, cost,…
From Local Prototyping to Distributed Clusters: An Open Source Platform for ML Research Teams
Toronto
See a demo scaling ML training from a local notebook to a GPU cluster, covering checkpoint recovery, hyperparameter…
Transformer Lab: Training LLMs and Diffusion Models Locally
Calgary
Live demo of Transformer Lab: fine‑tune large language models and run diffusion evaluations locally on a MacBook Air,…
Transparent Trust: What If AI Showed Its Work?
Raleigh
See how AI works with a prompt-first system. Outputs are inspectable and debuggable, building trust through transparency and…