Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Violex: Learning Violin with AI
Demoing Violex, a project using AI/ML to map music notation onto the violin, providing feedback for easier, engaging learning.
Violin is a very hard instrument to play. I’ll present my ongoing project called Violex, it is intended to help reading music sheets and mapping them on the instrument in a fun, intuitive and engaging way. The system also records your play and creates an AI/ML based feedback on how well you are performing! I’ll showcase my progress, the technical hurdles and the feedback mechanisms needed for creating an AI powered hardware like this.
Violex analyzes violin frequencies and durations using smart audio analysis for instant visual feedback.
- GPT-4GPT-4 is OpenAI’s large multimodal model: it processes both text and image inputs, delivering human-level performance on complex professional and academic benchmarks.This is OpenAI’s latest milestone in scaling deep learning: a large multimodal model accepting both text and image inputs. It demonstrates a significant capability leap over its predecessor, scoring in the top 10% on a simulated bar exam (GPT-3.5 scored in the bottom 10%). The model handles nuanced instructions and long-form content, supporting context windows up to 32,768 tokens (32K model). This capacity allows processing up to 25,000 words in a single, complex prompt. GPT-4 is engineered for enhanced reliability, steerability, and advanced reasoning across diverse tasks.
- Claude-3Claude-3 is Anthropic's state-of-the-art multimodal model family (Opus, Sonnet, Haiku), setting new industry benchmarks for intelligence, speed, and vision capabilities.Claude-3, developed by Anthropic, is a powerful family of three generative AI models: Opus, Sonnet, and Haiku. Opus, the flagship, excels in complex reasoning, outperforming peers on key benchmarks (MMLU, GPQA) and supporting a 200,000-token context window. Sonnet offers an optimal balance for enterprise workloads, delivering performance that is 2x faster than its predecessor, Claude 2.1. Haiku is the fastest and most cost-effective option, capable of processing a 10,000-token research paper (including charts) in under three seconds. All three models are multimodal, featuring strong vision capabilities for analyzing charts, diagrams, and PDFs alongside text, enabling advanced data extraction and analysis.
- Llama-2Llama 2 is Meta AI's powerful, openly accessible family of large language models (LLMs), featuring models from 7B to 70B parameters for research and commercial applications.Llama 2 is Meta AI's next-generation LLM family, released for free research and commercial use. The collection includes both pre-trained foundation models and instruction-tuned 'Chat' variants, scaling from 7 billion (7B) up to 70 billion (70B) parameters. Key technical upgrades over Llama 1 involve training on 2 trillion tokens (40% more data) and doubling the context length to 4096 tokens. The Llama-2-chat models were rigorously aligned using Reinforcement Learning from Human Feedback (RLHF), positioning them as a top-tier, openly available option for developers building advanced generative AI solutions.
- LangChainThe open-source framework for building and deploying reliable, data-aware Large Language Model (LLM) applications.LangChain is the essential framework for engineering LLM-powered applications: it simplifies connecting models (like GPT-4 or Claude) to external data, computation, and APIs. The platform provides a modular set of components—Chains, Agents, Tools, and Memory—allowing developers to quickly build complex workflows like Retrieval-Augmented Generation (RAG) pipelines and sophisticated conversational agents. Its Python and JavaScript libraries, combined with LangChain Expression Language (LCEL), offer a standardized interface for rapid prototyping and moving applications to production with confidence.
- PyTorchPyTorch is the open-source machine learning framework: it provides a Python-first tensor library with strong GPU acceleration and a dynamic computation graph for building deep neural networks.PyTorch, developed by Meta AI, is a premier open-source deep learning framework favored in both research and production environments. Its core is a powerful tensor library (like NumPy) optimized for GPU acceleration, delivering 50x or greater speedups for complex computations. The key differentiator is its 'Pythonic' design and dynamic computation graph (eager execution), which allows for rapid prototyping and simplified debugging compared to static-graph frameworks. Leveraging its Autograd system for automatic differentiation, practitioners build and train models for computer vision and NLP; major companies like Tesla (Autopilot) and Microsoft utilize PyTorch for critical AI applications.
Related projects
Vibe Hacking for the Era of Vibe Coding
Vienna
Learn how to defend against AI vulnerabilities. This talk explores automating security to counter criminal hackers, transforming "vibe…
vibe MLing - e2e finetunes on arbitrary domains with claude code via carefully synthesized data
Cologne
This talk details three new SKILLs for end-to-end model finetuning on arbitrary domains using Claude-generated data, covering extraction,…
Make Vibe Coding Actually Ship
Manchester Nh
See how a structured loop, separating ideation, specification, and execution with AI tools, turns AI ideas into working…
Claudio AI Musician
Milan
This talk demonstrates AI agents collaborating with users to control an FM synthesizer, creating a dataset for training…
Code Less, Create More: AI's productivity Revolution
Manchester Nh
Explore real-world AI tool use accelerating delivery, reducing debt, and shifting developers to innovation roles through practical examples.
Utilizing Synthetic Datasets for Sales prospects
Bremen
Learn how LLMs generate synthetic B2B sales scenarios and tailored insights, enabling fast, practical preparation for sales calls…