Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Strix Halo Unified Memory AI
Live benchmarking of the Strix Halo Ryzen AI Max 395, demonstrating PyTorch AOTriton FA and vLLM builds, performance metrics, and unified‑memory AI feasibility.
One of my hobbies is poking around on AI/ML RDNA3 and I had early access to a Framework Desktop (which I can bring to show off) and I will show what it can do - how fast it runs, what software it supports. Besides testing and improving performance, I also created the first build scripts for building PyTorch w/ AOTriton FA and vLLM.
Benchmarks PyTorch/vLLM LLM inference performance using ROCm and Flash Attention.
Strix Halo LLM inference optimization uses Vulkan/ROCm with LPDDR5x memory tuning.