Multi-Modal Video Understanding for Hyperlocal Discovery: Building AI That Sees, Hears, and Understands NYC | New York City .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

November 17, 2025 · New York City

CityPulse: Multi-Modal Video Understanding

Building CityPulse: integrating LLaVA, Whisper, and Llama models with pgvector for hyperlocal video understanding and semantic search on local AI infrastructure.

Overview
Links
Tech stack