Culturally Aligned AI: Building Dlab-852-Mini for Hong Kong Cultural Nuances | Hong Kong .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

December 18, 2025 · Hong Kong

Dlab-852-Mini: Hong Kong Cultural AI

Showcasing Dlab-852-Mini, a Phi-3 fine-tune for Hong Kong culture using the CultureKit eval, detailing training, evaluation, and case studies.

Overview
Links
Tech stack
  • Python
    Python: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.
    Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .
  • datasets
    Datasets is the core ML library for accessing, sharing, and processing thousands of AI-ready datasets (NLP, CV, Audio) with a single, efficient line of code.
    Datasets is the essential utility for modern machine learning workflows: it provides a unified API for data access and preprocessing. The library allows engineers to load over 350,000 datasets (SQuAD, Common Crawl, etc.) directly from the Hugging Face Hub. It leverages an Apache Arrow backend to ensure zero-copy reads, enabling efficient handling of massive datasets without RAM constraints. This architecture streamlines data preparation, making it fast and scalable for training state-of-the-art models across various domains.
  • Phi-3
    Microsoft's family of small language models (SLMs) delivering high-reasoning performance on local devices and edge hardware.
    Phi-3-mini packs 3.8 billion parameters into a footprint small enough for local deployment on an iPhone 14. Trained on a 3.3 trillion token dataset of high-quality synthetic data and filtered web content: it outperforms models twice its size (like Mixtral 8x7B) on benchmarks for coding and logic. The family includes 7B (small) and 14B (medium) variants, providing developers with low-latency options for complex tasks without the massive compute requirements of traditional LLMs.
  • MLX
    MLX is Apple's high-performance array framework for machine learning on Apple silicon, leveraging unified memory for zero-copy efficiency.
    MLX is an open-source array framework from Apple machine learning research, purpose-built for efficient ML on Apple Silicon (M-series chips). Its core strength is the unified memory model: this eliminates costly data transfers between the CPU and GPU, a major performance bottleneck in traditional frameworks. The API is immediately familiar, closely mirroring NumPy for array operations and PyTorch for higher-level packages like `mlx.nn` and `mlx.optimizers`. It supports Python, C++, C, and Swift bindings, making it highly flexible. Researchers use MLX to quickly train and deploy complex models, with examples including large-scale text generation with LLaMA and image creation via Stable Diffusion.

Related projects