Finetuning with Claude Synthetic Data

This talk details three new SKILLs for end-to-end model finetuning on arbitrary domains using Claude-generated data, covering extraction, synthesis, and evaluation.

Overview

Inspired by huggingface’s recent blog post on using Claude code to fine tune a model with an existing dataset (via their new CC skill), I wanted to see how much more of the e2e finetune process could be captured in SKILLs.

Had a few days over Xmas break and 150 commits later I have 3 new SKILLs to share with the community to help (1) extract domain knowledge (2) iteratively generate and filter synthetic data and (3) run the finetune and eval it

Have lots of challenges and learnings to share how I did it. I chose therapeutic coaching as my domain (but the SKILLs apply generally) and the resulting 14b finetuned model competes with human text based therapy

Links

Tech stack