Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Finetuning with Claude Synthetic Data
This talk details three new SKILLs for end-to-end model finetuning on arbitrary domains using Claude-generated data, covering extraction, synthesis, and evaluation.
Inspired by huggingface’s recent blog post on using Claude code to fine tune a model with an existing dataset (via their new CC skill), I wanted to see how much more of the e2e finetune process could be captured in SKILLs.
Had a few days over Xmas break and 150 commits later I have 3 new SKILLs to share with the community to help (1) extract domain knowledge (2) iteratively generate and filter synthetic data and (3) run the finetune and eval it
Have lots of challenges and learnings to share how I did it. I chose therapeutic coaching as my domain (but the SKILLs apply generally) and the resulting 14b finetuned model competes with human text based therapy