We Got Claude to Fine-Tune an Open Source LLM

12/05/2025

We gave Claude the ability to fine-tune language models using a new tool called Hugging Face Skills. Not just write training scripts, but to actually submit jobs to cloud GPUs, monitor progress, and push finished models to the Hugging Face Hub. With this skill, you can tell Claude things like:

Validate your dataset format
Select appropriate hardware (t4-small for a 0.6B model)
Use and update a training script with Trackio monitoring
Submit the job to Hugging Face Jobs
Report the job ID and estimated cost
Check on progress when you ask
Help you debug if something goes wrong

The model trains on Hugging Face GPUs while you do other things. When it’s done, your fine-tuned model appears on the Hub, ready to use. This isn’t a toy demo. The skill supports the same training methods used in production: supervised fine-tuning, direct preference optimization, and reinforcement learning with verifiable rewards. You can train models from 0.5B to 70B parameters, convert them to GGUF for local deployment, and run multi-stage pipelines that combine different techniques.

Full report : Hugging Face details how it used its new tool, Skills, to fine tune LLMs using Claude, including for writing scripts, submitting jobs to cloud GPUs, and more.

Tagged: AI Tools Hugging Face Large Language Models Open-source AI

Subscribe Sign In

Related Posts