Fine-Tuning | Notion

When fine-tuning comes up in an RRK answer, touch:

Did we exhaust cheaper interventions first? (prompt eng, few-shot, RAG)
What's the right method? (LoRA for most cases; QLoRA if budget-constrained; DPO if it's an alignment problem)
What's the data story? (volume, quality, diversity, held-out split)
What's the eval story? (baseline before, target metric, general-capability regression check, safety check)
What's the serving story? (adapter swapping, endpoint management, rollback)
What's the maintenance story? (when do we retune? what triggers it?)