3a. Full Fine-Tuning

Update every weight in the model. The "old way."

3b. PEFT — Parameter-Efficient Fine-Tuning (the umbrella)

The insight that changed everything: empirically, fine-tuning updates have low intrinsic rank. You don't need to touch every weight to specialize a model. Update a tiny subset, or add small new trainable parameters. Freeze the base.

Methods under PEFT:

3c. LoRA — Low-Rank Adaptation (deep dive)

The mechanism:

Instead of learning a new weight matrix W', learn a low-rank update to the frozen W:

W_effective = W_frozen + ΔW
where ΔW = B × A

W : d × d        (frozen, e.g., 4096 × 4096)
B : d × r        (trained, e.g., 4096 × 8)
A : r × d        (trained, e.g., 8 × 4096)
r : the rank     (typically 8, 16, 32, 64)

If d = 4096 and r = 8: