Epsilon AI Academy
العربية
Enroll

Calculus

Gradient descent

Training is just rolling downhill. At each step the model moves against the slope of the loss. The learning rate sets the step size — too big and it overshoots, too small and it crawls.

Steps0
Current loss
0.15
-1.1

A moderate learning rate slides smoothly to the minimum.