The Gradient Question

Backpropagation asks for the loss derivative with respect to each parameter. This lesson reuses the exact forward values and sets up the reverse chain rule.

highlighted = computed this step

Backprop asks for dL/d(each weight): how the loss changes with each parameter. The chain rule computes those quantities exactly from the already-shown graph.

\frac{dL}{d(\text{parameter})}\quad\text{by the chain rule}

Forward values we reuse

The forward pass ended with yhat=2 and L=1. Those exact node values are the inputs to the backward pass.

\hat y=2,\quad L=1

Summary

The backward pass starts at the loss and moves in reverse. The next step computes the first gradient, dL/dyhat.

L\rightarrow\hat y\rightarrow h\rightarrow z\rightarrow w,b