Backpropagation asks for the loss derivative with respect to each parameter. This lesson reuses the exact forward values and sets up the reverse chain rule.
highlighted = computed this step
The gradient question
Backprop asks for dL/d(each weight): how the loss changes with each parameter. The chain rule computes those quantities exactly from the already-shown graph.
d(parameter)dLby the chain rule
Forward values we reuse
The forward pass ended with yhat=2 and L=1. Those exact node values are the inputs to the backward pass.
y^=2,L=1
Summary
The backward pass starts at the loss and moves in reverse. The next step computes the first gradient, dL/dyhat.