Attention uses the row weights to form a weighted sum of value vectors. The contribution pattern is exact, while the multi-entry softmax weights remain named.
highlighted = computed this step
Weighting the values
After softmax, the attention output is a weighted sum of value vectors. Which positions contribute is fixed by the exact mask.
outputi=j≤i∑softmax(Si)jVj
Structurally named output
Because the multi-entry weights are named softmax symbols, the output row is also structurally named. The structure is exact; the numeric softmax weights are not pinned as decimals.
exact structure+named weights
Summary
This keeps the honesty boundary visible: exact score and mask mechanics feed a named weighted sum.