Attention is exact until a multi-entry softmax appears. The one-entry row pins to one; the two-entry row is named.
Attention, exactly
The Q, K, and V maps are identity matrices, so the displayed vectors pass through unchanged. Position zero can attend only to itself.
One-entry softmax
Position zero has score 1 and a one-entry softmax, so the attention weight is exactly 1. Therefore attn zero is (1,0).
Szero,zero=1,softmax=[1]
The named boundary
Position one has two unmasked scores, 1 and 2. Its softmax is named with exponentials, not decimalized.
softmax([1,2])=named