Top-k

Top-k keeps a small branch list by exact ordering. This lesson uses k equal to two on the first displayed step.

highlighted = computed this step

Top-k keeps the k largest logits instead of only one token. The ordering is still exact integer comparison.

\text{top-}k=\text{largest logits by exact order}

Top two at the first step

For k=2, the first step keeps a with logit 3 and c with logit 2.

\text{top-}2=[a(3),c(2)]

Top-k broadens the displayed branch list, but it still uses exact ordering of the same integer logits.

\text{top-k is deterministic selection}