Token overlap can be counted with exact integer counts. Honesty note: simplified toy corpus; jurisdictions vary; the pinned first step states the as-of date; not legal advice; retrieval points to sources and does not answer the legal question.
highlighted = computed this step
Retrieval honesty note
Honesty note: simplified toy corpus; jurisdictions vary; as of June 24, 2026; not legal advice; retrieval points to sources and does not answer the legal question.
toy corpus as of June24,2026
Match by token overlap
The stated retrieval model uses exact token overlap. It counts shared tokens between the query and each corpus row.
overlap count=matching tokens
Example query and source row
The example query is diversity amount citizenship. The matching row is the record for 28 U.S.C. sec. 1332.
query tokens are derived
The count governs rank order
Record id diversity has 3 of 3 matching tokens and rank 1 in this toy table.
rank=1,match=3 of 3
Diagram note
The diagram displays integer overlap and deterministic rank. It does not claim legal relevance or source weight.
rank is a toy retrieval order
Jurisdiction: US; as of 2026-06-24; not legal advice; Code encodes the stated-rule interpretation.
Summary
Lexical matching is useful because each displayed number can be recomputed from the query and corpus text.