Context window
Also known as: sequence length, max context
Maximum tokens the model can attend to in one forward pass; larger windows help long documents but increase memory and latency.
Also known as: sequence length, max context
Maximum tokens the model can attend to in one forward pass; larger windows help long documents but increase memory and latency.
Contact if you need a term added for a security or procurement review.