> all of the prompt tokens are available in advance and do not require decoding
> The other technique is called batching: this technique aggregates multiple prompts into a single decode operation.
So do prompts get decoded or not? Are there 2 decode steps? Unclear
Freedom5093•5mo ago
> all of the prompt tokens are available in advance and do not require decoding
> The other technique is called batching: this technique aggregates multiple prompts into a single decode operation.
So do prompts get decoded or not? Are there 2 decode steps? Unclear