Back to Glossary
Time to First Token (TTFT)
PerformanceThe latency between submitting a prompt to an LLM and receiving the first output token. Lower TTFT means faster response initiation, critical for interactive chatbot experiences.
The latency between submitting a prompt to an LLM and receiving the first output token. Lower TTFT means faster response initiation, critical for interactive chatbot experiences.