measurement
Time to First Token (TTFT) is a performance metric for LLMs, often used in service level agreements (SLAs) where organizations target a 95th percentile response time of under 2 seconds for user queries.

Authors

Sources

Referenced by nodes (1)