measurement
Tokens per second throughput is a metric used to measure the performance and response speed of LLMs.

Authors

Sources

Referenced by nodes (1)