Optimizing latency with OpenAI API models

Learn about factors that influence response times

Updated: 8 days ago

The latency of a completion request is mostly influenced by two factors: the model and the number of tokens generated. Please read our updated documentation for guidance on improving latencies.

Optimizing latency with OpenAI API models

Was this article helpful?