The latency of a completion request is mostly influenced by two factors: the model and the number of tokens generated. Please read our updated documentation for guidance on improving latencies.
Optimizing latency with OpenAI API models
Learn about factors that influence response times
Updated: 16 days ago
