Maximizing Performance: Leveraging PTUs with Client Retry Mechanisms in LLM Applications