[Suggestion] Cost Optimization + Timeout Reliability for Copilot

Hello Microsoft. I have a suggestion for Microsoft Copilot, that would reduce the ammount of tokens that would be used and reduce costs significantly. MS Copilot can use GPT-3.5 for Quick mode for new users, as its faster and uses less tokens. However, if the conversation gets longer than 7 days, I would suggest you to use GPT-4, as its better for complex tasks and it gives better and more accurate answers. This way, Microsoft could reduce the costs for Microsoft Copilot and OpenAI APIs. Other than that, I also have another suggestion to reduce more costs against Timeout errors.

When API requests face a timeout, users may experience empty or failed responses.
By implementing a structured retry mechanism with exponential backoff, Copilot could:

- Automatically retry requests after a timeout instead of failing immediately,
- Use exponential backoff to avoid overwhelming the system during peak usage,
- Add jitter (randomized delay) to prevent synchronized retries from multiple users,
- Log timeout events with request identifiers for easier debugging and monitoring.

This approach would improve reliability for all users, especially free‑tier users who are more likely to encounter timeouts during high demand. It would also reduce wasted API calls and optimize token usage.

I hope this suggestion helps improve Copilot’s reliability and efficiency for all users.

Microsoft Copilot for Service

Forum Discussion

[Suggestion] Cost Optimization + Timeout Reliability for Copilot

Resources