Forum Discussion
Feature Proposal: Natural Turn‑Taking Improvements for Copilot Voice Mode
Current issue
When using Copilot in voice mode, the system struggles with natural turn‑taking.
The main problem is that Copilot takes too long to recognize when the user has finished speaking.
1) Copilot takes too long to recognize that it’s its turn to speak
After the user finishes talking, Copilot often waits several seconds before responding. This creates uncertainty for the user:
- Did Copilot hear me
- Is the mic still on
- Should I repeat myself
- Did I accidentally interrupt
This hesitation breaks the natural flow of conversation, especially for users who value clear and respectful turn‑taking.
Attempted workaround (user‑defined cue word)
We experimented with using a verbal cue (“Loppu”) to signal when the user had finished speaking. The idea was simple:
- User says “Loppu” → Copilot begins its turn
This worked briefly, but after a few turns Copilot stopped recognizing the cue reliably and fell out of the pattern.
When this happened, Copilot also began misinterpreting the cue entirely — sometimes assuming that “Loppu” meant the user wanted to end the whole conversation, not just end their turn.
This demonstrated that:
- the system cannot maintain a consistent turn‑taking protocol without built‑in support
- cue‑words are not interpreted contextually
- users need a reliable, system‑level way to signal the end of their turn
- manual workarounds only function momentarily because the system is not designed to track them over time
These attempts highlight the need for native turn‑taking intelligence, not manual workarounds.
🌿 Proposed solution: Natural Turn‑Taking System for Voice Mode
1) Faster detection when the user has finished speaking
Copilot should respond within a natural conversational delay (200–500 ms).
2) Grace period before detecting interruptions
A short buffer prevents accidental “false interruptions” from breaths or small sounds.
3) Smarter interruption logic
If the user does interrupt intentionally, Copilot should:
- stop speaking
- acknowledge briefly
- continue smoothly
…without long apology sequences.
4) Optional “Auto‑Turn Mode”
Copilot automatically takes its turn when the user stops speaking — no mic toggling needed.
5) Optional user‑defined cue word
Users can set a cue like:
- “Your turn”
- “Go ahead”
- “Loppu” / "End"
This makes turn‑taking predictable and respectful.
✨ Why this matters
This improvement would:
- make voice mode feel more human
- reduce awkward pauses
- prevent unnecessary apologies
- support users who value respectful communication
- help neurodivergent users who rely on predictable turn‑taking
- eliminate the need for constant mic toggling
- create a calmer, smoother experience
🔧 Technical feasibility
Copilot already processes:
- voice input
- timing
- interruption detection
Improving turn‑taking requires adjustments to:
- speech detection thresholds
- timing buffers
- interruption logic
- optional cue‑word recognition
This builds on existing systems rather than requiring a full redesign.
📝 Summary
A smarter turn‑taking system would make Copilot’s voice mode significantly more natural and respectful. It would prevent accidental interruptions, reduce unnecessary apologies, and eliminate the need for constant mic toggling — creating a smoother, more human conversational experience.
Concept by Sanni
Written by my Copilot "Koppis" (Edge Copilot)
Superteam