Blog Post

Microsoft Teams Blog

4 MIN READ

Reduce background noise in Microsoft Teams meetings with AI-based noise suppression

Microsoft

Dec 16, 2020

Whether it be multiple meetings occurring in a small space, children playing loudly nearby, or construction noise outside of your home office, unwanted background noise can be really distracting in Teams meetings. We are excited to announce that users will have the ability to remove unwelcome background noise during their calls and meetings with our new AI-based noise suppression option.

Users can enable this helpful new feature by adjusting their device settings before their call or meeting and selecting "High" in the "Noise suppression" drop-down (note this feature is currently only supported in the Teams Windows desktop client). See this support article for details about how to turn it on and more here: https://aka.ms/noisesuppression.

Our new noise suppression feature works by analyzing an individual’s audio feed and uses specially trained deep neural networks to filter out noise and only retain speech. While traditional noise suppression algorithms can only address simple stationary noise sources such as a consistent fan noise, our AI-based approach learns the difference between speech and unnecessary noise and is able to suppress various non-stationary noises, such as keyboard typing or food wrapper crunching. With the increased work from home due to the COVID-19 pandemic, noises such as vacuuming, your child’s conflicting school lesson or kitchen noises have become more common but are effectively removed by our new AI-based noise suppression, exemplified in the video below.

The AI-based noise suppression relies on machine learning (ML) to learn the difference between clean speech and noise. The key is to train the ML model on a representative dataset to ensure it works in all situations our Teams customers are experiencing. There needs to be enough diversity in the data set in terms of the clean speech, the noise types, and the environments from which our customers are joining online meetings.

To achieve this dataset diversity, we have created a large dataset with approximately 760 hours of clean speech data and 180 hours of noise data. To comply with Microsoft’s strict privacy standards, we ensured that no customer data is being collected for this data set. Instead, we either used publicly available data or crowdsourcing to collect specific scenarios. For clean speech we ensured that we had a balance of female and male speech and we collected data from 10+ languages which also include tonal languages to ensure that our model will not change the meaning of a sentence by distorting the tone of the words. For the noise data we included 150 noise types to ensure we cover diverse scenarios that our customers may run into from keyboard typing to toilet flushing or snoring. Another important aspect was to include emotions in our clean speech so that expressions like laughter or crying are not suppressed. The characteristics of the environment from which our customers are joining their online Teams meetings has a strong impact on the speech signal as well. To capture that diversity, we trained our model with data from more than 3,000 real room environments and more than 115,000 synthetically created rooms.

Since we use deep learning it is important to have a powerful model training infrastructure. We use Microsoft Azure to allow our team to develop improved versions of our ML model. Another challenge is that the extraction of original clean speech from the noise needs to be done in a way that the human ear perceives as natural and pleasant. Since there are no objective metrics which are highly correlated to human perception, we developed a framework which allowed us to send the processed audio samples to crowdsourcing vendors where human listeners rated their audio quality on a one to five-star scale to produce mean opinion scores (MOS). With these human ratings we were able to develop a new perceptual metric which together with the subjective human ratings allowed us to make fast progress on improving the quality of our deep learning models.

To advance the research in this field we have also open-sourced our dataset and the perceptual quality crowdsourcing framework. This has been the basis of two competitions we hosted as part of the Interspeech 2020 and ICASSP 2021 conferences as outlined here: https://www.microsoft.com/en-us/research/dns-challenge/home/

Finally, we ensured that our deep learning model could run efficiently on the Teams client in real-time. By optimizing for human perception, we were able to achieve a good trade-off between quality and complexity which ensures that most Windows devices our customers are using can take advantage of our AI-based noise suppression. Our team is currently working on bringing this feature also to our Mac and mobile platforms.

AI based noise suppression is an example of how our deep learning technology has a profound impact on our customer’s quality of experience.

Learn more in the second part of our series about getting the most from your meetings and calls with Microsoft Teams: Re-setting the bar for meeting and call quality

Updated Sep 19, 2025

Version 9.0

Microsoft

Joined December 15, 2020

View Profile

Microsoft Teams Blog

Welcome to the Microsoft Teams Blog! Learn best practices, news, and trends directly from the team behind Microsoft Teams.

32 Comments

ad52_
Copper Contributor
Apr 22, 2022
Hi,

we use Microsoft Teams at work.

When listening to the voice of some of my colleagues I get immediately feelings of headache and stomach pain (and no, this is not related to WHAT they say...).

That does not happen with:
- Skype (at work)
- Google Meet
- Jitsi
- Zoom

I assume that this is due to the noise cancelling features of Microsoft Teams or some other compressor settings.

Could the product manager of Microsoft Teams / the noise cancelling functionality please check this with some https://en.wikipedia.org/wiki/Psychoacoustics experts how to improve this?

Thanks in advance.
robertaichner
Microsoft
Oct 05, 2021
Harald_Steindl yes there is a plan to allow turning on/off the ML-based noise suppression likely through an admin setting.
Harald_Steindl
Iron Contributor
Oct 04, 2021
Ivar EngenIt is very well woth to investigate what signal actually is on the HDMI. If this is the signal going to the local monitor, its no wonder that there are no local mics in the signal. Matter of fact, there should only be the remote signal being sent to the local loudspeakers.
Phrased differently: Are you sure, that the HDMI carries all the needed (mic) signals?

robertaichner
For heavens sake pls make the ML noise reduction switchable when introducing it on the MTR, pls. 😉
Teams is used in many creative ways nowadays. Way, way off the usual track of a classical meeting room.
While testing different platforms (Teams and others) and their various noise reduction systems, we discovered some odd artifacts when being used in unexpected (for lack of a better word) spaces. Like when used in really big and reverberant spaces and less than optimal mic-ing. Then the DSP/AEC reaches its limits. The results were not brillant but workable. However as soon as the noise-reduction wizards were introduced, voice got worse as in "artificial" or "slightly robotic". This was much more noticeable/annoying than the suboptimal audio.
Long story short: What might be perfect in certain applications dont go any good in others.
Thanks for consideration.
Ivar Engen
Brass Contributor
Oct 04, 2021
Hi Robert, thanks for your answer, I guess this means I have to look elsewhere for the poor audio experience. For the record, we are not using the MTR built-in microphone; the room is equipped with Cisco equipment including a "Cisco TelePresence Ceiling Microphone" and the MTR is capturing both video and audio from a HDMI cable using Magewell HDMI -> USB device.
robertaichner
Microsoft
Oct 04, 2021
Ivar Engen the Teams Rooms are not yet running the ML-based noise suppression and we are currently working on adding that functionality. So noise suppression shouldn't filter out audience reactions. As Harald_Steindl pointed out your issue may be that the microphones are too far from the audience?
Harald_Steindl
Iron Contributor
Oct 04, 2021
Ivar Engen
Pls note, that a built in Audio DSP might not be optimized for "auditorium" sized rooms AT ALL.
Most likely the best way is to bring in a certifed external DSP including an audio guy knowing what to do with it.
Then you can have as many microphones as you like or need incl. audience and/or ambience microphones, each with its dedicated AEC/DSP. Respectable brands are BIAMP or QSC for example. They are hooked to the Lenovo Hub via USB, so nothing fancy on the Teams side to do.
Feel free to PM me direct for any more details.
Ivar Engen
Brass Contributor
Oct 03, 2021
We have set up a Lenovo HUB 500 Teams Room in our auditorium to simplify running Live Events, but I would like to reduce the noise suppression to get a more "rich" audio experience that includes auditorium audience reactions, how do I do this in Teams Rooms; these settings are not available there?
robertaichner
Microsoft
Apr 26, 2021
Ryan Oden, PeteHoots, Sherry_Amelse, JerryRoselada we have released the feature last week on the Mac public preview build and it will roll out to the general ring over the next few weeks. Thank you for your patience! Our support article has been updated and also has a link on how to get the public preview build. https://aka.ms/noisesuppression/
Ryan Oden
Copper Contributor
Apr 26, 2021
I second ed. Several employees in our org have noticeable echo in Teams meetings which do not exist in Zoom. This is pushing many of our users away from Teams and over to Zoom.
PeteHoots
Copper Contributor
Apr 19, 2021
I second JerryRoselada 's ask for robertaichner - Any update on when the Mac version of Team's noise suppression will be ready? With all the additional work from home challenges, this would be a great feature to have..