Wondering if anyone else is seeing regular VM hangs with the Windows 10 Enterprise for Virtual Desktops image, or has any advice on troubleshooting the issue we're experiencing?
In our tenant we have 14 session hosts (each has 16 vCPU, 64GB RAM, 256GB Premium SSD) in a single host pool. FSLogix Apps is used for profiles and they're stored on a Premium Azure Files Storage Account (5TB Quota, 5000 allowed IO/s, 15000 burst IO/s) in the same region as the hosts. There are 225 users that use WVD for a full desktop environment (no RemoteApps).
Average CPU and RAM usage during peak time is less than 50% per VM.
Almost every day, usually during peak hours, at least one of the VMs hangs and needs to be restarted from the Azure portal. Users that are connected to the affected VM report that none of their opened applications are responsive, and that they can’t launch or close any applications, even using task manager. The start menu also becomes unresponsive.
Any new connections (via Remote Desktop client or directly via RDP) fail.
If we try to log off users using the "Invoke-RdsUserSessionLogoff" cmdlet, their session hangs at the "Signing you out" screen indefinitely.
If we try to kill any of their processes using task manager (as an admin) we get an Access Denied error message, or the process doesn’t get killed.
Typically in the event logs, about 30 minutes prior to the VM hanging, we start to a few of following events, but there is no commonality between applications, servers or users in the event descriptions.
1002 – Application Hang 7036 – Services entering a stop state 7011 – A timeout was reached wile waiting for a response from a service