[Resolved] Lab Services Outage: Cannot perform actions on a lab

Microsoft

[Final Update] 3/31 20:00 UTC

Hello everyone,

We would like to share a final update on the outage in Azure Lab Services. The outage has now been resolved. Thank you so much for your patience throughout the outage. Please read about the details of what happened and what we’re doing as preventative steps for the future in our blog post.

 

The labs that were affected by the outage are now in following two states:

  • The lab was recovered

All labs that were recovered should now operate as normal and will no longer be marked as affected by the outage. Please note that if you need to increase the lab capacity (i.e. increase the number of virtual machines in the VM pool) of these labs, you will need to republish the labs first. If you can’t republish but still need to increase lab capacity, please contact us at AzLabsCOVIDSupport@microsoft.com for help.

 

  • The lab was not recoverable, and needs to be deleted and recreated

Unfortunately, some labs affected by the outage were not recoverable. These labs will be marked as not recoverable (see screenshot below), and your users will no longer see the labs’ virtual machines in their 'My virtual machines' view. As much as it pains us to say this, it means loss of data for the users.

 

Pic1.png

 

To recreate these labs, please follow the steps below:

  1. Click on a lab that could not be recovered
  2. Go to the Users tab of the lab and click the “Export CSV” button to download a CSV file of the users list.
  3. The other tabs (Template, Schedule, Settings, etc.) will be read-only in case you need to copy over the info to the new lab
  4. In the My labs view, click on the “+ New lab” button, and set up the lab as before
  5. Once the lab is created, go to the Users tab. Click on the “Add users” > “Upload CSV” to import the CSV file of users into the new lab. Please note that you need to update the user quota (of hours) in this new lab.
  6. Click on the “Invite all” button to send out an email to your lab users with the registration link to your lab.
  7. Users will be able to click on the registration link to get access to virtual machines in your new lab.  
  8. Delete the old lab.

pic2.png

We understand that this can potentially be a lot of effort, and we are deeply sorry for the inconvenience this has caused. Please do not hesitate to contact us at AzLabsCOVIDSupport@microsoft.com if there’s anything we can do on our end to help.

 

-Labs Team

------------------------------------------------------------------------------------------------

[Update] 3/26 17:00 UTC

We would like to inform you that we are actively working on resolving the issue. 

 

How do I know if my lab is impacted in the outage?

One or many of your labs would be impacted if you see the following banner when you navigate to labs.azure.com

MicrosoftTeams-image.png

 

To identify specific impacted labs, navigate to a lab; if you see the following banner on your lab, that means it is impacted. 

MicrosoftTeams-image (1).png

 

The fastest way to mitigate the issue is to create a new lab. 

 

Please feel free to add a comment below if you have problems creating a new lab and/or any other questions. 

 

We apologize for the inconvenience and thank you for your patience. 

 

------------------------------------------------------------------------------------------------

[Update] 3/26 02:11 UTC

We would like to inform you that we are actively working on resolving the issue. 

The current mitigation is to create a new lab. 

 

Please feel free to add a comment below if you have problems creating a new lab and/or any other questions. 

 

We apologize for the inconvenience and thank you for your patience. 

------------------------------------------------------------------------------------------------

[Update] 3/25 21:00 UTC

As we investigate the issue further, we would like to share that the current mitigation to resolve these errors is to create a new lab. 

 

Here is a link to learn more on creating a new lab in lab services.  

 

We apologize for the inconvenience and thank you for your patience. 

------------------------------------------------------------------------------------------------

Hey everyone,

We are currently experiencing issues with certain labs where users are not able to perform specific actions such as start/stop lab virtual machines and/or publish a lab. 

 

We would like to inform you that we are actively looking into the issue and will continue providing updates on this post until the problem is resolved. 

 

We apologize for the inconvenience and thank you for your patience. 

 

- Lab Services team 

12 Replies

Creating a new lab does not work for my application. The "small GPU"  is an unavailable VM size and it should be an available size in the West US 2 region according to the support documents.

@TanmayeeKamath 

Thanks for reaching out.

Can you please send an email to me at takamath@microsoft.com with your subscription ID and lab account name?

Tanmayee

@TanmayeeKamath 

I created a new lab, now I have the starting issue on the new lab too!

This is unacceptable!

@enludovi , Could you please share details of your subscription and lab account with me sacalla at Microsoft.com

@TanmayeeKamath 
Will I be able to recover the Template in my lab that was affected by the outage?
It is a virtual domain environment that took many hours to put together, containing 3rd party product licenses that need to be deactivated online. 
Creating a new lab is not a viable option.

 

Nick 

@NickW-ATI ,  We are working on restoring as much as we can for all the labs. Please reach out to me with the subscription, lab account and lab name at sacalla at Microsoft.com

@TanmayeeKamath Hi, I cannot create a new lab as my existing lab is completely "greyed out" so I cannot even export my template to start over again. Additionally, creating a new lab is not the ideal situation as I have users partially through course work where they will lose everything if they have to start over. Add starting afresh building a new template will be days of work. Could we have an update on progress please to address this issue? Thanks.

Hi @awhite1962
We are continuing to work on the issue.
Please share your subscription ID, lab account name and lab name with me at takamath@microsoft.com
We will work on restoring your template machine, so that you can publish again without having to create the template machine.
Looking forward to hearing from you on email .

Tanmayee

I am unable to remove old, non-working labs, so I can not re-create them with the same names. Will this issue likely affect new labs created? Creating new labs is time consuming and with so many labs going down and having issues looks bad for our department recommending Azure Lab Services.  Additionally will there be compensation for students using up time when they couldn't shutdown the VMs?@TanmayeeKamath 

Hello @Ianpham,
Can you please share your sub ID, lab names with me on email? (takamath@microsoft.com)
We can check if your template VM is recoverable, if not unfortunately, the mitigation is to create a new lab. For new labs, please create them with a new name.
For a root cause analysis and next steps we will be sharing next steps soon.

We apologize for the inconvenience. Looking forward to hearing from you on email to help unblock you.

Tanmayee

How do I delete a Lab that has been impacted? Am I being charged? @TanmayeeKamath 

@larmet , you should not be charged for the labs.  You can check the charges for the labs in azure portal. Link  https://docs.microsoft.com/en-us/azure/cost-management-billing/costs/cost-analysis-common-uses has the details. If you have any questions, please reach out via email. Please share your lab url and subscription ID.