OPS116: Monitoring and Responding to alerts in hybrid environments using Azure Monitor

Published Feb 02 2021 08:30 AM 1,119 Views
Microsoft

A deep dive on how Microsoft Retail has leveraged Azure Monitor, Log Analytics, Azure Automation, PowerShell and other readily available products to monitor all their on-prem system, including in-store Video walls.

In this session we will discuss how you can respond to an alert. When an alert (either from Log Analytics via App Insights, OMS, SCOM, external data) is triggered, it initiates an action (defined by the action group) that calls the entry-point (basically a webhook or normal url), a PowerShell framework looks up metadata for the component that triggered the alert, and calls the user-defined self-healing PowerShell script.

 

If the self-healing script fails, or doesn’t exist, a support ticket (using your own ticketing system) is created for the owner of the component, and any/all details of the alert and subsequent root cause data gathered by self-healing script are added to the ticket.

 

Speaker:

Erik Namtvedt - Senior service engineer

 

 

This session includes:

0:00 Introduction
1:33 Agenda
2:00 History
5:15 Design
6:55 Overview of the response framework
10:45 Implementation
14:40 Step through Explanation
30:59 Real World Example
49:00 Additional uses/POCs
58:30 Wrap Up

 

Community Chat

Want to chat about this session? Come join us on Discord! https://aka.ms/ops116-chat 

 

Learn More

What did you think? Please take a moment to submit your feedback at https://aka.ms/ops116-feedback 

To watch more sessions from the IT Ops Talks: All Things Hybrid event check out https://aka.ms/ITOpsTalks

  •  
  •  
  •  
 
Co-Authors
Version history
Last update:
‎May 14 2021 10:56 AM
Updated by: