Forum Discussion
Alert emails for Azure Stack HCI health issues?
Microsoft has released the capability to monitor Azure Stack HCI clusters using "Azure Stack HCI Insights". Read more here:
Monitor HCI clusters with Azure Stack HCI Insights (preview) - Microsoft Community Hub
And read this to understand how to setup alerting:
Set up alerts for Azure Stack HCI systems - Azure Stack HCI | Microsoft Learn
--
Darryl van der Peijl
Splitbrain
I've looked at this before and it does not provide email alerts for new health faults.
Azure Stack HCI Insights - your first link - is one more dashboard full of endless information, which I am sure is good for some. It does not do email alerts.
Your second link requires me to learn a strange new query language in order to be able to get the alerts I want. First of all, I do not want to learn that language. Second, I should not need to make decisions on my own about which alerts to get. Microsoft has already decided what constitutes a Health Fault, and that is what I want alerts for. Nothing else.
So I just want an email whenever Get-HealthFault returns a NEW fault. See this: https://learn.microsoft.com/en-us/azure-stack/hci/manage/health-service-faults.
- Jun 07, 2023
ThomasI1
The "Azure Stack HCI Insights" is using 'Log Analytics' in the background which is a logging store and query product. Azure Stack HCI makes sure that all relevant information is send to 'Log Analytics' so that you can query that data and add an action to it. It also sends the 'Health Service Faults'.
The "Azure Stack HCI Insights" is nothing more than a dashboard with defined queries against Log Analytics that show up visually in graphs or counters. With the data in 'Log Analytics' you can do anything you want, if you are willing to extend on it.
If you want an turnkey monitoring solution, this is indeed not it (yet). Pre-defined alerting rules with pre-defined queries is not something that is offered today.
You can either get someone to help you write the right queries or look at alternative products such as SCOM with the right management packs or 3rd party monitoring solutions with Azure Stack HCI / Storage Spaces Direct support.
Hope it helped!
--
Darryl van der Peijl
Splitbrain- ThomasI1Jun 11, 2023Copper Contributor
I do not want a "turn key monitoring solution" and my original post does not ask for that.
A "turn key monitoring solution" sounds like I am asking for Microsoft to build some big new software solution. I am not. I am just asking for a tiny extension for - for example - Windows Admin Center.
The Hci OS already knows when health faults occur. And it already has a Powershell and a C# API. And Microsoft already uses that API in Windows Admin center to DISPLAY any CURRENT health faults at the dashboard page. Like this:
All it needs to do is send an email when it is about to display a new message here.
To me, it is beyond belief that Azure Stack HCI was launched without this essential functionaliy built in.
And no thanks: I do not want to setup a complele crazy animal like SCOM and manage it just for this. I did that with our old S2D cluster, but SCOM feels like something from the early 2000's that was designed for giant corporations with a full time admin just for running it.