Monitor Windows Storage Spaces Direct 2016/2019 with SCOM Management Pack
Published Dec 27 2019 12:59 AM 16.6K Views
Microsoft

SCOM have revamped the old S2D 2016 MP and have added a whole list of new features to enable you with monitoring of entire S2D stack all the way from Physical disks to CSVs/File shares.

Download the preview version today https://www.microsoft.com/en-us/download/details.aspx?id=100782 and tell us in comments what you like/dislike about this release. To help you get started, there is a MP guide along with the MP. Please see detail instructions on how to install this MP which is very easy to follow.

 

For our readers who don't want to go through the entire post, the highlight of this release are captured:

1. Storage Pool health and performance monitoring

2. Storage Node health and performance monitoring

3. Physical disks health and performance monitoring

4. Volumes health, performance, deduplication status, resiliency type.

 

Note: The CTP guide have all these details captured. Please check it out before installing the MP.

-----------------------------------------------------------------------------------------------------------------

Windows Server Storage Spaces Direct 2019 has enabled performance counters for various components which are documented here.

The latest S2D MP now supports all these performance counters to make it useful for the admins and operators to see the overall health and performance of clusters.

 

For our readers, who are interested to know more, let's see in bit detail what all this MP offers.

 

1. Storage Pool Monitoring: When multiple clusters are deployed, you want to see the health of the storage pool which these clusters are using. If the storage pool health is in critical/warning state, you would like to receive a SCOM alert with the description to root cause the underlying issue. This entire experience is possible now by S2D MP. The storage pool health and performance metrics gives you an overview of  storage pool. As we monitor multiple cluster, the cluster name is associated with storage pool. For each storage pool we can check the performance to view the storage capacity and may take actions as per the situation.

The severity and the health of the S2D components, that is shown by S2D MP is defined by the health faults which are well documented  here. This ensures that health shown by the cluster manager or scom stays in sync.

 

StoPool.PNG

 

 2. Storage Node Monitoring: The other important piece of a cluster is the storage node that forms your cluster. For monitoring we want to see what all nodes form our cluster, and the health and performance of these nodes. We may want to increase the number of nodes in the cluster, if we see the current nodes are being exhausted. 

We can see all the nodes and corresponding cluster under Storage Node. There will be a rule which monitors any error related to Node and fires an alert if an error occurs. With alert description, you take an action immediately and make your cluster healthy and happy again. How COOL is that !! :)

 

Below is the screenshot that shows that we have total 5 servers. 3 servers form cluster and 2 server forms other cluster. 

Node.PNG

Besides health of node, the Physical disk column shows the health of the disks attached to the nodes for a quick overview. If you want to see the performance of any node, just right click and go to the performance view. Select the counters and as always SCOM will draw a beautiful graph for it.

 

PerfNode.PNG

 

Are you still with me?? Great, let's see the other cool feature that we have added to S2D MP for YOU !!

 

3. Physical Disks monitoring: Now that our compute is healthy and taken care for our cluster, let's dig a little to the storage side. Let's see what all type(HDD. SSD..) of physical disks are part of our cluster,  and how much storage we are getting from these. All this data enables us to take preventive measure in the scenarios where we might have to increase the cache/capacity drive based on the usage. Again there are rules running, which will fire an alert when an error occurs for disks.

For the performance metrics, just right click the disk and all the performance counters are displayed.

diskperf.PNG

 

4. CSV Monitoring: Now that we have checked the health and performance of all the different components of our cluster starting from physical disk to storage pool. Next let;s see what all CSVs we have, what is the resiliency type and deduplication status of various volumes across our clusters.

A consolidated view of the health of all the CSV's with media type, size, resiliency type and dedup status helps admin to easily manage and monitor the cluster. Alerts will help if the volume is filling up or if there is any error. 

csv.PNG

 

5. Network Monitoring: Network monitoring is crucial when we are deploying a cluster using multiple components. A rule is checking for any network error and will fire an alert for you as soon as some network related error occurs.

 

Hope this helps and you like this new version that we have released for you. Please share this post if this can help others.

DLC link with MP guide is available at https://www.microsoft.com/en-us/download/details.aspx?id=100782

Please do share with us in comments your experience with this MP. You can directly send me a message if you face any difficulty with this MP.

 

Thanks,

Neha

10 Comments
Copper Contributor

Hi,  i have error, i can't add group in Dashboard :( 

  • SCOM  2019 (Clean install)
  • SQL 2017 (last CU - 14.0.3257.3)
  • OS - Windows Server 2016 (1607 Build 14393.3443)

MP:

 

Get-SCOMManagementPack | Select-String -Pattern "Storage"

[Microsoft.Windows.Server.10.0.Storage.StorageSpacesDirect.Presentation, 31bf3856ad364e35, 1.0.47.0] [Microsoft.Windows.Server.Storage.Visualization.Library, 31bf3856ad364e35, 1.0.47.0] [Microsoft.Windows.Server.10.0.Storage.StorageSpacesDirect, 31bf3856ad364e35, 1.0.47.0] [Microsoft.Storage.Library, 31bf3856ad364e35, 1.0.47.0]

Error:

 

 

An error occurred executing the command: [Microsoft.Windows.Server.Storage.Visualization.Library.DataProviders!DatacenterDashboardDataProvider/GetDatacenterGroupList] in provider: [Microsoft.Windows.Server.Storage.Visualization.Library.DataProviders.DatacenterDashboardDataProvider, Microsoft.Windows.Server.Storage.Visualization.Library.DataProviders, Version=7.0.5000.0, Culture=neutral, PublicKeyToken=31bf3856ad364e35]. : Incorrect syntax near ','.
Incorrect syntax near the keyword 'CONVERT'.
Incorrect syntax near the keyword 'AS'.

at Microsoft.EnterpriseManagement.Presentation.DataAccess.DataProviderCommandMethod.Invoke(CoreDataGateway gateWay, DataCommand command)
at Microsoft.EnterpriseManagement.Presentation.DataAccess.CoreDataGateway.ExecuteScalarInternal[TResult](DataCommand command)
at Microsoft.EnterpriseManagement.Presentation.DataAccess.CoreDataGateway.<ExecuteScalarAsync>b__2[TResult](<>f__AnonymousType0`1 data)

 

SCOM_S2D.png

Microsoft

hi @BETEP , please share your email address with me. I will share the fix of this issue with you. This is a bug in 2019 RTM, the fix will be released with upcoming 2019 UR1.

Copper Contributor

Hi @NehaGarg thank you very much, I will wait.

romanmirus@gmail.com

Microsoft

For SCOM 2019 RTM customers please fix the dashboard error with this hotfix.

https://support.microsoft.com/en-us/help/4506518/system-center-operations-manager-hotfix-for-scoped-...

Copper Contributor

Hello @NehaGarg ,

We are very interested in using this management pack. We are pretty new to SCOM (running 2019 here). We have a an Alert Rule that keeps on being repeated : S2D Close Alert script was failed.

Is it behaviour familiar ? where can I find some help.

 

PhS

Microsoft

@phsimons , I am starting an email thread with you to understand the issue and help you. Can you please share your email address with me? Thanks.

Copper Contributor

@NehaGarg thanks, I did send you a private message

Copper Contributor

Thanks Neha for this great MP.

I found this MP has more collection rule then alert generating rule.

 

I am more interested for alert generating rule for some of the performance counter (say free disk availability,related to latency and for some others as well)

 

Do you know by chance if you are looking to add more alert generating rule for some important performance counter in next version.

 

Thanks again...

 

Copper Contributor

Hello @NehaGarg ,
I've imported S2D MP everything looks good from management console, but in Web Console there is no dashborad view.
Is this by design?
SCOM Management Server 2019 UR2 (Windows Server 2016)
SCOM Web Console 2019 UR2 (Windows Server 2019)
My account assigned  a SCOM Admin rights
Capture1.PNGCapture2.PNG

Copper Contributor

Hello Neha,

 

The download page shows release date as Feb 12th so it seems like the CTP has been around for quite some time - I assume it's being updated. Can you share when the final version will be released?

We currently use the 1.0.47.0 version but that is creating massive numbers of alerts, just from the 3 ongoing sync flows. And I mean massive... for some clusters, I'm talking >1000.

 

Look forward to news on this!

 

Kind regards,

 

Gerald Versluis

Version history
Last update:
‎Dec 27 2019 12:59 AM
Updated by: