This blog discusses an overall approach for monitoring very commonly encountered uscases e.g. performance monitoring for CPU, disk, network, port connections, service status for hybrid VMs that are connected to Azure Arc.
Some of the recommendations around planning before onboarding servers are as follows:
Create a Server Inventory List:
Resource Group Planning:
Resource Tagging for Subdivision:
Monitoring Requirements:
Health Indicators with Color Coding:
Choose the Right Visualization Tool:
RBAC Strategy for Dashboard Users:
Log Analytics Workspace Strategy:
Large scale resource tagging in Azure can be achieved using a combination of CSV file and a script. A script can be used to discover and dump resources in CSV file. Next step is to modify and clean the CSV file for resources where tags need to be enforced. You may chose to remove the resources where tagging is not needed to be enforced. After modifying the CSV use a script to tag the resources in the CSV file.
Configure filters, tabs and groups in Azure Workbooks for data extraction and visualization
In the workbook example below, subscriptions, resource groups, workspaces, time range and tags have been used as filtering criteria.
Filters are defined using parameters in workbooks.
Create workbook parameters - Azure Monitor | Microsoft Learn
TopTrends, ServerMonitoring, Inventoy, ServiceMonitoring, AlertSummary, CapacityPlanning are tabs which will contain the visualised data. This will further contain groupings to keep correlated visualizations together.
Near real time monitoring of all PerfMon counters for Windows and Linux which can be collected at OS level for capacity and availability planning e.g. CPU, memory, uptime, network.
How to Enable VM Insights for log collection:
Tutorial - Monitor a hybrid machine with Azure Monitor VM insights - Azure Arc | Microsoft Learn
Define Data Collection Rules for Data Sources and Data Resources
In the example below data collection rules have been define for Performance Counters and windows event logs
Source of Data Visualization: Log analytics Perf and heartbeat table.
Source of Data collection: Log Analytics Perf Tables
Change tracking for windows and linux OS will keep a track of any changes on the system. Popular usecase in this category is services monitoring.
How to enable Change Tracking and Create Data Collection Rules for onboarded VMs
Change Tracking, Azure Arc, Multicloud, Windows, Enable change tracking (microsoft.com)
Source of Data collection: Log Analytics Table ConfigurationChange by enabling ChangeTracking using Azure Policy as documented below.
Dynamic threshold capability is available in Azure Monitor which adapts the threshold value based on the past behaviour patterns for configured alerts. This capability is powered by machine learning. Some metrics may or may not be supported for customised thresholding.
Inventory information such as serial number, model, properties, IP addresses, processor count, core count is captured by the agent and sent to the control plane in JSON format. Customers can choose the inventory details they want to project on the dashboard for monitoring.
Source of data collection: Azure Resource graph explorer type microsoft.hybridcompute/machines. For azure native VMs use microsoft.compute/machines
Monitoring connectivity from source to destination for ports such as 443, 445 source and destination subnets, availability, latency.
How to enable network connection Monitor for log collection
Source for data collection: Log Analytics table NWConnectionMonitorTestResult
Visualization dashboards for Alerts can be created on Azure monitor to manage and track all alerts on one single view as shown below
Source for data collection: Azure Resource Graph query AlertsManagementResources type microsoft.alertsmanagement/alerts
A good understanding of KQL queries for log analytics, Azure resource Graph and creating effective workbooks.
References:
Azure Monitor workbook chart visualizations - Azure Monitor | Microsoft Learn
Kusto Query Language (KQL) overview - Azure Data Explorer & Real-Time Analytics | Microsoft Learn
Overview of Azure Resource Graph - Azure Resource Graph | Microsoft Learn
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.