Forum Discussion
Loop through the KQL query result
Hi ,
I need to trigger an alert if windows service is stopped in one of the node.
There are 2 nodes and service will be running in both nodes or at one node .
Only If service is not running in both the node then alert need to be triggered.
I'm using the below query and its not right. because alert will be triggered if the service is stopped in one of the node as the query fetches the latest record
let status =
Event
| where TimeGenerated > ago (1d)
| where EventLog == 'System' and EventID == 7036 and Source == 'Service Control Manager' and RenderedDescription has "Apache tomcat"
| parse kind=relaxed EventData with * '<Data Name="param1">' Windows_Service_Name '</Data><Data Name="param2">' Windows_Service_State '</Data>' *
| summarize (TimeGenerated, winstatus) = arg_max(TimeGenerated, Windows_Service_State) by Windows_Service_Name, Computer;
status
| where winstatus != 'running'
| project winstatus, Windows_Service_Name, Computer, TimeGenerated
The above query works well if there's only one VM but for multiple VM's it wont work.
I tried to count the result if service is stopped in both Vms and alert trigger if count value is 2 but then again in Event logs sometimes there will be only one result (if there's no change in state of event within the time frame that used in query) so this method will not work either.
sample result for
Event
| where TimeGenerated > ago (1d)
| where EventLog == 'System' and EventID == 7036 and Source == 'Service Control Manager' and RenderedDescription has "Apache tomcat"
| parse kind=relaxed EventData with * '<Data Name="param1">' Windows_Service_Name '</Data><Data Name="param2">' Windows_Service_State '</Data>' *
| summarize (TimeGenerated, winstatus) = arg_max(TimeGenerated, Windows_Service_State) by Windows_Service_Name, Computer;
| 6/28/2021, 2:01:55.930 AM | Apache Tomcat 8.5.58 | apacheNode1 | running | |
| 6/28/2021, 1:02:54.257 AM | Apache Tomcat 8.5.58 | apacheNode2 | running |
How to loop / check if all the rows that returned for winstatus are != 'running'.
Regards,
Racheal
Racheal2k I think you tried this before?
let status = Event | where TimeGenerated > ago (1d) | where EventLog == 'System' and EventID == 7036 and Source == 'Service Control Manager' and RenderedDescription has 'WMI Performance Adapter' //"Apache tomcat" | parse kind=relaxed EventData with * '<Data Name="param1">' Windows_Service_Name '</Data><Data Name="param2">' Windows_Service_State '</Data>' * | summarize count(), (TimeGenerated, winstatus) = arg_max(TimeGenerated, Windows_Service_State) by Windows_Service_Name, Computer; status | extend winstatus = iif(winstatus == 'running',1,0) | summarize sumif(winstatus, winstatus > 0), ComputersOK = make_set_if(Computer, winstatus > 0), ComputerNotOk = make_set_if(Computer, winstatus == 0) | extend ServiceStatus = iif(sumif_winstatus > 0, "The service is running"," The Service is not runnimg")
6 Replies
- CliveWatsonFormer Employeemaybe add a last line of
| summarize anyif(winstatus !="stopped", true)- Racheal2kCopper Contributor
CliveWatson Thanks
This command is not clear to me
because I used,
| summarize anyif(winstatus !="stopped", true) --> returns false // . As per the query i think if status is not equal to stopped in any of the VM then returns true else returns false . this returns false because service is stopped in one of the VMalso checked
| summarize anyif(winstatus !="running", true) -> returns true// . As per the query i think if status is not equal to running in any of the VM then returns true else returns false . this returns true even though the service is running in one of the VMHere's the VM service status
6/28/2021, 10:00:08.173 AM stopped apacheNode1
6/28/2021, 10:07:53.470 AM running apacheNode2Modified query
let status =
Event
| where TimeGenerated > ago (1d)
| where EventLog == 'System' and EventID == 7036 and Source == 'Service Control Manager' and RenderedDescription has "Apache"
| parse kind=relaxed EventData with * '<Data Name="param1">' Windows_Service_Name '</Data><Data Name="param2">' Windows_Service_State '</Data>' *
| summarize (TimeGenerated, winstatus) = arg_max(TimeGenerated, Windows_Service_State) by Windows_Service_Name, Computer
| summarize status= anyif(winstatus != "stopped", true);
status
| where status == 'false'
| project status- Racheal2kCopper ContributorHI ,
Noticed that same query sometimes returns true and sometimes returns False.
I think it returns the status from the last record in the result set.