performance

3 Topics

How to connect with multiple kusto cluster and add users with reader permission?
I have to add appid and tenant id to grant permission to the application and we have almost 38 public kusto clusters. I would like to automate this process so i don't have connect the cluster each time and run the below command everytime: .add database [db_name] viewers('aadapp=APP_GUID_HERE;tenant_id here') "TEAM_NAME_HERE" is there a way, i can write a script which connect to all 38 clusters one by one and run the above command?
itshemant09
Oct 19, 2022 Place Azure Data Explorer
1.2KViews
0likes
0Comments
Azure Data Explorer API Limits
I'm making REST API requests, and am getting "An existing connection was forcibly closed by the remote host" -- I can't find in your otherwise awesome docs what the API request limits are, could you help?
JoeCrockett
Mar 01, 2021 Place Azure Data Explorer
2.5KViews
0likes
3Comments
Most effecient way to identify duplicates in data?
We're moving data analytics towards Kusto and one feature we would like to have is to sanity-check our data and find duplicates of data in a dataset. The problem I want to solve is to make sure we don't accidentally ingest duplicates and then report too high sales numers. Our plan now is to introduce an extra column with a sha1 hash of the row and do something like "summarize count() by sha1 | where count_ > 1" ... but that would need to touch every column in the data set. I realize that's in the nature of the problem, but I'd just like to investigate strategies what would make this as effecient as possible. Strategies I've thought of would be to first limit the dataset to a certain timewindow or perhaps by customerId. I also know about the ingestion tags but I don't want to impact extents too much since this is more of a sanity-check. What other strategies could we use to make this operation as efficient as possible?
abergs
Mar 14, 2019 Place Azure Data Explorer
18KViews
0likes
1Comment