Azure Integration Services Blog

5 MIN READ

Scaling Logic App Standard – High Throughput Batched Data Processing System

Microsoft

Jul 06, 2023

In the previous instance of this blog post series, we discussed how Logic App standard can be configured to scale to run high throughput workloads. In this blog, we showcase an application developed with Azure integration services components, aiming to streamline the processing of large number of orders received as a batch that needs to be processed within certain predetermined amount of time.

The orders are received as Service Bus messages, representing invoices securely stored in a blob container. To orchestrate this intricate workflow, the logic app workflows are designed to respond to each service bus message by initiating a series of actions. These actions encompass retrieving the relevant invoice from the blob storage, invoking various services like a rules engine to process the invoice, and ultimately transmitting the transformed data to the appropriate backend systems.

To meet scaling requirements, the solution divides the processing into two Logic Apps, each housing multiple workflows. This architectural decision allows for separating the scaling requirements into two apps to achieve faster scaling and efficient resource utilization.

We also showcase the levels of scalability that this application can achieve by providing various performance metrics. We hope you can use this as blueprint for building scalable applications with Logic Apps and other components in Azure integration services.

Invoice Preprocessing Logic App

The Service Bus messages are generated by an ETL pipeline that extracts each invoice into a separate blob. Subsequently, a notification is sent to Logic Apps by dropping a message in the Service Bus queue. A daily influx of 1 million invoices is ingested through this process. In a workflow featuring a Service Bus trigger, the messages are promptly picked up and a child workflow is invoked to initiate the preprocessing phase. The child workflow orchestrates several data transformation steps and collaborates with other services to enrich each invoice with additional data. Once the preprocessing is complete, the modified invoice is saved back into a blob, and the ingestion system is promptly notified by dropping messages on another Service Bus queue.

Invoice Ingestion Logic App

A single service bus triggered workflow will pick up the messages coming from invoice preprocessing system and orchestrate calls to various backend systems.

	Invoice-preprocessing Logic App	Invoice Ingestion Logic App
Number of workflows	2	1
Triggers	ServiceBus, Request (for nested workflow)	ServiceBus
Actions	Receive(25), Preprocess(40) serviceBus, Sql, Blob, compose, query, variables, mq, Http, Functions. JavaScript	Invoice-ingestion(90) ServiceBus, Sql, Blob, Variable, Liquid, query, Http
Number of storage accounts	5	5
Prewarmed instances	20	20
WS Plan	WS3	WS3
Max Scale settings	100	100

Performance characteristics

Invoice Preprocessor

Trigger Rate

Receiving over 50K/min messages at peak and completing about 1M messages within 40-50mins.
Kicking of nested workflow triggers at 15K/min at peak and kicking off over 2M runs within about 4 hours.

Action Execution Rate

850k/min peak action execution rate for invoice receiving workflow.
150K/min sustain action execution rate for invoice preprocessing workflow.

Job Execution Rate

Execution rate starting at around 300k/min and ramping up to a peak rate of 1M jobs/min within about 40 mins and then dropping back to 200K/min sustained job execution rate for preprocessing workflow.

Execution delay and instance scaling

The app had a prewarmed instance count of 20 and took about 40mins to scale out to full capacity of 80 instances.
The 95^th pc execution delay stayed below 200ms for the most part, but increased to about 400s during the compute ramp-up. The 50^th pc was 600ms at peak and less than 10ms most of the time. This is expected since more jobs got queued up during the ramp-up period than were dequeued for processing.

Execution Delay

Instance Count

Storage Transactions

This application used 5 storage accounts and each one of them had the following transactions volume. Reaching 10k/s at its peak (50% of storage account limits).

Storage Ingress

The total per account ingress at peak was 600 Mbps.

Invoice Ingesting Logic App

Trigger Rate

Receiving about 10K/min preprocessed invoice messages at a sustained rate from the preprocessing Logic App for 4.5 hours (over 1M total messages).

Action Execution Rate

Ramp up to a sustained action execution rate of 400K actions/min within about an hour and continue at that rate for more than 3 hours.

Job Execution Rate

Execution rate ramping up to about 500k/min in about 40min continuing at that rate for the rest of the run.

Execution delay and instance scaling

The app had a prewarmed instance count of 20 and took about 40mins to scale out to full capacity of 100 instances.
The 95^th pc execution delay stayed below 50ms for the most part, but increased to about 60s during the compute ramp-up. The 50^th pc was 70ms at peak and less than 20ms most of the time. This is expected since more jobs got queued up during the ramp-up period than were dequeued for processing.

Execution Delay

Instance count

Results Summary

	Invoice-preprocessing workflow	Invoice Ingestion workflow
Total number of invoices processed	1,000,000	1,000,000
Total processing time	4.5 hours	4.5 hours
Triggers	· 50k/min peak SB message read · 10k/min sustained SB read · 1M SB messages received in about 40min	10k/min sustained SB message read
Actions	· 850K actions/min peak execution rate for receiving workflow and 150k actions/min sustained rate for preprocessing workflow · Total actions executed: 65M	· 400K actions/min sustained execution rate · Total actions executed: 90M
Jobs	· 1M/min job rate at peak · 200K/min job rate sustain	· 500K/min sustained job rate
Execution delay	95^th Pc Increased up to 400s during scale-out and came back below 200ms at sustained load	95^th Pc Increased up to 60s during scale-out and came back below 20ms at sustained load
Scale out	Scaling instance count from 20 to 80 took about 40mins	Scaling instance count from 20 to 100 took about 40mins

Remarks

This case study can serve as a blueprint for designing batch workload processing.
It demonstrates the scaling characteristics and workload capacity of a single Logic App.
In this example, the total processing time of 4.5 hours remained well within the expected limits for the application. However, if necessary, the same configuration of two apps can be replicated across multiple scale-units to achieve horizontal scalability and reduce the overall processing time. For instance, with an average action execution rate of about 500K/min in the two-app configuration, setting up an additional two-app setup would double the action execution rate, enabling the same workload to be processed in half the time.

Updated Jul 06, 2023

Version 1.0

enterprise integration

RohithaH

Microsoft

Joined October 18, 2020

View Profile

Azure Integration Services Blog

Follow this blog board to get notified when there's new activity