Enhancing Performance of the FHIR Loader

The FHIR Loader deploys with a Standard App Service plan that can support tens of thousands file imports per hour. During testing though we have been able to scale the Loader performance to hundreds of thousands of files per hour.

Note: Scaling to hundreds of thousands of files per hour requires additional scaling on the FHIR API to handle the incoming messages. High rates of 429's at the API or Cosmos data plane indicate that additional scaling is necessary.

Conditions

Load scaling and performance turning were done in several Azure Regions
Load testing was created with Synthea generator, each Patient had 1 year of data
Load testing was measured with Patient only and Patient + Encounter bundles. Patients only bundles performed marginally better
Load testing zip (compressed-archive) bundles were limited to 300 resoruces to maintain queue depth, anything over 300 produced a growing delay for commits to the service bus
For brevity, the FHIR Loader works with both the Azure API for FHIR and the Microsoft FHIR Server, this doc uses API for FHIR to represent both services

File Preparation

The Loader will process files in several formats

Bundles - Bundles are a collection of FHIR resources placed into a single file with containing context
NDJSON - a.k.a Newline delimited JSON, is a format for storing structured data that may be processed one record at a time
Zip - Compressed FHIR bundles (zip format) allows users to move large amounts of files efficienly through the Azure network

Of the three formats, zip provides the fastest load time.

Environment

The Loader relies heavily on Network based storage, therefore we recommend that you deploy the Loader in the same region as your API for FHIR service.

Resource Group

The Loader can be stored in a resource group other than the API for FHIR. It is extremely important though that all Secrity measures be implemented to protect PHI in the cloud.

Storage Container

Storage was set to Standard / Hot Access tier. Depending on how files are being moved into Azure (ie Azure Data Factor vs. AzCopy) customer may want to consider having a seperate storage account for input storage then simply moving the files from incoming storage to the Loader containers.

Additional Storage Performance ideas can be found here

App Service Plan

Within an Azure App Service, the compute resources you use are determined by the App Service plan that you run your apps on. All load testing was performed on Production scale P2V3 systems with scale out instances ranging from 5-17. Once load testing is finished the App Service Plan can be safely scaled back to reduce costs.

For more information see Azure App Service plans overview

Function App

The Loader Function app comes with a MAXCONNECTION setting of 20 processes for connecting to backend systems. Be cautious when adjusting this as queue's into Service Bus and Event Grid can increase delay to maintain consistency / order. If the number of MAXCONNECTIONS is too high, the Service Bus and Event Grid will exponentially slow down.

Our Function code follows Azure Best practices for performance and reliability

API for FHIR

Azure API for FHIR uses database to store its data. Performance of the underlying database depends on the number of Request Units (RU) selected during service provisioning or in database settings after the service has been provisioned. Throughput must be provisioned to ensure that sufficient system resources are available for your database at all times. How many RUs you need for your application depends on operations you perform. Operations can range from simple read and writes to more complex queries.

For Loader performance testing we used 50,000 RU's.

More information on Azure API for FHIR performance can be found here

Application Insights

The FHIR Loader is deploed with Application Insights and it is recommeneded that customers use Application Insights to monitor Loader performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance.md

performance.md

Enhancing Performance of the FHIR Loader

Conditions

File Preparation

Environment

Resource Group

Storage Container

App Service Plan

Function App

API for FHIR

Application Insights

Files

performance.md

Latest commit

History

performance.md

File metadata and controls

Enhancing Performance of the FHIR Loader

Conditions

File Preparation

Environment

Resource Group

Storage Container

App Service Plan

Function App

API for FHIR

Application Insights