Deployment Guide

This repository presents a solution and reference architecture for the Knowledge Mining solution accelerator. Please note that the provided code serves as a demonstration and is not an officially supported Microsoft offering.

For additional security, please review how to use Azure API Management with microservices deployed in Azure Kubernetes Service.

Prerequisites
Regional Availability
Deployment
Next Steps

Prerequisites

PowerShell (v5.1+) - available for Windows, macOS, and Linux.
Azure CLI (v2.0+) - command-line tool for managing Azure resources.

2a. kubectl - command-line tool for interacting with Kubernetes clusters.
In PowerShell, run the following command:
```
 az aks install-cli
```
2b. aks-preview - extension for Azure CLI to manage Azure Kubernetes Service.
In PowerShell, run the following command:
```
 az extension add --name aks-preview
```
Helm - package manager for Kubernetes
Docker Desktop: service to containerize and publish into Azure Container Registry. Please make sure Docker desktop is running before executing Deployment script.
Azure Access - subscription-level Owner or User Access Administrator role required.
Microsoft.Compute Registration - Ensure that Microsoft.Compute is registered in your Azure subscription by following these steps:
1. Log in to your Azure Portal.
2. Navigate to your active Azure subscription.
3. Go to Settings and select Resource Providers.
4. Check for Microsoft.Compute and click Register if it is not already registered.

Regional Availability

Due to model availability within various data center regions, the following services have been hard-coded to specific regions.

Azure Open AI (GPT 4o mini):
The solution relies on GPT-4o mini and text-embedding-3-large models which are all currently available in the 'WestUS3', 'EastUS', 'EastUS2', 'SwedenCentral' region.
Please check the model summary table and region availability if needed.
Azure AI Document Intelligence (East US):
The solution relies on a 2023-10-31-preview or later that is currently available in East US region.
The deployment region for this model is fixed in 'East US'

Deployment

The automated deployment process is very straightforward and simplified via a single deployment script that completes in approximately 10-15 minutes:

Automated Deployment Steps:

Deploy Azure resources.
Get secret information from Azure resources.
Update application configuration files with secrets.
Set Application Configuration in Azure App Configuration.
Compile application, build image, and push to Azure Container Registry.
Configure Kubernetes cluster infrastructure.
Update Kubernetes configuration files.
Deploy certificates, ingress controller and then application images from Azure Container Registry.

Execute Deployment Script:

Open PowerShell, change directory where you code cloned, then run the deploy script:

cd .\Deployment\

.\resourcedeployment.ps1

If you run into issue with PowerShell script file not being digitally signed, you can execute below command:

powershell.exe -ExecutionPolicy Bypass -File ".\resourcedeployment.ps1"

You will be prompted for the following parameters with this Screen :

Tenant ID - The Azure Active Directory (AAD) tenant ID. This is used for authenticating against Azure resources. Copy this from the Azure portal. Example: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
Subscription ID - The Azure subscription ID where resources will be deployed. Copy this from the Azure portal.
Environment Name - A unique environment name (e.g., dev, test, prod). This is used to scope resource names and group deployments logically.
Resource Group Name - The Azure resource group to deploy resources into. You may either:
- Specify an existing resource group to reuse it, see below for more details, or
- Leave blank to auto-generate a new name.

Location - Azure data center where resources will be deployed.

Please check Azure resource availability and note hardcoded regions. The following locations are currently supported:

'EastUS', 'EastUS2', 'WestUS', 'WestUS2', 'WestUS3', 'CentralUS', 'NorthCentralUS', 'SouthCentralUS','WestEurope', 'NorthEurope', 'SoutheastAsia', 'EastAsia', 'JapanEast', 'JapanWest', 'AustraliaEast', 'AustraliaSoutheast', 'CentralIndia', 'SouthIndia', 'CanadaCentral','CanadaEast', 'UKSouth', 'UKWest', 'FranceCentral', 'FranceSouth', 'KoreaCentral','KoreaSouth', 'GermanyWestCentral', 'GermanyNorth', 'NorwayWest', 'NorwayEast', 'SwitzerlandNorth', 'SwitzerlandWest', 'UAENorth', 'UAECentral', 'SouthAfricaNorth','SouthAfricaWest', 'BrazilSouth','BrazilSoutheast', 'QatarCentral', 'ChinaNorth', 'ChinaEast', 'ChinaNorth2', 'ChinaEast2'

ModelLocation - Azure data center where GPT model will be deployed.
The following locations are currently available :
```
'WestUS3', 'EastUS', 'EastUS2', 'SwedenCentral'
```
Email - used for issuing certificates in Kubernetes clusters from the Let's Encrypt service. Email address should be valid.
GO ! - Deployment Script executes Azure deployment, Azure Infrastructure configuration, Application code compile and publish into Kubernetes Cluster.

Configuring a New or Existing Resource Group

➕ Creating a New Resource Group

You have two options:

Manually specify a resource group name (e.g., rg-myproject-dev)
Leave the input field blank — a new name will be auto-generated by the script

🔁 Using an Existing Resource Group

If reusing an existing Azure Resource Group:

Provide the exact name of the existing resource group
Ensure the environment name matches the original environment used for that resource group

⚠️ After deployment, please restart the AKS (Kubernetes) service to ensure updated configurations are applied when using a reused resource group.

Manual Deployment Steps:

Create Content Filter - Please follow below steps

Navigate to project in Azure OpenAI, then go to Azure AI Foundry, select Safety + security

Click on Create Content Filter and set the filters to a high threshold for the following categories: Hate, Sexual, Self-harm, Violence

Please select the checkbox of profanity

Leave all other configurations at their default settings and click on create

Deployment Complete

🥳🎉 First, congrats on finishing Deployment!

Let's check the message and configure your model's TPM rate higher to get better performance.
You can check the Application URL from the final console message.
Don't miss this Url information. This is the application's endpoint URL and it should be used for your data importing process.

Next Steps

1. Configure Azure OpenAI Rate Limits

Capacity Note:

The deployment script creates models with a setting of 1 token per minute (TPM) rate limit.

Faster performance can be achieved by increasing the TPM limit with Azure AI Foundry.

Capacity varies for regional quota limits as well as for provisioned throughput.

As a starting point, we recommend the following quota threshold be set up for this service run.

Model Name	TPM Threshold
GPT-4o-mini	100K TPM
text-embedding-3-large	200K TPM

⚠️ Warning: Insufficient quota can cause failures during the upload process. Please ensure you have the recommended capacity or request for additional capacity before start uploading the files.

Browse to the project in Azure AI Foundry, and select each of the 2 models within the Deployments menu:

Increase the TPM value for each model for faster report generation:

2. Data Uploading and Processing

After increasing the TPM limit for each model, let's upload and process the sample documents.

cd .\Deployment\

Execute uploadfiles.ps1 file with -EndpointUrl parameter as URL in console message.

.\uploadfiles.ps1 -EndpointUrl https://kmgs<your dns name>.<datacenter>.cloudapp.azure.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment Guide

Contents

Prerequisites

Regional Availability

Deployment

Automated Deployment Steps:

Execute Deployment Script:

Configuring a New or Existing Resource Group

Manual Deployment Steps:

Deployment Complete

🥳🎉 First, congrats on finishing Deployment!

Next Steps

1. Configure Azure OpenAI Rate Limits

2. Data Uploading and Processing

FilesExpand file tree

DeploymentGuide.md

Latest commit

History

DeploymentGuide.md

File metadata and controls

Deployment Guide

Contents

Prerequisites

Regional Availability

Deployment

Automated Deployment Steps:

Execute Deployment Script:

Configuring a New or Existing Resource Group

Manual Deployment Steps:

Deployment Complete

🥳🎉 First, congrats on finishing Deployment!

Next Steps

1. Configure Azure OpenAI Rate Limits

2. Data Uploading and Processing