Deployment Guide

Pre-requisites

To deploy this solution, ensure you have access to an Azure subscription with the necessary permissions to create resource groups, resources, app registrations, and assign roles at the resource group level. This should include Contributor role at the subscription level and Role Based Access Control (RBAC) permissions at the subscription and/or resource group level.

Check the Azure Products by Region page and select a region where the following services are available:

Here are some example regions where the services are available: East US, East US2, Australia East, UK South, France Central.

Important Note for PowerShell Users

If you encounter issues running PowerShell scripts due to the policy of not being digitally signed, you can temporarily adjust the ExecutionPolicy by running the following command in an elevated PowerShell session:

Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass

This will allow the scripts to run for the current session without permanently changing your system's policy.

Deployment Options & Steps

Pick from the options below to see step-by-step instructions for GitHub Codespaces, VS Code Dev Containers, VS Code Web, and Local Environments.

Deploy in GitHub Codespaces

GitHub Codespaces

You can run this solution using GitHub Codespaces. The button will open a web-based VS Code instance in your browser:

Open the solution accelerator (this may take several minutes):
Accept the default values on the create Codespaces page.
Open a terminal window if it is not already open.
Continue with the deploying steps.

Deploy in VS Code

VS Code Dev Containers

You can run this solution in VS Code Dev Containers, which will open the project in your local VS Code using the Dev Containers extension:

Start Docker Desktop (install it if not already installed).
Open the project:
In the VS Code window that opens, once the project files show up (this may take several minutes), open a terminal window.
Continue with the deploying steps.

Deploy in VS Code Web

Visual Studio Code Web

You can run this solution using VS Code Web, which provides a browser-based development environment with Azure integration:

Open the solution accelerator:
Sign in with your Azure account when prompted.
Select the subscription where you want to deploy the solution.
Wait for the environment to initialize (includes all deployment tools).
Once the solution opens, the AI Foundry terminal will automatically start. If prompted, choose "Overwrite with versions from template" and provide a unique environment name.
Authenticate with Azure (VS Code Web requires device code authentication):
```
azd auth login --use-device-code
```
Note: In VS Code Web environment, the regular azd auth login command may fail. Use the --use-device-code flag to authenticate via device code flow. Follow the prompts in the terminal to complete authentication.
Continue with the deploying steps.

Deploy in your local Environment

Local Environment

If you're not using one of the above options for opening the project, then you'll need to:

Make sure the following tools are installed:
- PowerShell (v7.0+) - available for Windows, macOS, and Linux.
- Azure Developer CLI (azd) (v1.15.0+)
- Python 3.11+
- Node.js 18+
- Docker Desktop
- Git
Clone the repository or download the project code via command-line:
```
azd init -t microsoft/content-generation-solution-accelerator/
```
Open the project folder in your terminal or editor.
Continue with the deploying steps.

Consider the following settings during your deployment to modify specific settings:

Configurable Deployment Settings

When you start the deployment, most parameters will have default values, but you can update the following settings:

Setting	Description	Default value
Azure Region	The region where resources will be created.	(empty)
Environment Name	A 3–20 character alphanumeric value used to generate a unique ID to prefix the resources.	env_name
GPT Model	Choose from gpt-4, gpt-4o, gpt-4o-mini.	gpt-4o-mini
GPT Model Version	The version of the selected GPT model.	2024-07-18
OpenAI API Version	The Azure OpenAI API version to use.	2025-01-01-preview
GPT Model Deployment Capacity	Configure capacity for GPT models (in thousands).	30k
DALL-E Model	DALL-E model for image generation.	dall-e-3
Image Tag	Docker image tag to deploy. Common values: `latest`, `dev`, `hotfix`.	latest
Use Local Build	Boolean flag to determine if local container builds should be used.	false
Existing Log Analytics Workspace	To reuse an existing Log Analytics Workspace ID.	(empty)
Existing Azure AI Foundry Project	To reuse an existing Azure AI Foundry Project ID instead of creating a new one.	(empty)

[Optional] Quota Recommendations

By default, the GPT-4o-mini model capacity in deployment is set to 30k tokens, so we recommend updating the following:

For GPT-4o-mini - increase the capacity to at least 150k tokens post-deployment for optimal performance.

For DALL-E 3 - ensure you have sufficient capacity for image generation requests.

Depending on your subscription quota and capacity, you can adjust quota settings to better meet your specific needs.

⚠️ Warning: Insufficient quota can cause deployment errors. Please ensure you have the recommended capacity or request additional capacity before deploying this solution.

Deploying with AZD

Once you've opened the project in Codespaces, Dev Containers, or locally, you can deploy it to Azure by following the steps in the AZD Deployment Guide

Post Deployment Steps

Add App Authentication

Follow steps in App Authentication to configure authentication in app service. Note: Authentication changes can take up to 10 minutes.
Assign RBAC Roles (if needed)

If you encounter 401/403 errors, run the RBAC assignment script and wait 5-10 minutes for propagation:
```
bash ./scripts/assign_rbac_roles.sh
```
Deleting Resources After a Failed Deployment
- Follow steps in Delete Resource Group if your deployment fails and/or you need to clean up the resources.

Troubleshooting

Common Issues and Solutions

401 Unauthorized Errors

Symptom: API calls return 401 errors

Cause: Missing RBAC role assignments

Solution: Run assign_rbac_roles.sh and wait 5-10 minutes for propagation

403 Forbidden from Cosmos DB

Symptom: Cosmos DB operations fail with 403

Cause: Missing Cosmos DB data plane role (not ARM role)

Solution: Use az cosmosdb sql role assignment create (not az role assignment create)

SSE Streaming Not Working

Symptom: Long responses timeout, no streaming updates

Causes:

HTTP/2 enabled on App Service (breaks SSE)
Proxy timeout too short

Solution:

az webapp config set -g $RESOURCE_GROUP -n <app-name> --http20-enabled false

Backend Not Accessible

Symptom: Frontend cannot reach backend API

Cause: VNet/DNS configuration issues

Solution:

Verify VNet integration is enabled on App Service
Verify private DNS zone is linked to VNet
Verify A record points to correct ACI IP
Check if ACI IP changed (run update_backend_dns.sh)

Image Generation Not Working

Symptom: DALL-E requests fail

Cause: Missing DALL-E model deployment or incorrect endpoint

Solution:

Verify DALL-E 3 deployment exists in Azure OpenAI resource
Check AZURE_OPENAI_DALLE_ENDPOINT and AZURE_OPENAI_DALLE_DEPLOYMENT environment variables

Environment Variables Reference

Backend Environment Variables (ACI)

Variable	Description	Example
AZURE_OPENAI_ENDPOINT	GPT model endpoint	https://ai-account.cognitiveservices.azure.com/
AZURE_OPENAI_DEPLOYMENT_NAME	GPT deployment name	gpt-4o-mini
AZURE_OPENAI_DALLE_ENDPOINT	DALL-E endpoint	https://dalle-account.cognitiveservices.azure.com/
AZURE_OPENAI_DALLE_DEPLOYMENT	DALL-E deployment name	dall-e-3
COSMOS_ENDPOINT	Cosmos DB endpoint	https://cosmos.documents.azure.com:443/
COSMOS_DATABASE	Database name	content-generation
AZURE_STORAGE_ACCOUNT_NAME	Storage account	storagecontentgen
AZURE_STORAGE_CONTAINER	Product images container	product-images
AZURE_STORAGE_GENERATED_CONTAINER	Generated images container	generated-images

Frontend Environment Variables (App Service)

Variable	Description	Example
BACKEND_URL	Backend API URL	http://backend.contentgen.internal:8000
WEBSITES_PORT	App Service port	3000

Sample Prompts

To help you get started, here are some sample prompts you can use with the Content Generation Solution:

"Create a product description for a new eco-friendly water bottle"
"Generate marketing copy for a summer sale campaign"
"Write social media posts promoting our latest product launch"
"Create an image for a blog post about sustainable living"
"Generate a product image showing a modern office setup"

These prompts serve as a great starting point to explore the solution's capabilities with text generation, image generation, and content management.

Architecture Overview

The solution consists of:

Backend: Python 3.11 + Quart + Hypercorn running in Azure Container Instance (ACI) with VNet integration
Frontend: React + Vite + TypeScript + Fluent UI running on Azure App Service with Node.js proxy
AI Services:
- Azure OpenAI (GPT model for text generation)
- Azure OpenAI (DALL-E 3 for image generation)
Data Services:
- Azure Cosmos DB (products catalog, conversations)
- Azure Blob Storage (product images, generated images)
Networking:
- Private VNet for backend container
- App Service with VNet integration for frontend-to-backend communication
- Private DNS zone for internal name resolution

Security Considerations

Managed Identity: The solution uses system-assigned managed identity instead of connection strings
Private VNet: Backend runs in private subnet, not exposed to internet
RBAC: Principle of least privilege - only necessary roles are assigned
No Secrets in Code: All credentials managed through Azure identity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment Guide

Pre-requisites

Important Note for PowerShell Users

Deployment Options & Steps

GitHub Codespaces

VS Code Dev Containers

Visual Studio Code Web

Local Environment

Deploying with AZD

Post Deployment Steps

Troubleshooting

401 Unauthorized Errors

403 Forbidden from Cosmos DB

SSE Streaming Not Working

Backend Not Accessible

Image Generation Not Working

Environment Variables Reference

Sample Prompts

Architecture Overview

Security Considerations

FilesExpand file tree

DEPLOYMENT.md

Latest commit

History

DEPLOYMENT.md

File metadata and controls

Deployment Guide

Pre-requisites

Important Note for PowerShell Users

Deployment Options & Steps

GitHub Codespaces

VS Code Dev Containers

Visual Studio Code Web

Local Environment

Deploying with AZD

Post Deployment Steps

Troubleshooting

401 Unauthorized Errors

403 Forbidden from Cosmos DB

SSE Streaming Not Working

Backend Not Accessible

Image Generation Not Working

Environment Variables Reference

Sample Prompts

Architecture Overview

Security Considerations