Skip to content

Commit 4ae3326

Browse files
authored
refs
1 parent 3c28435 commit 4ae3326

1 file changed

Lines changed: 23 additions & 0 deletions

File tree

0_Azure/3_AzureAI/AIFoundry/demos/5_What-Global-DataZone-StdType.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,10 @@ From [Enterprise trust in Azure OpenAI Service strengthened with Data Zones](htt
2323

2424
> [!TIP]
2525
> The appropriate deployment model depends on your data residency and compliance requirements. Here's a breakdown of the three main options, and we also need to consider the different ways the models are available to be deployed:
26+
> `Global Deployments`: Suitable for scenarios where compliance with regional data laws is not required, and performance and scale are the primary concerns.
27+
> - Data `at rest and in transit are not restricted to any specific region or data zone`.
28+
> - Azure may `store or process data anywhere across its global infrastructure`.
29+
> - This model offers maximum flexibility and scalability, but provides the least control over data residency and movement.
2630
> - `Regional PTU`: Use this model when you have strict regional data residency requirements. For example, if you deploy in Sweden Central, all data (`both at rest and in transit`) remains within that `region’s boundaries.`
2731
> - `Standard Deployment`: Choose this when you need to keep data at rest within a specific Azure region, but are comfortable with some flexibility for data in transit.
2832
> - You select the region (e.g., Sweden Central).
@@ -32,6 +36,25 @@ From [Enterprise trust in Azure OpenAI Service strengthened with Data Zones](htt
3236
> - You select the `zone, not the region.`
3337
> - Azure manages the region placement internally, ensuring all data remains within the selected zone.
3438
39+
| **Deployment Type** | **Scope** | **Model Type** | **Data Residency (At Rest & In Transit)** | **Latency & Performance** | **Use Case / Description** |
40+
|--------------------------------|------------------|------------------------|------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------|
41+
| Global Standard | Global | Standard | Data at rest stays in the designated Azure geography; data in transit may be processed in any Azure region. | Low latency under normal usage; may experience **latency variation** at high volumes. | Best for general use cases where flexibility and scale are prioritized over strict residency. |
42+
| Global Provisioned Managed | Global | Managed | Same as Global Standard, but with **reserved model capacity** across global infrastructure. | More consistent latency; suitable for **high-volume workloads** needing predictable performance. | Ideal for enterprise-grade deployments with consistent traffic and performance needs. |
43+
| Global Batch | Global | Batch | Same residency as other global types; data processed asynchronously across global regions. | **Up to 24-hour turnaround**; optimized for cost and throughput, not real-time latency. | Large-scale processing like document summarization, content generation, or NLP tasks. |
44+
| Data Zone Standard | Data Zone | Standard | Data at rest stays in the designated geography; data in transit processed within the selected data zone. | Similar to Global Standard; **latency may vary** at high volumes. | Ensures compliance with zone-level regulations (e.g., EU), with moderate performance control. |
45+
| Data Zone Provisioned Managed | Data Zone | Managed | Same as Data Zone Standard, but with **reserved capacity** within the zone. | Lower latency variance; optimized for **predictable performance** in compliant environments. | Combines compliance with performance for regulated industries. |
46+
| Data Zone Batch | Data Zone | Batch | Same as other Data Zone types; batch processing within the zone. | **Up to 24-hour turnaround**; cost-effective for non-real-time workloads. | Compliant batch processing for large datasets within a data zone. |
47+
| Regional PTU | Specific Region | PTU-based | Data at rest and in transit remain strictly within the selected Azure region. | Lowest latency variance; **tightest control** over data movement. | Required for strict regional compliance (e.g., financial or government workloads). |
48+
| Standard Deployment | Specific Region | Standard | Data at rest stays in region; data in transit may leave but stays within the same data zone. | Moderate latency control; **some flexibility** in data routing. | Balanced option for regional control with some scalability. |
49+
| Data Zone Deployment | Data Zone | Standard | Both data at rest and in transit remain within the selected data zone; Azure chooses the region internally. | Similar to Data Zone Standard; **no control over specific region**, but zone compliance maintained. | Good for zone-level compliance without needing to manage region specifics. |
50+
51+
> [!TIP]
52+
> - **Global Batch** and **Data Zone Batch**: Target turnaround time is **up to 24 hours**, making them unsuitable for real-time applications.
53+
> - **Standard deployments** (Global/Data Zone/Regional): May experience **latency variation** at high usage volumes.
54+
> - **Provisioned deployments**: Offer **reserved capacity**, reducing latency variance and improving predictability.
55+
> - **Streaming**: Can reduce perceived latency by returning tokens incrementally, especially useful in chat interfaces.
56+
> - **Content Filtering**: Adds safety but may increase latency slightly.
57+
3558
<details>
3659
<summary><b>List of References</b> (Click to expand)</summary>
3760

0 commit comments

Comments
 (0)