Skip to content

Commit 13fb201

Browse files
authored
Merge eb162c1 into 8ea3ad5
2 parents 8ea3ad5 + eb162c1 commit 13fb201

5 files changed

Lines changed: 168 additions & 6 deletions

File tree

0_Azure/2_AzureAnalytics/0_Fabric/demos/30_dynamic_pipeline_nbkparameters.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ Last updated: 2025-03-03
6666

6767
```python
6868
# Parameters
69-
file_path = "abfss://<your-container-name>@<your-storage-account-name>.dfs.core.windows.net/<your-bronze-lakehousename>.Lakehouse/Tables/<table name>"
69+
file_path = "" # expected value format: abfss://<your-container-name>@<your-storage-account-name>.dfs.core.windows.net/<your-bronze-lakehousename>.Lakehouse/Tables/<table name>
7070
token = ""
7171
```
7272
5. **Mark as Parameter Cell**: Select the `Toggle Parameter Cell` option to mark the cell as a parameter cell

0_Azure/3_AzureAI/9_AzureOpenAI/demos/10_AssistantsPlayground.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
44
[brown9804](https://github.com/brown9804)
55

6-
Last updated: 2024-11-28
6+
Last updated: 2025-03-03
77

88
------------------------------------------
99

@@ -66,3 +66,9 @@ Example:
6666

6767
> - Using the Azure OpenAI Assistants API to build custom solutions using Azure’s GPT-4 model integrated with other Azure services (e.g., Azure Logic Apps, Azure Functions) for enhanced functionality. This approach enables you to build assistants with custom workflows, tool integration, code execution capabilities, custom instructions, code interpreters, and custom functions.
6868
- **Storage Limits**: The current limit for uploaded files is 100GB, but you can request an increase through Azure support if needed through Azure support channels.
69+
70+
71+
<div align="center">
72+
<h3 style="color: #4CAF50;">Total Visitors</h3>
73+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
74+
</div>
Lines changed: 147 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,147 @@
1+
# Azure Open AI: Model Availability - Quick Overview
2+
3+
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
4+
[brown9804](https://github.com/brown9804)
5+
6+
Last updated: 2025-03-03
7+
8+
------------------------------------------
9+
10+
<details>
11+
<summary><b>List of References </b> (Click to expand)</summary>
12+
13+
- [Azure OpenAI deployment types](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/deployment-types)
14+
- [What is provisioned throughput?](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-throughput?tabs=global-ptum)
15+
- [Azure OpenAI provisioned Managed offering updates](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-migration)
16+
- [Provisioned throughput units onboarding](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/provisioned-throughput-onboarding)
17+
- [Azure OpenAI Service pricing](https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/)
18+
19+
</details>
20+
21+
## Content
22+
23+
- [Deployment Options](#deployment-options)
24+
- [When Azure OpenAI Model Availability PTU is Not Available](#when-azure-openai-model-availability-ptu-is-not-available)
25+
- [Complete a Capacity Request](#complete-a-capacity-request)
26+
- [Use a Different Model](#use-a-different-model)
27+
- [Use a Different Region/Zones or Global](#use-a-different-regionzones-or-global)
28+
- [Attempt Deployment at a Different Time](#attempt-deployment-at-a-different-time)
29+
- [Azure Reserved Instances](#azure-reserved-instances)
30+
31+
## Deployment Options
32+
33+
| **Deployment Option** | **Description** | **Pros** | **Cons** | **Technical Notes** |
34+
|-----------------------|-----------------|----------|----------|---------------------|
35+
| **Data Zones** | Data Zones allow customers to process and store their data within specific geographic boundaries, ensuring compliance with regional data residency requirements while maintaining optimal performance. | Ensures data residency compliance, optimized for regional performance. | Limited availability compared to global options. | Suitable for applications with strict data residency requirements. May require configuration of virtual networks and subnets. |
36+
| **Global Standard** | Global Standard deployments leverage Azure's global infrastructure to dynamically route customer traffic to the data center with the best availability for the customer’s inference requests. | Highest initial throughput limits, best model availability, low latency. | Potential latency variation for high volume workloads. | Ideal for applications needing high availability and low latency. Uses Azure's global load balancing and routing capabilities. |
37+
| **Provisioned Throughput Units (PTUs)** | PTUs provide guaranteed throughput by allocating specific processing capacity for your deployment. This ensures stable performance and predictable latency. | Predictable performance, allocated processing capacity, potential cost savings for high throughput workloads. | Requires accurate forecasting of capacity needs, may involve higher upfront costs. | Best for applications with consistent and high throughput requirements. Requires careful planning and capacity management. |
38+
39+
## When Azure OpenAI Model Availability PTU is Not Available
40+
41+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/f1da9940-a809-4902-95ba-f524e490edbe" />
42+
43+
### Complete a Capacity Request
44+
> Submit a request to Azure for additional capacity.
45+
- **Pros**: Tailored to your specific needs.
46+
- **Cons**: May take time to process and approve.
47+
> Steps:
48+
1. **Sign in to Azure Portal**: Use your Azure subscription credentials.
49+
2. **Navigate to `Quotas`**: Enter `quotas` in the search box and select `Quotas`
50+
3. **Request Increase**: Click `Request Quota`.
51+
52+
<img width="550" alt="image" src="https://github.com/user-attachments/assets/b6cc365a-6a54-407e-850b-79b932d09e5d" />
53+
54+
4. **Submit Details**: Fill the [Azure OpenAI Service: Request for Quota Increase](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR4xPXO648sJKt4GoXAed-0pUMFE1Rk9CU084RjA0TUlVSUlMWEQzVkJDNCQlQCN0PWcu)
55+
56+
### Use a Different Model
57+
> Switch to an alternative model that is available in the desired region. Use the [Azure AI Foundry Model Catalog](https://ai.azure.com/explore/models) to discover and evaluate a wide range of models from various providers.
58+
59+
- **Pros**: Immediate availability, potentially lower costs.
60+
- **Cons**: May require adjustments to your application to accommodate the new model.
61+
> Steps:
62+
1. **Identify Alternative Models**: Check the Azure OpenAI Service models available in your region.
63+
2. **Evaluate Compatibility**: Assess the performance and compatibility of the alternative model with your application.
64+
3. **Update Application**: Make necessary adjustments to your application to integrate the new model.
65+
66+
https://github.com/user-attachments/assets/5b10270d-b5fe-40f7-9af2-df2cdc77658b
67+
68+
### Use a Different Region/Zones or Global
69+
70+
> Deploy the model in a different region or use Azure's global infrastructure to dynamically route traffic to the best available data center.
71+
- **Pros**: Increased availability, potential cost savings.
72+
- **Cons**: Possible latency issues, data residency concerns.
73+
74+
> Steps:
75+
1. **Select Region**: Choose a different region or data zone for deployment.
76+
2. **Configure Deployment**: Set up the deployment in the new region using Azure's global load balancing features.
77+
3. **Monitor Performance**: Track latency and performance to ensure it meets your application's requirements.
78+
79+
https://github.com/user-attachments/assets/341dfc6a-a996-40a5-9f73-96adafe2f92b
80+
81+
### Attempt Deployment at a Different Time
82+
> Capacity availability can change dynamically, so trying at a different time might help.
83+
- **Pros**: Simple to try.
84+
- **Cons**: No guarantee of success.
85+
> Steps:
86+
1. **Monitor Capacity Status**: Keep an eye on Azure's capacity status and availability.
87+
2. **Plan Off-Peak Deployments**: Schedule deployments during off-peak hours when capacity is more likely to be available.
88+
3. **Retry Deployment**: Attempt the deployment again at a different time if initial attempts fail.
89+
90+
### Azure Reserved Instances
91+
92+
> Reserved Instances allow you to reserve capacity in advance, ensuring availability and potentially reducing costs. This is essentially a commitment to use a certain amount of PTUs over a specified period.
93+
94+
> [!NOTE]
95+
> Azure OpenAI `Reservations are agreements` for a specific time period and compute capacity. Whether using the Pay-as-you-go model or reservations, you need to create the Azure OpenAI Capacity within a resource group. <br/><br/>
96+
> Reservations in Azure, including Azure OpenAI `reservations`, are `managed at the subscription level`. This means that the reserved capacity units (PTUs) apply to the entire subscription, not to individual resource groups. <br/>
97+
> - `Reservations`: Provide a `subscription-wide discount` for committing to a certain amount of capacity over a period of time. <br/>
98+
> - `Capacity Creation`: You create and manage Azure OpenAI `capacities within specific resource groups`, but the `cost benefits from the reservation apply at the subscription level`.
99+
100+
- **Pros**: Guaranteed capacity, potential cost savings, predictable performance.
101+
- **Cons**: Requires upfront commitment, less flexibility.
102+
103+
> Steps:
104+
1. **Evaluate Needs**: Assess your long-term capacity requirements.
105+
2. **Purchase Reserved Instances**:
106+
- Sign in to the Azure portal.
107+
- Navigate to `Cost Management + Billing` and select `Reservations + Hybrid Benefit`.
108+
- Click on `Add` and choose `Azure OpenAI Service`.
109+
- Select the region, quantity, and deployment type (Global, Data Zone, or Regional).
110+
- Add the Azure OpenAI Service SKU to your cart.
111+
- Verify the quantity of PTUs you want to purchase and complete your order.
112+
3. **Configure Deployment**: Set up your deployment using the reserved instances.
113+
4. **Monitor Usage**: Track usage to ensure it aligns with your reserved capacity.
114+
115+
> [!NOTE]
116+
> While managing resources within resource groups, the reservation’s cost benefits are applied across the entire subscription.
117+
118+
| **Aspect** | **Details** |
119+
|------------|-------------|
120+
| **Reservation** | - `Subscription Level Management`: When you make a reservation, it applies to the entire subscription. This means any resource within that subscription can benefit from the reserved capacity.<br/>- `Discounts`: The primary benefit of reservations is the cost savings. By committing to a certain amount of capacity over a period of time, you receive a discount compared to pay-as-you-go pricing.<br/>- `Flexibility`: While the reservation itself is at the subscription level, you can still create and manage individual capacities within different resource groups. The reserved capacity units are utilized by any eligible resources within the subscription. |
121+
| **Capacity** | - `Creating Capacity`: Even though the reservation is at the subscription level, you still need to create the actual Azure OpenAI capacity in the Azure portal. This capacity can be assigned to specific resource groups as needed.<br/>- `Utilizing Reservations`: When you create an Azure OpenAI capacity, it will automatically utilize the reserved capacity units from your subscription, ensuring you benefit from the cost savings. |
122+
123+
> [!NOTE]
124+
> Scope Assignment in Reservations
125+
126+
| **Level** | **Scope** | **Usage/Management** |
127+
|--------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
128+
| **Subscription Level** | - Reservations are applied at the subscription level.<br/>- Reserved capacity units (PTUs) provide a discount for any eligible resources within the entire subscription. | Any Azure OpenAI capacity created within this subscription can utilize the reserved PTUs, benefiting from the cost savings. |
129+
| **Resource Group Level** | - Reservations are managed at the subscription level.<br/>- The reserved capacity units are not directly assigned to individual resource groups but are available for any resource within the subscription. | You can still organize and manage your resources within different resource groups. |
130+
131+
> [!TIP]
132+
> Define the scope of the reservation
133+
134+
| **Scope Option** | **Description** |
135+
|--------------------------|-------------------------------------------------------------------------------------------------------|
136+
| **Single resource group**| Applies the reservation discount to the matching resources in the selected resource group only. |
137+
| **Single subscription** | Applies the reservation discount to the matching resources in the selected subscription. |
138+
| **Shared** | Applies the reservation discount to matching resources in eligible subscriptions within the billing context. |
139+
| **Management group** | Applies the reservation discount to the matching resources in the list of subscriptions that are part of both the management group and billing scope. |
140+
141+
https://github.com/user-attachments/assets/b42e4446-73ef-46fe-b84c-246b2636b391
142+
143+
144+
<div align="center">
145+
<h3 style="color: #4CAF50;">Total Visitors</h3>
146+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
147+
</div>

0_Azure/3_AzureAI/9_AzureOpenAI/demos/4_PTUs_TPM.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Costa Rica
55
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
66
[brown9804](https://github.com/brown9804)
77

8-
Last updated: 2024-11-19
8+
Last updated: 2025-03-03
99

1010
----------
1111

@@ -62,9 +62,13 @@ This means with 50 PTUs, you can process 75,000 tokens per minute.
6262

6363
> Improve accuracy of your estimate by adding multiple workloads to your PTU calculation. Each workload will be calculated and displayed as well as the aggregate total if both are running at the same time to your deployment.
6464
65-
<img width="705" alt="image" src="https://github.com/user-attachments/assets/e59c7e24-6d9b-4ea3-855c-d4f284b53e59">
65+
<img width="750" alt="image" src="https://github.com/user-attachments/assets/d7599273-b4e3-478a-b2b0-b72f8647bb0e" />
66+
67+
<img width="750" alt="image" src="https://github.com/user-attachments/assets/540a1fd2-cae1-445c-8ca8-a0123cc63d7e" />
68+
69+
https://github.com/user-attachments/assets/22f2c86b-2294-4a6d-acb6-31697ae6646c
6670

6771
<div align="center">
6872
<h3 style="color: #4CAF50;">Total Visitors</h3>
6973
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
70-
</div>
74+
</div>

0_Azure/3_AzureAI/9_AzureOpenAI/demos/8_Overview.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Costa Rica
55
[![GitHub](https://img.shields.io/badge/--181717?logo=github&logoColor=ffffff)](https://github.com/)
66
[brown9804](https://github.com/brown9804)
77

8-
Last updated: 2024-10-22
8+
Last updated: 2025-03-03
99

1010
------------------------------------------
1111

@@ -17,3 +17,8 @@ Last updated: 2024-10-22
1717

1818

1919
</details>
20+
21+
<div align="center">
22+
<h3 style="color: #4CAF50;">Total Visitors</h3>
23+
<img src="https://profile-counter.glitch.me/brown9804/count.svg" alt="Visitor Count" style="border: 2px solid #4CAF50; border-radius: 5px; padding: 5px;"/>
24+
</div>

0 commit comments

Comments
 (0)