Skip to content

Commit a15495f

Browse files
authored
both approaches overview
1 parent a93be0f commit a15495f

1 file changed

Lines changed: 34 additions & 0 deletions

File tree

0_Azure/2_AzureAnalytics/3_Databricks/1_demos/MedallionArch_Fabric+Databricks.md

Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,9 +14,43 @@ Last updated: 2025-02-13
1414
<details>
1515
<summary><b>List of References </b> (Click to expand)</summary>
1616

17+
- [TechExcel: Microsoft Fabric with Azure Databricks for Data Analytics (lvl 300 / CSU) lab](https://microsoft.github.io/TechExcel-Fabric-with-Databricks-for-Data-Analytics/)
18+
- [Databricks Unity Catalog tables available in Microsoft Fabric](https://blog.fabric.microsoft.com/en-us/blog/databricks-unity-catalog-tables-available-in-microsoft-fabric/)
19+
- [Integrate OneLake with Azure Databricks](https://learn.microsoft.com/en-us/fabric/onelake/onelake-azure-databricks)
20+
- [Tutorial: Configure Microsoft Fabric mirrored databases from Azure Databricks (Preview)](https://learn.microsoft.com/en-us/fabric/database/mirrored-database/azure-databricks-tutorial)
21+
- [Integrating Microsoft Fabric with Azure Databricks Delta Tables](https://techcommunity.microsoft.com/blog/fasttrackforazureblog/integrating-microsoft-fabric-with-azure-databricks-delta-tables/3916332)
22+
- [Data Intelligence End-to-End with Azure Databricks and Microsoft Fabric](https://techcommunity.microsoft.com/blog/azurearchitectureblog/data-intelligence-end-to-end-with-azure-databricks-and-microsoft-fabric/4232621)
1723

1824
</details>
1925

26+
## Overview
27+
28+
| Resource | Key Components | Details |
29+
|------------|--------------------------|-------------------------------------------------------------------------|
30+
| Databricks | - ETL Tasks<br/> - Data Ingestion | ETL Tasks: Databricks efficiently handles Extract, Transform, Load (ETL) tasks. It manages the bronze, silver, and gold layers of the medallion architecture: <br> - **Bronze Layer**: Raw data ingestion from various sources. <br> - **Silver Layer**: Data cleaning and transformation. <br> - **Gold Layer**: Aggregated and refined data ready for analysis. <br> Data Ingestion: Supports batch and streaming data ingestion, ensuring real-time data processing capabilities. <br> - **Data Cleaning**: Utilizes Apache Spark's powerful processing engine to clean and transform data at scale. <br> - **Aggregation**: Performs complex aggregations and computations, making data ready for downstream analytics. |
31+
| Fabric | - Data Integration <br> - Orchestration <br> - Monitoring and Management | Fabric seamlessly integrates with Databricks, providing a unified interface for managing data workflows. <br> - **Data Integration**: Facilitates the orchestration of data pipelines, ensuring smooth data flow between different stages of processing. <br> - **Monitoring and Management**: Offers robust monitoring and management tools to track data pipeline performance and troubleshoot issues. |
32+
33+
> Here is a [reference of a medallion architecture using only Fabric](https://github.com/MicrosoftCloudEssentials-LearningHub/MS-Fabric-Essentials-Workshop/tree/main/AzurePortal/1_MedallionArch):
34+
35+
<p align="center">
36+
<img width="650" alt="image" src="https://github.com/user-attachments/assets/15a7dbfe-524b-4aa9-9b45-3db6bca2dd03" />
37+
</p>
38+
39+
> If you need to handle `complex data transformations and large-scale data processing`, you can use our combined solution of **Fabric + Databricks**. This powerful combination leverages the strengths of both platforms to provide a robust data processing pipeline. This workshop on [Fabric with Databricks for Data Analytics](https://microsoft.github.io/TechExcel-Fabric-with-Databricks-for-Data-Analytics/) offers a comprehensive step-by-step guide on developing Medallion Architecture using Fabric and Databricks. <br/>
40+
41+
<p align="center">
42+
<img width="650" alt="image" src="https://github.com/user-attachments/assets/58431d3b-e294-46fe-89a4-92a046168ec4" />
43+
</p>
44+
45+
## Integration with Multiple Sources:
46+
47+
> In these two examples of Medallion Archuitecture, `Azure SQL Database` was used as the input source, but the solution is highly flexible and can integrate with multiple data sources, including:
48+
49+
- **Cloud Storage**: Azure Blob Storage, AWS S3, Google Cloud Storage.
50+
- **Databases**: SQL Server, MySQL, PostgreSQL, Oracle.
51+
- **Streaming Sources**: Kafka, Event Hubs, IoT Hub.
52+
- **APIs and Web Services**: REST APIs, SOAP services.
53+
2054

2155
<div align="center">
2256
<h3 style="color: #4CAF50;">Total Visitors</h3>

0 commit comments

Comments
 (0)