Skip to content
This repository was archived by the owner on Jun 29, 2019. It is now read-only.

Commit 25d9d19

Browse files
author
xibingaomsft
committed
Update
1 parent fc786b6 commit 25d9d19

1 file changed

Lines changed: 5 additions & 3 deletions

File tree

Misc/SQLDW/machine-learning-data-science-process-sqldw-walkthrough.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ We will formulate three prediction problems based on the *tip\_amount*, namely:
6666
## <a name="setup"></a>Setting Up the Azure data science environment for advanced analytics
6767

6868

69-
In this tutorial we will demonstrate loading data to SQL DW, data exploration, feature engineering. Sample scripts are shared in GitHub.
69+
In this tutorial we will demonstrate loading data to SQL Data Warehouse, exploration data, engineering features, and building machine learning models.
7070

7171
To set up your Azure Data Science environment, follow the steps below.
7272

@@ -85,7 +85,7 @@ Follow the documentation at [https://azure.microsoft.com/en-us/documentation/art
8585
- User Name
8686
- Password
8787

88-
3. [Create an Azure ML workspace](machine-learning-create-workspace.md)
88+
3. Follow the documentation at [https://azure.microsoft.com/en-us/documentation/articles/machine-learning-create-workspace/](https://azure.microsoft.com/en-us/documentation/articles/machine-learning-create-workspace/) to create an Azure Machine Learning workspace.
8989

9090
## <a name="getdata"></a>Load the data into SQL Data Warehouse
9191

@@ -119,12 +119,14 @@ Input your credentials as prompted. After this PowerShell script is run the firs
119119

120120
Depending on the geographical location of your blob storage account, the process of copying data from public blob to your private storage account could take about 15 minutes or longer,and the process of loading data from your storage account to SQL DW could takes about 20 minutes or longer. For your information, the public blob storage account we use to share the data is located at South Central US.
121121

122+
Other than loading data to the SQL Data Warehouse, the Powershell script (1) creates a sample data table (NYCTaxi_Sample) which is going to be used in following sections and (2) parameterizes the data exploration scripts ([SQL](./SQLDW_Explorations.sql) and [IPython notebook](./SQLDW_Explorations.ipynb)) with your Data Warehouse credentials.
123+
122124
After successful execution, you will see screen like below:
123125
![][20]
124126

125127
## <a name="dbexplore"></a>Data Exploration and Feature Engineering in SQL Data Warehouse
126128

127-
In this section, we will perform data exploration and feature generation by running SQL queries directly in the **SQL Server Management Studio** or **Visual Studio**. A sample script named **SQLDW.sql** is provided on [Github](./SQLDW_Explorations.sql). Modify the script to change the database or data table name, if it is different from the default.
129+
In this section, we will perform data exploration and feature generation by running SQL queries directly in the **SQL Server Management Studio** or **Visual Studio**. A sample script named **SQLDW_Explorations.sql** is provided on [Github](./SQLDW_Explorations.sql).
128130

129131
In this exercise, we will:
130132

0 commit comments

Comments
 (0)