You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jun 29, 2019. It is now read-only.
Copy file name to clipboardExpand all lines: Misc/SQLDW/machine-learning-data-science-process-sqldw-walkthrough.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -66,7 +66,7 @@ We will formulate three prediction problems based on the *tip\_amount*, namely:
66
66
## <aname="setup"></a>Setting Up the Azure data science environment for advanced analytics
67
67
68
68
69
-
In this tutorial we will demonstrate loading data to SQL DW, data exploration, feature engineering. Sample scripts are shared in GitHub.
69
+
In this tutorial we will demonstrate loading data to SQL Data Warehouse, exploration data, engineering features, and building machine learning models.
70
70
71
71
To set up your Azure Data Science environment, follow the steps below.
72
72
@@ -85,7 +85,7 @@ Follow the documentation at [https://azure.microsoft.com/en-us/documentation/art
85
85
- User Name
86
86
- Password
87
87
88
-
3.[Create an Azure ML workspace](machine-learning-create-workspace.md)
88
+
3.Follow the documentation at [https://azure.microsoft.com/en-us/documentation/articles/machine-learning-create-workspace/](https://azure.microsoft.com/en-us/documentation/articles/machine-learning-create-workspace/) to create an Azure Machine Learning workspace.
89
89
90
90
## <aname="getdata"></a>Load the data into SQL Data Warehouse
91
91
@@ -119,12 +119,14 @@ Input your credentials as prompted. After this PowerShell script is run the firs
119
119
120
120
Depending on the geographical location of your blob storage account, the process of copying data from public blob to your private storage account could take about 15 minutes or longer,and the process of loading data from your storage account to SQL DW could takes about 20 minutes or longer. For your information, the public blob storage account we use to share the data is located at South Central US.
121
121
122
+
Other than loading data to the SQL Data Warehouse, the Powershell script (1) creates a sample data table (NYCTaxi_Sample) which is going to be used in following sections and (2) parameterizes the data exploration scripts ([SQL](./SQLDW_Explorations.sql) and [IPython notebook](./SQLDW_Explorations.ipynb)) with your Data Warehouse credentials.
123
+
122
124
After successful execution, you will see screen like below:
123
125
![][20]
124
126
125
127
## <aname="dbexplore"></a>Data Exploration and Feature Engineering in SQL Data Warehouse
126
128
127
-
In this section, we will perform data exploration and feature generation by running SQL queries directly in the **SQL Server Management Studio** or **Visual Studio**. A sample script named **SQLDW.sql** is provided on [Github](./SQLDW_Explorations.sql). Modify the script to change the database or data table name, if it is different from the default.
129
+
In this section, we will perform data exploration and feature generation by running SQL queries directly in the **SQL Server Management Studio** or **Visual Studio**. A sample script named **SQLDW_Explorations.sql** is provided on [Github](./SQLDW_Explorations.sql).
0 commit comments