Skip to content

Latest commit

 

History

History
41 lines (30 loc) · 1.87 KB

File metadata and controls

41 lines (30 loc) · 1.87 KB
title HDFS File Source
description HDFS File Source
author chugugrace
ms.author chugu
ms.date 03/01/2017
ms.service sql
ms.subservice integration-services
ms.topic reference
f1_keywords
sql13.ssis.designer.hdfsfilesrc.f1

HDFS File Source

[!INCLUDEsqlserver-ssis]

The HDFS File Source component enables an SSIS package to read data from a HDFS file. The supported file formats are Text and Avro. (ORC sources are not supported.)

To configure the HDFS File Source, drag and drop the HDFS File Source on the data flow designer and double-click the component to open the editor.

HDFS File Source Editor

Options

Configure the following options on the General tab of the Hadoop File Source Editor dialog box.

Field Description
Hadoop Connection Specify an existing Hadoop Connection Manager or create a new one. This connection manager indicates where the HDFS files are hosted.
File Path Specify the name of the HDFS file.
File format Specify the format for the HDFS file. The available options are Text and Avro. (ORC sources are not supported.)
Column delimiter character If you select Text format, specify the column delimiter character.
Column names in the first data row If you select Text format, specify whether the first row in the file contains column names.

After you configure these options, select the Columns tab to map source columns to destination columns in the data flow.

See Also

Hadoop Connection Manager
HDFS File Destination