[
https://issues.apache.org/jira/browse/SQOOP-1647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Qian Xu updated SQOOP-1647:
---------------------------
Attachment: SQOOP-1647.4.patch
Rebased to current code base
> FROM-side: Read data from HDFS
> ------------------------------
>
> Key: SQOOP-1647
> URL: https://issues.apache.org/jira/browse/SQOOP-1647
> Project: Sqoop
> Issue Type: Sub-task
> Components: connectors
> Affects Versions: 1.99.5
> Reporter: Qian Xu
> Assignee: Qian Xu
> Fix For: 1.99.5
>
> Attachments: SQOOP-1647.2.patch, SQOOP-1647.3.patch,
> SQOOP-1647.4.patch
>
>
> This JIRA proposes to support reading data from an HDFS dataset.
> The scope is defined as follows:
> Source: Any valid HDFS dataset
> Exposed Configuration:
> [From] Dataset URI (String, has a validation check)
> Workflow:
> # KiteFromInitializer will check dataset existence
> # Sqoop will create N KiteExtractor instances.
> # Every KiteExtractor instance will create a DatasetReader. Records will be
> read by DatasetReader. Avro data types will be converted to Sqoop expected
> data types and be written to Sqoop.
> # KiteFromDestroy expect to do nothing.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)