[ 
https://issues.apache.org/jira/browse/SQOOP-1647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Qian Xu updated SQOOP-1647:
---------------------------
    Attachment: SQOOP-1647.4.patch

Rebased to current code base

> FROM-side: Read data from HDFS
> ------------------------------
>
>                 Key: SQOOP-1647
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1647
>             Project: Sqoop
>          Issue Type: Sub-task
>          Components: connectors
>    Affects Versions: 1.99.5
>            Reporter: Qian Xu
>            Assignee: Qian Xu
>             Fix For: 1.99.5
>
>         Attachments: SQOOP-1647.2.patch, SQOOP-1647.3.patch, 
> SQOOP-1647.4.patch
>
>
> This JIRA proposes to support reading data from an HDFS dataset.
> The scope is defined as follows:
> Source: Any valid HDFS dataset
> Exposed Configuration:
> [From] Dataset URI (String, has a validation check)
> Workflow:
> # KiteFromInitializer will check dataset existence
> # Sqoop will create N KiteExtractor instances.
> # Every KiteExtractor instance will create a DatasetReader. Records will be 
> read by DatasetReader. Avro data types will be converted to Sqoop expected 
> data types and be written to Sqoop. 
> # KiteFromDestroy expect to do nothing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to