[ https://issues.apache.org/jira/browse/KYLIN-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16262514#comment-16262514 ]
Kaige Liu edited comment on KYLIN-3044 at 11/22/17 1:32 PM: ------------------------------------------------------------- Sqoop splits data to a couple of parts and import them parallel. I add a property kylin.source.jdbc.sqoop-mapper-num to specify how many splits should be divided. Sqoop would run a mapper for each split. To make each mapper gets even input, split column is chosen by following some rules: 1. Prefer ClusteredBy column 2. Prefer DistributedBy column 3. Prefer Partition date column 4. Prefer Higher cardinality column 5. Prefer numeric column 6. Pick a column at first glance Patch updated. was (Author: liukaige): Sqoop splits data to a couple of parts and import them parallel. I add a property kylin.source.jdbc.sqoop-mapper-num to specify how many splits should be divided. Sqoop would run a mapper for each split. To make each mapper gets even input, split column is chosen following some rules: 1. Prefer ClusteredBy column 2. Prefer DistributedBy column 3. Prefer Partition date column 4. Prefer Higher cardinality column 5. Prefer numeric column 6. Pick a column at first glance Patch updated. > Support SQL Server as data source > --------------------------------- > > Key: KYLIN-3044 > URL: https://issues.apache.org/jira/browse/KYLIN-3044 > Project: Kylin > Issue Type: Task > Reporter: Kaige Liu > Assignee: Kaige Liu > Attachments: KYLIN-3044-sqlserver-as-datasource.patch, > KYLIN-3044-sqlserver-as-datasource.patch > > > [KYLIN-1351|https://issues.apache.org/jira/browse/KYLIN-1351] has added > Vertica as data source. Base on the work of KYLIN-1351, I'd like to enable > SQL Server as data source of kylin. -- This message was sent by Atlassian JIRA (v6.4.14#64029)