[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16812382#comment-16812382 ] Shaofeng SHI commented on KYLIN-3934: - Hao, would you like to raise a PR to Kylin? Thank you! > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16812995#comment-16812995 ] ASF GitHub Bot commented on KYLIN-3934: --- freewh commented on pull request #587: KYLIN-3934 add config for sqoop config null-string and null-non-string URL: https://github.com/apache/kylin/pull/587 add config for sqoop config null-string and null-non-string fix build error with adding source version and target version in scala-maven-plugin This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16835439#comment-16835439 ] ASF GitHub Bot commented on KYLIN-3934: --- nichunen commented on pull request #635: KYLIN-3934 add config for sqoop config null-string and null-non-string URL: https://github.com/apache/kylin/pull/635 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16835447#comment-16835447 ] ASF GitHub Bot commented on KYLIN-3934: --- nichunen commented on pull request #587: KYLIN-3934 add config for sqoop config null-string and null-non-string URL: https://github.com/apache/kylin/pull/587 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16836427#comment-16836427 ] ASF GitHub Bot commented on KYLIN-3934: --- nichunen commented on pull request #635: KYLIN-3934 add config for sqoop config null-string and null-non-string URL: https://github.com/apache/kylin/pull/635 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16836428#comment-16836428 ] ASF subversion and git services commented on KYLIN-3934: Commit 3f0ca847e09532ff07d4258f92e30fec331e672c in kylin's branch refs/heads/master from wanghao [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=3f0ca84 ] KYLIN-3934 add config for sqoop config null-string and null-non-string > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16837097#comment-16837097 ] ASF subversion and git services commented on KYLIN-3934: Commit 53123d3e5250041203f1c815ed93533bd03a7d2b in kylin's branch refs/heads/2.6.x from wanghao [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=53123d3 ] KYLIN-3934 add config for sqoop config null-string and null-non-string > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16837096#comment-16837096 ] nichunen commented on KYLIN-3934: - Committed to 2.6.x branch > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3934) sqoop import param '--null-string' result in null value become blank string in hive table
[ https://issues.apache.org/jira/browse/KYLIN-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16838435#comment-16838435 ] ASF subversion and git services commented on KYLIN-3934: Commit 53123d3e5250041203f1c815ed93533bd03a7d2b in kylin's branch refs/heads/2.6.x-hadoop3.1 from wanghao [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=53123d3 ] KYLIN-3934 add config for sqoop config null-string and null-non-string > sqoop import param '--null-string' result in null value become blank string > in hive table > - > > Key: KYLIN-3934 > URL: https://issues.apache.org/jira/browse/KYLIN-3934 > Project: Kylin > Issue Type: Bug > Components: Others >Affects Versions: v2.6.0 >Reporter: wanghao >Priority: Major > Fix For: v2.6.2 > > > when column value from jdbc is null, sqoop will write it into hive table with > blank string. > eg > jdbc: > A | B > 1 | 1 > 2 | 2 > a | null > > hive table: > A | B > 1 | 1 > 2 | 2 > a | > because of this, when I use count(distinct B), it return 3, not 2, and it can > lead to other problems > > > {code:java} > String cmd = String.format(Locale.ROOT, > "%s/bin/sqoop import" + generateSqoopConfigArgString() > + "--connect \"%s\" --driver %s --username %s --password %s --query \"%s AND > \\$CONDITIONS\" " > + "--target-dir %s/%s --split-by %s --boundary-query \"%s\" --null-string '' " > + "--fields-terminated-by '%s' --num-mappers %d", > sqoopHome, connectionUrl, driverClass, jdbcUser, jdbcPass, selectSql, > jobWorkingDir, hiveTable, > splitColumn, bquery, filedDelimiter, mapperNum); > {code} > the param '–null=string' should be ' > \\N' instead of blank string ''. > I resolved this problem by replace the param. But it needs be configured in > kylin.properties > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)