[ https://issues.apache.org/jira/browse/HBASE-17905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yi Liang updated HBASE-17905: ----------------------------- Attachment: HBASE-17905-V1.patch The reason why bulkload fail when table not exist is the {code}BulkLoadPatitioner#numPartitions {code} will return 0 if table not exist, if partition number equals 0, it means that spark repartitionAndSortWithinPartitions(Partitioner) will return nothing, and the following transformation will not be executed This patch fix the errors for bulkload fail when table not exist, and also add some log information, I wonder if we can also add a BulkLoad API that do not have tablename as parameter > [hbase-spark] bulkload does not work when table not exist > ---------------------------------------------------------- > > Key: HBASE-17905 > URL: https://issues.apache.org/jira/browse/HBASE-17905 > Project: HBase > Issue Type: Bug > Reporter: Yi Liang > Assignee: Yi Liang > Attachments: HBASE-17905-V1.patch > > > when using HBase-Spark bulkload api, an argument of tablename is needed, the > bulkload can run successfully only if table exist in HBase. If table not > exist, the bulkload can not run successfully and it even do not report any > errors or throw exception. -- This message was sent by Atlassian JIRA (v6.3.15#6346)