[ 
https://issues.apache.org/jira/browse/HBASE-17905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yi Liang updated HBASE-17905:
-----------------------------
    Attachment: HBASE-17905-V1.patch

The reason why bulkload fail when table not exist is the 
{code}BulkLoadPatitioner#numPartitions {code} will return 0 if table not exist, 
if partition number equals 0, it means that spark 
repartitionAndSortWithinPartitions(Partitioner) will return nothing, and the 
following transformation will not be executed

This patch fix the errors for bulkload fail when table not exist, and also add 
some log information, I wonder if we can also add a BulkLoad API that do not 
have tablename as parameter 

> [hbase-spark]  bulkload does not work when table not exist
> ----------------------------------------------------------
>
>                 Key: HBASE-17905
>                 URL: https://issues.apache.org/jira/browse/HBASE-17905
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Yi Liang
>            Assignee: Yi Liang
>         Attachments: HBASE-17905-V1.patch
>
>
> when using HBase-Spark bulkload api, an argument of tablename is needed, the 
> bulkload can run successfully only if  table exist in HBase.  If table not 
> exist, the bulkload can not run successfully and it even do not report any 
> errors or throw exception. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to