[jira] [Created] (KYLIN-4037) Can't Cleanup Data in Hbase's HDFS Storage When Deploy Apache Kylin with Standalone HBase Cluster

2019-06-10 Thread wangxiaojing (JIRA)
wangxiaojing created KYLIN-4037:
---

 Summary: Can't Cleanup Data in Hbase's HDFS Storage When Deploy 
Apache Kylin with Standalone HBase Cluster
 Key: KYLIN-4037
 URL: https://issues.apache.org/jira/browse/KYLIN-4037
 Project: Kylin
  Issue Type: Bug
  Components: Others, Tools, Build and Test
Affects Versions: v2.6.2, v2.6.1, Future, all
Reporter: wangxiaojing
 Attachments: image-2019-06-10-15-56-58-124.png

Hello,

     When Deploy Apache Kylin with Standalone HBase Cluster and use kylin 
cleanup storage tools to cleanup dirty data ,but it seems only cleanup the data 
In computer cluster HDFS/HIVE and table in hbase cluster, not clean the HDFS 
data in hbase,also have many dirdy data in hbase HDFS .

    Tools: [http://kylin.apache.org/docs/howto/howto_cleanup_storage.html]

     Version:master as well as 2.6.2 release.

     Data that should be cleaned up in the HBase HDFS cluster ,which cube or 
segment has been droped.

!image-2019-06-10-15-56-58-124.png!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


Re: Re: Deploy Apache Kylin with Standalone HBase Cluster

2019-04-18 Thread ShaoFeng Shi
thanks for the sharing; Is the beeline + "zookeeper discoverer model"  also
okay finally?

Best regards,

Shaofeng Shi 史少锋
Apache Kylin PMC
Email: shaofeng...@apache.org

Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
Join Kylin user mail group: user-subscr...@kylin.apache.org
Join Kylin dev mail group: dev-subscr...@kylin.apache.org




lk_hadoop  于2019年4月18日周四 上午10:34写道:

> I made some mistake , both cluster with NN-HA configure is OK .
>
> 2019-04-18
>
> lk_hadoop
>
>
>
> 发件人:"lk_hadoop"
> 发送时间:2019-04-17 15:20
> 主题:Re: Deploy Apache Kylin with Standalone HBase Cluster
> 收件人:"dev"
> 抄送:
>
> finally I succeed with read-write separation deployment . There are two
> point cause my failure :
> 1、when use beeline to connect hive ,should not use zookeeper discoverer
> model ,should connect to one of the hiveservers directly.
> 2、should not configure NN HA to connect to Hbase cluster , although I
> configured kylin.storage.hbase.cluster-hdfs-config-file=hbase.hdfs.xml ,
> JOB failed when step to : Convert Cuboid Data to HFile.
> Error Message :
> java.lang.RuntimeException: Could not find any configured addresses for
> URI
> hdfs://nameservice1/user/mykylin/kylin_metadata/kylin-4fdee76b-6b73-087a-b9ad-6cf17dd84aad/kylin_sales_cube/hfile
>
>
> 2019-04-17
>
> lk_hadoop
>
>
>
> 发件人:"lk_hadoop"
> 发送时间:2019-04-16 15:30
> 主题:Deploy Apache Kylin with Standalone HBase Cluster
> 收件人:"dev"
> 抄送:
>
> hi,all:
> I want to try read-write separation deployment . Is the Standalone
> HBase Cluster should use the same HDFS withe the Main Cluster ? My Hbase
> cluster is completly separate with main cluster , both cluster's HDFS is NN
> HA , I can't sucess with read-write separation deployment .
>
> 2019-04-16
>
>
> lk_hadoop


Re: Re: Deploy Apache Kylin with Standalone HBase Cluster

2019-04-17 Thread lk_hadoop
I made some mistake , both cluster with NN-HA configure is OK .

2019-04-18 

lk_hadoop 



发件人:"lk_hadoop"
发送时间:2019-04-17 15:20
主题:Re: Deploy Apache Kylin with Standalone HBase Cluster
收件人:"dev"
抄送:

finally I succeed with read-write separation deployment . There are two point 
cause my failure : 
1、when use beeline to connect hive ,should not use zookeeper discoverer model 
,should connect to one of the hiveservers directly. 
2、should not configure NN HA to connect to Hbase cluster , although I 
configured kylin.storage.hbase.cluster-hdfs-config-file=hbase.hdfs.xml , JOB 
failed when step to : Convert Cuboid Data to HFile. 
Error Message :   
java.lang.RuntimeException: Could not find any configured addresses for URI 
hdfs://nameservice1/user/mykylin/kylin_metadata/kylin-4fdee76b-6b73-087a-b9ad-6cf17dd84aad/kylin_sales_cube/hfile
 

2019-04-17  

lk_hadoop  



发件人:"lk_hadoop" 
发送时间:2019-04-16 15:30 
主题:Deploy Apache Kylin with Standalone HBase Cluster 
收件人:"dev" 
抄送: 

hi,all:  
I want to try read-write separation deployment . Is the Standalone HBase 
Cluster should use the same HDFS withe the Main Cluster ? My Hbase cluster is 
completly separate with main cluster , both cluster's HDFS is NN HA , I can't 
sucess with read-write separation deployment .  

2019-04-16  


lk_hadoop   

Re: Deploy Apache Kylin with Standalone HBase Cluster

2019-04-17 Thread lk_hadoop
finally I succeed with read-write separation deployment . There are two point 
cause my failure :
1、when use beeline to connect hive ,should not use zookeeper discoverer model 
,should connect to one of the hiveservers directly.
2、should not configure NN HA to connect to Hbase cluster , although I 
configured kylin.storage.hbase.cluster-hdfs-config-file=hbase.hdfs.xml , JOB 
failed when step to : Convert Cuboid Data to HFile.
Error Message :  
java.lang.RuntimeException: Could not find any configured addresses for URI 
hdfs://nameservice1/user/mykylin/kylin_metadata/kylin-4fdee76b-6b73-087a-b9ad-6cf17dd84aad/kylin_sales_cube/hfile

2019-04-17 

lk_hadoop 



发件人:"lk_hadoop"
发送时间:2019-04-16 15:30
主题:Deploy Apache Kylin with Standalone HBase Cluster
收件人:"dev"
抄送:

hi,all: 
I want to try read-write separation deployment . Is the Standalone HBase 
Cluster should use the same HDFS withe the Main Cluster ? My Hbase cluster is 
completly separate with main cluster , both cluster's HDFS is NN HA , I can't 
sucess with read-write separation deployment . 

2019-04-16 


lk_hadoop  

Deploy Apache Kylin with Standalone HBase Cluster

2019-04-16 Thread lk_hadoop
hi,all:
I want to try read-write separation deployment . Is the Standalone HBase 
Cluster should use the same HDFS withe the Main Cluster ? My Hbase cluster is 
completly separate with main cluster , both cluster's HDFS is NN HA , I can't 
sucess with read-write separation deployment .

2019-04-16


lk_hadoop