Yang, using HBase cluster's HDFS as the working dir is not a wise way for
this problem. It should be a bug in Kylin.

2017-05-27 14:31 GMT+08:00 Li Yang <[email protected]>:

> Maybe update kylin.properties with below can workaround?
>
> kylin.env.hdfs-working-dir=hdfs://cdh5-mini/kylin
>
>
> On Fri, May 26, 2017 at 1:57 PM, suheng.cloud (JIRA) <[email protected]>
> wrote:
>
> > suheng.cloud created KYLIN-2648:
> > -----------------------------------
> >
> >              Summary: Encounter cube merge error when deploy kylin on
> > stand alone hbase cluster
> >                  Key: KYLIN-2648
> >                  URL: https://issues.apache.org/jira/browse/KYLIN-2648
> >              Project: Kylin
> >           Issue Type: Bug
> >           Components: Job Engine
> >     Affects Versions: v2.0.0
> >          Environment: hadoop :cdh5.4.0 (both main and hbase env)
> > hbase  : hbase-1.2.0-cdh5.7.6
> > hive: apache-hive-2.1.1
> >
> > kylin version: 2.0
> >             Reporter: suheng.cloud
> >             Assignee: Dong Li
> >
> >
> > I try to deploy kylin on one node of a stand alone hbase
> > cluster(hdfs://cdh5-mini/) which seperate from main hive
> > cluster(hdfs://cdh5/),
> > According to the blog "Deploy Apache Kylin with Standalone HBase Cluster"
> > : make sure the configurations of hadoop and hive points to main cluster,
> > I clone hadoop dir to another path and modify "fs.defaultFS" in
> > core-site.xml to "hdfs://cdh5/" , and in head of kylin.sh, I export
> > HADOOP_HOME to this new path.
> > So all goes well (include cube build/refresh) until I execute cube merge.
> > The merge error occurs at step "#9 Step Name: Garbage Collection on
> HDFS".
> >
> >
> > The stacktrace  as follows:
> > 2017-05-25 17:28:07,070 INFO  [pool-9-thread-1]
> > threadpool.DefaultScheduler:114 : CubingJob{id=c6709f0b-8858-
> 4e66-a4c2-320ebc70a2e3,
> > name=kylin_sales_cube - 20120101000000_20140201000000 - MERGE - GMT+08:00
> > 2017-05-25 16:51:30, state=READY} prepare to schedule
> > 2017-05-25 17:28:07,073 INFO  [pool-9-thread-1]
> > threadpool.DefaultScheduler:117 : CubingJob{id=c6709f0b-8858-
> 4e66-a4c2-320ebc70a2e3,
> > name=kylin_sales_cube - 20120101000000_20140201000000 - MERGE - GMT+08:00
> > 2017-05-25 16:51:30, state=READY} scheduled
> > 2017-05-25 17:28:07,075 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.AbstractExecutable:110 : Executing AbstractExecutable
> > (kylin_sales_cube - 20120101000000_20140201000000 - MERGE - GMT+08:00
> > 2017-05-25 16:51:30)
> > 2017-05-25 17:28:07,078 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3
> > 2017-05-25 17:28:07,083 INFO  [pool-9-thread-1]
> > threadpool.DefaultScheduler:124 : Job Fetcher: 0 should running, 1
> actual
> > running, 0 stopped, 1 ready, 19 already succeed, 0 error, 11 discarded, 0
> > others
> > 2017-05-25 17:28:07,083 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.ExecutableManager:389 : job id:c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3
> > from READY to RUNNING
> > 2017-05-25 17:28:07,105 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.AbstractExecutable:110 : Executing AbstractExecutable (Garbage
> > Collection on HDFS)
> > 2017-05-25 17:28:07,106 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3-08
> > 2017-05-25 17:28:07,111 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.ExecutableManager:389 : job id:c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-08
> > from READY to RUNNING
> > 2017-05-25 17:28:07,154 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > steps.HDFSPathGarbageCollectionStep:78 : Drop HDFS path on FileSystem:
> > hdfs://cdh5
> > 2017-05-25 17:28:07,217 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > steps.HDFSPathGarbageCollectionStep:90 : HDFS path
> > hdfs:///kylin/kylin_metadata/kylin-a11d510f-d8a5-45c1-b430-bc7def851432
> > not exists.
> > 2017-05-25 17:28:07,249 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > steps.HDFSPathGarbageCollectionStep:90 : HDFS path
> > hdfs:///kylin/kylin_metadata/kylin-0c1ed2d0-f595-4f58-aaea-2dbe7b41a550
> > not exists.
> > 2017-05-25 17:28:07,320 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > steps.HDFSPathGarbageCollectionStep:78 : Drop HDFS path on FileSystem:
> > hdfs://cdh5-mini
> > 2017-05-25 17:28:07,324 ERROR [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.AbstractExecutable:126 : error running Executable:
> > HDFSPathGarbageCollectionStep{id=c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-08,
> > name=Garbage Collection on HDFS, state=RUNNING}
> > 2017-05-25 17:28:07,326 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3-08
> > 2017-05-25 17:28:07,331 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3-08
> > 2017-05-25 17:28:07,334 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.ExecutableManager:389 : job id:c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-08
> > from RUNNING to ERROR
> > 2017-05-25 17:28:07,335 ERROR [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.AbstractExecutable:126 : error running Executable:
> > CubingJob{id=c6709f0b-8858-4e66-a4c2-320ebc70a2e3, name=kylin_sales_cube
> > - 20120101000000_20140201000000 - MERGE - GMT+08:00 2017-05-25 16:51:30,
> > state=RUNNING}
> > 2017-05-25 17:28:07,337 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3
> > 2017-05-25 17:28:07,342 DEBUG [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > dao.ExecutableDao:217 : updating job output, id: c6709f0b-8858-4e66-a4c2-
> > 320ebc70a2e3
> > 2017-05-25 17:28:07,344 INFO  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.ExecutableManager:389 : job id:c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3
> > from RUNNING to ERROR
> > 2017-05-25 17:28:07,345 WARN  [Job c6709f0b-8858-4e66-a4c2-
> 320ebc70a2e3-128]
> > execution.AbstractExecutable:258 : no need to send email, user list is
> > empty
> > 2017-05-25 17:28:07,346 ERROR [pool-10-thread-1]
> > threadpool.DefaultScheduler:146 : ExecuteException
> > job:c6709f0b-8858-4e66-a4c2-320ebc70a2e3
> > org.apache.kylin.job.exception.ExecuteException: org.apache.kylin.job.
> exception.ExecuteException:
> > java.lang.IllegalArgumentException: Wrong FS:
> hdfs:/kylin/kylin_metadata/
> > kylin-a11d510f-d8a5-45c1-b430-bc7def851432, expected: hdfs://cdh5-mini
> >          at org.apache.kylin.job.execution.AbstractExecutable.
> > execute(AbstractExecutable.java:134)
> >          at org.apache.kylin.job.impl.threadpool.DefaultScheduler$
> > JobRunner.run(DefaultScheduler.java:142)
> >          at java.util.concurrent.ThreadPoolExecutor.runWorker(
> > ThreadPoolExecutor.java:1142)
> >          at java.util.concurrent.ThreadPoolExecutor$Worker.run(
> > ThreadPoolExecutor.java:617)
> >          at java.lang.Thread.run(Thread.java:745)
> > Caused by: org.apache.kylin.job.exception.ExecuteException: java.lang.
> IllegalArgumentException:
> > Wrong FS: hdfs:/kylin/kylin_metadata/kylin-a11d510f-d8a5-45c1-b430-
> bc7def851432,
> > expected: hdfs://cdh5-mini
> >          at org.apache.kylin.job.execution.AbstractExecutable.
> > execute(AbstractExecutable.java:134)
> >          at org.apache.kylin.job.execution.DefaultChainedExecutable.
> > doWork(DefaultChainedExecutable.java:64)
> >          at org.apache.kylin.job.execution.AbstractExecutable.
> > execute(AbstractExecutable.java:124)
> >          ... 4 more
> > Caused by: java.lang.IllegalArgumentException: Wrong FS:
> > hdfs:/kylin/kylin_metadata/kylin-a11d510f-d8a5-45c1-b430-bc7def851432,
> > expected: hdfs://cdh5-mini
> >          at org.apache.hadoop.fs.FileSystem.checkPath(
> FileSystem.java:658)
> >          at org.apache.hadoop.hdfs.DistributedFileSystem.getPathName(
> > DistributedFileSystem.java:194)
> >          at org.apache.hadoop.hdfs.DistributedFileSystem.access$
> > 000(DistributedFileSystem.java:106)
> >          at org.apache.hadoop.hdfs.DistributedFileSystem$19.
> > doCall(DistributedFileSystem.java:1215)
> >          at org.apache.hadoop.hdfs.DistributedFileSystem$19.
> > doCall(DistributedFileSystem.java:1211)
> >          at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(
> > FileSystemLinkResolver.java:81)
> >          at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(
> > DistributedFileSystem.java:1211)
> >          at org.apache.hadoop.fs.FileSystem.exists(FileSystem.java:1413)
> >          at org.apache.kylin.storage.hbase.steps.
> > HDFSPathGarbageCollectionStep.dropHdfsPathOnCluster(
> > HDFSPathGarbageCollectionStep.java:85)
> >          at org.apache.kylin.storage.hbase.steps.
> > HDFSPathGarbageCollectionStep.doWork(HDFSPathGarbageCollectionStep.
> > java:65)
> >          at org.apache.kylin.job.execution.AbstractExecutable.
> > execute(AbstractExecutable.java:124)
> >          ... 6 more
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.15#6346)
> >
>



-- 
Best regards,

Shaofeng Shi 史少锋

Reply via email to