Hi Michael,

Did you try to make the change in the default config file mapred-site.xml?
Kylin uses the default hadoop configs (unless it need some specific
parameters), the recommended place for param like "mapreduce.framework.name"
should be the default config files;

Regarding the HBase version, yes 1.1.3 hasn't been released, I will update
the statement on download page, but HBase 1.1 user should aware that the
fix of HBASE-14269 <https://issues.apache.org/jira/browse/HBASE-14269> is
only included in 1.1.3 and subsequent versions; User should upgrade once it
be released.

Thanks Michael for the sharing, it will help other users I believe.





2016-01-11 15:30 GMT+08:00 michael jones <[email protected]>:

> 1. Making it not run as mapred local I had to explictly add in the kylin
> job config, it wasnt taking it from HADOOP_HOME/etc/
>     <property>
>       <name>mapreduce.cluster.administrators</name>
>       <value>hadoop</value>
>     </property>
>     <property>
>       <name>mapreduce.framework.name</name>
>       <value>yarn</value>
>     </property>
>
> 2. Kylin was defaulting to 8020 for hdfs when making hbase requests, and
> was using 9000 when making hive request. I had to explicitly
> set kylin.hbase.cluster.fs=hdfs://HOSTXX:9000/
>
> I guess this is less of issue with Kylin but more with what combination of
> HBASE, HIVE and DFS and YARN people use. HPD solves it to a certain degree,
> but its not free of its own ness either.  Compounded with the fact that
> kylin does hbase tomcat to launch the binary, its some bit of a challenge
> to get all the ducks in line.
>
> It will be good if the wider community shares their combinations of binary
> versions that have worked for them "error free".
> FYI - i am using HBASE 1.1.1, hadoop(fs,yarn) 2.5.1,  hive 0.14 and Kylin
> 1.3 prebuilt
>
>
>
> Another thing to note is in the docs
> http://kylin.apache.org/download/
> It says requirements as Hbase 1.1.3+  (There hasnt been any + release thus
> far, even 1.1.3 (not +)  is only available RC0 is currently, Is this a
> typo?
>
> On Sun, Jan 10, 2016 at 5:08 PM, ShaoFeng Shi <[email protected]>
> wrote:
>
> > Hi Michael, I saw your origin message but had no idea;
> >
> > I don't undrestand "realized that kylin by default was trying to do
> mapred
> > local instead of on submitting to yarn clusters"; Kylin will load the
> > default hadoop configs from user's installation, so user need make sure
> the
> > server that runs Kylin has been properly configured with the cluster. You
> > mentioned that "fixed that with params in the kylin.properties and
> > yarn-site.xml", could you please elaborate which parameter is related
> with
> > this and what change you made?
> >
> > The HDFS port issue, I didn't find where Kylin hard-code 8020 (except the
> > test case configuration which assume connecting with a HDP sandbox). If
> > this is a bug, could you report a JIRA with your environment information
> as
> > well as the error logs? That would be helpful for understanding the
> issue,
> > thank you;
> >
> >
> >
> > 2016-01-10 12:44 GMT+08:00 michael jones <[email protected]>:
> >
> > > After some bit of digging realized that kylin by default was trying to
> do
> > > mapred local instead of on submitting to yarn clusters, fixed that with
> > > params in the kylin.properties and yarn-site.xml.  These errors were
> not
> > > quite intuitive in the logs, prolly need more trace in debug mode.
> > >
> > > Another issue is with default dfs port. Hadoop 2.5 onwards frequent
> usage
> > > is 9000. While older version defaults to 8020.
> > >
> > > In kylin 1.2 and 1.3 explicit override the dfs port dint work for me/
> > dint
> > > find the right property to set.
> > > Instead restart dfs to listen on 8020 instead of 9000. This solve the
> > > subsequent issue.
> > >
> > > FYI for those who are using this configuration
> > > Kylin 1.2 or 1.3
> > > Hbase 1.1.X
> > > Hadoop 2.5.x or 2.6.x
> > >
> > >
> > >
> > > On Fri, Jan 8, 2016 at 1:32 PM, michael jones <[email protected]>
> > > wrote:
> > >
> > > > I am consistently getting NPE for any cube building on stage2 of the
> > > > process, for any cube.
> > > > The sessionId consistently comes as null. Most likely i am missing
> > > setting
> > > > somthing in the conf. Am using the default kylin.properties of
> prebuit
> > > > kylin 1.3 binary download.
> > > >
> > > >
> > > >
> > >
> >
> +------------------------------------------------------------------------------------------------------+
> > > > | Extract Fact Table Distinct Columns
> > > >                              |
> > > >
> > > >
> > >
> >
> +------------------------------------------------------------------------------------------------------+
> > > > 2016-01-08 13:26:24,895 INFO  [pool-7-thread-1]
> > > Configuration.deprecation:
> > > > session.id is deprecated. Instead, use dfs.metrics.session-id
> > > > 2016-01-08 13:26:24,896 INFO  [pool-7-thread-1] jvm.JvmMetrics:
> > > > Initializing JVM Metrics with processName=JobTracker, *sessionId=*
> > > >
> > > > ==> /opt/kylin/tomcat/logs/kylin_job.log <==
> > > > [pool-7-thread-1]:[2016-01-08
> > > >
> > >
> >
> 13:26:24,905][ERROR][org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:109)]
> > > > - error running Executable
> > > > java.lang.NullPointerException
> > > > at org.apache.kylin.job.common.MapReduceExecutable.onExecuteStart(
> > > > *MapReduceExecutable.java:79*)
> > > > at
> > > >
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
> > > > at
> > > >
> > >
> >
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:51)
> > > > at
> > > >
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> > > > at
> > > >
> > >
> >
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:130)
> > > > at
> > > >
> > >
> >
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> > > > at
> > > >
> > >
> >
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> > > > at java.lang.Thread.run(Thread.java:745)
> > > >
> > > >
> > > >
> > > >
> > >
> >
> https://github.com/apache/kylin/blob/master/job/src/main/java/org/apache/kylin/job/common/MapReduceExecutable.java#L79
> > > >
> > > > Configuration conf = HadoopUtil.getCurrentConfiguration();
> > > > Job job = new Cluster(conf).getJob(JobID.forName(mrJobId));
> > > > if (job.getJobState() == JobStatus.State.FAILED) {
> > > >     //remove previous mr job info
> > > >     super.onExecuteStart(executableContext);
> > > > }
> > > >
> > > > Either Kylin is not able to get the hadoop cluster or not able to
> > submit
> > > > the job.
> > > > Anyone encountered this issue before ?  I couldnt find in the docs on
> > how
> > > > to yarn cluster endpoint properties in kylin.properties
> > > >
> > > > Thanks
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > On Thu, Jan 7, 2016 at 4:59 PM, michael jones <[email protected]
> >
> > > > wrote:
> > > >
> > > >> Kyline - apache-kylin-1.3-HBase-1.1-SNAPSHOT-bin
> > > >> Hbase - 1.1.2
> > > >> Hadoop - 2.7.1
> > > >> Hive 1.2.1 - Updated hive metastore to use mysqldb  (instead of
> derby)
> > > >> Hive/Hbase by itself work fine tested with created tables etc.
> > > >>
> > > >> Kylin Sandbox/Local setup.
> > > >>
> > > >> Kylin starts properly except for this error
> > > >> java.io.FileNotFoundException:
> > > >> /opt/hadoop-2.7.1/contrib/capacity-scheduler/*.jar (No such file or
> > > >> directory)
> > > >> (I figure this is ok to ignore since the default task scheduler in
> > local
> > > >> mode should work fine ?)
> > > >>
> > > >> After running bin/sample.sh
> > > >>
> > > >> Tried to follow steps as mentioned in the tutorial, When building
> the
> > > >> cube, stage 1 successed.
> > > >> #2 Step Name: Extract Fact Table Distinct Columns
> > > >> Fails with NPE.
> > > >>
> > > >>  -conf /opt/kylin/conf/kylin_job_conf.xml -cubename kylin_sales_cube
> > > >> -output
> > > >>
> > >
> >
> /kylin/kylin_metadata/kylin-1575f8f6-98df-44da-8ac2-b480ccd0380a/kylin_sales_cube/fact_distinct_columns
> > > >> -jobname Kylin_Fact_Distinct_Columns_kylin_sales_cube_Step
> -tablename
> > > >>
> > >
> >
> default.kylin_intermediate_kylin_sales_cube_desc_19700101000000_20160112000000_1575f8f6_98df_44da_8ac2_b480ccd0380a
> > > >>
> > > >> java.lang.NullPointerException
> > > >> at
> > > >>
> > >
> >
> org.apache.kylin.job.common.MapReduceExecutable.onExecuteStart(MapReduceExecutable.java:79)
> > > >> at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
> > > >> at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:51)
> > > >> at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> > > >> at
> > > >>
> > >
> >
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:130)
> > > >> at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> > > >> at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> > > >> at java.lang.Thread.run(Thread.java:745)
> > > >>
> > > >>
> > > >> kylin.log
> > > >>
> > > >> [http-bio-7070-exec-4]:[2016-01-07
> > > >>
> > >
> >
> 15:31:37,797][DEBUG][org.apache.kylin.rest.filter.KylinApiFilter.logRequest(KylinApiFilter.java:120)]
> > > >> - REQUEST: REQUESTER=ADMIN;REQ_TIME=GMT-08:00 2016-01-07
> > > >>
> > >
> >
> 15:31:37;URI=/kylin/api/jobs/1575f8f6-98df-44da-8ac2-b480ccd0380a/resume;METHOD=PUT;QUERY_STRING=null;PAYLOAD=;RESP_STATUS=200;
> > > >> [pool-6-thread-1]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,880][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:102)]
> > > >> - CubingJob{id=1575f8f6-98df-44da-8ac2-b480ccd0380a,
> > > name=kylin_sales_cube
> > > >> - 19700101000000_20160112000000 - BUILD - GMT-08:00 2016-01-07
> > 14:39:59,
> > > >> state=READY} prepare to schedule
> > > >> [pool-6-thread-1]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,880][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:106)]
> > > >> - CubingJob{id=1575f8f6-98df-44da-8ac2-b480ccd0380a,
> > > name=kylin_sales_cube
> > > >> - 19700101000000_20160112000000 - BUILD - GMT-08:00 2016-01-07
> > 14:39:59,
> > > >> state=READY} scheduled
> > > >>
> > >
> >
> +------------------------------------------------------------------------------------------------------+[pool-6-thread-1]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,880][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:112)]
> > > >> - Job Fetcher: 0 running, 1 actual running, 1 ready, 0 others
> > > >>
> > > >> | kylin_sales_cube - 19700101000000_20160112000000 - BUILD -
> GMT-08:00
> > > >> 2016-01-07 14:39:59             |
> > > >>
> > > >>
> > >
> >
> +------------------------------------------------------------------------------------------------------+
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,883][DEBUG][org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:200)]
> > > >> - Saving resource
> /execute_output/1575f8f6-98df-44da-8ac2-b480ccd0380a
> > > >> (Store kylin_metadata@hbase)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,885][INFO][org.apache.kylin.job.manager.ExecutableManager.updateJobOutput(ExecutableManager.java:241)]
> > > >> - job id:1575f8f6-98df-44da-8ac2-b480ccd0380a from READY to RUNNING
> > > >>
> > > >>
> > >
> >
> +------------------------------------------------------------------------------------------------------+
> > > >> | Extract Fact Table Distinct Columns
> > > >>                              |
> > > >>
> > > >>
> > >
> >
> +------------------------------------------------------------------------------------------------------+
> > > >> 2016-01-07 15:32:17,917 INFO  [pool-7-thread-2] jvm.JvmMetrics:
> Cannot
> > > >> initialize JVM Metrics with processName=JobTracker, sessionId= -
> > already
> > > >> initialized
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,918][ERROR][org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:109)]
> > > >> - error running Executable
> > > >> java.lang.NullPointerException
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.common.MapReduceExecutable.onExecuteStart(MapReduceExecutable.java:79)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:51)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:130)
> > > >>         at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> > > >>         at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> > > >>         at java.lang.Thread.run(Thread.java:745)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,923][DEBUG][org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:200)]
> > > >> - Saving resource
> > > /execute_output/1575f8f6-98df-44da-8ac2-b480ccd0380a-01
> > > >> (Store kylin_metadata@hbase)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,927][DEBUG][org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:200)]
> > > >> - Saving resource
> > > /execute_output/1575f8f6-98df-44da-8ac2-b480ccd0380a-01
> > > >> (Store kylin_metadata@hbase)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,929][INFO][org.apache.kylin.job.manager.ExecutableManager.updateJobOutput(ExecutableManager.java:241)]
> > > >> - job id:1575f8f6-98df-44da-8ac2-b480ccd0380a-01 from READY to ERROR
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,929][ERROR][org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:109)]
> > > >> - error running Executable
> > > >> org.apache.kylin.job.exception.ExecuteException:
> > > >> java.lang.NullPointerException
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:111)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:51)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:130)
> > > >>         at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> > > >>         at
> > > >>
> > >
> >
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> > > >>         at java.lang.Thread.run(Thread.java:745)
> > > >> Caused by: java.lang.NullPointerException
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.common.MapReduceExecutable.onExecuteStart(MapReduceExecutable.java:79)
> > > >>         at
> > > >>
> > >
> >
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
> > > >>         ... 6 more
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,933][DEBUG][org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:200)]
> > > >> - Saving resource
> /execute_output/1575f8f6-98df-44da-8ac2-b480ccd0380a
> > > >> (Store kylin_metadata@hbase)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,937][DEBUG][org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:200)]
> > > >> - Saving resource
> /execute_output/1575f8f6-98df-44da-8ac2-b480ccd0380a
> > > >> (Store kylin_metadata@hbase)
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,938][INFO][org.apache.kylin.job.manager.ExecutableManager.updateJobOutput(ExecutableManager.java:241)]
> > > >> - job id:1575f8f6-98df-44da-8ac2-b480ccd0380a from RUNNING to ERROR
> > > >> [pool-7-thread-2]:[2016-01-07
> > > >>
> > >
> >
> 15:32:17,938][ERROR][org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:134)]
> > > >> - ExecuteException job:1575f8f6-98df-44da-8ac2-b480ccd0380a
> > > >> org.apache.kylin.job.exception.ExecuteException:
> > > >> org.apache.kylin.job.exception.ExecuteException:
> > > >> java.lang.NullPointerException
> > > >>
> > > >>
> > > >>
> > > >>
> > > >> Am I missing something setup on kylin properties or missing jars ?
> > > >>
> > > >>
> > > >>
> > > >
> > >
> >
> >
> >
> > --
> > Best regards,
> >
> > Shaofeng Shi
> >
>



-- 
Best regards,

Shaofeng Shi

Reply via email to