[ 
https://issues.apache.org/jira/browse/GOBBLIN-12?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093032#comment-16093032
 ] 

mazhiyong commented on GOBBLIN-12:
----------------------------------

thanks [~Zhixiong Chen]
this is my config file.
#job.conf
gobblin.yarn.work.dir=/user/yarn/gobblin-yarn

fs.uri=hdfs://mzcluster

mr.job.max.mappers=1
mr.job.root.dir=${gobblin.yarn.work.dir}/working

topic.whitelist=test

kafka.brokers=mz-bigdata-32-201:9092,mz-bigdata-32-202:9092,mz-bigdata-32-203:9092
kafka.workunit.packer.type=BI_LEVEL

bootstrap.with.offset=earliest

job.name=GobblinKafkaQuickStart
job.group=GobblinKafka
job.description=Gobblin quick start job for Kafka
job.schedule=0 0/10 * * * ?

source.class=gobblin.source.extractor.extract.kafka.KafkaSimpleSource

extract.namespace=gobblin.extract.kafka
extract.limit.enabled=true
extract.limit.type=time
extract.limit.timeLimitTimeunit=minutes
extract.limit.timeLimit=10

writer.builder.class=gobblin.writer.SimpleDataWriterBuilder
writer.file.path.type=tablename
writer.destination.type=HDFS
writer.output.format=txt
writer.fs.uri=${fs.uri}

metrics.reporting.file.enabled=true
metrics.log.dir=${gobblin.yarn.work.dir}/metrics
metrics.reporting.file.suffix=txt

data.publisher.type=gobblin.publisher.BaseDataPublisher
data.publisher.final.dir=/gobblintest/job-output
data.publisher.replace.final.dir=false

application.conf
gobblin.yarn.app.name=gobblin-yarn
gobblin.yarn.app.master.memory.mbs=512
gobblin.yarn.initial.containers=2
gobblin.yarn.container.memory.mbs=2048
gobblin.yarn.conf.dir=${gobblin.home.dir}/conf/yarn
gobblin.yarn.lib.jars.dir=${gobblin.home.dir}/lib
gobblin.yarn.app.master.files.local=${gobblin.yarn.conf.dir}"/log4j-yarn.properties,"${gobblin.yarn.conf.dir}"/application.conf,"${gobblin.yarn.conf.dir}"/reference.conf"
gobblin.yarn.container.files.local=${gobblin.yarn.app.master.files.local}
gobblin.yarn.logs.sink.root.dir=${gobblin.home.dir}/applogs

gobblin.cluster.helix.cluster.name=gobblin-yarn
gobblin.cluster.job.conf.path=${gobblin.home.dir}/jobconf

writer.fs.uri=${fs.uri}
state.store.fs.uri=${fs.uri}

writer.destination.type=HDFS
writer.output.format=AVRO
writer.staging.dir=${gobblin.yarn.work.dir}/task-staging
writer.output.dir=${gobblin.yarn.work.dir}/task-output

data.publisher.type=gobblin.publisher.BaseDataPublisher
data.publisher.final.dir=${gobblin.yarn.work.dir}/job-output
data.publisher.replace.final.dir=false

state.store.dir=${gobblin.yarn.work.dir}/state-store

qualitychecker.row.err.file=${gobblin.yarn.work.dir}/err

job.lock.enabled=true
job.lock.type=gobblin.runtime.locks.ZookeeperBasedJobLock

metrics.log.dir=${gobblin.yarn.work.dir}/metrics

task.status.reportintervalinms=1000

metrics.enabled=true

#reference.conf
gobblin.home.dir=/opt/gobblin-dist

gobblin.yarn.app.queue=default
gobblin.yarn.app.name=gobblin-yarn
gobblin.yarn.app.master.memory.mbs=512
gobblin.yarn.app.master.cores=1
gobblin.yarn.app.report.interval.minutes=5
gobblin.yarn.max.get.app.report.failures=4
gobblin.yarn.email.notification.on.shutdown=false
gobblin.yarn.initial.containers=1
gobblin.yarn.container.memory.mbs=512
gobblin.yarn.container.cores=1
gobblin.yarn.container.affinity.enabled=true
gobblin.yarn.helix.instance.max.retries=2
gobblin.yarn.keytab.login.interval.minutes=1440
gobblin.yarn.token.renew.interval.minutes=720
gobblin.yarn.work.dir=/gobblin

gobblin.cluster.helix.cluster.name=gobblin-yarn
gobblin.cluster.zk.connection.string=${zookeeper.connection.string}
gobblin.locks.zookeeper.acquire.timeout_milliseconds=10000
gobblin.locks.zookeeper.connection_string=${zookeeper.connection.string}

fs.uri="hdfs://mzcluster"

job.execinfo.server.enabled=false

zookeeper.connection.string="mz-bigdata-32-20:2181,mz-bigdata-32-202:2181,mz-bigdata-32-203:2181"
zookeeper.session.timeout.seconds=180
zookeeper.connection.timeout.seconds=30
zookeeper.retry.backoff.seconds=1
zookeeper.retry.count.max=10

> gobblin on yarn mode,AbstractJobLauncher can't lock and throw Excetion for 
> some reson
> -------------------------------------------------------------------------------------
>
>                 Key: GOBBLIN-12
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-12
>             Project: Apache Gobblin
>          Issue Type: Bug
>            Reporter: mazhiyong
>
> when i running bin/gobblin-yarn.sh start, the job throw exception,
> ## ApplicationMaster Log
> 2017-06-11 00:45:00 CST WARN  [TaskStateCollectorService RUNNING] 
> gobblin.runtime.TaskStateCollectorService  - No output task state files found 
> in 
> hdfs://mycluster/user/yarn/GobblinYarn/application_1497109223607_0007/_taskstates/job_GobblinKafkaQuickStart_1497112920089
> 2017-06-11 00:46:00 CST INFO  [DefaultQuartzScheduler_Worker-3] 
> org.quartz.core.JobRunShell  - Job GobblinKafka.GobblinKafkaQuickStart threw 
> a JobExecutionException: 
> org.quartz.JobExecutionException: gobblin.runtime.JobException: Previous 
> instance of job GobblinKafkaQuickStart is still running, skipping this 
> scheduled run [See nested exception: gobblin.runtime.JobException: Previous 
> instance of job GobblinKafkaQuickStart is still running, skipping this 
> scheduled run]
>       at gobblin.cluster.GobblinHelixJob.executeImpl(GobblinHelixJob.java:87)
>       at gobblin.scheduler.BaseGobblinJob.execute(BaseGobblinJob.java:53)
>       at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
>       at 
> org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:573)
> Caused by: gobblin.runtime.JobException: Previous instance of job 
> GobblinKafkaQuickStart is still running, skipping this scheduled run
>       at 
> gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:163)
>       at 
> gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:144)
>       at 
> gobblin.cluster.GobblinHelixJobLauncher.<init>(GobblinHelixJobLauncher.java:127)
>       at gobblin.cluster.GobblinHelixJob.executeImpl(GobblinHelixJob.java:65)
>       ... 3 more
> 2017-06-11 00:46:00 CST WARN  [TaskStateCollectorService RUNNING] 
> gobblin.runtime.TaskStateCollectorService  - No output task state files found 
> in 
> hdfs://mycluster/user/yarn/GobblinYarn/application_1497109223607_0007/_taskstates/job_GobblinKafkaQuickStart_1497112920089
> 2017-06-11 00:47:00 CST WARN  [TaskStateCollectorService RUNNING] 
> gobblin.runtime.TaskStateCollectorService  - No output task state files found 
> in 
> hdfs://mycluster/user/yarn/GobblinYarn/application_1497109223607_0007/_taskstates/job_GobblinKafkaQuickStart_1497112920089
> 2017-06-11 00:48:00 CST INFO  [DefaultQuartzScheduler_Worker-4] 
> org.quartz.core.JobRunShell  - Job GobblinKafka.GobblinKafkaQuickStart threw 
> a JobExecutionException: 
> org.quartz.JobExecutionException: gobblin.runtime.JobException: Previous 
> instance of job GobblinKafkaQuickStart is still running, skipping this 
> scheduled run [See nested exception: gobblin.runtime.JobException: Previous 
> instance of job GobblinKafkaQuickStart is still running, skipping this 
> scheduled run]
>       at gobblin.cluster.GobblinHelixJob.executeImpl(GobblinHelixJob.java:87)
>       at gobblin.scheduler.BaseGobblinJob.execute(BaseGobblinJob.java:53)
>       at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
>       at 
> org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:573)
> Caused by: gobblin.runtime.JobException: Previous instance of job 
> GobblinKafkaQuickStart is still running, skipping this scheduled run
>       at 
> gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:163)
>       at 
> gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:144)
>       at 
> gobblin.cluster.GobblinHelixJobLauncher.<init>(GobblinHelixJobLauncher.java:127)
>       at gobblin.cluster.GobblinHelixJob.executeImpl(GobblinHelixJob.java:65)
>       ... 3 more
> public AbstractJobLauncher(Properties jobProps, List<? extends Tag<?>> 
> metadataTags,
>       @Nullable SharedResourcesBroker<GobblinScopeTypes> instanceBroker)
>       throws Exception {
>     
> Preconditions.checkArgument(jobProps.containsKey(ConfigurationKeys.JOB_NAME_KEY),
>         A job must have a job name specified by job.name);
>     // Add clusterIdentifier tag so that it is added to any new TaskState 
> created
>     List<Tag<?>> clusterNameTags = Lists.newArrayList();
>     clusterNameTags.addAll(Tag.fromMap(ClusterNameTags.getClusterNameTags()));
>     GobblinMetrics.addCustomTagsToProperties(jobProps, clusterNameTags);
>     // Make a copy for both the system and job configuration properties
>     this.jobProps = new Properties();
>     this.jobProps.putAll(jobProps);
>     if (!tryLockJob(this.jobProps)) {
>       throw new JobException(String.format(Previous instance of job %s is 
> still running, skipping this scheduled run,
>           this.jobProps.getProperty(ConfigurationKeys.JOB_NAME_KEY)));
>     }
> .......
>  
> *Github Url* : https://github.com/linkedin/gobblin/issues/1938 
> *Github Reporter* : [~phenixmzy] 
> *Github Created At* : 2017-06-10T20:02:43Z 
> *Github Updated At* : 2017-07-12T21:49:48Z 
> h3. Comments 
> ----
> [~phenixmzy] wrote on 2017-06-12T09:13:49Z : i am use ZookeeperBasedJobLock 
> and  found this.jobLockOptional.isPresent() always is true and 
> this.jobLockOptional.get().tryLock() always is false ,in the 
> AbstractJobLauncher#tryLockJob, so that can't lock  and throws 
> gobblin.runtime.JobException: Previous instance of job GobblinKafkaQuickStart 
> is still running, skipping this scheduled run.
> i am query zookeeper, 
> executor ls /locks/GobblinKafkaQuickStart/leases , return
> [_c_1a81243e-cf64-4839-9187-ef7ee65c729c-lease-0000000001];
> executor ls  /locks/GobblinKafkaQuickStart/locks return  []
> but what's reason can't be locked? 
>  
> *Github Url* : 
> https://github.com/linkedin/gobblin/issues/1938#issuecomment-307733856 
> ----
> [~zxc] wrote on 2017-07-12T21:49:48Z : Did you specify `job.lock.type` 
> configuration? Can you provide the job file? It's expected that 
> `this.jobLockOptional.isPresent()` is always true as you're using a lock. The 
> problem is why `this.jobLockOptional.get().tryLock()` always returns false. 
>  
> *Github Url* : 
> https://github.com/linkedin/gobblin/issues/1938#issuecomment-314907460



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to