[
https://issues.apache.org/jira/browse/FALCON-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14104182#comment-14104182
]
Venkatesh Seetharam commented on FALCON-623:
--------------------------------------------
I think this is handled in terms of storage but oozie needs to be setup to be
aware of both the clusters and this is documented in HiveIntegration page.
Docs:
src/site/twiki/HiveIntegration.twiki
{code}
* Oozie 4.x with Hadoop-2.x
Replication jobs are submitted to oozie on the destination cluster. Oozie runs
a table export job
on RM on source cluster. Oozie server on the target cluster must be configured
with source hadoop
configs else jobs fail with errors on secure and non-secure clusters as below:
<verbatim>
org.apache.hadoop.security.token.SecretManager$InvalidToken: Password not found
for ApplicationAttempt appattempt_1395965672651_0010_000002
</verbatim>
Make sure all oozie servers that falcon talks to has the hadoop configs
configured in oozie-site.xml
<verbatim>
<property>
<name>oozie.service.HadoopAccessorService.hadoop.configurations</name>
<value>*=/etc/hadoop/conf,arpit-new-falcon-1.cs1cloud.internal:8020=/etc/hadoop-1,arpit-new-falcon-1.cs1cloud.internal:8032=/etc/hadoop-1,arpit-new-falcon-2.cs1cloud.internal:8020=/etc/hadoop-2,arpit-new-falcon-2.cs1cloud.internal:8032=/etc/hadoop-2,arpit-new-falcon-5.cs1cloud.internal:8020=/etc/hadoop-3,arpit-new-falcon-5.cs1cloud.internal:8032=/etc/hadoop-3</value>
<description>
Comma separated AUTHORITY=HADOOP_CONF_DIR, where AUTHORITY is the
HOST:PORT of
the Hadoop service (JobTracker, HDFS). The wildcard '*' configuration
is
used when there is no exact match for an authority. The
HADOOP_CONF_DIR contains
the relevant Hadoop *-site.xml files. If the path is relative is
looked within
the Oozie configuration directory; though the path can be absolute
(i.e. to point
to Hadoop client conf/ directories in the local filesystem.
</description>
</property>
</verbatim>
{code}
Code:
org.apache.falcon.oozie.feed.FeedReplicationCoordinatorBuilder#propagateTableCopyProperties
org.apache.falcon.entity.FeedHelper#getStagingPath
> HCat replication fails on table-export
> --------------------------------------
>
> Key: FALCON-623
> URL: https://issues.apache.org/jira/browse/FALCON-623
> Project: Falcon
> Issue Type: Bug
> Components: replication
> Environment: QA
> Reporter: Karishma Gulati
>
> On scheduling a one-source, one-target HCat Replication job, table export
> fails, with error message:
> {code}
> JA008: File does not exist:
> /projects/ivory/staging/falcon/workflows/feed/raaw-logs16-73741e09/1373320570ef25b7d7c1ee474f1f0428_1408529998170/lib/falcon-client-0.6-incubating-SNAPSHOT.jar
> {code}
> Oozie track trace:
> {code}
> 2014-08-20 11:13:01,477 ERROR pool-2-thread-9 UserGroupInformation -
> SERVER[ip-192-168-138-139] PriviledgedActionException as:karishma
> (auth:PROXY) via oozie (auth:SIMPLE) cause:java.io.FileNotFoundException:
> File does not exist:
> /projects/ivory/staging/falcon/workflows/feed/raaw-logs16-105f5895/bfed9c56081276857ce86136475fc7da_1408530730861/lib/falcon-client-0.6-incubating-SNAPSHOT.jar
> 2014-08-20 11:13:01,585 WARN pool-2-thread-9 ActionStartXCommand -
> SERVER[ip-192-168-138-139] USER[karishma] GROUP[-] TOKEN[]
> APP[FALCON_FEED_REPLICATION_raaw-logs16-105f5895]
> JOB[0000078-140813072435213-oozie-oozi-W]
> ACTION[0000078-140813072435213-oozie-oozi-W@table-export] Error starting
> action [table-export]. ErrorType [ERROR], ErrorCode [JA008], Message [JA008:
> File does not exist:
> /projects/ivory/staging/falcon/workflows/feed/raaw-logs16-105f5895/bfed9c56081276857ce86136475fc7da_1408530730861/lib/falcon-client-0.6-incubating-SNAPSHOT.jar]
> org.apache.oozie.action.ActionExecutorException: JA008: File does not exist:
> /projects/ivory/staging/falcon/workflows/feed/raaw-logs16-105f5895/bfed9c56081276857ce86136475fc7da_1408530730861/lib/falcon-client-0.6-incubating-SNAPSHOT.jar
> at
> org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:412)
> at
> org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:930)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor.java:1085)
> at
> org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:228)
> at
> org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:63)
> at org.apache.oozie.command.XCommand.call(XCommand.java:283)
> at org.apache.oozie.command.XCommand.call(XCommand.java:352)
> at
> org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:395)
> at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:73)
> at org.apache.oozie.command.XCommand.call(XCommand.java:283)
> at org.apache.oozie.command.XCommand.call(XCommand.java:352)
> at
> org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:273)
> at
> org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:60)
> at org.apache.oozie.command.XCommand.call(XCommand.java:283)
> at org.apache.oozie.command.XCommand.call(XCommand.java:352)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:241)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:55)
> at org.apache.oozie.command.XCommand.call(XCommand.java:283)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:174)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:701)
> Caused by: java.io.FileNotFoundException: File does not exist:
> /projects/ivory/staging/falcon/workflows/feed/raaw-logs16-105f5895/bfed9c56081276857ce86136475fc7da_1408530730861/lib/falcon-client-0.6-incubating-SNAPSHOT.jar
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:824)
> at
> org.apache.hadoop.filecache.DistributedCache.getFileStatus(DistributedCache.java:185)
> at
> org.apache.hadoop.filecache.TrackerDistributedCacheManager.determineTimestamps(TrackerDistributedCacheManager.java:821)
> at
> org.apache.hadoop.filecache.TrackerDistributedCacheManager.determineTimestampsAndCacheVisibilities(TrackerDistributedCacheManager.java:778)
> at
> org.apache.hadoop.mapred.JobClient.copyAndConfigureFiles(JobClient.java:852)
> at
> org.apache.hadoop.mapred.JobClient.copyAndConfigureFiles(JobClient.java:743)
> at org.apache.hadoop.mapred.JobClient.access$400(JobClient.java:174)
> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:960)
> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:945)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:416)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
> at
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:945)
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:919)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:915)
> ... 20 more
> {code}
> I set up falcon in distributed mode, using different clusters for source and
> target.
--
This message was sent by Atlassian JIRA
(v6.2#6252)