[ 
https://issues.apache.org/jira/browse/FALCON-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13791301#comment-13791301
 ] 

Srikanth Sundarrajan commented on FALCON-93:
--------------------------------------------

Can prefix be "falcon.source." & "falcon.target." instead of just source & 
target?
{code}
+                    propagateStorageProperties(srcCluster, (CatalogStorage) 
sourceStorage, props, "source");
+                    propagateStorageProperties(trgCluster, (CatalogStorage) 
targetStorage, props, "target");
{code}

Looks like all tables go through the same export path. Can export & import be 
avoided for external tables or does the export/import already take care of the 
fact that table is external and allow you to short-circuit this?
{code}
+    <decision name="replication-decision">
+        <switch>
+            <case to="table-export">
+                ${feedStorageType == "TABLE"}
+            </case>
+            <default to="replication"/>
+        </switch>
+    </decision>
{code}

Seems to be using distcp-v1. This is not desirable.
{code}
+    <!-- Table Replication - Import data and metadata from HDFS Staging into 
Target Hive -->
+    <action name="table-replication">
+        <distcp xmlns="uri:oozie:distcp-action:0.1">
+            <job-tracker>${targetJobTracker}</job-tracker>
+            <name-node>${targetNameNode}</name-node>
+            <configuration>
+                <property>
+                    <name>mapred.job.queue.name</name>
+                    <value>${queueName}</value>
+                </property>
+            </configuration>
+            <arg>${sourceStagingDir}/${nominalTime}</arg>
+            <arg>${targetStagingDir}/${nominalTime}</arg>
+        </distcp>
+        <ok to="table-import"/>
+        <error to="fail"/>
+    </action>
{code}

Looks like scenario where data from multiple sources each owning a partition 
getting merged in the target cluster isn't implemented, as the export need to 
be specific to the partition against each of the source cluster. Please confirm.

> Replication to handle hive table replication
> --------------------------------------------
>
>                 Key: FALCON-93
>                 URL: https://issues.apache.org/jira/browse/FALCON-93
>             Project: Falcon
>          Issue Type: Sub-task
>    Affects Versions: 0.3
>            Reporter: Venkatesh Seetharam
>            Assignee: Venkatesh Seetharam
>         Attachments: FALCON-93.patch, FALCON-93-r1.patch
>
>
> Data and metadata to be replicated atomically.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to