Grant Henke created KUDU-2895:
---------------------------------

             Summary: Native Apache Atlas Support
                 Key: KUDU-2895
                 URL: https://issues.apache.org/jira/browse/KUDU-2895
             Project: Kudu
          Issue Type: New Feature
            Reporter: Grant Henke


This tracks adding lineage support to Kudu and Apache Atlas. 

A few notes based on some initial research:
 * It probably makes sense to generate a generic lineage file which can be 
consumed by Apache Atlas for lineage.
 ** This avoids the need for Java interaction in the server
 ** This is the approach Impala uses
 ** See ATLAS-3183 and 
[https://impala.apache.org/docs/build3x/html/topics/impala_lineage.html#lineage]
 * Creating lineage entries for table "DDL" initially makes sense
 ** CREATE, ALTER, DELETE
 ** This is what Hbase seems to do: [https://atlas.apache.org/Hook-HBase.html]
 ** "Only the namespace, table and column-family create/update/ delete 
operations are captured by Atlas HBase hook"
 * The need for lineage information by scans in unclear
 ** It would be super fine grained and difficult to interpret.
 ** Instead lineage from other tools doing the scanning would be more 
interpretable (Impala, Spark, etc).



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to