----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/70462/ -----------------------------------------------------------
(Updated April 12, 2019, 12:39 a.m.) Review request for atlas, Ashutosh Mestry, Sridhar K, Le Ma, Madhan Neethiraj, and Sarath Subramanian. Changes ------- Adding the Atlas JIRA to the review Summary (updated) ----------------- ATLAS-3133 : Adding support for Process Executions in Atlas Bugs: ATLAS-3133 https://issues.apache.org/jira/browse/ATLAS-3133 Repository: atlas Description ------- Adding support for Process Executions in Atlas Diffs (updated) ----- addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/BaseHiveEvent.java 31346d0954140cd8bda690dc9079e0913f7b9d7d addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateHiveProcess.java d61f1d74e3238e0a7474de67c0400c108d8919ea addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateTable.java 674a89f6e4852dc30c29c5681854ec3ba8611f35 addons/hive-bridge/src/main/java/org/apache/atlas/hive/model/HiveDataTypes.java 94010d0cb1a7a5c48b71f6d77c5e1a8f5cfcf013 addons/hive-bridge/src/test/java/org/apache/atlas/hive/HiveITBase.java 002b90839f78dc843b5aca56042c3decd299bed8 addons/hive-bridge/src/test/java/org/apache/atlas/hive/hook/HiveHookIT.java 142e35561fff42f7672c5b5136af1173049580b3 addons/models/0000-Area0/0010-base_model.json 8b41ee89f3a7288bb4cdad3df6887242b40b68d2 addons/models/1000-Hadoop/1030-hive_model.json e96443382a587411d1207d6e1157ea65350cbdba Diff: https://reviews.apache.org/r/70462/diff/2/ Changes: https://reviews.apache.org/r/70462/diff/1-2/ Testing ------- We want to add support for Process Executions in Atlas. With the help of process executions attributes specific to each execution(like startTime, endTime, queryText) will be captured in the execution object. With the current model of Atlas this is lost since each execution overrides the previous one. To solve this problem. We are creating 2 new entity definitions and one relations definition. We create an entity definition called a process execution and another one called hive process execution. The hive process execution has all the attributes which are specific to each execution. We also create a relationship defintion between a hive process and a hive process execution as a many to one relation. The lineage does not get affected in this since the inputs and outputs are still attached to the hive process. Hive Process can now be thought of as a grouping of multiple executions. There is no lineage shown for a hive process execution. The criteria for grouping is based on the qualifie dName of a Hive Process. As long as the qualified Name of a hive process remains the same, each execution gets mapped to the same hive process. Thanks, Aadarsh Jajodia