[ https://issues.apache.org/jira/browse/GOBBLIN-587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zhixiong Chen updated GOBBLIN-587: ---------------------------------- Description: Currently, gobblin lineage is sent at dataset level. The task is to send partition level lineage for fs sink. An example kafka-hdfs partition lineage is {code:java} { "timestamp": 1536785248451, "namespace": { "string": "gobblin.event.lineage" }, "name": "LoginEvent", "metadata": { "destination": "{\"object-type\":\"org.apache.gobblin.dataset.PartitionDescriptor\",\"object-data\":{\"dataset\":{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"hdfs\",\"metadata\":{\"branch\":\"0\"},\"name\":\"/data/tracking/LoginEvent\"}},\"name\":\"hourly/2018/09/12/12\"}}", "eventType": "LineageEvent", "source": "{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"kafka\",\"metadata\":{},\"name\":\"LoginEvent\"}}", "metricContextName": "org.apache.gobblin.runtime.SafeDatasetCommit.1693032310", "metricContextID": "1a7895b0-9e93-414e-ac0b-038f9375c82e", "class": "org.apache.gobblin.runtime.SafeDatasetCommit", } } {code} Note: Lineage is not available automatically. You might have to implement the support in your source-destination pair. was: Currently, gobblin lineage is sent at dataset level. The task is to send partition level lineage for fs sink. An example kafka-hdfs partition lineage is {code:java} { "timestamp": 1536785248451, "namespace": { "string": "gobblin.event.lineage" }, "name": "LoginEvent", "metadata": { "destination": "{\"object-type\":\"org.apache.gobblin.dataset.PartitionDescriptor\",\"object-data\":{\"dataset\":{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"hdfs\",\"metadata\":{\"branch\":\"0\"},\"name\":\"/tmp/zhchen/data/tracking/LoginEvent\"}},\"name\":\"hourly/2018/09/12/12\"}}", "eventType": "LineageEvent", "source": "{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"kafka\",\"metadata\":{},\"name\":\"LoginEvent\"}}", "metricContextName": "org.apache.gobblin.runtime.SafeDatasetCommit.1693032310", "metricContextID": "1a7895b0-9e93-414e-ac0b-038f9375c82e", "class": "org.apache.gobblin.runtime.SafeDatasetCommit", } } {code} > Implement gobblin fs sink partition level lineage > ------------------------------------------------- > > Key: GOBBLIN-587 > URL: https://issues.apache.org/jira/browse/GOBBLIN-587 > Project: Apache Gobblin > Issue Type: Task > Reporter: Zhixiong Chen > Assignee: Zhixiong Chen > Priority: Major > > Currently, gobblin lineage is sent at dataset level. The task is to send > partition level lineage for fs sink. An example kafka-hdfs partition lineage > is > {code:java} > { > "timestamp": 1536785248451, > "namespace": { > "string": "gobblin.event.lineage" > }, > "name": "LoginEvent", > "metadata": { > "destination": > "{\"object-type\":\"org.apache.gobblin.dataset.PartitionDescriptor\",\"object-data\":{\"dataset\":{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"hdfs\",\"metadata\":{\"branch\":\"0\"},\"name\":\"/data/tracking/LoginEvent\"}},\"name\":\"hourly/2018/09/12/12\"}}", > "eventType": "LineageEvent", > "source": > "{\"object-type\":\"org.apache.gobblin.dataset.DatasetDescriptor\",\"object-data\":{\"platform\":\"kafka\",\"metadata\":{},\"name\":\"LoginEvent\"}}", > "metricContextName": > "org.apache.gobblin.runtime.SafeDatasetCommit.1693032310", > "metricContextID": "1a7895b0-9e93-414e-ac0b-038f9375c82e", > "class": "org.apache.gobblin.runtime.SafeDatasetCommit", > } > } > {code} > Note: Lineage is not available automatically. You might have to implement the > support in your source-destination pair. -- This message was sent by Atlassian JIRA (v7.6.3#76005)