> Does that make sense? Sure !
Thanks Mark 2018-01-23 15:13 GMT+01:00 Mark Payne <marka...@hotmail.com>: > Dominique, > > Thats a great question. An important distinction between NiFi's lineage > and Atlas's lineage is that NiFi provides > data-level lineage (i.e., lineage for each individual FlowFile). Atlas > provides "dataset-level" > lineage. So what gets exported to Atlas is not the lineage of each > individual FlowFile > but rather the connection between each source and destination. > > So, for example, if you pull 1,000 FlowFiles from Kafka Topic ABC and push > them to HDFS directory /path/to/my/new/file > and then you pull a million FlowFiles from Kafka Topic XYZ and push them > to HDFS directory /path/to/my/other/file > then what gets exported to Atlas is two separate lineages: a lineage from > Kafka Topic ABC to HDFS directory > /path/to/my/new/file and a lineage from Kafka Topic XYZ to HDFS directory > /path/to/my/other/file. > It will not export this lineage 1 million FlowFiles as a result of having > 1 million FlowFile traverse this lineage. It will > send it only one time. > > Does that make sense? > > Thanks > -Mark > > > > On Jan 23, 2018, at 7:15 AM, Dominique De Vito <ddv36...@gmail.com> > wrote: > > > > Hi, > > > > AFAIK lineage occurs at FlowFile level. > > That is (AFAIU) each FlowFile could have it its own lineage. > > > > So, if Nifi is reading a file as input, with 1 000 records, is Nifi > going to send to Atlas 1 000 lineages? > > > > If yes, does Nifi send these 1 000 lineages (to Atlas) in one call in a > batch way? > > > > Or, these 1 000 lineages to send corespond to 1 000 calls to Atlas ? > > > > Thanks. > > > > Dominique > > > > > > > > > > > >