[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

2018-02-07 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/nifi/pull/2435


---


[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

2018-01-25 Thread ijokarumawak
Github user ijokarumawak commented on a diff in the pull request:

https://github.com/apache/nifi/pull/2435#discussion_r164017503
  
--- Diff: 
nifi-nar-bundles/nifi-atlas-bundle/nifi-atlas-reporting-task/src/main/java/org/apache/nifi/atlas/NiFiAtlasHook.java
 ---
@@ -255,7 +255,11 @@ public void commitMessages() {
 }
 return new Tuple<>(refQualifiedName, 
typedQualifiedNameToRef.get(toTypedQualifiedName(typeName, refQualifiedName)));
 }).filter(Objects::nonNull).filter(tuple -> tuple.getValue() != 
null)
-.collect(Collectors.toMap(Tuple::getKey, Tuple::getValue));
+// If duplication happens, use new value.
+.collect(Collectors.toMap(Tuple::getKey, Tuple::getValue, 
(oldValue, newValue) -> {
+logger.warn("Duplicated qualified name was found, use 
the new one. oldValue={}, newValue={}", new Object[]{oldValue, newValue});
+return newValue;
+}));
--- End diff --

While I was testing, I got the following exception:
```
2018-01-25 05:06:41,430 ERROR [Timer-Driven Process Thread-1] 
o.a.n.a.reporting.ReportLineageToAtlas 
ReportLineageToAtlas[id=057986ae-0161-1000-d0b0-1b890a17f5aa] Error running 
task ReportLineageToAtlas[id=057986ae-0161-1000-d0b0-1b890a17f5aa] due to 
java.lang.IllegalStateException: Duplicate key {Id='(type: fs_path, id: 
69be7a40-4ff8-4c4e-b714-2d394c14398d)', traits=[], values={}} NiFiAtlasHook.258
```
The exception means, an existing nifi_flow_path entity has more than one 
entries having pointing to the same entity having identical qualified name, 
from its inputs or outputs attribute. This happened because I was using the old 
test environment which has data created before Atlas integration implemented 
de-duplication logic. However, it would be more protective to handle such 
duplication in case if this occurs for some other reason.


---


[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

2018-01-25 Thread ijokarumawak
GitHub user ijokarumawak opened a pull request:

https://github.com/apache/nifi/pull/2435

NIFI-4818: Fix transit URL parsing at Hive2JDBC and KafkaTopic for Re…

…portLineageToAtlas

- Hive2JDBC: Handle connection parameters and multiple host entries
correctly
- KafkaTopic: Handle multiple host entries correctly
- Avoid potential "IllegalStateException: Duplicate key" exception
when NiFiAtlasHook analyzes existing NiFiFlowPath input/output entries

Thank you for submitting a contribution to Apache NiFi.

In order to streamline the review of the contribution we ask you
to ensure the following steps have been taken:

### For all changes:
- [x] Is there a JIRA ticket associated with this PR? Is it referenced 
 in the commit message?

- [x] Does your PR title start with NIFI- where  is the JIRA number 
you are trying to resolve? Pay particular attention to the hyphen "-" character.

- [x] Has your PR been rebased against the latest commit within the target 
branch (typically master)?

- [x] Is your initial contribution a single, squashed commit?

### For code changes:
- [ ] Have you ensured that the full suite of tests is executed via mvn 
-Pcontrib-check clean install at the root nifi folder?
- [x] Have you written or updated unit tests to verify your changes?
- [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)? 
- [ ] If applicable, have you updated the LICENSE file, including the main 
LICENSE file under nifi-assembly?
- [ ] If applicable, have you updated the NOTICE file, including the main 
NOTICE file found under nifi-assembly?
- [ ] If adding new Properties, have you added .displayName in addition to 
.name (programmatic access) for each of the new properties?

### For documentation related changes:
- [ ] Have you ensured that format looks appropriate for the output in 
which it is rendered?

### Note:
Please ensure that once the PR is submitted, you check travis-ci for build 
issues and submit an update to your PR as soon as possible.


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ijokarumawak/nifi nifi-4818

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/nifi/pull/2435.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2435


commit 91ee885398128e796918a6fc6b98bdf442c5ebf1
Author: Koji Kawamura 
Date:   2018-01-25T04:57:01Z

NIFI-4818: Fix transit URL parsing at Hive2JDBC and KafkaTopic for 
ReportLineageToAtlas

- Hive2JDBC: Handle connection parameters and multiple host entries
correctly
- KafkaTopic: Handle multiple host entries correctly
- Avoid potential "IllegalStateException: Duplicate key" exception
when NiFiAtlasHook analyzes existing NiFiFlowPath input/output entries




---