[ https://issues.apache.org/jira/browse/TINKERPOP-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387407#comment-15387407 ]
Daniel Kuppitz edited comment on TINKERPOP-1380 at 7/21/16 9:08 AM: -------------------------------------------------------------------- I inspected the side-effects in my original traversal and they actually look good. But what I found is that {{cap("p")}} emits 2 results. {code} gremlin> g.withComputer().V().emit(cyclicPath().or().not(both())).repeat(both()).until(cyclicPath()). gremlin> aggregate("p").by(path()).cap("p") ==>{[v[0]]=1, [v[1], v[2], v[1]]=1, [v[2], v[1], v[2]]=1} ==>{[v[0]]=1, [v[1], v[2], v[1]]=1, [v[2], v[1], v[2]]=1} {code} By explicitly limiting it to 1 result, {{dedup()}} gives the expected result: {code} gremlin> g.withComputer().V().emit(cyclicPath().or().not(both())).repeat(both()).until(cyclicPath()). gremlin> aggregate("p").by(path()).cap("p").limit(1).unfold().limit(local, 1).map( gremlin> __.as("v").select("p").unfold(). gremlin> filter(unfold().where(eq("v"))). gremlin> unfold().dedup().order().by(id).fold() gremlin> ).dedup() ==>[v[1], v[2]] ==>[v[0]] {code} So it looks like it's more of a {{cap()}} problem. Perhaps we should change the title of this ticket and also split it into 3 separate tickets. We have: * {{dedup()}} sometimes doesn't dedup * some usages if {{dedup()}} lead to exceptions * {{cap}} in OLAP emits 2 results instead of just 1 * side-effects get duplicated I thought about the last one a bit longer and I think it's not a bug. The "problem" is, that the original traversal was already executed; the result is confusing, but actually what should be expected. Maybe {{clone()}} should throw an exception if the traversal was already iterated. was (Author: dkuppitz): I inspected the side-effects in my original traversal and the actually look good. But what I found is that {{cap("p")}} emits 2 results. {code} gremlin> g.withComputer().V().emit(cyclicPath().or().not(both())).repeat(both()).until(cyclicPath()). gremlin> aggregate("p").by(path()).cap("p") ==>{[v[0]]=1, [v[1], v[2], v[1]]=1, [v[2], v[1], v[2]]=1} ==>{[v[0]]=1, [v[1], v[2], v[1]]=1, [v[2], v[1], v[2]]=1} {code} By explicitly limiting it to 1 result, {{dedup()}} gives the expected result: {code} gremlin> g.withComputer().V().emit(cyclicPath().or().not(both())).repeat(both()).until(cyclicPath()). gremlin> aggregate("p").by(path()).cap("p").limit(1).unfold().limit(local, 1).map( gremlin> __.as("v").select("p").unfold(). gremlin> filter(unfold().where(eq("v"))). gremlin> unfold().dedup().order().by(id).fold() gremlin> ).dedup() ==>[v[1], v[2]] ==>[v[0]] {code} So it looks like it's more of a {{cap()}} problem. Perhaps we should change the title of this ticket and also split it into 3 separate tickets. We have: * {{dedup()}} sometimes doesn't dedup * some usages if {{dedup()}} lead to exceptions * {{cap}} in OLAP emits 2 results instead of just 1 * side-effects get duplicated I thought about the last one a bit longer and I think it's not a bug. The "problem" is, that the original traversal was already executed; the result is confusing, but actually what should be expected. Maybe {{clone()}} should throw an exception if the traversal was already iterated. > dedup() doesn't dedup in rare cases > ----------------------------------- > > Key: TINKERPOP-1380 > URL: https://issues.apache.org/jira/browse/TINKERPOP-1380 > Project: TinkerPop > Issue Type: Bug > Components: process > Affects Versions: 3.2.1 > Reporter: Daniel Kuppitz > Fix For: 3.2.2 > > > I stumbled across this issue when I tried to solve a problem on the mailing > list. It seems like a lot of steps need to be involved in order to make it > reproducible. > {code} > gremlin> :set max-iteration 10 > gremlin> > gremlin> g = TinkerFactory.createModern().traversal().withComputer() > ==>graphtraversalsource[tinkergraph[vertices:6 edges:6], graphcomputer] > gremlin> > g.V().repeat(both()).until(cyclicPath()).path().aggregate("x").cap("x").unfold().dedup() > ==>[v[1], v[2], v[1]] > ==>[v[1], v[2], v[1]] > ==>[v[1], v[3], v[1]] > ==>[v[1], v[3], v[1]] > ==>[v[1], v[4], v[1]] > ==>[v[1], v[4], v[1]] > ==>[v[2], v[1], v[2]] > ==>[v[2], v[1], v[2]] > ==>[v[3], v[1], v[3]] > ==>[v[3], v[1], v[3]] > ... > {code} > I can't reproduce it w/o using {{repeat()}}, {{aggregate()}} or {{cap()}}. It > is reproducible without {{path()}} though. And then it even gets a little > worse; check this out: > {code} > gremlin> > g.V().repeat(both()).until(cyclicPath()).aggregate("x").cap("x").unfold().dedup() > ==>v[1] > ==>v[1] > ==>v[2] > ==>v[2] > ==>v[3] > ==>v[3] > ==>v[4] > ==>v[4] > ==>v[5] > ==>v[5] > ... > gremlin> > g.V().repeat(both()).until(cyclicPath()).aggregate("x").cap("x").unfold().dedup().dedup() > java.lang.RuntimeException: java.lang.IllegalStateException: > java.lang.IllegalArgumentException: The memory can only be set() during > vertex program setup and terminate: x > Display stack trace? [yN] > {code} > The exception occurs only in OLAP mode, but also for more meaningful patterns > ({{.dedup().dedup()}} really doesn't make much sense). > For a better / larger example see: > https://groups.google.com/d/msg/gremlin-users/NMXExuvDjt0/ps7bJDYwAQAJ -- This message was sent by Atlassian JIRA (v6.3.4#6332)