Hi Guozhang,

Yes, it does seem to be fixed in trunk.  Thanks.  I should have tried that,
but I assumed that the recently released 0.10.0.1 would be pretty close to
trunk.  I can see where that was mistaken, since 0.10.0 is quite divergent
from trunk.

Mathieu


On Wed, Aug 10, 2016 at 2:39 PM, Guozhang Wang <wangg...@gmail.com> wrote:

> Hello Mathieu,
>
> I think this issue is fixed in trunk but may get missed in the 0.10.0
> branch, could you try running your program from trunk to verify if it is
> the case? If yes we can consider backportting the hotfix from trunk to
> 0.10.0 and have another bug fix release.
>
>
> Guozhang
>
> On Wed, Aug 10, 2016 at 7:32 AM, Mathieu Fenniak <
> mathieu.fenn...@replicon.com> wrote:
>
> > Hey there, Kafka Users,
> >
> > I'm trying to join two topics with Kafka Streams.  The first topic is a
> > changelog of one object, and the second is a changelog of a related
> > object.  In order to join these tables, I'm grouping the second table by
> a
> > piece of data in it that indicates what record it is related to in the
> > first table.  But I'm getting an unexpected error related to the
> > repartitioning topic for the aggregated table:
> >
> > org.apache.kafka.streams.errors.TopologyBuilderException: Invalid
> topology
> > building: External source topic not found:
> > *TableNumber2Aggregated-repartition*
> > at
> > org.apache.kafka.streams.processor.internals.StreamPartitionAssignor.
> > ensureCopartitioning(StreamPartitionAssignor.java:452)
> > at
> > org.apache.kafka.streams.processor.internals.StreamPartitionAssignor.
> > ensureCopartitioning(StreamPartitionAssignor.java:440)
> >
> >
> > (Full exception:
> > https://gist.github.com/mfenniak/11ca081191932fbb33a0c3cc32ad1686)
> >
> > It appears that the "TableNumber2Aggregated-repartition" topic *is*
> > created
> > in Kafka by the streams application, but the Kafka topic has a prefix
> that
> > matches my application id (timesheet-status).  Perhaps something is
> > prefixing the topic name, but it isn't being applied everywhere?
> >
> > $ ./kafka-topics.sh --zookeeper localhost --list
> > TableNumber1
> > TableNumber2
> > __consumer_offsets
> > timesheet-status-TableNumber2Aggregated-repartition
> >
> >
> > Here's a sample that reproduces the issue (note, I've cut out all the
> > actual mapping, grouping, and aggregating logic, but, this still
> reproduces
> > the error):
> >
> >     public static TopologyBuilder createTopology() {
> >         KStreamBuilder builder = new KStreamBuilder();
> >
> >         KTable table1Mapped = builder.table(Serdes.String(), new
> > JsonSerde(Map.class), "TableNumber1")
> >                 .mapValues((value) -> null);
> >
> >         KTable table2Aggregated = builder.table(Serdes.String(), new
> > JsonSerde(Map.class), "TableNumber2")
> >             .groupBy((key, value) -> null)
> >             .aggregate(() -> null, (k, v, t) -> null, (k, v, t) -> null,
> > new JsonSerde(Map.class), "TableNumber2Aggregated");
> >
> >         table1Mapped.join(table2Aggregated, (left, right) -> {
> >             LOG.debug("join");
> >             return null;
> >         });
> >
> >         return builder;
> >     }
> >
> > I'm using the latest Kafka Streams release, 0.10.0.1.  Any thoughts on
> how
> > I could proceed to debug or workaround this?
> >
> > Thanks all,
> >
> > Mathieu
> >
>
>
>
> --
> -- Guozhang
>

Reply via email to