Thanks. By default, can you explain me why I received the message in wrong
order? Note there are only 9 lines from 1 to 9, but on consumer side their
original order becomes messed up.

~~~sent from my cell phone, sorry if there is any typo

Hans Jespersen <h...@confluent.io> 于 2018年5月26日周六 上午12:16写道:

> If you create a topic with one partition they will be in order.
>
> Alternatively if you publish with the same key for every message they will
> be in the same order even if your topic has more than 1 partition.
>
> Either way above will work for Kafka.
>
> -hans
>
> > On May 25, 2018, at 8:56 PM, Raymond Xie <xie3208...@gmail.com> wrote:
> >
> > Hello,
> >
> > I just started learning Kafka and have the environment setup on my
> > hortonworks sandbox at home vmware.
> >
> > test.csv is what I want the producer to send out:
> >
> > more test1.csv ./kafka-console-producer.sh --broker-list
> > sandbox.hortonworks.com:6667 --topic kafka-topic2
> >
> > 1, abc
> > 2, def
> > ...
> > 8, vwx
> > 9, zzz
> >
> > What I received are all the content of test.csv, however, not in their
> > original order;
> >
> > kafka-console-consumer.sh --zookeeper 192.168.112.129:2181 --topic
> > kafka-topic2
> >
> > 2, def
> > 1, abc
> > ...
> > 9, zzz
> > 8, vwx
> >
> >
> > I read from google that partition could be the feasible solution,
> however,
> > my questions are:
> >
> > 1. for small files like this one, shall I really do the partitioning? how
> > small a partition would be acceptable to ensure the sequence?
> > 2. for big files, each partition could still contain multiple lines, how
> to
> > ensure all the lines in each partition won't get messed up on consumer
> side?
> >
> >
> > I also want to know what is the best practice to process large volume of
> > data through kafka? There should be better way other than console
> command.
> >
> > Thank you very much.
> >
> >
> >
> > *------------------------------------------------*
> > *Sincerely yours,*
> >
> >
> > *Raymond*
>

Reply via email to