Yes thats bad approach . Mostly users keep a static string for the “id” part in 
the spoutConfig. Whats the need to use randomUUID.

-- 
Harsha


On March 9, 2015 at 11:09:46 PM, Tousif (tousif.pa...@gmail.com) wrote:

Thanks Harsha,

Does zkRoot in the spoutconfig is used along with random string to create path 
in Zookeeper?

SpoutConfig(hosts, PropertyManager.getProperty("kafka.spout.topic").toString(), 
"/kafkastormroot", UUID.randomUUID().toString());

 zkroot  is prepended to the random id we give to create the unique place in ZK 
 where consumption state is stored.  If this is true than every time we deploy 
our topology it will create a unique path ? 

How will it get previous offset?


On Mon, Mar 9, 2015 at 8:36 PM, Harsha <st...@harsha.io> wrote:
If your topology has saved Kafka offset in your zookeeper it will start 
processing from that otherwise It checks spoutConfig.forceFromStart set to true 
in this case it will try to fetch data from the beginning of the queue i.e 
kafka.api.EarliestTime() . If none of the above matches it will pick users 
spoutConfig.startOffsetTime.

"kafka.api.OffsetRequest.EarliestTime() finds the beginning of the data in the 
logs and starts streaming from there, kafka.api.OffsetRequest.LatestTime() will 
only stream new messages.”

If you stopping and re-deploying the topology make sure you used the same name 
as KafkaSpout uses topology name to store and retrieve the offsets from 
zookeeper.

-- 
Harsha



On March 9, 2015 at 7:30:38 AM, Tousif (tousif.pa...@gmail.com) wrote:

If your topology has saved Kafka offset in your zookeeper it will start 
processing from that otherwise It checks spoutConfig.forceFromStart set to true 
in this case it will try to fetch data from the beginning of the queue i.e 
kafka.api.EarliestTime() . If none of the above matches it will pick users 
spoutConfig.startOffsetTime.

"kafka.api.OffsetRequest.EarliestTime() finds the beginning of the data in the 
logs and starts streaming from there, kafka.api.OffsetRequest.LatestTime() will 
only stream new messages.”

If you stopping and re-deploying the topology make sure you used the same name 
as KafkaSpout uses topology name to store and retrieve the offsets from 
zookeeper.

-- 
Harsha



--


Regards
Tousif Khazi

Reply via email to