config trident to emit no batch when no data found in kafka

2015-04-08 Thread 王天驹
hi,
i am using trident, and found that when no data found in kafka, spout still
emit empty batch
can this be turned off?
thx!


Re: Field Grouping question on optimization

2015-04-08 Thread Rajiv Onat
You may want to look at partial key grouping feature to reduce hotspots, cases 
where key cardinality makes a bolt instance straggle. This feature is in 0.9.4

-Rajiv

> On Apr 8, 2015, at 11:52 AM, Kashyap Mhaisekar  wrote:
> 
> Hi,
> My topology is like the following ->
> Spout -> Bolt A -> Bolt B -> Bolt C -> Bolt D
> 
> the groupings between Bolt C -> Bolt D is a field grouping as Bolt D is doing 
> an aggregation while everything else is a shuffleGrouping.
> 
> Use Case:
> If the spout emits 100K tuples such that the emits all are grouped on the 
> same field, then Bolt D will need to take all the load and hence becomes very 
> slow. IN this case, increasing the no. of instances of Bolt D will not help 
> as the grouping is for the instance of Bolt D.
> 
> Question: How can this be optimized?
> 
> Did anyone face such a use case? Please recommend.
> 
> Thanks
> Kashyap
> 


Field Grouping question on optimization

2015-04-08 Thread Kashyap Mhaisekar
Hi,
My topology is like the following ->
*Spout -> Bolt A -> Bolt B -> Bolt C -> Bolt D*

the groupings between* Bolt C -> Bolt D* is a field grouping as Bolt D is
doing an aggregation while everything else is a shuffleGrouping.

Use Case:
If the spout emits 100K tuples such that the emits all are grouped on the
same field, then Bolt D will need to take all the load and hence becomes
very slow. IN this case, increasing the no. of instances of Bolt D will not
help as the grouping is for the instance of Bolt D.

Question: How can this be optimized?

Did anyone face such a use case? Please recommend.

Thanks
Kashyap


Re: Opening connection to Random IPs

2015-04-08 Thread Anil Kumar
Hi Jens, Thanks for your reply.

We are not running supervisor in LXC / Docker containers.

Its running from a physical Linux machine.

-
Anil
On 8 Apr 2015 23:46, "Jens-U. Mozdzen"  wrote:

> Hi Anil,
>
> Zitat von Anil Kumar :
>
>> Hi Guys,
>>
>> We have problem while submitting topology where it is trying to open up
>> client connection to random IPs like 70.127.0.0, 90.127.0.0 etc.
>>
>
> are you running your Storm supervisors within Docker / LXC containers?
>
> Regards,
> Jens
>
>


Re: Opening connection to Random IPs

2015-04-08 Thread Jens-U. Mozdzen

Hi Anil,

Zitat von Anil Kumar :

Hi Guys,

We have problem while submitting topology where it is trying to open up
client connection to random IPs like 70.127.0.0, 90.127.0.0 etc.


are you running your Storm supervisors within Docker / LXC containers?

Regards,
Jens



Opening connection to Random IPs

2015-04-08 Thread Anil Kumar
Hi Guys,

We have problem while submitting topology where it is trying to open up
client connection to random IPs like 70.127.0.0, 90.127.0.0 etc.

Could you please help us in finding out why it is trying to do this ?

Extract from our logs are:

6845 [main-SendThread(90.127.0.0:2000)] INFO
org.apache.storm.zookeeper.ClientCnxn - Opening socket connection to server
90.127.0.0/90.127.0.0:2000. Will n

ot attempt to authenticate using SASL (unknown error)

6851 [main-SendThread(90.127.0.0:2000)] WARN
 org.apache.storm.zookeeper.ClientCnxn - Session 0x0 for server null,
unexpected error, closing socket connecti

on and attempting reconnect

java.net.NoRouteToHostException: No route to host

at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
~[na:1.8.0_11]

at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:712)
~[na:1.8.0_11]

at
org.apache.storm.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:361)
~[storm-core-0.9.3.jar:0.9.3]

at
org.apache.storm.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1081)
~[storm-core-0.9.3.jar:0.9.3]

-
Anil