Nick,
if you want to scale you need to enable RSS (or use the zbalancer) and have a 
thread per queue.

Luca

> On 03 Sep 2015, at 18:24, Nick Allen <[email protected]> wrote:
> 
> I have a similar need.  I need to ingest 40+ gpbs into a Hadoop grid.  Kafka 
> is acting as my landing zone/front door for the grid.  
> 
> I tried many variations of using tcpdump, Flume, and other concoctions.  I 
> ended up building a custom pcap ingest process in C.  The app uses PF_RING ZC 
> to load balance packets across multiple threads.  I then push the packet data 
> into Kafka using librdkafka.  Both the pull from PF_RING and the push to 
> Kafka batch many packets at a time (trading latency for throughput).
> 
> With the minimal tuning that I have done, it can handle roughly 10-12 Gbps.  
> I only need to achieve 10 Gbps on a single host and then I am going to scale 
> horizontally to manage the aggregate pcap that I need to capture.
> 
> Right now, the bottleneck is the master thread in PF_RING that dispatches 
> packets off to each worker thread.  That thread pegs a single CPU core (a 
> rather beefy core, I might add).  It does not seem capable of handling 
> additional worker threads to scale beyond 10-12 Gbps.
> 
> I wish I had access to the source to review and confirm, but that is how it 
> appears with the information that I have.
> 
> 
> 
> 
> 
> 
> 
> 
> 
> On Thu, Sep 3, 2015 at 11:46 AM, Manny Veloso <[email protected] 
> <mailto:[email protected]>> wrote:
> Also, when you say 1k flows per second is that 1k devices reporting their 
> flows every second? We’d need a two to three orders of magnitude more 
> performance.
> -- 
> Manny Veloso
> Sr. Solutions Engineer
> Smartrg.com
> 
> From: <[email protected] 
> <mailto:[email protected]>> on behalf of Luca Deri 
> <[email protected] <mailto:[email protected]>>
> Reply-To: "[email protected] 
> <mailto:[email protected]>" <[email protected] 
> <mailto:[email protected]>>
> Date: Tuesday, September 1, 2015 at 10:52 PM
> To: "[email protected] <mailto:[email protected]>" 
> <[email protected] <mailto:[email protected]>>
> Subject: Re: [Ntop-misc] nprobe and kafka?
> 
> Manny
> we have added kafka support on one of our development prototypes so movign to 
> the official nprobe should not be too difficult. The performance is similar 
> to the ZMQ or elasticsearch implementation, so considered the JSON conversion 
> is at least 1k flows/sec
> 
> Luca
> 
>> On 01 Sep 2015, at 23:20, Manny Veloso <[email protected] 
>> <mailto:[email protected]>> wrote:
>> 
>> Hi!
>> 
>> I’m looking to use nprobe as a bridge into kafka. In the splunk app nprobe 
>> just sends data into splunk. Is that basically the same configuration as a 
>> kafka install?
>> 
>> Also, what kind of throughput can I expect out of nprobe? 
>> -- 
>> Manny Veloso
>> Sr. Solutions Engineer
>> Smartrg.com 
>> <http://smartrg.com/>_______________________________________________
>> Ntop-misc mailing list
>> [email protected] <mailto:[email protected]>
>> http://listgateway.unipi.it/mailman/listinfo/ntop-misc 
>> <http://listgateway.unipi.it/mailman/listinfo/ntop-misc>
> 
> _______________________________________________
> Ntop-misc mailing list
> [email protected] <mailto:[email protected]>
> http://listgateway.unipi.it/mailman/listinfo/ntop-misc 
> <http://listgateway.unipi.it/mailman/listinfo/ntop-misc>
> 
> 
> 
> -- 
> Nick Allen <[email protected] <mailto:[email protected]>>
> _______________________________________________
> Ntop-misc mailing list
> [email protected]
> http://listgateway.unipi.it/mailman/listinfo/ntop-misc

_______________________________________________
Ntop-misc mailing list
[email protected]
http://listgateway.unipi.it/mailman/listinfo/ntop-misc

Reply via email to