Re: [heka] TcpOutput and buffer

Etienne Champetier Thu, 16 Apr 2015 11:14:52 -0700

2015-04-16 19:34 GMT+02:00 Rob Miller <[email protected]>:

> On 04/16/2015 04:48 AM, Etienne Champetier wrote:
>
>> Hi,
>>
>> I'm using heka to read logs with logstreamer (nginx access log) and ship
>> them to another heka via TcpOutput.
>> I'm new to heka (and this ML) so here are my questions:
>>
>> Is bufferedoutput (to disk) mandatory? (maybe i'm not understanding the
>> code correctly/missed a config)
>>
> TcpOutput always runs the messages through a disk buffer, yes. This might
> be optional in the future, but it's true for now.
>
>> In my case it produces useless IO, ie read logs from disk, split them,
>> parse them, write them to disk, read them again and send them via tcp.
>>
> In our experience the performance impact from running the messages through
> a disk buffer is small enough, and TCP connections are brittle enough, that
> it wasn't worth introducing more complexity into the code to support both
> options. We have plans to change the way buffering works entirely, however,
> so it's optionally available to all plugins, this will change your options
> with regard to the TcpOutput.
>


ok cool (issue https://github.com/mozilla-services/heka/issues/1378 i guess)
Maybe you should add a note in the documentation

 Is there plan for memory buffer output (buffered chan, ...).
>>
> There haven't been plans thus far, no. Buffered channels are always in
> play, but there's no retry, messages that don't get through will be
> dropped. I'm definitely open to considering in memory buffering options,
> but not until after we get all of the details of the disk buffering
> situation worked out.
>
>> Output buffer shrink only every ticker_interval, which is 300sec, isn't
>> it too much by default? (on first run of heka with big input logs buffer
>> can grow a lot)
>>
> Whether or not this is a reasonable default depends on how much data is
> flowing through, and how precious a commodity disk space is. I'm open to
> lowering it, however, and explaining the implications a bit better in the
> docs.
>

now that i think a bit more about it, instead of time based rotation maybe
this should be size based rotation (i've seen a buffer of ~500Mo)
Also what are the drawbacks of rotating too often?

 queue_full_action = block doesn't recover (you have to restart heka) or
>> i'm not patient enough?
>>
> No, it's not you... I've done some testing and have been able to
> reproduce. I'm currently working on a fix, more soon.
>

Thanks a lot (i've opened an issue
https://github.com/mozilla-services/heka/issues/1484)


> -r
>
>

_______________________________________________
Heka mailing list
[email protected]
https://mail.mozilla.org/listinfo/heka

Re: [heka] TcpOutput and buffer

Reply via email to