HDFS sink - rollover

2015-09-21 Thread Thomas . Beer
Mit den besten Gruessen/Best regards, Dr. Thomas Beer Architect Big Data Solutions Interior Electronics Solutions I IES BE Rechnungsadresse / Invoice address: Continental Automotive GmbH Blumenstraße 18 93055 Regensburg, Germany Mobile: +49 151 20306831 Fax: +49 941 790-8730 E-Mail: thomas.b...@

Antwort: Re: HDFS sink - rollover

2015-09-21 Thread Thomas . Beer
Eran, thanks a lot. "idleTimeout" solved my problem. Best, Thomas Von:IT CTO An: user@flume.apache.org, Datum: 21.09.2015 12:48 Betreff: Re: HDFS sink - rollover My guess would be that you don't have a steady flow of messages and you get to the interval

Re: HDFS sink - rollover

2015-09-21 Thread IT CTO
er count varies (it is not always done after 4 events, as > written before). > > Best, > Thomas > > > > > Von:thomas.b...@continental-corporation.com > An:user@flume.apache.org, > Datum: 21.09.2015 12:05 > Betreff:HDFS sink - rollover

Antwort: HDFS sink - rollover

2015-09-21 Thread Thomas . Beer
Addon: The rollover count varies (it is not always done after 4 events, as written before). Best, Thomas Von:thomas.b...@continental-corporation.com An: user@flume.apache.org, Datum: 21.09.2015 12:05 Betreff:HDFS sink - rollover Hi, I'm using the Kafka-Flume sourc

HDFS sink - rollover

2015-09-21 Thread Thomas . Beer
Hi, I'm using the Kafka-Flume source and the Flume-HDFS sink for writing SequenceFiles. I would like to rollover a SequenceFile after a specific count of events/messages was written, e.g. after 50 messages (see rollCount parameter below) a new file should be written. My configuration seems to b

Re: Flume hdfs sink rollover

2012-08-26 Thread Denny Ye
Yes, you are right. Flume uses uncompressed size to judge the case of rolling. The appropriate place to calculate size is in-memory. Normally, compression ratio of snappy might be 5x-10x, more better if there have too many duplicated data. Thus, it fits your setting, do you agree? -Regards Denny Y

Re: Flume hdfs sink rollover

2012-08-26 Thread Mohit Anchlia
On Sun, Aug 26, 2012 at 6:47 AM, Denny Ye wrote: > hi Mohit, > Why you confirm it doesn't work at time? I think it reaches to size > limitation of your setting 'hdfs.rollSize'. Each snappy file almost 5 > hundreds megabytes every 6 or 7 minutes. It fits the compression radio of > snappy for

Re: Flume hdfs sink rollover

2012-08-26 Thread Denny Ye
hi Mohit, Why you confirm it doesn't work at time? I think it reaches to size limitation of your setting 'hdfs.rollSize'. Each snappy file almost 5 hundreds megabytes every 6 or 7 minutes. It fits the compression radio of snappy format I rearraged your file order. It's well from my point

Flume hdfs sink rollover

2012-08-24 Thread Mohit Anchlia
I have rollover defined either to roll every 5G or 1+ hr but doesn't seem to be working. Could you please suggest if I got the conf incorrectly configured? foo.sinks.hdfsSink.hdfs.filePrefix = web foo.sinks.hdfsSink.hdfs.rollInterval = 4000 foo.sinks.hdfsSink.hdfs.rollCount = 0 foo.sinks.hdfsSi