Exception Handling with Flume

Souvik Bose Mon, 08 Dec 2014 03:38:14 -0800

Hello All,

I am stuck with a problem with flume version 1.4.0. I am usingspooldirectory source with a custom interceptor to process encoded gpsfiles and save it in hdfs and solr (using morphline solr sink). The maininformtion is stored on the file name itself which is coming in on thespool directory and the content is irrelevant. So I am using the custominterceptor to extract and transform the file header and store theextracted data in Json format as the output of the event.

My problem comes in:

1. When there is a 0 byte file comes in (generally files come in with a"!" symbol in the content) flume stops and throws an exception. We don'tneed the content of the file in any case, but still face exception asflume cannot handle 0 byte files.2. When there is content with some weird characters like !f!, flumestops with exception3. Even when everything is running fine, I am losing some data/ events.On closer introspection I found that some are available in hdfs but notin solr and vice versa. I am not using any processor sinkgroups likefailover or load balancing. Is it because of that?

I want to achieve a solution where I can handle any exceptions and thefile/data which causes the exception is discarded and flume processesthe next file in the spool directory. The date comes in at high velocity100 files every seconds. So manually deleting the file and retstartingflume is the regular practice I do to keep everything back on track. ButI am sure there must be some better ways to handle this case. Can youguys please suggests some better alternatives for my approach please//?/


Thanks & Regards,
Souvik Bose
///

Exception Handling with Flume

Reply via email to