Two parallel agents from same source to same sink

Margus Roo Thu, 21 Jan 2016 07:06:12 -0800

Hi

I try to set up flume high availability
From rsyslog comes same feed to two different servers s1 and s2.
In both servers are configured flume-agents to listen feed from rsyslog.
Both agents are writing feed to HDFS.
What I am getting into HDFS is different files with duplicated content.

Is there any best practice architecture how to use flume in situationslike this.What I am trying to avoid is in situation when one server is down thensyslog is forwarded into two servers and at least one can transportevents to HDFS.

At the moment I thought I can clean after some time duplicates beforehive will use directory.


--
Margus (margusja) Roo
http://margus.roo.ee
skype: margusja
+372 51 48 780

Two parallel agents from same source to same sink

Reply via email to