RE: [DISCUSS] Should we support hot reconfiguration?

Jorn Argelo - Ephorus Tue, 12 Jun 2012 09:44:18 -0700

Hi all,


Not sure if a regular user should participate in these sort of
discussions but here's my opinion nevertheless ;-)

 

I think one of the biggest flaws of OG Flume is that it's too complex
and maybe even over-engineered. For example the centralized
configuration in the Flume Master sounds good on paper but in practice
it doesn't make things easier at all (fortunately this was fixed with NG
Flume). So IMO stick to the KISS principle and keep things
down-to-earth. I have no problem restarting agents, and a HUP
construction to reload the configuration as Senthivil suggested is even
better.

 

I guess most users are using a config management system like Puppet or
Chef to deploy and configure their agents. If you keep that in mind in
terms of configuration it makes things a whole lot easier for those
users.

 

Regards,

Jorn





Van: Senthilvel Rangaswamy [mailto:[email protected]] 
Verzonden: zaterdag 9 juni 2012 1:32
Aan: [email protected]
CC: Flume Development
Onderwerp: Re: [DISCUSS] Should we support hot reconfiguration?

 

IMHO, online reconfiguration is dangerous. A typical use case for flume
is to be deployed at the very
beginning of the data source, like web servers. These are typically in
large numbers. Say you push out
a bad config and that gets picked up, it will wreak havoc on the
infrastructure.

I would like the flume to pick the new config when it is HUP'ed. This
way, it is a controlled deployment,
but at the same time not a full restart.



On Thu, Jun 7, 2012 at 3:18 PM, Eric Sammer <[email protected]>
wrote:

Flumers:

 

Flume 0.9.x supported online reconfiguration and the intention was for
the 1.x branch to do so as well (it doesn't yet). I wanted to start a
discussion around whether people are interested in this kind of
functionality or if simply restarting the daemon(s) was sufficient for
your deployment. There are two ways of thinking about it:

 

* Support reconfiguration. Agents may have multiple flows passing
through them and, ideally, adding new ones shouldn't interrupt existing
flows. Agent restarts interrupt collection and, for non-durable channels
(i.e. MemoryChannel), data *may* be lost. Reconfiguration will add
significant complexity and ultimately does not get around host level
maintenance, software upgrades, and the like.

 

* Do no support reconfiguration. Accept the fact that agents may go down
eventually, so it should be supported as a first class case. In other
words, embrace the idea of failure / maintenance and handle it by
recommending topologies of agents that include multiple agents at each
tier and simply roundrobin / failover where necessary. The only downside
is the agent tier closest to the originating source (e.g. a log4j
client); restarting that agent means the client application needs to be
able to find another agent or buffer (which impacts durability or blocks
the application).


 

We can optionally support some subset of online reconfiguration such as
only allowing new flows to be introduced or existing flows to be
"decommissioned," but not allow alteration of existing flows. Ultimately
this feature is a ton of work and adds a ton of complexity so if it's
not something folks are clambering for, we should spend our time
worrying about more pressing issues.

 

Thoughts? Comments?

-- 
Eric Sammer
twitter: esammer
data: www.cloudera.com




-- 
..Senthil

"If there's anything more important than my ego around, I want it 
 caught and shot now."
                                                    - Douglas Adams.

RE: [DISCUSS] Should we support hot reconfiguration?

Reply via email to