Great, thanks for filing that JIRA Attila. Let's continue the discussion
there.

Mike

On Tue, Jul 12, 2016 at 8:50 AM, Attila Simon <[email protected]> wrote:

> Hi Mike,
>
> I created a jira (https://issues.apache.org/jira/browse/FLUME-2954)
> and would like to look around first in the codebase where such content
> log was introduced. Based on the actual use cases we can discuss
> further what would be the best approach. Thanks for the log4j concerns
> that actually moved my standpoint a bit.
>
> Cheers,
> Attila
>
>
> Attila Simon
> Software Engineer
> Email:   [email protected]
>
>
>
>
> On Sat, Jul 9, 2016 at 3:06 AM, Mike Percy <[email protected]> wrote:
> > Hi Attila,
> > Thanks for bringing this up. I agree that we should prevent logging data
> > unless it is explicitly enabled.
> >
> > One concern I have about the log4j approach is that many people have
> > customized their log4j.properties file. As long as your proposal would
> keep
> > logging of data disabled for all of the likely customizations people
> might
> > have in place then it sounds good. However maybe you can be a little more
> > specific about how it would look in the log4j.properties file and how it
> > would look at the code level when writing to that named logger. I'm not
> > totally sure I understand your exact proposal.
> >
> > Mike
> >
> > On Tue, Jul 5, 2016 at 9:44 AM, Attila Simon <[email protected]> wrote:
> >
> >> Hi,
> >>
> >> Flume has built in functionality to log out data flowing through
> >> mainly for debugging purposes. This functionality appears in several
> >> places of the codebase. I think such functionality rise security
> >> concerns in production environments where sensitive information might
> >> be ingested so it is crucial that enabling such functionality has to
> >> be as explicit as possible (avoid implicit side effect setup).
> >> Eg: setting the level of root logger to debug/trace cause that every
> >> other logger will start logging at debug/trace including the ones
> >> logging raw data.
> >>
> >> Options to solve this issue:
> >> 1) command line option to enable data logging
> >> 2) configuration property to enable data logging globally
> >> 3) implementing a single concept which is solely responsible for
> >> logging ie a single LoggerSink (which already exists) or Interceptor
> >> 4) introduction of a new named logger instance which is configured OFF
> >> in log4j config
> >> 5) any other idea is welcomed
> >>
> >> Considering the pros and cons of the usage and implementation I would
> >> vote for 4) but I require your opinion. I'm going to open a jira to
> >> tackle this work (please let me know if there are some important
> >> fields I have to set considering 1.7 release).
> >>
> >> Cheers,
> >> Attila
> >>
>

Reply via email to