wolfgang hoschek created FLUME-1753:
---------------------------------------

             Summary: Add support for submitting multiple files per avro client 
command line invocation
                 Key: FLUME-1753
                 URL: https://issues.apache.org/jira/browse/FLUME-1753
             Project: Flume
          Issue Type: Improvement
          Components: Sinks+Sources
    Affects Versions: v1.3.0
            Reporter: wolfgang hoschek


Some of the use cases we have involve sending a bunch of little files into a 
flume pipe. Would give better perf to send them all with one flume-ng 
avro-client command line invocation, rather than N processes started up 
serially.

Perhaps have flume be able to read a list of input files from stdin, generated 
by linux find command or HDFS 'find' command or similar. Along the following 
lines:

find ... |flume-ng avro-client --inputlist=-

and from a file containing one URL per line:

flume-ng avro-client --inputlist=<file> 

and from N files:

flume-ng avro-client —-filename=<file1> —-filename=<file2> ... 
—-filename=<fileN>  

A file can be a local file or an HDFS path URL.

In our case the headers would be the same for all files. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to