Thanks Josh, Is there any performance penalty in unions, assuming that I have several hundreds of input files?
On Tue, Feb 12, 2013 at 4:39 PM, Josh Wills <[email protected]> wrote: > Yeah, of course-- that's how stuff like joins work. > > PTable<K, V> first = pipeline.read(new TableSource<K, V>(firstFile)); > PTable<K, V> second = ...; > PTable<K, V> union = first.union(second); > > etc. > > > On Tue, Feb 12, 2013 at 1:36 PM, Victor Iacoban <[email protected] > >wrote: > > > Is there any support in crunch to use multiple sequence files as pipeline > > source? > > something similar to standard MultipleInputs > > > > Thanks, > > victor > > >
