Thanks! It seems to work quite well using append!(d1::DataArray, d2::DataArray) from DataArrays.jl trunk. If I had a time machine to extend the Sunday, I’d work on a proper version of vcat using that, but alas. I appreciate the amazing work that has gone into Julia, HDF5.jl and DataFrames.jl. :)
On 26 Jan 2014, at 21:55, John Myles White <johnmyleswh...@gmail.com> wrote: > This is quite close to being possible, but we’re missing a few things. > > Daniel Jones recently added an append! method to DataArrays, which would let > you do this column-by-column. > > To help you out, we need to add an append! method to DataFrames as well. I’ve > wanted that badly myself lately. > > I will try to get to this today, but am already pretty overwhelmed with work > for the day. > > — John > > On Jan 26, 2014, at 11:02 AM, Joosep Pata <joosep.p...@gmail.com> wrote: > >> Is there a way to avoid copying when doing vcat(df1::DataFrame, >> df2::DataFrame, …)? I’m trying to open hundreds of files with DataFrames, >> merge all of them and save a single ~150M row x 100 col DataFrame using HDF5 >> and JLD (to be opened later using mmap), and it seems to work marvelously, >> apart from the vcat. >> Does a no-copy option exist? I’m aware of DataStreams as a concept, but as I >> understand, they’re not fully fleshed out yet. >