parallel unzip in progress

Jay Norwood Mon, 02 Apr 2012 22:29:42 -0700

I'm working on a parallel unzip. I started with phobos std.zip,but found that to be too monolithic. I needed to separate outthe tasks that get the directory entries, create the directorytree, get the compressed data, expand the data and create theuncompressed files on disk. It currently unzips a 2GB directorystruct in about 18 secs while 7zip takes around 55 secs. Onlyabout 4 seconds of this is the creation of the directorystructure and the expanding. The other 14 secs is writing theregular files.

The subtasks needed to be separated not only because of the needto run them in parallel, but also because the current std.zipimplementation is a memory hog, keeping the whole compressed andexpanded data sections in memory. I was running out of memory ina 32 bit application just attempting to unzip the test file withthe std.zip operations. The parallel version peaks at around150MB memory used during the operation.

The parallel version is still missing the operation of restoringthe original file attributes, and I see no example in thedocuments of what would normally be done. Am I missing thissomewhere? I'll have to dig around...

parallel unzip in progress

Reply via email to