On Thu, 11 Oct 2007 22:52:55 +0000, RyanL wrote: > I'm a newbie with a large number of data files in multiple > directories. I want to uncompress, read, and copy the contents of > each file into one master data file. The code below seems to be doing > this perfectly. The problem is each of the data files has a header > row in the first line, which I do not want in the master file. How > can I skip that first line when writing to the master file? Any help > is much appreciated. Thank you.
Untested version with `itertools.islice()`: import glob import gzip import os from itertools import islice def main(): zipdir = 'G:/Research/Data/' outfilename = 'G:/Research/Data/master_data.txt' out_file = open(outfilename, 'w') for name in os.listdir(os.curdir): if os.path.isdir(name): os.chdir(name) for zip_name in glob.glob('*.gz'): in_file = gzip.GzipFile(zip_name, 'r') out_file.writelines(islice(in_file, 1, None)) in_file.close() os.chdir(os.pardir) out_file.close() Ciao, Marc 'BlackJack' Rintsch -- http://mail.python.org/mailman/listinfo/python-list