If you use something like this:

for line in file.readlines():

then line is a string to the next newline and it automatically detects the EOF and the same with file.readline() but that will give you one character at a time.

On 7/19/05, frank h. <[EMAIL PROTECTED]> wrote:
hello all
I am trying to write a script in python that parses a gzipped logfile

the unzipped logfiles can be very large (>2GB)

basically the statements

file = gzip.GzipFile(logfile)
data = "" ()

for line in data.striplines():
....


would do what I want, but this is not feasible becasue the gzip files
are so huge.

So I do file.readline() in a for loop, but have no idea how long to
continue, because I dont know how many lines the files contain. How do
I check for end of file when using readline() ?
simply put it in a while loop and enclose it with try: except: ?

what would be the best (fastest) approach to deal with such large gzip
files in python?

thanks
_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to