James Dominy added the comment:
Ah, I did some digging. It turns out pbzip2 is installed on the system in
question, and more annoyingly, /usr/bin/bzip2 is a symlink to pbzip2. I didn't
realise the file was compressed by pbzip2.
Thanks for the
James Dominy added the comment:
How does one create a multi-stream bzip2 file in the first place? And how do I
tell it's multi-stream.
--
___
Python tracker
<http://bugs.python.org/is
James Dominy added the comment:
Whoops, forget to add the output from the standard binutils
$ bzcat example-file.csv.bz2 | wc -c
909602
$ bzcat example-file.csv.bz2 | md5sum
48f4b69b2b8bb0b171ebc36313eb6616 -
As you can see file sizes and hashes do not match
Changes by James Dominy :
--
title: BZ2File does decompress some .bz2 files correctly -> BZ2File doesn't
decompress some .bz2 files correctly
___
Python tracker
<http://bugs.python.org
New submission from James Dominy:
bz2.BZ2File does not decompress a file (see attached) correctly. This file can
be decompressed and compressed via stadard unix tools (bzip2 and bunzip2)
without change.
Consider ...
$ python
Python 2.7.6 (default, Dec 7 2013, 22:49:16)
[GCC 4.8.2] on
James Dominy <[EMAIL PROTECTED]> added the comment:
I've been working on a patch that allows pdb when run as a script to
split it's output such that the program being debugged uses a specified
tty for stdin/stdout, and leave the pdb.py IO on the original
stdin/stdout. I thi