URL:
<http://savannah.gnu.org/bugs/?53968>
Summary: Decompressed data is written to WARC file when using
--compression=gzip
Project: GNU Wget
Submitted by: None
Submitted on: Thu 24 May 2018 12:50:50 PM UTC
Category: Program Logic
Severity: 3 - Normal
Priority: 5 - Normal
Status: None
Privacy: Public
Assigned to: None
Originator Name: William Prescott
Originator Email: [email protected]
Open/Closed: Open
Discussion Lock: Any
Release: 1.19.5
Operating System: GNU/Linux
Reproducibility: Every Time
Fixed Release: None
Planned Release: None
Regression: None
Work Required: None
Patch Included: None
_______________________________________________________
Details:
When using the "--compression=gzip" option, Wget will write decompressed data
into "response" WARC records instead of recording the response exactly as it
was received.
This also prevents one from correctly decoding data in the WARC file that had
been transferred using chunked encoding.
_______________________________________________________
File Attachments:
-------------------------------------------------------
Date: Thu 24 May 2018 12:50:50 PM UTC Name: example.warc.gz Size: 5KiB By:
None
WARC and terminal output from running "wget -d --compression=gzip
--warc-file=example 'http://lists.gnu.org/'"
<http://savannah.gnu.org/bugs/download.php?file_id=44208>
-------------------------------------------------------
Date: Thu 24 May 2018 12:50:50 PM UTC Name: wgetOutput.txt Size: 2KiB By:
None
WARC and terminal output from running "wget -d --compression=gzip
--warc-file=example 'http://lists.gnu.org/'"
<http://savannah.gnu.org/bugs/download.php?file_id=44209>
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?53968>
_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/