New submission from alter-bug-tracer <[email protected]>:
UnicodeDecodeError is thrown instead of csv.Error when parsing malformed inputs.
Examples:
1. file0
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd5 in position 0: invalid
continuation byte
Traceback (most recent call last):
File "csv_parser.py", line 6, in <module>
for row in reader:
File "/usr/local/lib/python3.8/csv.py", line 111, in __next__
row = next(self.reader)
File "/usr/local/lib/python3.8/codecs.py", line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
2. file1
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 51:
invalid start byte
Traceback (most recent call last):
File "csv_parser.py", line 6, in <module>
for row in reader:
File "/usr/local/lib/python3.8/csv.py", line 110, in __next__
self.fieldnames
File "/usr/local/lib/python3.8/csv.py", line 97, in fieldnames
self._fieldnames = next(self.reader)
File "/usr/local/lib/python3.8/codecs.py", line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
(file0, file1 and csv_parser.py attached)
----------
files: csv.zip
messages: 342939
nosy: alter-bug-tracer
priority: normal
severity: normal
status: open
title: csv: undocumented UnicodeDecodeError on malformed file
type: behavior
versions: Python 3.8
Added file: https://bugs.python.org/file48339/csv.zip
_______________________________________
Python tracker <[email protected]>
<https://bugs.python.org/issue36975>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com