Thanks Antione for the pointers. I switched to using io::BufferReader and
that fixed the issue. Newline options also work fine. It is an extra copy
for me, but for now I am good.

On Wed, Mar 9, 2022 at 7:46 AM Antoine Pitrou <anto...@python.org> wrote:

> On Mon, 7 Mar 2022 11:52:02 -0800
> HK Verma <hkve...@gmail.com> wrote:
>
> > Thanks Antoine. Yes I have newlines_in_values set to false. Other configs
> > also look ok.
> > However I do have rows with less number of columns than the specified
> > numbers in convert options in column types. I have my own
> > invalid_row_handler where I currently skip these rows.
> > It looks like the parser is doing a quick pass and splitting the blocks
> > using the new line separator and parsing them in parallel. This seems
> > similar to spark csv multiline option. To avoid this case, I did set
> > newlines_in_values to false and also checked if newlines are not there in
> > any of the fields.
>
> No, you should check `newlines_in_values` to true and see if the error
> goes away.
>
> Regards
>
> Antoine.
>
>
>

Reply via email to