Re: Writing empty strings to parquet files

2018-05-07 Thread scarrascoso
Hi Wes: Thanks for your message. I would say that both test_pandas_parquet_1_0_rountrip and test_pandas_parquet_2_0_rountrip (in arrow/python/pyarrow/tests/test_parquet.py) already test this. Sorry I didn’t realize this sooner. All the best, Sergio Carrascoso > On 5 May 2018, at 01:31, Wes M

Re: Writing empty strings to parquet files

2018-05-04 Thread scarrascoso
Hi Uwe: Thanks a lot for your feedback. While preparing a simple example to reproduce this issue, I have been able to get the expected behavior (empty strings properly written as ‘’ in the parquet file). So actually there’s no problem with the Parquet.write_table The problem was rather in a bu

Writing empty strings to parquet files

2018-05-02 Thread scarrascoso
Hi: I would like to know if there is any way in PyArrow to write empty string values to a parquet file. When I use Parquet.write_table, if any column contains empty string values, they end up as None in the parquet file. My process depends on these values to be properly written as empty strings