Denis created ARROW-7956:
----------------------------

             Summary: Memory leak in pyarrow functions 
.ipc.serialize_pandas/deserialize_pandas
                 Key: ARROW-7956
                 URL: https://issues.apache.org/jira/browse/ARROW-7956
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.16.0, 0.15.0
            Reporter: Denis
         Attachments: loans.parquet, pyarrow_mem_leak_test.py

Used python version is 3.7.4 (conda distribution)

OS: Ubunty 18.04

pandas version is 0.24.2

numpy version is 1.16.4

 

To reproduce the issue run the attached script pyarrow_mem_leak_test.py. Also 
put the attached file loans.parquet to the folder of working directory.

 

Also the reading and writing to parquet in memory do has memory leaks. To 
reproduce this run function test_parquet_leak() from the attached file 
pyarrow_mem_leak_test.py

The memory leak is 100% reproducible.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to