Uwe L. Korn created ARROW-362: --------------------------------- Summary: Python: Calling to_pandas on a table read from Parquet leaks memory Key: ARROW-362 URL: https://issues.apache.org/jira/browse/ARROW-362 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Uwe L. Korn
Steps to reproduce: * Read a parquet file with {{pyarrow.parquet.read_table}} * Convert the table to a DataFrame with {{to_pandas}} * Repeat the last step several time and see an ever increasing memory usage This seems to happen only in this combination. Calling {{gc.collect}} doesn't help. -- This message was sent by Atlassian JIRA (v6.3.4#6332)