[ https://issues.apache.org/jira/browse/ARROW-13254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Weston Pace closed ARROW-13254. ------------------------------- Fix Version/s: 5.0.0 Resolution: Duplicate I'm going to go ahead and close this as a duplicate of ARROW-12983. If you try it on 5.0.0 (the next version to have the fix for ARROW-12983) or the latest nightly and the issue is still there then feel free to reopen. > [Python] Processes killed and semaphore objects leaked when reading pandas > data > ------------------------------------------------------------------------------- > > Key: ARROW-13254 > URL: https://issues.apache.org/jira/browse/ARROW-13254 > Project: Apache Arrow > Issue Type: Bug > Environment: OS name and version: macOS 11.4 > Python version: 3.8.10 > Pyarrow version: 4.0.1 > Reporter: Koyomi Akaguro > Priority: Major > Fix For: 5.0.0 > > > When I run {{pa.Table.from_pandas(df)}} for a >1G dataframe, it reports > > {{Killed: 9 > ../anaconda3/envs/py38/lib/python3.8/multiprocessing/resource_tracker.py:216: > UserWarning: resource_tracker: There appear to be 1 leaked semaphore objects > to clean up at shutdown}} > > -- This message was sent by Atlassian Jira (v8.3.4#803005)