Fred Tzeng created HDFS-14676: --------------------------------- Summary: Calls to libhdfs (via pyarrow) hang after a while Key: HDFS-14676 URL: https://issues.apache.org/jira/browse/HDFS-14676 Project: Hadoop HDFS Issue Type: Bug Components: hdfs, libhdfs Affects Versions: 3.0.3 Environment: hadoop-3.0.3 python 3.6 Centos7 Reporter: Fred Tzeng
I'm using the pyarrow HDFS client in a long running (forever) app that makes connections to HDFS (via libhdfs) as external requests come in and destroys the connection as soon as the request is handled. This happens a large amount of times on separate threads and everything works great. The problem is, after the app idles for a while (perhaps hours) and no HDFS connections are made during this time, when the next connection is attempted, it hangs. No exceptions are thrown. As soon as I restart my python app, the HDFS connection works just fine again. I'm using the precompiled libhdfs.so directly from the hadoop-3.0.3 distribution. Do I typically need to recompile libhdfs.so for my OS, or is the one out of the box typically fine? I've checked with the Arrow community first- they've recommended I check with the Hadoop community since all the pyarrow client does is pass through the commands to libhdfs. Any suggestions on debugging this hanging issue would be appreciated. -- This message was sent by Atlassian JIRA (v7.6.14#76016) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org