[ https://issues.apache.org/jira/browse/ARROW-473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16217743#comment-16217743 ]
ASF GitHub Bot commented on ARROW-473: -------------------------------------- cpcloud commented on issue #1031: WIP ARROW-473: [C++/Python] Add public API for retrieving block locations for a particular HDFS file URL: https://github.com/apache/arrow/pull/1031#issuecomment-339140764 It takes around 2 minutes to fully start up. Is the script waiting for at least that amount of time before doing anything? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > [C++/Python] Add public API for retrieving block locations for a particular > HDFS file > ------------------------------------------------------------------------------------- > > Key: ARROW-473 > URL: https://issues.apache.org/jira/browse/ARROW-473 > Project: Apache Arrow > Issue Type: New Feature > Components: C++, Python > Reporter: Wes McKinney > Labels: pull-request-available > Fix For: 0.8.0 > > > This is necessary for applications looking to schedule data-local work. > libhdfs does not have APIs to request the block locations directly, so we > need to see if the {{hdfsGetHosts}} function will do what we need. For > libhdfs3 there is a public API function -- This message was sent by Atlassian JIRA (v6.4.14#64029)