[ 
https://issues.apache.org/jira/browse/ARROW-473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16224262#comment-16224262
 ] 

ASF GitHub Bot commented on ARROW-473:
--------------------------------------

wesm commented on issue #1031: WIP ARROW-473: [C++/Python] Add public API for 
retrieving block locations for a particular HDFS file
URL: https://github.com/apache/arrow/pull/1031#issuecomment-340315909
 
 
   @AnkitAggarwalPEC would it be OK if I took over developing the patch from 
here so we can get this closed out for 0.8.0? I have a local HDFS setup (from 
my Apache Impala development environment) I can use for testing. I'm not sure 
what's going on with the Docker setup but someone will need to investigate 
separately

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> [C++/Python] Add public API for retrieving block locations for a particular 
> HDFS file
> -------------------------------------------------------------------------------------
>
>                 Key: ARROW-473
>                 URL: https://issues.apache.org/jira/browse/ARROW-473
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Python
>            Reporter: Wes McKinney
>              Labels: pull-request-available
>             Fix For: 0.8.0
>
>
> This is necessary for applications looking to schedule data-local work. 
> libhdfs does not have APIs to request the block locations directly, so we 
> need to see if the {{hdfsGetHosts}} function will do what we need. For 
> libhdfs3 there is a public API function 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to