[jira] [Commented] (ARROW-1041) [Python] Support read_pandas on a directory of Parquet files

2017-07-14 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16088150#comment-16088150
 ] 

Wes McKinney commented on ARROW-1041:
-

[~cpcloud] can you update the scope of this issue? 

> [Python] Support read_pandas on a directory of Parquet files
> 
>
> Key: ARROW-1041
> URL: https://issues.apache.org/jira/browse/ARROW-1041
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Phillip Cloud
>Assignee: Phillip Cloud
> Fix For: 0.5.0
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (ARROW-1041) [Python] Support read_pandas on a directory of Parquet files

2017-07-14 Thread Phillip Cloud (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16088194#comment-16088194
 ] 

Phillip Cloud commented on ARROW-1041:
--

I think the scope here is pretty limited. {{read_pandas}} should support 
reading a directory of parquet files, which isn't implemented. I can take a 
look at implementing this at the beginning of next week. It looks like the work 
involves some forwarding of flags plus making any changes to {{read_row_group}} 
that would prevent us from reading index columns into a {{pyarrow.Table}}.

> [Python] Support read_pandas on a directory of Parquet files
> 
>
> Key: ARROW-1041
> URL: https://issues.apache.org/jira/browse/ARROW-1041
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Phillip Cloud
>Assignee: Phillip Cloud
> Fix For: 0.5.0
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (ARROW-1041) [Python] Support read_pandas on a directory of Parquet files

2017-07-14 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16088217#comment-16088217
 ] 

Wes McKinney commented on ARROW-1041:
-

Cool; I may get to it this weekend depending on how the rest of the 0.5.0 
issues go, otherwise I'll wait for you

> [Python] Support read_pandas on a directory of Parquet files
> 
>
> Key: ARROW-1041
> URL: https://issues.apache.org/jira/browse/ARROW-1041
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Phillip Cloud
>Assignee: Phillip Cloud
> Fix For: 0.5.0
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (ARROW-1041) [Python] Support read_pandas on a directory of Parquet files

2017-07-17 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16090817#comment-16090817
 ] 

Wes McKinney commented on ARROW-1041:
-

I'm touching this code a bunch working on ARROW-1103 so let me take a crack at 
this

> [Python] Support read_pandas on a directory of Parquet files
> 
>
> Key: ARROW-1041
> URL: https://issues.apache.org/jira/browse/ARROW-1041
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Phillip Cloud
>Assignee: Wes McKinney
> Fix For: 0.5.0
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (ARROW-1041) [Python] Support read_pandas on a directory of Parquet files

2017-07-17 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16090984#comment-16090984
 ] 

Wes McKinney commented on ARROW-1041:
-

Included in PR: https://github.com/apache/arrow/pull/862

> [Python] Support read_pandas on a directory of Parquet files
> 
>
> Key: ARROW-1041
> URL: https://issues.apache.org/jira/browse/ARROW-1041
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Phillip Cloud
>Assignee: Wes McKinney
> Fix For: 0.5.0
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)