[jira] [Resolved] (ARROW-7367) [Python] Use np.full instead of np.array.repeat in ParquetDatasetPiece

Wes McKinney (Jira) Mon, 06 Jan 2020 10:47:12 -0800


     [ 
https://issues.apache.org/jira/browse/ARROW-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Wes McKinney resolved ARROW-7367.
---------------------------------
    Fix Version/s: 1.0.0
       Resolution: Fixed

Issue resolved by pull request 5999
[https://github.com/apache/arrow/pull/5999]

> [Python] Use np.full instead of np.array.repeat in ParquetDatasetPiece
> ----------------------------------------------------------------------
>
>                 Key: ARROW-7367
>                 URL: https://issues.apache.org/jira/browse/ARROW-7367
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Xavier Lacroze
>            Priority: Trivial
>              Labels: pull-request-available
>             Fix For: 1.0.0
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> For small tables (len < 100) execution time is slightly degraded (~ x1.4 at 
> len = 10), for large ones performance gain is huge (exec time ~ x0.04 at len 
> = 100_000)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Resolved] (ARROW-7367) [Python] Use np.full instead of np.array.repeat in ParquetDatasetPiece

Reply via email to