[jira] [Comment Edited] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655102#comment-17655102 ] Joris Van den Bossche edited comment on ARROW-18400 at 1/5/23 7:08 PM:

[jira] [Comment Edited] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655102#comment-17655102 ] Joris Van den Bossche edited comment on ARROW-18400 at 1/5/23 7:07 PM:

[jira] [Comment Edited] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655102#comment-17655102 ] Joris Van den Bossche edited comment on ARROW-18400 at 1/5/23 7:07 PM:

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655102#comment-17655102 ] Joris Van den Bossche commented on ARROW-18400: --- Yes, I think it has to use {{Flatten()}}

[jira] [Resolved] (ARROW-16728) [Python] Switch default and deprecate use_legacy_dataset=True in ParquetDataset

2022-12-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-16728. --- Resolution: Fixed Issue resolved by pull request 14052

[jira] [Resolved] (ARROW-18363) [Docs] Include warning when viewing old docs (redirecting to stable/dev docs)

2022-12-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18363. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (ARROW-16337) [Python] Expose parameter that determines to store Arrow schema in Parquet metadata in Python

2022-12-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-16337. --- Resolution: Fixed Issue resolved by pull request 13000

[jira] [Resolved] (ARROW-18394) [CI][Python] Nightly pyhon pandas jobs using latest or upstream_devel fail

2022-12-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18394. --- Resolution: Fixed Issue resolved by pull request 15048

[jira] [Resolved] (ARROW-18272) [pyarrow] ParquetFile does not recognize GCS cloud path as a string

2022-12-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18272. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-12-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17651169#comment-17651169 ] Joris Van den Bossche commented on ARROW-18400: --- A small reproducible example to

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-12-21 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17650932#comment-17650932 ] Joris Van den Bossche commented on ARROW-18400: --- Using Alenka's script, I explored it a

[jira] [Updated] (ARROW-8891) [C++] Split non-cast compute kernels into a separate shared library

2022-12-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8891: - Priority: Critical (was: Major) > [C++] Split non-cast compute kernels into a

[jira] [Commented] (ARROW-18394) [CI][Python] Nightly pyhon pandas jobs using latest or upstream_devel fail

2022-12-08 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644867#comment-17644867 ] Joris Van den Bossche commented on ARROW-18394: --- For the failure shown above, this seems

[jira] [Created] (ARROW-18428) [Website] Enable github issues on arrow-site repo

2022-12-08 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-18428: - Summary: [Website] Enable github issues on arrow-site repo Key: ARROW-18428 URL: https://issues.apache.org/jira/browse/ARROW-18428 Project: Apache

[jira] [Commented] (ARROW-14799) [C++] Adding tabular pretty printing of Table / RecordBatch

2022-12-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644352#comment-17644352 ] Joris Van den Bossche commented on ARROW-14799: --- If we tackle this in C++, it might be

[jira] [Resolved] (ARROW-18123) [Python] Cannot use multi-byte characters in file names in write_table

2022-12-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18123. --- Resolution: Fixed Issue resolved by pull request 14764

[jira] [Updated] (ARROW-18003) [Python] Add sort_by to RecordBatch

2022-12-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18003: -- Labels: good-first-issue (was: ) > [Python] Add sort_by to RecordBatch >

[jira] [Updated] (ARROW-18003) [Python] Add sort_by to RecordBatch

2022-12-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18003: -- Summary: [Python] Add sort_by to RecordBatch (was: [Python] Add sort_by to

[jira] [Resolved] (ARROW-18280) [C++][Python] Support slicing to arbitrary end in list_slice kernel

2022-12-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18280. --- Resolution: Fixed Issue resolved by pull request 14749

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17642421#comment-17642421 ] Joris Van den Bossche commented on ARROW-18400: --- While combining the chunks before

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-12-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641982#comment-17641982 ] Joris Van den Bossche commented on ARROW-18265: --- bq. However, it would be a bit tricky if

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-12-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641750#comment-17641750 ] Joris Van den Bossche commented on ARROW-18375: --- bq. I use "Type: enhancement" for

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641378#comment-17641378 ] Joris Van den Bossche commented on ARROW-18375: --- I interpret "enhancement" as an

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641367#comment-17641367 ] Joris Van den Bossche commented on ARROW-18375: --- (I added "Type: test" and "Type: task" as

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641351#comment-17641351 ] Joris Van den Bossche commented on ARROW-18375: --- We should probably also add "Type: test"

[jira] [Commented] (ARROW-18376) MIGRATION: Add component labels to GitHub

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641152#comment-17641152 ] Joris Van den Bossche commented on ARROW-18376: --- Those are now all present as labels, so

[jira] [Resolved] (ARROW-18125) [Python] Handle pytest 8 deprecations about pytest.warns(None)

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18125. --- Resolution: Fixed Issue resolved by pull request 14729

[jira] [Resolved] (ARROW-18399) [Python] Reduce warnings during tests

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18399. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (ARROW-18125) [Python] Handle pytest 8 deprecations about pytest.warns(None)

2022-11-30 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-18125: - Assignee: Miles Granger > [Python] Handle pytest 8 deprecations about

[jira] [Comment Edited] (ARROW-18359) PrettyPrint Improvements

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640819#comment-17640819 ] Joris Van den Bossche edited comment on ARROW-18359 at 11/29/22 4:27 PM:

[jira] [Commented] (ARROW-18359) PrettyPrint Improvements

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640819#comment-17640819 ] Joris Van den Bossche commented on ARROW-18359: --- Also linking

[jira] [Updated] (ARROW-17326) [Go][FlightSQL] Add Support for FlightSQL to Go

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17326: -- Component/s: (was: SQL) > [Go][FlightSQL] Add Support for FlightSQL to Go

[jira] [Updated] (ARROW-17359) [Go][FlightSQL] Create SQLite example

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17359: -- Component/s: (was: SQL) > [Go][FlightSQL] Create SQLite example >

[jira] [Updated] (ARROW-17325) AQE should use available column statistics from completed query stages

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17325: -- Component/s: Rust - Ballista (was: SQL) > AQE should use

[jira] [Updated] (ARROW-18234) [Swift] Swift implementation of Arrow

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18234: -- Component/s: (was: Swift) > [Swift] Swift implementation of Arrow >

[jira] [Updated] (ARROW-2631) [Dart] Begin a Dart language library

2022-11-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-2631: - Component/s: (was: Dart) > [Dart] Begin a Dart language library >

[jira] [Commented] (ARROW-18380) MIGRATION: Enable bot handling of GitHub issue linked PRs

2022-11-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638335#comment-17638335 ] Joris Van den Bossche commented on ARROW-18380: --- > Here is an example of GitHub bot

[jira] [Assigned] (ARROW-18394) [CI][Python] Nightly pyhon pandas jobs using latest or upstream_devel fail

2022-11-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-18394: - Assignee: Joris Van den Bossche > [CI][Python] Nightly pyhon pandas

[jira] [Commented] (ARROW-18399) [Python] Reduce warnings during tests

2022-11-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638262#comment-17638262 ] Joris Van den Bossche commented on ARROW-18399: --- We have ARROW-17651 and ARROW-18125

[jira] [Resolved] (ARROW-18373) MIGRATION: Enable multiple component selection in issue templates

2022-11-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18373. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Comment Edited] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637679#comment-17637679 ] Joris Van den Bossche edited comment on ARROW-18265 at 11/23/22 10:02 AM:

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637679#comment-17637679 ] Joris Van den Bossche commented on ARROW-18265: --- Yes, but the square-bracket form is

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637388#comment-17637388 ] Joris Van den Bossche commented on ARROW-18265: --- bq. I think you are referring to this:

[jira] [Comment Edited] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637292#comment-17637292 ] Joris Van den Bossche edited comment on ARROW-18265 at 11/22/22 2:42 PM:

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637292#comment-17637292 ] Joris Van den Bossche commented on ARROW-18265: --- [~westonpace] one aspect to explicitly

[jira] [Resolved] (ARROW-18379) [Python] Change warnings to _warnings in _plasma_store_entry_point

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18379. --- Resolution: Fixed Issue resolved by pull request 14695

[jira] [Resolved] (ARROW-17989) [C++] Enable struct_field kernel to accept string field names

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-17989. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (ARROW-18173) [Python] Drop older versions of Pandas (<1.0)

2022-11-22 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18173. --- Resolution: Fixed Issue resolved by pull request 14631

[jira] [Resolved] (ARROW-18341) [Doc][Python] Update note about bundling Arrow C++ on Windows

2022-11-21 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18341. --- Resolution: Fixed Issue resolved by pull request 14660

[jira] [Resolved] (ARROW-18225) [Python] write_metadata does not fully use **kwargs

2022-11-21 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18225. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (ARROW-18363) [Docs] Include warning when viewing old docs (redirecting to stable/dev docs)

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635876#comment-17635876 ] Joris Van den Bossche commented on ARROW-18363: --- There is also some work to include this

[jira] [Commented] (ARROW-18363) [Docs] Include warning when viewing old docs (redirecting to stable/dev docs)

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635873#comment-17635873 ] Joris Van den Bossche commented on ARROW-18363: --- For example the MNE docs mentioned above

[jira] [Commented] (ARROW-18363) [Docs] Include warning when viewing old docs (redirecting to stable/dev docs)

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635864#comment-17635864 ] Joris Van den Bossche commented on ARROW-18363: --- Renamed the issue to not be specific

[jira] [Updated] (ARROW-18363) [Docs] Include warning when viewing old docs (redirecting to stable/dev docs)

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18363: -- Summary: [Docs] Include warning when viewing old docs (redirecting to

[jira] [Created] (ARROW-18363) [Docs] Include warning when viewing old contributing docs (redirecting to dev docs)

2022-11-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-18363: - Summary: [Docs] Include warning when viewing old contributing docs (redirecting to dev docs) Key: ARROW-18363 URL:

[jira] [Commented] (ARROW-18298) [Python] datetime shifted when using pyarrow.Table.from_pandas to load a pandas DateFrame containing datetime with timezone

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635819#comment-17635819 ] Joris Van den Bossche commented on ARROW-18298: --- bq. I thought initially it was just how

[jira] [Updated] (ARROW-17136) [C++] HadoopFileSystem open_append_stream throwing an error if file does not exists

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17136: -- Summary: [C++] HadoopFileSystem open_append_stream throwing an error if file

[jira] [Updated] (ARROW-17136) [C++] HadoopFileSystem open_append_stream throwing an error if file does not exists

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17136: -- Labels: good-first-issue (was: ) > [C++] HadoopFileSystem open_append_stream

[jira] [Updated] (ARROW-17136) [C++] open_append_stream throwing an error if file does not exists

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17136: -- Component/s: C++ (was: Python) > [C++]

[jira] [Updated] (ARROW-17136) [C++] open_append_stream throwing an error if file does not exists

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17136: -- Summary: [C++] open_append_stream throwing an error if file does not exists

[jira] [Comment Edited] (ARROW-18276) [Python] Reading from hdfs using pyarrow 10.0.0 throws OSError: [Errno 22] Opening HDFS file

2022-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635303#comment-17635303 ] Joris Van den Bossche edited comment on ARROW-18276 at 11/18/22 9:43 AM:

[jira] [Commented] (ARROW-18340) [Python] PyArrow C++ header files no longer always included in installed pyarrow

2022-11-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634814#comment-17634814 ] Joris Van den Bossche commented on ARROW-18340: --- cc [~kou] [~raulcd] > [Python] PyArrow

[jira] [Updated] (ARROW-18340) [Python] PyArrow C++ header files no longer always included in installed pyarrow

2022-11-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18340: -- Affects Version/s: 10.0.0 > [Python] PyArrow C++ header files no longer

[jira] [Updated] (ARROW-18340) [Python] PyArrow C++ header files no longer always included in installed pyarrow

2022-11-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18340: -- Component/s: Python > [Python] PyArrow C++ header files no longer always

[jira] [Updated] (ARROW-18340) [Python] PyArrow C++ header files no longer always included in installed pyarrow

2022-11-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18340: -- Fix Version/s: 11.0.0 > [Python] PyArrow C++ header files no longer always

[jira] [Created] (ARROW-18340) [Python] PyArrow C++ header files no longer always included in installed pyarrow

2022-11-16 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-18340: - Summary: [Python] PyArrow C++ header files no longer always included in installed pyarrow Key: ARROW-18340 URL:

[jira] [Updated] (ARROW-18129) [Python] get_include() gives wrong directory in conda environment

2022-11-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18129: -- Summary: [Python] get_include() gives wrong directory in conda environment

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634264#comment-17634264 ] Joris Van den Bossche commented on ARROW-15716: --- To just OR-combine the different

[jira] [Resolved] (ARROW-18264) [Python] Add Time64Scalar.value field

2022-11-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18264. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Created] (ARROW-18329) [Python][CI] Support ORC in Windows wheels

2022-11-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-18329: - Summary: [Python][CI] Support ORC in Windows wheels Key: ARROW-18329 URL: https://issues.apache.org/jira/browse/ARROW-18329 Project: Apache Arrow

[jira] [Resolved] (ARROW-18257) [Python] array of time64 type changes from Time64Type to DataType

2022-11-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18257. --- Resolution: Fixed Issue resolved by pull request 14633

[jira] [Closed] (ARROW-9538) [Python] Allow pyarrow.filesystem.resolve_filesystem_and_path to parse S3 URL

2022-11-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-9538. Resolution: Won't Fix > [Python] Allow

[jira] [Commented] (ARROW-9538) [Python] Allow pyarrow.filesystem.resolve_filesystem_and_path to parse S3 URL

2022-11-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17631653#comment-17631653 ] Joris Van den Bossche commented on ARROW-9538: -- This is working for {{pyarrow.fs}}, and so

[jira] [Updated] (ARROW-18297) [Python] from/to pandas with MultiIndex raises incorrectly

2022-11-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18297: -- Summary: [Python] from/to pandas with MultiIndex raises incorrectly (was:

[jira] [Resolved] (ARROW-18164) [Python] Dataset scanner does not follow default memory pool setting

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18164. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (ARROW-18229) [C++][Python] RecordBatchReader can be created with a 'dict' schema which then crashes on use

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18229. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (ARROW-17893) [Python] Bug: Wrong reading of timedelta

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-17893. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Updated] (ARROW-18295) [C++] FieldRef::FindAll/FindOne(DataType) improve error

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18295: -- Summary: [C++] FieldRef::FindAll/FindOne(DataType) improve error (was: [C++]

[jira] [Resolved] (ARROW-18238) [Python] Improve docs for S3FileSystem / bucket region resolution

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18238. --- Resolution: Fixed Issue resolved by pull request 14599

[jira] [Resolved] (ARROW-17360) [Python] Order of columns in pyarrow.feather.read_table

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-17360. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Updated] (ARROW-18173) [Python] Drop older versions of Pandas (<1.0)

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18173: -- Priority: Critical (was: Major) > [Python] Drop older versions of Pandas

[jira] [Created] (ARROW-18293) [C++] Proxy memory pool crashes with Dataset scanning

2022-11-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-18293: - Summary: [C++] Proxy memory pool crashes with Dataset scanning Key: ARROW-18293 URL: https://issues.apache.org/jira/browse/ARROW-18293 Project:

[jira] [Resolved] (ARROW-17832) [Python] Construct MapArray from sequence of dicts (instead of list of tuples)

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-17832. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (ARROW-18246) [Python][Docs] PyArrow table join docstring typos for left and right suffix arguments

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-18246. --- Resolution: Fixed Issue resolved by pull request 14591

[jira] [Resolved] (ARROW-17892) [CI] Use Python 3.10 in AppVeyor build

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-17892. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (ARROW-18226) [Python] pyarrow.lib.ArrowInvalid: Not a Feather V1 or Arrow IPC file

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630860#comment-17630860 ] Joris Van den Bossche commented on ARROW-18226: --- Could you please show this as the output

[jira] [Commented] (ARROW-18123) [Python] Cannot use multi-byte characters in file names in write_table

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630859#comment-17630859 ] Joris Van den Bossche commented on ARROW-18123: --- Yes, we certainly support relative paths.

[jira] [Updated] (ARROW-18123) [Python] Cannot use multi-byte characters in file names in write_table

2022-11-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18123: -- Fix Version/s: 11.0.0 > [Python] Cannot use multi-byte characters in file

[jira] [Commented] (ARROW-18257) [Python] array of time64 type changes from Time64Type to DataType

2022-11-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630252#comment-17630252 ] Joris Van den Bossche commented on ARROW-18257: --- Yes, thanks for the report! This case

[jira] [Updated] (ARROW-18257) [Python] array of time64 type changes from Time64Type to DataType

2022-11-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-18257: -- Fix Version/s: 11.0.0 > [Python] array of time64 type changes from Time64Type

[jira] [Updated] (ARROW-17820) [C++] Implement arithmetic kernels on List(number)

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17820: -- Summary: [C++] Implement arithmetic kernels on List(number) (was: Implement

[jira] [Commented] (ARROW-17820) Implement arithmetic kernels on List(number)

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629186#comment-17629186 ] Joris Van den Bossche commented on ARROW-17820: --- It would be nice if we would have a way

[jira] [Updated] (ARROW-17820) Implement arithmetic kernels on List(number)

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17820: -- Labels: kernel query-engine (was: ) > Implement arithmetic kernels on

[jira] [Commented] (ARROW-18251) [CI][Python] AMD64 macOS 11 Python 3 job fails on master on pip install

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629013#comment-17629013 ] Joris Van den Bossche commented on ARROW-18251: --- Not directly an idea. This build has been

[jira] [Commented] (ARROW-18185) [C++][Compute] Support KEEP_NULL option for compute::Filter

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628751#comment-17628751 ] Joris Van den Bossche commented on ARROW-18185: --- > What about implementing this as an

[jira] [Updated] (ARROW-12739) [C++] Function to combine Arrays row-wise into ListArray

2022-11-04 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-12739: -- Labels: kernel query-engine (was: ) > [C++] Function to combine Arrays

[jira] [Commented] (ARROW-18229) [C++][Python] RecordBatchReader can be created with a 'dict' schema which then crashes on use

2022-11-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628423#comment-17628423 ] Joris Van den Bossche commented on ARROW-18229: --- I opened a PR to just ensure the argument

[jira] [Commented] (ARROW-18229) [C++][Python] RecordBatchReader can be created with a 'dict' schema which then crashes on use

2022-11-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628416#comment-17628416 ] Joris Van den Bossche commented on ARROW-18229: --- What is causing the segfault here is

[jira] [Assigned] (ARROW-18229) [C++][Python] RecordBatchReader can be created with a 'dict' schema which then crashes on use

2022-11-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-18229: - Assignee: Joris Van den Bossche > [C++][Python] RecordBatchReader can

[jira] [Commented] (ARROW-18226) [Python] pyarrow.lib.ArrowInvalid: Not a Feather V1 or Arrow IPC file

2022-11-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628380#comment-17628380 ] Joris Van den Bossche commented on ARROW-18226: --- Can you print {{pyarrow.__version__}} in

  1   2   3   4   5   6   7   8   9   10   >