[jira] [Resolved] (ARROW-9593) [Python] Add custom pickle reducers for DictionaryScalar

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9593. Resolution: Fixed > [Python] Add custom pickle reducers for DictionaryScalar >

[jira] [Resolved] (ARROW-6281) [Python] Produce chunked arrays for nested types in pyarrow.array

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-6281. Resolution: Fixed > [Python] Produce chunked arrays for nested types in pyarrow.array > ---

[jira] [Resolved] (ARROW-10000) [C++][Python] Support constructing StructArray from list of key-value pairs

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-1. - Resolution: Fixed > [C++][Python] Support constructing StructArray from list of key-valu

[jira] [Assigned] (ARROW-10000) [C++][Python] Support constructing StructArray from list of key-value pairs

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-1: --- Assignee: Krisztian Szucs > [C++][Python] Support constructing StructArray from lis

[jira] [Resolved] (ARROW-9996) [C++] Dictionary is unset when calling DictionaryArray.GetScalar for null values

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9996. Resolution: Fixed > [C++] Dictionary is unset when calling DictionaryArray.GetScalar for nu

[jira] [Updated] (ARROW-9976) [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9976: --- Fix Version/s: 2.0.0 > [Python] ArrowCapacityError when doing Table.from_pandas with large da

[jira] [Resolved] (ARROW-2367) [Python] ListArray has trouble with sizes greater than kMaximumCapacity

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-2367. Resolution: Fixed > [Python] ListArray has trouble with sizes greater than kMaximumCapacity

[jira] [Updated] (ARROW-9999) [Python] Support constructing dictionary array directly through pa.array()

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-: --- Fix Version/s: 2.0.0 > [Python] Support constructing dictionary array directly through pa.arr

[jira] [Resolved] (ARROW-9976) [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9976. Resolution: Fixed > [Python] ArrowCapacityError when doing Table.from_pandas with large dat

[jira] [Resolved] (ARROW-9999) [Python] Support constructing dictionary array directly through pa.array()

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-. Resolution: Fixed > [Python] Support constructing dictionary array directly through pa.arra

[jira] [Resolved] (ARROW-9994) [C++][Python] Auto chunking nested array containing binary-like fields result malformed output

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9994. Resolution: Fixed > [C++][Python] Auto chunking nested array containing binary-like fields

[jira] [Updated] (ARROW-9993) [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9993: --- Fix Version/s: 2.0.0 > [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects

[jira] [Updated] (ARROW-9994) [C++][Python] Auto chunking nested array containing binary-like fields result malformed output

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9994: --- Fix Version/s: 2.0.0 > [C++][Python] Auto chunking nested array containing binary-like fields

[jira] [Resolved] (ARROW-9993) [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9993. Resolution: Fixed > [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects >

[jira] [Commented] (ARROW-9993) [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203117#comment-17203117 ] Krisztian Szucs commented on ARROW-9993: My goal was to prevent raising from a ro

[jira] [Commented] (ARROW-10058) [C++] Investigate performance of LevelsToBitmap without BMI2

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203136#comment-17203136 ] Antoine Pitrou commented on ARROW-10058: Thanks for looking at this! Two questi

[jira] [Created] (ARROW-10114) arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'

2020-09-28 Thread Markus Skyttner (Jira)
Markus Skyttner created ARROW-10114: --- Summary: arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer' Key: ARROW-10114

[jira] [Commented] (ARROW-10058) [C++] Investigate performance of LevelsToBitmap without BMI2

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203151#comment-17203151 ] Antoine Pitrou commented on ARROW-10058: Here is an updated patch with 5 bits lo

[jira] [Updated] (ARROW-10114) arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'

2020-09-28 Thread Markus Skyttner (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Skyttner updated ARROW-10114: Description: A .jsonl file (newline separated JSON) created from open data available at [

[jira] [Resolved] (ARROW-8618) [C++] ASSIGN_OR_RAISE should move its argument

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8618. --- Resolution: Fixed Issue resolved by pull request 8264 [https://github.com/apache/arrow/pull/8

[jira] [Comment Edited] (ARROW-10058) [C++] Investigate performance of LevelsToBitmap without BMI2

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203151#comment-17203151 ] Antoine Pitrou edited comment on ARROW-10058 at 9/28/20, 12:04 PM: ---

[jira] [Created] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Maciej (Jira)
Maciej created ARROW-10115: -- Summary: [C++] CSV empty quoted string is treated as NULL Key: ARROW-10115 URL: https://issues.apache.org/jira/browse/ARROW-10115 Project: Apache Arrow Issue Type: Impro

[jira] [Updated] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej updated ARROW-10115: --- Description: When parsing my CSV I have set {color:#267f99}ConvertOptions{color}::s{color:#001080}trings_can

[jira] [Updated] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej updated ARROW-10115: --- Description: When parsing my CSV I have set {color:#267f99}ConvertOptions{color}::s{color:#001080}trings_can

[jira] [Updated] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej updated ARROW-10115: --- Description: When parsing my CSV I have set {color:#267f99}ConvertOptions{color}::s{color:#001080}trings_can

[jira] [Commented] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203193#comment-17203193 ] Antoine Pitrou commented on ARROW-10115: That sounds like a reasonable request.

[jira] [Updated] (ARROW-6043) [Python] Array equals returns incorrectly if NaNs are in arrays

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6043: -- Labels: pull-request-available (was: ) > [Python] Array equals returns incorrectly if NaNs are

[jira] [Created] (ARROW-10116) [Python][Packaging] Fix gRPC linking error in macOS wheels builds

2020-09-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-10116: --- Summary: [Python][Packaging] Fix gRPC linking error in macOS wheels builds Key: ARROW-10116 URL: https://issues.apache.org/jira/browse/ARROW-10116 Project: Apac

[jira] [Updated] (ARROW-10116) [Python][Packaging] Fix gRPC linking error in macOS wheels builds

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10116: --- Labels: pull-request-available (was: ) > [Python][Packaging] Fix gRPC linking error in macO

[jira] [Updated] (ARROW-9295) [Archery] Support rust clippy in the lint command

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9295: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [Archery] Support rust clippy in t

[jira] [Updated] (ARROW-8459) [Dev][Archery] Use a more recent cmake-format

2020-09-28 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8459: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [Dev][Archery] Use a more recent c

[jira] [Created] (ARROW-10117) [C++] Implement work-stealing scheduler / multiple queues in ThreadPool

2020-09-28 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-10117: Summary: [C++] Implement work-stealing scheduler / multiple queues in ThreadPool Key: ARROW-10117 URL: https://issues.apache.org/jira/browse/ARROW-10117 Project: Apac

[jira] [Updated] (ARROW-10114) [R] arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'

2020-09-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10114: - Summary: [R] arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_u

[jira] [Assigned] (ARROW-10116) [Python][Packaging] Fix gRPC linking error in macOS wheels builds

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10116: - Assignee: Krisztian Szucs (was: Apache Arrow JIRA Bot) > [Python][Pack

[jira] [Assigned] (ARROW-10116) [Python][Packaging] Fix gRPC linking error in macOS wheels builds

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10116: - Assignee: Apache Arrow JIRA Bot (was: Krisztian Szucs) > [Python][Pack

[jira] [Commented] (ARROW-10058) [C++] Investigate performance of LevelsToBitmap without BMI2

2020-09-28 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203273#comment-17203273 ] Yibo Cai commented on ARROW-10058: -- Thanks. This is a quick version to verify the idea.

[jira] [Created] (ARROW-10118) [Rust] [DataFusion] Add support for JSON data sources

2020-09-28 Thread Andy Grove (Jira)
Andy Grove created ARROW-10118: -- Summary: [Rust] [DataFusion] Add support for JSON data sources Key: ARROW-10118 URL: https://issues.apache.org/jira/browse/ARROW-10118 Project: Apache Arrow Issu

[jira] [Commented] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203279#comment-17203279 ] Neal Richardson commented on ARROW-10115: - If ParseOptions::quoting is true, the

[jira] [Commented] (ARROW-10115) [C++] CSV empty quoted string is treated as NULL

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203283#comment-17203283 ] Antoine Pitrou commented on ARROW-10115: > If ParseOptions::quoting is true, the

[jira] [Updated] (ARROW-5679) [Python] Drop Python 3.5 from support matrix

2020-09-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5679: Fix Version/s: 3.0.0 > [Python] Drop Python 3.5 from support matrix > -

[jira] [Commented] (ARROW-5679) [Python] Drop Python 3.5 from support matrix

2020-09-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203292#comment-17203292 ] Wes McKinney commented on ARROW-5679: - I'd support dropping Python 3.5 after 2.0.0 si

[jira] [Commented] (ARROW-1846) [C++] Implement "any" reduction kernel for boolean data

2020-09-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203293#comment-17203293 ] Wes McKinney commented on ARROW-1846: - It is, but it just needs to be able to short-c

[jira] [Created] (ARROW-10119) [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)

2020-09-28 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-10119: -- Summary: [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) Key: ARROW-10119 URL: https://issues.apache.org/jira/browse/ARROW-10119 Project: Apache Arrow

[jira] [Updated] (ARROW-10119) [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10119: --- Labels: pull-request-available (was: ) > [C++] Fix Parquet crashes on invalid input (OSS-Fu

[jira] [Assigned] (ARROW-10119) [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10119: - Assignee: Apache Arrow JIRA Bot (was: Antoine Pitrou) > [C++] Fix Parq

[jira] [Assigned] (ARROW-10119) [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10119: - Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot) > [C++] Fix Parq

[jira] [Updated] (ARROW-9651) [C++][Dataset] Debug segfault in dataset writing on 32-bit mingw (RTools 35)

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9651: --- Fix Version/s: (was: 2.0.0) > [C++][Dataset] Debug segfault in dataset writing on 32-bit

[jira] [Updated] (ARROW-8661) [C++][Gandiva] Reduce number of files and headers

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8661: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++][Gandiva] Reduce number of fi

[jira] [Assigned] (ARROW-9147) [C++][Dataset] Support null -> other type promotion in Dataset scanning

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9147: -- Assignee: Ben Kietzman > [C++][Dataset] Support null -> other type promotion in Datase

[jira] [Assigned] (ARROW-10057) [C++] Add Parquet-Arrow roundtrip tests for nested data

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-10057: -- Assignee: Antoine Pitrou > [C++] Add Parquet-Arrow roundtrip tests for nested data >

[jira] [Updated] (ARROW-7090) [C++] AssertFieldEqual (and friends) doesn't show metadata on failure

2020-09-28 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7090: Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] AssertFieldEqual (and friends) doe

[jira] [Created] (ARROW-10120) [C++][Parquet] Create reading benchmarks for 2-level nested data

2020-09-28 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-10120: -- Summary: [C++][Parquet] Create reading benchmarks for 2-level nested data Key: ARROW-10120 URL: https://issues.apache.org/jira/browse/ARROW-10120 Project: Apache

[jira] [Updated] (ARROW-7372) [C++] Allow creating dictionary array from simple JSON

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7372: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Allow creating dictionary arra

[jira] [Closed] (ARROW-9208) [C++] SlowInputStream test failure in GitHub Actions

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9208. - Fix Version/s: (was: 2.0.0) Resolution: Not A Problem I'm closing this right now as th

[jira] [Updated] (ARROW-9226) [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9226: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [Python] pyarrow.fs.HadoopFileSystem

[jira] [Updated] (ARROW-9633) [C++] Do not toggle memory mapping globally in LocalFileSystem

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9633: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Do not toggle memory mapping g

[jira] [Assigned] (ARROW-9941) [Python] Better string representation for extension types

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9941: - Assignee: Antoine Pitrou > [Python] Better string representation for extension types > -

[jira] [Updated] (ARROW-9421) [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9421: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++][Parquet] Redundancies SchemaMa

[jira] [Assigned] (ARROW-10008) [Python] pyarrow.parquet.read_table fails with predicate pushdown on categorical data with use_legacy_dataset=False

2020-09-28 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-10008: Assignee: Ben Kietzman > [Python] pyarrow.parquet.read_table fails with predicate pushdow

[jira] [Updated] (ARROW-9421) [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9421: -- Priority: Trivial (was: Major) > [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices >

[jira] [Commented] (ARROW-9421) [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203343#comment-17203343 ] Antoine Pitrou commented on ARROW-9421: --- I'm not sure what the redundancies are exa

[jira] [Commented] (ARROW-8228) [C++][Parquet] Support writing lists that have null elements that are non-empty.

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203349#comment-17203349 ] Antoine Pitrou commented on ARROW-8228: --- [~emkornfi...@gmail.com] Can you elaborate

[jira] [Resolved] (ARROW-9983) [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API

2020-09-28 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-9983. - Assignee: Ben Kietzman Resolution: Fixed > [C++][Dataset][Python] Use larger default batch

[jira] [Assigned] (ARROW-5244) [C++] Review experimental / unstable APIs

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-5244: - Assignee: Antoine Pitrou > [C++] Review experimental / unstable APIs > -

[jira] [Created] (ARROW-10121) [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream

2020-09-28 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-10121: Summary: [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream Key: ARROW-10121 URL: https://issues.apache.org/jira/browse/ARROW-10121 Project: A

[jira] [Updated] (ARROW-4753) [C++] Extension types and layouts for text-optimized data structures

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-4753: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Extension types and layouts fo

[jira] [Updated] (ARROW-1565) [C++] Implement TopK/BottomK streaming execution nodes

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-1565: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Implement TopK/BottomK streami

[jira] [Assigned] (ARROW-10121) [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream

2020-09-28 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-10121: -- Assignee: Antoine Pitrou > [C++][Python] Variable dictionaries do not survive roundtr

[jira] [Comment Edited] (ARROW-8385) [Python][Parquet] Crash on parquet.read_table on windows python 3.82

2020-09-28 Thread Andrus (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203447#comment-17203447 ] Andrus edited comment on ARROW-8385 at 9/28/20, 6:27 PM: - Same is

[jira] [Commented] (ARROW-8385) [Python][Parquet] Crash on parquet.read_table on windows python 3.82

2020-09-28 Thread Andrus (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203447#comment-17203447 ] Andrus commented on ARROW-8385: --- Same issue here. Calling read_table produces an exit with

[jira] [Created] (ARROW-10122) [Python] Selecting one column of multi-index results in a duplicated value column.

2020-09-28 Thread Troy Zimmerman (Jira)
Troy Zimmerman created ARROW-10122: -- Summary: [Python] Selecting one column of multi-index results in a duplicated value column. Key: ARROW-10122 URL: https://issues.apache.org/jira/browse/ARROW-10122

[jira] [Created] (ARROW-10123) `parquet.read_table` fails when `BytesIO` buffer is given as source

2020-09-28 Thread Prem Sagar Gali (Jira)
Prem Sagar Gali created ARROW-10123: --- Summary: `parquet.read_table` fails when `BytesIO` buffer is given as source Key: ARROW-10123 URL: https://issues.apache.org/jira/browse/ARROW-10123 Project: Ap

[jira] [Commented] (ARROW-10122) [Python] Selecting one column of multi-index results in a duplicated value column.

2020-09-28 Thread Troy Zimmerman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203460#comment-17203460 ] Troy Zimmerman commented on ARROW-10122: This seems like it could be related to

[jira] [Updated] (ARROW-8426) [Rust] [Parquet] Add support for writing dictionary types

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8426: -- Labels: pull-request-available (was: ) > [Rust] [Parquet] Add support for writing dictionary t

[jira] [Comment Edited] (ARROW-8385) [Python][Parquet] Crash on parquet.read_table on windows python 3.82

2020-09-28 Thread Andrus (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203447#comment-17203447 ] Andrus edited comment on ARROW-8385 at 9/28/20, 7:06 PM: - Same is

[jira] [Created] (ARROW-10124) Write functions don't follow umask setting

2020-09-28 Thread Charlton Callender (Jira)
Charlton Callender created ARROW-10124: -- Summary: Write functions don't follow umask setting Key: ARROW-10124 URL: https://issues.apache.org/jira/browse/ARROW-10124 Project: Apache Arrow

[jira] [Updated] (ARROW-10124) [R] Write functions don't follow umask setting

2020-09-28 Thread Charlton Callender (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Charlton Callender updated ARROW-10124: --- Summary: [R] Write functions don't follow umask setting (was: Write functions don't

[jira] [Assigned] (ARROW-10124) [R] Write functions don't follow umask setting

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-10124: --- Assignee: Antoine Pitrou > [R] Write functions don't follow umask setting > ---

[jira] [Commented] (ARROW-10124) [R] Write functions don't follow umask setting

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203518#comment-17203518 ] Neal Richardson commented on ARROW-10124: - [~apitrou] I think this is an issue w

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Kyle Kavanagh (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Kavanagh updated ARROW-10088: -- Priority: Blocker (was: Major) > [R] Integer64 incorrectly read into R data.table > -

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Priority: Major (was: Blocker) > [R] Integer64 incorrectly read into R data.table > -

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Fix Version/s: 2.0.0 > [R] Integer64 incorrectly read into R data.table >

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Priority: Critical (was: Major) > [R] Integer64 incorrectly read into R data.table >

[jira] [Commented] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203548#comment-17203548 ] Neal Richardson commented on ARROW-10088: - Ok, it looks like there are multiple

[jira] [Created] (ARROW-10125) [R] Int64 downcast check doesn't consider all chunks

2020-09-28 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-10125: --- Summary: [R] Int64 downcast check doesn't consider all chunks Key: ARROW-10125 URL: https://issues.apache.org/jira/browse/ARROW-10125 Project: Apache Arrow

[jira] [Assigned] (ARROW-10125) [R] Int64 downcast check doesn't consider all chunks

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-10125: --- Assignee: Neal Richardson > [R] Int64 downcast check doesn't consider all chunks >

[jira] [Updated] (ARROW-10125) [R] Int64 downcast check doesn't consider all chunks

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10125: --- Labels: pull-request-available (was: ) > [R] Int64 downcast check doesn't consider all chun

[jira] [Commented] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203551#comment-17203551 ] Neal Richardson commented on ARROW-10088: - Ok, split the latter issue to ARROW-1

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Priority: Major (was: Critical) > [R] Integer64 incorrectly read into R data.table >

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Description: - Original description: I've got a proprietary dataset where one of th

[jira] [Updated] (ARROW-10088) [R] Integer64 incorrectly read into R data.table

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Description: Issues with metadata$r: * Handling subclasses of integer64 when relating to

[jira] [Updated] (ARROW-10088) [R] Issues in restoring R metadata for "integer64", "data.table" classes

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Description: Issues with metadata$r: * Handling integer64 (and subclasses) when relating

[jira] [Updated] (ARROW-10088) [R] Issues in restoring R metadata for "integer64", "data.table" classes

2020-09-28 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Summary: [R] Issues in restoring R metadata for "integer64", "data.table" classes (was: [

[jira] [Commented] (ARROW-8228) [C++][Parquet] Support writing lists that have null elements that are non-empty.

2020-09-28 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203618#comment-17203618 ] Micah Kornfield commented on ARROW-8228: The case that isn't covered is if you ha

[jira] [Created] (ARROW-10126) Impossible to import pyarrow module in python. Generates this "ImportError: DLL load failed: The specified procedure could not be found."

2020-09-28 Thread Flavio M (Jira)
Flavio M created ARROW-10126: Summary: Impossible to import pyarrow module in python. Generates this "ImportError: DLL load failed: The specified procedure could not be found." Key: ARROW-10126 URL: https://issues.apa

[jira] [Created] (ARROW-10127) [Format] Update specification to support 256-bit Decimal types

2020-09-28 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-10127: --- Summary: [Format] Update specification to support 256-bit Decimal types Key: ARROW-10127 URL: https://issues.apache.org/jira/browse/ARROW-10127 Project: Apache

[jira] [Updated] (ARROW-10127) [Format] Update specification to support 256-bit Decimal types

2020-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10127: --- Labels: pull-request-available (was: ) > [Format] Update specification to support 256-bit D

[jira] [Assigned] (ARROW-10127) [Format] Update specification to support 256-bit Decimal types

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10127: - Assignee: Micah Kornfield (was: Apache Arrow JIRA Bot) > [Format] Upda

[jira] [Assigned] (ARROW-10127) [Format] Update specification to support 256-bit Decimal types

2020-09-28 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-10127: - Assignee: Apache Arrow JIRA Bot (was: Micah Kornfield) > [Format] Upda

[jira] [Created] (ARROW-10128) [Rust] Dictionary-encoding is out of spec

2020-09-28 Thread Jorge (Jira)
Jorge created ARROW-10128: - Summary: [Rust] Dictionary-encoding is out of spec Key: ARROW-10128 URL: https://issues.apache.org/jira/browse/ARROW-10128 Project: Apache Arrow Issue Type: Task

  1   2   >