[jira] [Created] (ARROW-11365) [Rust] [Parquet] Implement parsers for v2 of the text schema
Neville Dipale created ARROW-11365: -- Summary: [Rust] [Parquet] Implement parsers for v2 of the text schema Key: ARROW-11365 URL: https://issues.apache.org/jira/browse/ARROW-11365 Project: Apache Arrow Issue Type: Sub-task Components: Rust Affects Versions: 3.0.0 Reporter: Neville Dipale V2 of the writer produces schema like: required INT32 fieldname INTEGER(32, true); We should support parsing this format, as it maps to logical types. I'm unsure of what the implications are for fields that don't have a logical type representation, but have a converted type (e.g. INTERVAL). We can try write a V2 file with parquet-cpp and observe the behaviour. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11364) [Rust] Umbrella issue for parquet 2.6.0 support
Neville Dipale created ARROW-11364: -- Summary: [Rust] Umbrella issue for parquet 2.6.0 support Key: ARROW-11364 URL: https://issues.apache.org/jira/browse/ARROW-11364 Project: Apache Arrow Issue Type: Improvement Components: Rust Affects Versions: 3.0.0 Reporter: Neville Dipale This is the umbrella issue where we can collect everything related to parquet 2.6.0 support (parquet-format-rs: 2.6.1). It looks like there's some plumbing needed on the typesystem + parsing logic to fully support writing and reading v2 of the file format. Existing compatibility issues can also be linked to this, or added as sub-tasks. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11363) C++ Library Build Failure with gRPC 1.34+
Carlo Cabrera created ARROW-11363: - Summary: C++ Library Build Failure with gRPC 1.34+ Key: ARROW-11363 URL: https://issues.apache.org/jira/browse/ARROW-11363 Project: Apache Arrow Issue Type: Bug Components: C++ Affects Versions: 2.0.0 Environment: macOS 10.14, 10.15, 11.1 Reporter: Carlo Cabrera I tried to update Homebrew's version of gRPC at [https://github.com/Homebrew/homebrew-core/pull/69649]. However, CI reported build failures from Apache Arrow. Among the reported errors include: {{error: implicit instantiation of undefined template}} {{error: incomplete definition of type}} {{error: implicit instantiation of undefined template}} Complete CI run logs available at [https://github.com/Homebrew/homebrew-core/actions/runs/506756245]. Will Apache Arrow be compatible with the latest version of gRPC? (As an aside, the lack of compatibility with LLVM 11 is also blocking Homebrew from packaging it for Apple M1.) -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11362) [Rust][DataFusion] Use iterator APIs in to_array_of_size
Daniël Heres created ARROW-11362: Summary: [Rust][DataFusion] Use iterator APIs in to_array_of_size Key: ARROW-11362 URL: https://issues.apache.org/jira/browse/ARROW-11362 Project: Apache Arrow Issue Type: Improvement Reporter: Daniël Heres Assignee: Daniël Heres To improve performance of to_array_of_size -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11361) [Rust] Build buffers from iterator of booleans
Tyrel Rink created ARROW-11361: -- Summary: [Rust] Build buffers from iterator of booleans Key: ARROW-11361 URL: https://issues.apache.org/jira/browse/ARROW-11361 Project: Apache Arrow Issue Type: Improvement Components: Rust Reporter: Tyrel Rink Assignee: Tyrel Rink Simplify the creation of bit buffers in the comparison kernel by introducing a standard Iterator -> Buffer function. Most null-buffer logic can rely on byte-level boolean operations, so it may not have much utility outside of the comparison kernel? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11360) [Rust] [DataFusion] Improve CSV "No files found" error message
Andy Grove created ARROW-11360: -- Summary: [Rust] [DataFusion] Improve CSV "No files found" error message Key: ARROW-11360 URL: https://issues.apache.org/jira/browse/ARROW-11360 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Reporter: Andy Grove Fix For: 4.0.0 There are two places in DataFusion where the error message "No files found" is returned if no CSV files can be found in the specified directory with the specified file extension. It would be much easier to debug issues if this error message stated the directory and file extension. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11359) [Rust] take boolean implementation is unsound
Jorge Leitão created ARROW-11359: Summary: [Rust] take boolean implementation is unsound Key: ARROW-11359 URL: https://issues.apache.org/jira/browse/ARROW-11359 Project: Apache Arrow Issue Type: Bug Components: Rust Reporter: Jorge Leitão The `take` of boolean does not check that the index is within bounds when calls `values.value(index)`, not checks that `index` is defined (i.e. the slot is not null). See description https://github.com/apache/arrow/pull/9301 for details -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11358) [Rust] Add benchmark for concatenating small arrays
Daniël Heres created ARROW-11358: Summary: [Rust] Add benchmark for concatenating small arrays Key: ARROW-11358 URL: https://issues.apache.org/jira/browse/ARROW-11358 Project: Apache Arrow Issue Type: Improvement Reporter: Daniël Heres -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11357) [Rust] take primitive implementation is unsound
Jorge Leitão created ARROW-11357: Summary: [Rust] take primitive implementation is unsound Key: ARROW-11357 URL: https://issues.apache.org/jira/browse/ARROW-11357 Project: Apache Arrow Issue Type: Bug Components: Rust Reporter: Jorge Leitão Assignee: Jorge Leitão There is an out of bound read. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11356) [Rust]
Daniël Heres created ARROW-11356: Summary: [Rust] Key: ARROW-11356 URL: https://issues.apache.org/jira/browse/ARROW-11356 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Reporter: Daniël Heres Assignee: Daniël Heres -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-11355) [Rust] Align Date type with spec
Jorge Leitão created ARROW-11355: Summary: [Rust] Align Date type with spec Key: ARROW-11355 URL: https://issues.apache.org/jira/browse/ARROW-11355 Project: Apache Arrow Issue Type: Improvement Components: Rust Reporter: Jorge Leitão Assignee: Jorge Leitão -- This message was sent by Atlassian Jira (v8.3.4#803005)