[jira] [Created] (ARROW-11365) [Rust] [Parquet] Implement parsers for v2 of the text schema

2021-01-23 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-11365:
--

 Summary: [Rust] [Parquet] Implement parsers for v2 of the text 
schema
 Key: ARROW-11365
 URL: https://issues.apache.org/jira/browse/ARROW-11365
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Rust
Affects Versions: 3.0.0
Reporter: Neville Dipale


V2 of the writer produces schema like:

    required INT32 fieldname INTEGER(32, true);

We should support parsing this format, as it maps to logical types.
I'm unsure of what the implications are for fields that don't have a logical 
type representation, but have a converted type (e.g. INTERVAL). We can try 
write a V2 file with parquet-cpp and observe the behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11364) [Rust] Umbrella issue for parquet 2.6.0 support

2021-01-23 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-11364:
--

 Summary: [Rust] Umbrella issue for parquet 2.6.0 support
 Key: ARROW-11364
 URL: https://issues.apache.org/jira/browse/ARROW-11364
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust
Affects Versions: 3.0.0
Reporter: Neville Dipale


This is the umbrella issue where we can collect everything related to parquet 
2.6.0 support (parquet-format-rs: 2.6.1).

It looks like there's some plumbing needed on the typesystem + parsing logic to 
fully support writing and reading v2 of the file format.

Existing compatibility issues can also be linked to this, or added as sub-tasks.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11363) C++ Library Build Failure with gRPC 1.34+

2021-01-23 Thread Carlo Cabrera (Jira)
Carlo Cabrera created ARROW-11363:
-

 Summary: C++ Library Build Failure with gRPC 1.34+
 Key: ARROW-11363
 URL: https://issues.apache.org/jira/browse/ARROW-11363
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Affects Versions: 2.0.0
 Environment: macOS 10.14, 10.15, 11.1
Reporter: Carlo Cabrera


I tried to update Homebrew's version of gRPC at 
[https://github.com/Homebrew/homebrew-core/pull/69649]. However, CI reported 
build failures from Apache Arrow.

Among the reported errors include: 
{{error: implicit instantiation of undefined template}}
{{error: incomplete definition of type}}
{{error: implicit instantiation of undefined template}}

Complete CI run logs available at 
[https://github.com/Homebrew/homebrew-core/actions/runs/506756245].

Will Apache Arrow be compatible with the latest version of gRPC? (As an aside, 
the lack of compatibility with LLVM 11 is also blocking Homebrew from packaging 
it for Apple M1.)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11362) [Rust][DataFusion] Use iterator APIs in to_array_of_size

2021-01-23 Thread Jira
Daniël Heres created ARROW-11362:


 Summary: [Rust][DataFusion] Use iterator APIs in to_array_of_size
 Key: ARROW-11362
 URL: https://issues.apache.org/jira/browse/ARROW-11362
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Daniël Heres
Assignee: Daniël Heres


To improve performance of 
to_array_of_size



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11361) [Rust] Build buffers from iterator of booleans

2021-01-23 Thread Tyrel Rink (Jira)
Tyrel Rink created ARROW-11361:
--

 Summary: [Rust] Build buffers from iterator of booleans
 Key: ARROW-11361
 URL: https://issues.apache.org/jira/browse/ARROW-11361
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust
Reporter: Tyrel Rink
Assignee: Tyrel Rink


Simplify the creation of bit buffers in the comparison kernel by introducing a 
standard Iterator -> Buffer function.

Most null-buffer logic can rely on byte-level boolean operations, so it may not 
have much utility outside of the comparison kernel?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11360) [Rust] [DataFusion] Improve CSV "No files found" error message

2021-01-23 Thread Andy Grove (Jira)
Andy Grove created ARROW-11360:
--

 Summary: [Rust] [DataFusion] Improve CSV "No files found" error 
message
 Key: ARROW-11360
 URL: https://issues.apache.org/jira/browse/ARROW-11360
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust - DataFusion
Reporter: Andy Grove
 Fix For: 4.0.0


There are two places in DataFusion where the error message "No files found" is 
returned if no CSV files can be found in the specified directory with the 
specified file extension.

It would be much easier to debug issues if this error message stated the 
directory and file extension.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11359) [Rust] take boolean implementation is unsound

2021-01-23 Thread Jira
Jorge Leitão created ARROW-11359:


 Summary: [Rust] take boolean implementation is unsound
 Key: ARROW-11359
 URL: https://issues.apache.org/jira/browse/ARROW-11359
 Project: Apache Arrow
  Issue Type: Bug
  Components: Rust
Reporter: Jorge Leitão


The `take` of boolean does not check that the index is within bounds when calls 
`values.value(index)`, not checks that `index` is defined (i.e. the slot is not 
null).

See description https://github.com/apache/arrow/pull/9301 for details



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11358) [Rust] Add benchmark for concatenating small arrays

2021-01-23 Thread Jira
Daniël Heres created ARROW-11358:


 Summary: [Rust] Add benchmark for concatenating small arrays
 Key: ARROW-11358
 URL: https://issues.apache.org/jira/browse/ARROW-11358
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Daniël Heres






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11357) [Rust] take primitive implementation is unsound

2021-01-23 Thread Jira
Jorge Leitão created ARROW-11357:


 Summary: [Rust] take primitive implementation is unsound
 Key: ARROW-11357
 URL: https://issues.apache.org/jira/browse/ARROW-11357
 Project: Apache Arrow
  Issue Type: Bug
  Components: Rust
Reporter: Jorge Leitão
Assignee: Jorge Leitão


There is an out of bound read.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11356) [Rust]

2021-01-23 Thread Jira
Daniël Heres created ARROW-11356:


 Summary: [Rust]
 Key: ARROW-11356
 URL: https://issues.apache.org/jira/browse/ARROW-11356
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust - DataFusion
Reporter: Daniël Heres
Assignee: Daniël Heres






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-11355) [Rust] Align Date type with spec

2021-01-23 Thread Jira
Jorge Leitão created ARROW-11355:


 Summary: [Rust] Align Date type with spec
 Key: ARROW-11355
 URL: https://issues.apache.org/jira/browse/ARROW-11355
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust
Reporter: Jorge Leitão
Assignee: Jorge Leitão






--
This message was sent by Atlassian Jira
(v8.3.4#803005)