[GitHub] [arrow] github-actions[bot] commented on pull request #9314: [Rust] Fix deprecation warning in clippy

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9314: URL: https://github.com/apache/arrow/pull/9314#issuecomment-766625620 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then could

[GitHub] [arrow] codecov-io edited a comment on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766197876 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=h1) Report > Merging [#9305](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=desc) (e07f7e5)

[GitHub] [arrow] jorgecarleitao opened a new pull request #9314: [Rust] Fix deprecation warning in clippy

2021-01-24 Thread GitBox
jorgecarleitao opened a new pull request #9314: URL: https://github.com/apache/arrow/pull/9314 Master and all PRs are failing with a deprecation warning in clippy. This fixes it. This is an automated message from the Apache

[GitHub] [arrow] zhztheplayer commented on pull request #7030: ARROW-7808: [Java][Dataset] Implement Dataset Java API by JNI to C++

2021-01-24 Thread GitBox
zhztheplayer commented on pull request #7030: URL: https://github.com/apache/arrow/pull/7030#issuecomment-766616108 Hi @emkornfield @pitrou I think I have addressed existing comments so would you like to take another look now? Thanks a lot. Since this was submitted long time ago (be

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
jorgecarleitao commented on a change in pull request #9305: URL: https://github.com/apache/arrow/pull/9305#discussion_r563510962 ## File path: rust/datafusion/src/scalar.rs ## @@ -205,28 +205,104 @@ impl ScalarValue { ScalarValue::Boolean(e) => { A

[GitHub] [arrow] jorgecarleitao commented on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766611408 > Arrow is used inside DF, which is used to build databases on to of it. If the user defines `DECIMAL(2,2)` it's a bit overhead to use i128 as representation for it.

[GitHub] [arrow] tyrelr commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563502344 ## File path: rust/arrow/src/compute/kernels/take.rs ## @@ -254,6 +254,137 @@ impl Default for TakeOptions { } } +#[inline(always)] +fn maybe_usize(i

[GitHub] [arrow] jorgecarleitao closed pull request #9293: ARROW-11349: [Rust] Add from_iter_values to create arrays from (non null) values

2021-01-24 Thread GitBox
jorgecarleitao closed pull request #9293: URL: https://github.com/apache/arrow/pull/9293 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #9256: WIP: ARROW-11310: [Rust] implement JSON writer

2021-01-24 Thread GitBox
jorgecarleitao commented on a change in pull request #9256: URL: https://github.com/apache/arrow/pull/9256#discussion_r563499210 ## File path: rust/arrow/src/json/writer.rs ## @@ -0,0 +1,301 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contribut

[GitHub] [arrow] codecov-io edited a comment on pull request #9312: ARROW-11369: [DataFusion] Split expressions.rs

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9312: URL: https://github.com/apache/arrow/pull/9312#issuecomment-766538103 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9312?src=pr&el=h1) Report > Merging [#9312](https://codecov.io/gh/apache/arrow/pull/9312?src=pr&el=desc) (e1cf992)

[GitHub] [arrow] jorgecarleitao commented on pull request #9312: ARROW-11369: [DataFusion] Split expressions.rs

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9312: URL: https://github.com/apache/arrow/pull/9312#issuecomment-766578945 The clippy error is unrelated. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] cyb70289 commented on pull request #9310: ARROW-11367: [C++] Implement t-digest approximate quantile utility

2021-01-24 Thread GitBox
cyb70289 commented on pull request #9310: URL: https://github.com/apache/arrow/pull/9310#issuecomment-766570951 mingw64 ci failure is not relevant This is an automated message from the Apache Git Service. To respond to the me

[GitHub] [arrow] nevi-me commented on a change in pull request #9313: [Rust] [Experiment] Trigonometry kernels

2021-01-24 Thread GitBox
nevi-me commented on a change in pull request #9313: URL: https://github.com/apache/arrow/pull/9313#discussion_r563480506 ## File path: rust/arrow/src/compute/kernels/trigonometry.rs ## @@ -0,0 +1,248 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more

[GitHub] [arrow] github-actions[bot] commented on pull request #9313: [Rust] [Experiment] Trigonometry kernels

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9313: URL: https://github.com/apache/arrow/pull/9313#issuecomment-766567197 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then could

[GitHub] [arrow] nevi-me opened a new pull request #9313: [Rust] [Experiment] Trigonometry kernels

2021-01-24 Thread GitBox
nevi-me opened a new pull request #9313: URL: https://github.com/apache/arrow/pull/9313 This is on top of #9297 I was curious if (ab)using the `compute::unary` kernel would perform better on slightly complex functions. I implemented the Haversine function, which calculates the

[GitHub] [arrow] codecov-io commented on pull request #9312: ARROW-11369: [DataFusion] Split expressions.rs

2021-01-24 Thread GitBox
codecov-io commented on pull request #9312: URL: https://github.com/apache/arrow/pull/9312#issuecomment-766538103 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9312?src=pr&el=h1) Report > Merging [#9312](https://codecov.io/gh/apache/arrow/pull/9312?src=pr&el=desc) (84bce69) into

[GitHub] [arrow] github-actions[bot] commented on pull request #9312: ARROW-11369: [DataFusion] Split expressions.rs

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9312: URL: https://github.com/apache/arrow/pull/9312#issuecomment-766530454 https://issues.apache.org/jira/browse/ARROW-11369 This is an automated message from the Apache Git Ser

[GitHub] [arrow] jorgecarleitao opened a new pull request #9312: ARROW-11369: [DataFusion] Split expressions.rs

2021-01-24 Thread GitBox
jorgecarleitao opened a new pull request #9312: URL: https://github.com/apache/arrow/pull/9312 This PR aims at splitting the large `expressions.rs` module in parts, thereby making it easier to navigate and work with. No backward incompatible changes are expected from this.

[GitHub] [arrow] joeyac opened a new issue #9311: [arrow c++] How can I modify `RecordBatch` or `Table` value piped from another c++ program?

2021-01-24 Thread GitBox
joeyac opened a new issue #9311: URL: https://github.com/apache/arrow/issues/9311 writer.cc: construct table like https://arrow.apache.org/docs/cpp/examples/row_columnar_conversion.html :: VectorToColumnarTable, then use the following code write table to stdout: ``` arrow::Status Wri

[GitHub] [arrow] tyrelr commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563446434 ## File path: rust/arrow/src/compute/kernels/take.rs ## @@ -254,6 +254,137 @@ impl Default for TakeOptions { } } +#[inline(always)] +fn maybe_usize(i

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
jorgecarleitao commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563445497 ## File path: rust/arrow/src/compute/kernels/take.rs ## @@ -254,6 +254,137 @@ impl Default for TakeOptions { } } +#[inline(always)] +fn maybe

[GitHub] [arrow] tyrelr commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563443673 ## File path: rust/arrow/src/compute/kernels/take.rs ## @@ -254,6 +254,137 @@ impl Default for TakeOptions { } } +#[inline(always)] +fn maybe_usize(i

[GitHub] [arrow] tyrelr commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563443673 ## File path: rust/arrow/src/compute/kernels/take.rs ## @@ -254,6 +254,137 @@ impl Default for TakeOptions { } } +#[inline(always)] +fn maybe_usize(i

[GitHub] [arrow] github-actions[bot] commented on pull request #9310: ARROW-11367: [C++] Implement t-digest approximate quantile utility

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9310: URL: https://github.com/apache/arrow/pull/9310#issuecomment-766501335 https://issues.apache.org/jira/browse/ARROW-11367 This is an automated message from the Apache Git Ser

[GitHub] [arrow] cyb70289 opened a new pull request #9310: ARROW-11367: [C++] Implement t-digest approximate quantile utility

2021-01-24 Thread GitBox
cyb70289 opened a new pull request #9310: URL: https://github.com/apache/arrow/pull/9310 t-Digest is a data structure to approximate accurate quantiles of arbitrary length dataset using constant space. This utility will be used in implementing approximate quantile kernel and latency

[GitHub] [arrow] WeichenXu123 commented on pull request #9187: ARROW-11223: [Java] Fix: BaseVariableWidthVector/BaseLargeVariableWidthVector setNull() and getBufferSizeFor() trigger offset buffer over

2021-01-24 Thread GitBox
WeichenXu123 commented on pull request #9187: URL: https://github.com/apache/arrow/pull/9187#issuecomment-766499542 > A better way to get an estimate of buffer size would be to include a density value for the avg number of bytes per record, similar to setInitialCapacity(int valueCount, dou

[GitHub] [arrow] houqp edited a comment on pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
houqp edited a comment on pull request #9309: URL: https://github.com/apache/arrow/pull/9309#issuecomment-766489488 clippy is failing due to unrelated issue, can i fix it in the same PR or should I create a new dedicated one to address the issue? --

[GitHub] [arrow] houqp commented on pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
houqp commented on pull request #9309: URL: https://github.com/apache/arrow/pull/9309#issuecomment-766489488 clippy is failing due to unrelated issue, should i fix it in the same PR or create a new dedicated one to address the issue? ---

[GitHub] [arrow] codecov-io edited a comment on pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9309: URL: https://github.com/apache/arrow/pull/9309#issuecomment-766471716 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9309?src=pr&el=h1) Report > Merging [#9309](https://codecov.io/gh/apache/arrow/pull/9309?src=pr&el=desc) (474e14a)

[GitHub] [arrow] codecov-io commented on pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
codecov-io commented on pull request #9309: URL: https://github.com/apache/arrow/pull/9309#issuecomment-766471716 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9309?src=pr&el=h1) Report > Merging [#9309](https://codecov.io/gh/apache/arrow/pull/9309?src=pr&el=desc) (5656232) into

[GitHub] [arrow] houqp commented on a change in pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
houqp commented on a change in pull request #9309: URL: https://github.com/apache/arrow/pull/9309#discussion_r563404985 ## File path: rust/rustfmt.toml ## @@ -15,9 +15,10 @@ # specific language governing permissions and limitations # under the License. +edition = "2018" Re

[GitHub] [arrow] github-actions[bot] commented on pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9309: URL: https://github.com/apache/arrow/pull/9309#issuecomment-766461000 https://issues.apache.org/jira/browse/ARROW-11366 This is an automated message from the Apache Git Ser

[GitHub] [arrow] houqp opened a new pull request #9309: ARROW-11366: [Datafusion] support boolean literal in comparison expression

2021-01-24 Thread GitBox
houqp opened a new pull request #9309: URL: https://github.com/apache/arrow/pull/9309 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [arrow] mathyingzhou commented on a change in pull request #8648: ARROW-7906: [C++] [Python] Add ORC write support

2021-01-24 Thread GitBox
mathyingzhou commented on a change in pull request #8648: URL: https://github.com/apache/arrow/pull/8648#discussion_r563395657 ## File path: cpp/src/arrow/adapters/orc/adapter_test.cc ## @@ -58,6 +73,81 @@ class MemoryOutputStream : public liborc::OutputStream { uint64_t len

[GitHub] [arrow] tyrelr commented on a change in pull request #9304: ARROW-11361: [Rust] Build MutableBuffer/Buffer from iterator of bools

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9304: URL: https://github.com/apache/arrow/pull/9304#discussion_r563393453 ## File path: rust/arrow/src/buffer.rs ## @@ -1188,6 +1209,57 @@ impl Drop for SetLenOnDrop<'_> { } } +/// Creating a `MutableBuffer` instance by set

[GitHub] [arrow] tyrelr commented on a change in pull request #9304: ARROW-11361: [Rust] Build MutableBuffer/Buffer from iterator of bools

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9304: URL: https://github.com/apache/arrow/pull/9304#discussion_r563393453 ## File path: rust/arrow/src/buffer.rs ## @@ -1188,6 +1209,57 @@ impl Drop for SetLenOnDrop<'_> { } } +/// Creating a `MutableBuffer` instance by set

[GitHub] [arrow] mathyingzhou commented on a change in pull request #8648: ARROW-7906: [C++] [Python] Add ORC write support

2021-01-24 Thread GitBox
mathyingzhou commented on a change in pull request #8648: URL: https://github.com/apache/arrow/pull/8648#discussion_r563393105 ## File path: cpp/src/arrow/adapters/orc/adapter_test.cc ## @@ -157,4 +249,2151 @@ TEST(TestAdapter, readIntAndStringFileMultipleStripes) { EXPECT

[GitHub] [arrow] eladroz edited a comment on pull request #9285: ARROW-10349: [Python] Build and publish aarch64 wheels [WIP]

2021-01-24 Thread GitBox
eladroz edited a comment on pull request #9285: URL: https://github.com/apache/arrow/pull/9285#issuecomment-766086589 @xhochy @kszucs hi, I've published my experience with building on AWS arm64 VM's [here](https://lists.apache.org/thread.html/r274a6f9a3d0133fe3b247ea9bb1da6c6f24fc1511508d1

[GitHub] [arrow] gangliao edited a comment on issue #9307: [Rust][DataFusion] Join Statement: Schema contains duplicate unqualified field name

2021-01-24 Thread GitBox
gangliao edited a comment on issue #9307: URL: https://github.com/apache/arrow/issues/9307#issuecomment-766417028 I tried two solutions: ```rust let sql = concat!( "SELECT b, d ", "FROM t1 JOIN t2 ON t1.a = t2.a ", "ORDER BY b A

[GitHub] [arrow] gangliao commented on issue #9307: [Rust][DataFusion] Join Statement: Schema contains duplicate unqualified field name

2021-01-24 Thread GitBox
gangliao commented on issue #9307: URL: https://github.com/apache/arrow/issues/9307#issuecomment-766417028 I tried two solutions: ```rust let sql = concat!( "SELECT b, d ", "FROM t1 JOIN t2 ON t1.a = t2.a ", "ORDER BY b ASC ",

[GitHub] [arrow] tyrelr commented on a change in pull request #9304: ARROW-11361: [Rust] Build MutableBuffer/Buffer from iterator of bools

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9304: URL: https://github.com/apache/arrow/pull/9304#discussion_r563336446 ## File path: rust/arrow/src/buffer.rs ## @@ -1188,6 +1209,57 @@ impl Drop for SetLenOnDrop<'_> { } } +/// Creating a `MutableBuffer` instance by set

[GitHub] [arrow] tyrelr commented on a change in pull request #9304: ARROW-11361: [Rust] Build MutableBuffer/Buffer from iterator of bools

2021-01-24 Thread GitBox
tyrelr commented on a change in pull request #9304: URL: https://github.com/apache/arrow/pull/9304#discussion_r563336446 ## File path: rust/arrow/src/buffer.rs ## @@ -1188,6 +1209,57 @@ impl Drop for SetLenOnDrop<'_> { } } +/// Creating a `MutableBuffer` instance by set

[GitHub] [arrow] Dandandan edited a comment on issue #9307: [Rust][DataFusion] Join Statement: Schema contains duplicate unqualified field name

2021-01-24 Thread GitBox
Dandandan edited a comment on issue #9307: URL: https://github.com/apache/arrow/issues/9307#issuecomment-766404237 @gangliao I believe this is a SQL error, the `a` is ambiguous in the projection. Could you try: `SELECT t1.a as a, b, d`?

[GitHub] [arrow] Dandandan commented on issue #9307: [Rust][DataFusion] Join Statement: Schema contains duplicate unqualified field name

2021-01-24 Thread GitBox
Dandandan commented on issue #9307: URL: https://github.com/apache/arrow/issues/9307#issuecomment-766404237 @gangliao I believe this is a SQL error, the `a` is ambiguous in the projection. Could you try: `SELECT t1.a as a, b, d`

[GitHub] [arrow] Joey9801 commented on pull request #9308: [Rust] Fix unused variable in code example

2021-01-24 Thread GitBox
Joey9801 commented on pull request #9308: URL: https://github.com/apache/arrow/pull/9308#issuecomment-766396406 I'm not sure this fix is substantial enough that it really needs a JIRA ticket. Happy to make one and link it to this PR if I'm wrong about that though!

[GitHub] [arrow] Joey9801 opened a new pull request #9308: [Rust] Fix unused variable in code example

2021-01-24 Thread GitBox
Joey9801 opened a new pull request #9308: URL: https://github.com/apache/arrow/pull/9308 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] gangliao opened a new issue #9307: [Rust][DataFusion] Join Statement: Schema contains duplicate unqualified field name

2021-01-24 Thread GitBox
gangliao opened a new issue #9307: URL: https://github.com/apache/arrow/issues/9307 ```shell "Schema contains duplicate unqualified field name \'a\'") thread 'tests::simple_join' panicked at 'assertion failed: `(left == right)` left: `1`, right: `0`: the test returned a termin

[GitHub] [arrow] codecov-io edited a comment on pull request #9306: ARROW-10297: [Rust] Parameter for parquet-read to output data in json format

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9306: URL: https://github.com/apache/arrow/pull/9306#issuecomment-766366191 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=h1) Report > Merging [#9306](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=desc) (e9e6fe3)

[GitHub] [arrow] codecov-io edited a comment on pull request #9306: ARROW-10297: [Rust] Parameter for parquet-read to output data in json format

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9306: URL: https://github.com/apache/arrow/pull/9306#issuecomment-766366191 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=h1) Report > Merging [#9306](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=desc) (5fd156a)

[GitHub] [arrow] eladroz edited a comment on pull request #9285: ARROW-10349: [Python] Build and publish aarch64 wheels [WIP]

2021-01-24 Thread GitBox
eladroz edited a comment on pull request #9285: URL: https://github.com/apache/arrow/pull/9285#issuecomment-766086589 @xhochy @kszucs hi, I've published my experience with building on AWS arm64 VM's [here](https://lists.apache.org/thread.html/r274a6f9a3d0133fe3b247ea9bb1da6c6f24fc1511508d1

[GitHub] [arrow] codecov-io commented on pull request #9306: ARROW-10297: [Rust] Parameter for parquet-read to output data in json format

2021-01-24 Thread GitBox
codecov-io commented on pull request #9306: URL: https://github.com/apache/arrow/pull/9306#issuecomment-766366191 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=h1) Report > Merging [#9306](https://codecov.io/gh/apache/arrow/pull/9306?src=pr&el=desc) (1a064a3) into

[GitHub] [arrow] tyrelr commented on pull request #9215: ARROW-11270: [Rust] Array slice accessors

2021-01-24 Thread GitBox
tyrelr commented on pull request #9215: URL: https://github.com/apache/arrow/pull/9215#issuecomment-766361051 I'm still seeing a mix of inconsistent performance hits/bumps after the rebase. ``` critcmp master-67d0c2e38 array-slice-83b8938af -t 10 grouparray-

[GitHub] [arrow] github-actions[bot] commented on pull request #9306: ARROW-10297: [Rust] Add flag to show json output in parquet-read binary

2021-01-24 Thread GitBox
github-actions[bot] commented on pull request #9306: URL: https://github.com/apache/arrow/pull/9306#issuecomment-766360952 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then could

[GitHub] [arrow] manojkarthick opened a new pull request #9306: ARROW-10297: [Rust] Add flag to show json output in parquet-read binary

2021-01-24 Thread GitBox
manojkarthick opened a new pull request #9306: URL: https://github.com/apache/arrow/pull/9306 Add an option to print output in JSON format. in the parquet-read binary. Having json output allows for easy analysis using tools like [jq](https://stedolan.github.io/jq/). This PR builds on the c

[GitHub] [arrow] zhztheplayer commented on a change in pull request #7030: ARROW-7808: [Java][Dataset] Implement Dataset Java API by JNI to C++

2021-01-24 Thread GitBox
zhztheplayer commented on a change in pull request #7030: URL: https://github.com/apache/arrow/pull/7030#discussion_r56327 ## File path: cpp/src/arrow/memory_pool.h ## @@ -149,6 +149,43 @@ class ARROW_EXPORT ProxyMemoryPool : public MemoryPool { std::unique_ptr impl_; }

[GitHub] [arrow] nevi-me commented on a change in pull request #9297: ARROW-11354: [Rust] Speed-up cast of dates and times (2-4x)

2021-01-24 Thread GitBox
nevi-me commented on a change in pull request #9297: URL: https://github.com/apache/arrow/pull/9297#discussion_r563293232 ## File path: rust/arrow/src/compute/kernels/mod.rs ## @@ -30,3 +30,4 @@ pub mod sort; pub mod substring; pub mod take; pub mod temporal; +pub mod unary;

[GitHub] [arrow] nevi-me commented on a change in pull request #9297: ARROW-11354: [Rust] Speed-up cast of dates and times (2-4x)

2021-01-24 Thread GitBox
nevi-me commented on a change in pull request #9297: URL: https://github.com/apache/arrow/pull/9297#discussion_r563293232 ## File path: rust/arrow/src/compute/kernels/mod.rs ## @@ -30,3 +30,4 @@ pub mod sort; pub mod substring; pub mod take; pub mod temporal; +pub mod unary;

[GitHub] [arrow] nevi-me commented on a change in pull request #9297: ARROW-11354: [Rust] Speed-up cast of dates and times (2-4x)

2021-01-24 Thread GitBox
nevi-me commented on a change in pull request #9297: URL: https://github.com/apache/arrow/pull/9297#discussion_r563288134 ## File path: rust/arrow/src/compute/kernels/unary.rs ## @@ -0,0 +1,74 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contrib

[GitHub] [arrow] Dandandan commented on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
Dandandan commented on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766343030 CI failures don't seem to be related to the changes. This is an automated message from the Apache Git Service. T

[GitHub] [arrow] ovr edited a comment on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
ovr edited a comment on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766338705 > Looking at the spec, a decimal type only supports 128 and 256 bits. So, I am not understanding why we are trying to add support for Int32,Int64,Int128,LargeDecimal here.

[GitHub] [arrow] ovr commented on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
ovr commented on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766338705 > Looking at the spec, a decimal type only supports 128 and 256 bits. So, I am not understanding why we are trying to add support for Int32,Int64,Int128,LargeDecimal here. Arro

[GitHub] [arrow] ovr commented on a change in pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
ovr commented on a change in pull request #9232: URL: https://github.com/apache/arrow/pull/9232#discussion_r563282158 ## File path: rust/arrow/src/compute/kernels/cast.rs ## @@ -443,6 +456,13 @@ pub fn cast(array: &ArrayRef, to_type: &DataType) -> Result { ))),

[GitHub] [arrow] alamb commented on a change in pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
alamb commented on a change in pull request #9232: URL: https://github.com/apache/arrow/pull/9232#discussion_r563278609 ## File path: rust/arrow/src/compute/kernels/cast.rs ## @@ -443,6 +456,13 @@ pub fn cast(array: &ArrayRef, to_type: &DataType) -> Result { ))),

[GitHub] [arrow] Dandandan commented on a change in pull request #9304: ARROW-11361: [Rust] Build MutableBuffer/Buffer from iterator of bools

2021-01-24 Thread GitBox
Dandandan commented on a change in pull request #9304: URL: https://github.com/apache/arrow/pull/9304#discussion_r563279700 ## File path: rust/arrow/src/buffer.rs ## @@ -1188,6 +1209,57 @@ impl Drop for SetLenOnDrop<'_> { } } +/// Creating a `MutableBuffer` instance by

[GitHub] [arrow] codecov-io edited a comment on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766197876 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=h1) Report > Merging [#9305](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=desc) (6144a23)

[GitHub] [arrow] codecov-io edited a comment on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766197876 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=h1) Report > Merging [#9305](https://codecov.io/gh/apache/arrow/pull/9305?src=pr&el=desc) (d612b0f)

[GitHub] [arrow] Dandandan commented on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
Dandandan commented on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766329608 @jorgecarleitao yes, but those are on `GroupByScalar` instead of `ScalarValue`. I think it might clean up some code by having just one `ScalarValue` enum defined and reusing fu

[GitHub] [arrow] jorgecarleitao commented on pull request #9305: ARROW-11362:[Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9305: URL: https://github.com/apache/arrow/pull/9305#issuecomment-766329191 I noticed that this is also being used in `hash_aggregate::create_batch_from_map`. This is an automated me

[GitHub] [arrow] Dandandan commented on pull request #9300: ARROW-11356: [Rust] Add method to PrimitiveArray::from_value to efficiently create an Array from a value

2021-01-24 Thread GitBox
Dandandan commented on pull request #9300: URL: https://github.com/apache/arrow/pull/9300#issuecomment-766327769 Creating an null array with known size might be another that might come up in the future, but I am witholding that until it is supported by some use case. -

[GitHub] [arrow] Dandandan closed pull request #9300: ARROW-11356: [Rust] Add method to PrimitiveArray::from_value to efficiently create an Array from a value

2021-01-24 Thread GitBox
Dandandan closed pull request #9300: URL: https://github.com/apache/arrow/pull/9300 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] Dandandan edited a comment on pull request #9300: ARROW-11356: [Rust] Add method to PrimitiveArray::from_value to efficiently create an Array from a value

2021-01-24 Thread GitBox
Dandandan edited a comment on pull request #9300: URL: https://github.com/apache/arrow/pull/9300#issuecomment-766324078 @jorgecarleitao I agree, it makes more sense to wait until this is faster and for now use the iterator for this case (even if it is slightly more verbose). I think at

[GitHub] [arrow] Dandandan commented on pull request #9300: ARROW-11356: [Rust] Add method to PrimitiveArray::from_value to efficiently create an Array from a value

2021-01-24 Thread GitBox
Dandandan commented on pull request #9300: URL: https://github.com/apache/arrow/pull/9300#issuecomment-766324078 @jorgecarleitao I agree, it makes more sense to wait until this is faster and for now use the iterator for this case (even if it is slightl I think at some point this version

[GitHub] [arrow] jorgecarleitao closed pull request #9262: ARROW-11317: [Rust] Include the prettyprint feature in CI Coverage

2021-01-24 Thread GitBox
jorgecarleitao closed pull request #9262: URL: https://github.com/apache/arrow/pull/9262 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] jorgecarleitao commented on pull request #9300: ARROW-11356: [Rust] Add method to PrimitiveArray::from_value to efficiently create an Array from a value

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9300: URL: https://github.com/apache/arrow/pull/9300#issuecomment-766321620 Do you see any difference between this one and using the iterator? I would expect them to perform equally well, since both are using `push`. --

[GitHub] [arrow] jorgecarleitao edited a comment on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
jorgecarleitao edited a comment on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766319983 Hi @ovr , I went through what is here so far. First of all, great stuff that you are taking this on. Broadly speaking, this PR currently contains the fol

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
jorgecarleitao commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563267292 ## File path: rust/arrow/src/datatypes.rs ## @@ -357,10 +373,12 @@ impl JsonSerializable for u16 { } impl ArrowNativeType for u16 { +#[inline

[GitHub] [arrow] jorgecarleitao edited a comment on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
jorgecarleitao edited a comment on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766319983 Hi @ovr , I went through what is here so far. First of all, great stuff that you are taking this on. Broadly speaking, this PR currently contains the fol

[GitHub] [arrow] Dandandan commented on a change in pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
Dandandan commented on a change in pull request #9301: URL: https://github.com/apache/arrow/pull/9301#discussion_r563265584 ## File path: rust/arrow/src/datatypes.rs ## @@ -357,10 +373,12 @@ impl JsonSerializable for u16 { } impl ArrowNativeType for u16 { +#[inline]

[GitHub] [arrow] jorgecarleitao commented on pull request #9232: ARROW-10818: [Rust] Implement DecimalType

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9232: URL: https://github.com/apache/arrow/pull/9232#issuecomment-766319983 Hi @ovr , I went through what is here so far. First of all, great stuff that you are taking this on. Broadly speaking, this PR currently contains the following

[GitHub] [arrow] codecov-io edited a comment on pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
codecov-io edited a comment on pull request #9301: URL: https://github.com/apache/arrow/pull/9301#issuecomment-766090110 # [Codecov](https://codecov.io/gh/apache/arrow/pull/9301?src=pr&el=h1) Report > Merging [#9301](https://codecov.io/gh/apache/arrow/pull/9301?src=pr&el=desc) (5939a4e)

[GitHub] [arrow] jorgecarleitao commented on pull request #9301: ARROW-11357: [Rust]: Fix out-of-bounds reads in `take` and other undefined behavior

2021-01-24 Thread GitBox
jorgecarleitao commented on pull request #9301: URL: https://github.com/apache/arrow/pull/9301#issuecomment-766317722 @Dandandan , I pushed a fix and the benches were updated. For non-nulls, this is now `[-20,-30%]` faster, but still 2x slower when the indices have nulls, as the logic is n