[ 
https://issues.apache.org/jira/browse/ARROW-12521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17332727#comment-17332727
 ] 

Diana Clarke commented on ARROW-12521:
--------------------------------------

See also:

Allow duplicates in the value_set for compute::is_in 
https://issues.apache.org/jira/browse/ARROW-12554

> [C++] arrow-compute-scalar-set-lookup-benchmark failure with new random 
> generator
> ---------------------------------------------------------------------------------
>
>                 Key: ARROW-12521
>                 URL: https://issues.apache.org/jira/browse/ARROW-12521
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Benchmarking, C++
>            Reporter: Antoine Pitrou
>            Priority: Major
>             Fix For: 5.0.0
>
>
> Unfortunately, the value set lookup kernels don't support duplicate values in 
> the value set, which makes our benchmark for these kernels inherently fragile.
> {code}
> -- Arrow Fatal Error --
> NotImplemented: duplicate values in value_set
> subprocess.CalledProcessError: Command 
> '['/tmp/arrow-archery-du8luj93/WORKSPACE/build/release/arrow-compute-scalar-set-lookup-benchmark',
>  '--benchmark_repetitions=1', '--benchmark_out=/tmp/tmpn20z4ul9', 
> '--benchmark_out_format=json']' died with <Signals.SIGABRT: 6>.
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to