[ https://issues.apache.org/jira/browse/ARROW-12521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17332727#comment-17332727 ]
Diana Clarke commented on ARROW-12521: -------------------------------------- See also: Allow duplicates in the value_set for compute::is_in https://issues.apache.org/jira/browse/ARROW-12554 > [C++] arrow-compute-scalar-set-lookup-benchmark failure with new random > generator > --------------------------------------------------------------------------------- > > Key: ARROW-12521 > URL: https://issues.apache.org/jira/browse/ARROW-12521 > Project: Apache Arrow > Issue Type: Bug > Components: Benchmarking, C++ > Reporter: Antoine Pitrou > Priority: Major > Fix For: 5.0.0 > > > Unfortunately, the value set lookup kernels don't support duplicate values in > the value set, which makes our benchmark for these kernels inherently fragile. > {code} > -- Arrow Fatal Error -- > NotImplemented: duplicate values in value_set > subprocess.CalledProcessError: Command > '['/tmp/arrow-archery-du8luj93/WORKSPACE/build/release/arrow-compute-scalar-set-lookup-benchmark', > '--benchmark_repetitions=1', '--benchmark_out=/tmp/tmpn20z4ul9', > '--benchmark_out_format=json']' died with <Signals.SIGABRT: 6>. > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)