Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-24 Thread Ryan Berti
functions? which col/lit helpers should be used when?). Are there docs describing all of the locations + standards for defining a function? If not, that'd be great to have too. Ryan Berti Senior Data Engineer | Ads DE M 7023217573 5808 W Sunset Blvd | Los Angeles, CA 90028 On Wed, May 24, 2023

Supporting Datasketches HllSketch via Spark Functions

2023-04-19 Thread Ryan Berti
s: - hll_union(BinaryType, BinaryType) -> BinaryType - hll_sketch_estimate(BinaryType) -> LongType The latest set of tests failed due to some connectivity(?) issues - is there an easy way to re-drive tests without pushing a new commit? Thanks! Ryan Berti Senior Data Engineer | Ads DE M 7

Re: Implementation for approx_count_distinct_sketch and associated functions

2023-01-20 Thread Ryan Berti
his PR. I've included a format identifier in this implementation's HLL++ sketches to set us up for migrating to a cross-compatible sketch format / HLL++ implementation in the future. Thanks Ryan Berti Senior Data Engineer | Ads DE M 7023217573 5808 W Sunset Blvd | Los Angeles, CA 90028 On Wed,

Implementation for approx_count_distinct_sketch and associated functions

2023-01-11 Thread Ryan Berti
opening a PR against the main spark repo? Thanks! Ryan Berti Senior Data Engineer | Ads DE M 7023217573 5808 W Sunset Blvd | Los Angeles, CA 90028