Makoto Yui created HIVEMALL-259: ----------------------------------- Summary: [BUG] feature_binning does not work properly under certain conditions Key: HIVEMALL-259 URL: https://issues.apache.org/jira/browse/HIVEMALL-259 Project: Hivemall Issue Type: Bug Affects Versions: 0.5.2 Reporter: Makoto Yui Assignee: Makoto Yui Fix For: 0.6.0
feature_binning does not properly work in certain condition. It might be a bug in quantiles lookup by a different key type object at [this line|[https://github.com/apache/incubator-hivemall/blob/master/core/src/main/java/hivemall/ftvec/binning/FeatureBinningUDF.java#L133]]. {code:java} WITH extracted as ( select extract_feature(feature) as index, extract_weight(feature) as value from input l LATERAL VIEW explode(features) r as feature ), bins as ( select map(index, build_bins(value, 5, true)) as quantiles -- 5 bins with auto bin shrinking from extracted group by index ) select l.features as original, feature_binning(l.features, r.quantiles) as features from input l cross join bins r ; {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)