Makoto Yui created HIVEMALL-259:
-----------------------------------

             Summary: [BUG] feature_binning does not work properly under 
certain conditions
                 Key: HIVEMALL-259
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-259
             Project: Hivemall
          Issue Type: Bug
    Affects Versions: 0.5.2
            Reporter: Makoto Yui
            Assignee: Makoto Yui
             Fix For: 0.6.0


 

feature_binning does not properly work in certain condition.

It might be a bug in quantiles lookup by a different key type object at [this 
line|[https://github.com/apache/incubator-hivemall/blob/master/core/src/main/java/hivemall/ftvec/binning/FeatureBinningUDF.java#L133]].

 
{code:java}
WITH extracted as (
  select
    extract_feature(feature) as index,
    extract_weight(feature) as value
  from
    input l
    LATERAL VIEW explode(features) r as feature
),
bins as (
   select
     map(index, build_bins(value, 5, true)) as quantiles -- 5 bins with auto 
bin shrinking
   from
     extracted
   group by
     index
)
select
  l.features as original,
  feature_binning(l.features, r.quantiles) as features
from
  input l
  cross join bins r
;
{code}
 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to