Github user myui commented on the issue:
https://github.com/apache/incubator-hivemall/pull/102
Need to fix the following error:
```sql
create table predicted_rf_old
as
SELECT
passengerid,
predicted.label,
predicted.probability,
predicted.probabilities
FROM (
SELECT
passengerid,
rf_ensemble(predicted) as predicted
-- hivemall v0.5-rc.1 or later
-- rf_ensemble(predicted.value, predicted.posteriori, model_weight) as
predicted
-- rf_ensemble(predicted.value, predicted.posteriori) as predicted --
avoid OOB accuracy (i.e., model_weight)
FROM (
SELECT
t.passengerid,
-- hivemall v0.4.1-alpha.2 or before
-- tree_predict(p.model, t.features, ${classification}) as predicted
-- hivemall v0.4.1-alpha.3 or later
-- tree_predict(p.model_id, p.model_type, p.pred_model, t.features,
${classification}) as predicted
-- hivemall v0.5-rc.1 or later
-- p.model_weight,
-- tree_predict(p.model_id, p.model, t.features, ${classification})
as predicted
tree_predict_v1(p.model_id, p.model_type, p.pred_model, t.features,
${classification}) as predicted -- to use the old model in v0.5-rc.1 or later
FROM (
SELECT
-- model_id, model
-- hivemall v0.4.1-alpha.3 or later
model_id, -3 as model_type, model as pred_model
-- hivemall v0.5-rc.1 or later
-- model_id, model_weight, model
FROM
model_rf
DISTRIBUTE BY rand(1)
) p
LEFT OUTER JOIN test_rf t
) t1
group by
passengerid
) t2
;
```
```
Caused by: java.lang.NullPointerException
at
hivemall.smile.tools.RandomForestEnsembleUDAF$RfEvaluatorV1.merge(RandomForestEnsembleUDAF.java:186)
at
org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluator.aggregate(GenericUDAFEvaluator.java:191)
at
org.apache.hadoop.hive.ql.exec.GroupByOperator.updateAggregations(GroupByOperator.java:619)
at
org.apache.hadoop.hive.ql.exec.GroupByOperator.processHashAggr(GroupByOperator.java:794)
at
org.apache.hadoop.hive.ql.exec.GroupByOperator.processKey(GroupByOperator.java:700)
at
org.apache.hadoop.hive.ql.exec.GroupByOperator.process(GroupByOperator.java:768)
... 28 more
```
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---