Fan Hong created FLINK-30734:
--------------------------------
Summary: KBinsDiscretizer handles Double.NaN incorrectly
Key: FLINK-30734
URL: https://issues.apache.org/jira/browse/FLINK-30734
Project: Flink
Issue Type: Bug
Components: Library / Machine Learning
Reporter: Fan Hong
When the training data contains Double.NaN values and the strategy is set to
"quantile", the generated model data has Double.NaN as the right edge of the
largest bin.
My expected behavior is to ignore Double.NaN values when training, and to
support skip/error/keep strategy when transforming with generated
KBinsDiscretizerModel.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)