[ https://issues.apache.org/jira/browse/FLINK-30730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Fan Hong updated FLINK-30730: ----------------------------- Description: When training data contains null values, StringIndexer throws a exception. The reason is this method [1]: null values are neither String type nor Number type. In StringIndexerModel, null values are also not handled correctly when performing transformation. [1] [https://github.com/apache/flink-ml/blob/966cedd7bbab4e12d8d8b37dbd582146714e68a6/flink-ml-lib/src/main/java/org/apache/flink/ml/feature/stringindexer/StringIndexer.java#L164] was: When training data contains null values, StringIndexer throws a exception. The reason is this method [1]: null values are neither String type nor Number type. [1] [https://github.com/apache/flink-ml/blob/966cedd7bbab4e12d8d8b37dbd582146714e68a6/flink-ml-lib/src/main/java/org/apache/flink/ml/feature/stringindexer/StringIndexer.java#L164] > StringIndexer cannot handle null values correctly > ------------------------------------------------- > > Key: FLINK-30730 > URL: https://issues.apache.org/jira/browse/FLINK-30730 > Project: Flink > Issue Type: Bug > Components: Library / Machine Learning > Affects Versions: ml-2.1.0 > Reporter: Fan Hong > Priority: Major > > When training data contains null values, StringIndexer throws a exception. > The reason is this method [1]: null values are neither String type nor Number > type. > In StringIndexerModel, null values are also not handled correctly when > performing transformation. > > [1] > [https://github.com/apache/flink-ml/blob/966cedd7bbab4e12d8d8b37dbd582146714e68a6/flink-ml-lib/src/main/java/org/apache/flink/ml/feature/stringindexer/StringIndexer.java#L164] -- This message was sent by Atlassian Jira (v8.20.10#820010)