[jira] [Created] (FLINK-32889) BinaryClassificationEvaluator gives wrong weighted AUC value

2023-08-17 Thread Fan Hong (Jira)
Fan Hong created FLINK-32889: Summary: BinaryClassificationEvaluator gives wrong weighted AUC value Key: FLINK-32889 URL: https://issues.apache.org/jira/browse/FLINK-32889 Project: Flink Issue

[jira] [Created] (FLINK-32810) Improve managed memory usage in ListStateWithCache

2023-08-08 Thread Fan Hong (Jira)
Fan Hong created FLINK-32810: Summary: Improve managed memory usage in ListStateWithCache Key: FLINK-32810 URL: https://issues.apache.org/jira/browse/FLINK-32810 Project: Flink Issue Type

[jira] [Created] (FLINK-31846) Support cancel final checkpoint when all tasks are finished

2023-04-18 Thread Fan Hong (Jira)
Fan Hong created FLINK-31846: Summary: Support cancel final checkpoint when all tasks are finished Key: FLINK-31846 URL: https://issues.apache.org/jira/browse/FLINK-31846 Project: Flink Issue

[jira] [Created] (FLINK-31809) Improve efficiency of ListStateWithCache#snapshotState

2023-04-14 Thread Fan Hong (Jira)
Fan Hong created FLINK-31809: Summary: Improve efficiency of ListStateWithCache#snapshotState Key: FLINK-31809 URL: https://issues.apache.org/jira/browse/FLINK-31809 Project: Flink Issue Type

Re: [DISCUSS] Releasing Flink ML 2.2.0

2023-03-30 Thread Fan Hong
Hi Dong and Zhipeng, Thanks for starting the discussion. Glad to see a new release of Flink ML. Cheers! On Fri, Mar 31, 2023 at 2:34 PM Zhipeng Zhang wrote: > Hi Dong, > > Thanks for starting the discussion. +1 for the Flink ML 2.1.0 release. >

[jira] [Created] (FLINK-31625) Possbile OOM in KBinsDiscretizer

2023-03-27 Thread Fan Hong (Jira)
Fan Hong created FLINK-31625: Summary: Possbile OOM in KBinsDiscretizer Key: FLINK-31625 URL: https://issues.apache.org/jira/browse/FLINK-31625 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-31623) Improvements on DataStreamUtils#sample

2023-03-27 Thread Fan Hong (Jira)
Fan Hong created FLINK-31623: Summary: Improvements on DataStreamUtils#sample Key: FLINK-31623 URL: https://issues.apache.org/jira/browse/FLINK-31623 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-31189) Allow ignore less frequent values in StringIndexer

2023-02-22 Thread Fan Hong (Jira)
Fan Hong created FLINK-31189: Summary: Allow ignore less frequent values in StringIndexer Key: FLINK-31189 URL: https://issues.apache.org/jira/browse/FLINK-31189 Project: Flink Issue Type

[jira] [Created] (FLINK-31030) Support more binary classification evaluation metrics.

2023-02-12 Thread Fan Hong (Jira)
Fan Hong created FLINK-31030: Summary: Support more binary classification evaluation metrics. Key: FLINK-31030 URL: https://issues.apache.org/jira/browse/FLINK-31030 Project: Flink Issue Type

[jira] [Created] (FLINK-31029) KBinsDiscretizer gives wrong bin edges when input data contains only 2 distinct values

2023-02-12 Thread Fan Hong (Jira)
Fan Hong created FLINK-31029: Summary: KBinsDiscretizer gives wrong bin edges when input data contains only 2 distinct values Key: FLINK-31029 URL: https://issues.apache.org/jira/browse/FLINK-31029

[jira] [Created] (FLINK-31026) KBinsDiscretizer should gives binEdges wrong bin edges when all values are same.

2023-02-12 Thread Fan Hong (Jira)
Fan Hong created FLINK-31026: Summary: KBinsDiscretizer should gives binEdges wrong bin edges when all values are same. Key: FLINK-31026 URL: https://issues.apache.org/jira/browse/FLINK-31026 Project

[jira] [Created] (FLINK-31010) Add Transformer and Estimator for GBTClassifier and GBTRegressor

2023-02-10 Thread Fan Hong (Jira)
Fan Hong created FLINK-31010: Summary: Add Transformer and Estimator for GBTClassifier and GBTRegressor Key: FLINK-31010 URL: https://issues.apache.org/jira/browse/FLINK-31010 Project: Flink

[jira] [Created] (FLINK-30982) Support checkpoint mechanism in GBT

2023-02-08 Thread Fan Hong (Jira)
Fan Hong created FLINK-30982: Summary: Support checkpoint mechanism in GBT Key: FLINK-30982 URL: https://issues.apache.org/jira/browse/FLINK-30982 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-30957) Support other missing features

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30957: Summary: Support other missing features Key: FLINK-30957 URL: https://issues.apache.org/jira/browse/FLINK-30957 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-30956) Add Python implementation of GBTClassifer and GBTRegressor.

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30956: Summary: Add Python implementation of GBTClassifer and GBTRegressor. Key: FLINK-30956 URL: https://issues.apache.org/jira/browse/FLINK-30956 Project: Flink Issue

[jira] [Created] (FLINK-30955) Support early stopping with validation set.

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30955: Summary: Support early stopping with validation set. Key: FLINK-30955 URL: https://issues.apache.org/jira/browse/FLINK-30955 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-30954) Add estimator and transformer for GBTRegressor.

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30954: Summary: Add estimator and transformer for GBTRegressor. Key: FLINK-30954 URL: https://issues.apache.org/jira/browse/FLINK-30954 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-30953) Support intermediate state management and model save/load.

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30953: Summary: Support intermediate state management and model save/load. Key: FLINK-30953 URL: https://issues.apache.org/jira/browse/FLINK-30953 Project: Flink Issue

[jira] [Created] (FLINK-30952) Add main training and transforming part.

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30952: Summary: Add main training and transforming part. Key: FLINK-30952 URL: https://issues.apache.org/jira/browse/FLINK-30952 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-30939) Add public APIs and topmost framework for GBTClassifer

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30939: Summary: Add public APIs and topmost framework for GBTClassifer Key: FLINK-30939 URL: https://issues.apache.org/jira/browse/FLINK-30939 Project: Flink Issue Type

[jira] [Created] (FLINK-30937) Add Transformer and Estimator for GBTClassifier and GBTRegressor

2023-02-07 Thread Fan Hong (Jira)
Fan Hong created FLINK-30937: Summary: Add Transformer and Estimator for GBTClassifier and GBTRegressor Key: FLINK-30937 URL: https://issues.apache.org/jira/browse/FLINK-30937 Project: Flink

[jira] [Created] (FLINK-30734) KBinsDiscretizer handles Double.NaN incorrectly

2023-01-18 Thread Fan Hong (Jira)
Fan Hong created FLINK-30734: Summary: KBinsDiscretizer handles Double.NaN incorrectly Key: FLINK-30734 URL: https://issues.apache.org/jira/browse/FLINK-30734 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-30730) StringIndexer cannot handle null values correctly when training

2023-01-17 Thread Fan Hong (Jira)
Fan Hong created FLINK-30730: Summary: StringIndexer cannot handle null values correctly when training Key: FLINK-30730 URL: https://issues.apache.org/jira/browse/FLINK-30730 Project: Flink

[jira] [Created] (FLINK-30401) Add Estimator and Transformer for MinHashLSH

2022-12-13 Thread Fan Hong (Jira)
Fan Hong created FLINK-30401: Summary: Add Estimator and Transformer for MinHashLSH Key: FLINK-30401 URL: https://issues.apache.org/jira/browse/FLINK-30401 Project: Flink Issue Type: New Feature

Re: [DISCUSS] FLIP-173: Support DAG of algorithms (Flink ML)

2021-08-19 Thread Fan Hong
some community convention. I also would like to mention the same issue exists in Proposal 1, as there are also multiple places where developers can implement algorithms. In summary, I think the first and second issue above are preference-related, and hope my thoughts can give some clues. The third

Re: [DISCUSS] FLIP-173: Support DAG of algorithms (Flink ML)

2021-08-19 Thread Fan Hong
ve. [1] https://docs.google.com/document/d/1L3aI9LjkcUPoM52liEY6uFktMnFMNFQ6kXAjnz_11do Sincerely, Fan Hong On Fri, Aug 6, 2021 at 3:56 PM Becket Qin wrote: > Hi Zhipeng, > > Yes, I agree that the key difference between the two options is how they > support MIMO. > > My main