flink-ml algorithms

2022-06-02 Thread Natia Chachkhiani
Hi, I am running OnlineKmeans from flink-ml repo on a small dataset. I've noticed that I don't get consistent results, assignments to clusters, across different runs. I have set both parallelism and globalBatchSize to 1. I am doing simple fit and transform on each data point ingested. Is the order

Re: flink-ml algorithms

2022-06-05 Thread Natia Chachkhiani
dataset and got inconsistent results too? >> >> Best regards, >> Jing >> >> On Fri, Jun 3, 2022 at 4:29 AM Natia Chachkhiani < >> natia.chachkhia...@gmail.com> wrote: >> >>> Hi, >>> >>> I am running OnlineKmeans from flink-ml re

Re: flink-ml algorithms

2022-06-06 Thread Natia Chachkhiani
Hi, I have another question. Is the implementation of kmeans in flink-ml same as Spark's StreamingKmeans? Should the accuracy/results from the same dataset be comparable between the two? On Sun, Jun 5, 2022 at 8:14 PM Natia Chachkhiani < natia.chachkhia...@gmail.com> wrote: >

flink-ml onlinekmeans

2022-06-09 Thread Natia Chachkhiani
Hi, I have a couple of questions about the onlineKmeans algorithm. I am running OnlikeKmeans on a small dataset (36k records) with k=2 and testing varying decay rates. The features are consumed in flink from kafka topic. Sample feature: [0., 5.0, 1.0, 0.0, 0.0, 0.0, 0.0] Is the implementation

unsubscribe

2023-02-27 Thread Natia Chachkhiani