For example, I may have huge different regions and every regions have many
or less points.

And I also want to real-time to analyze the newest data and older data, but
I don't want to put data into memory cuz I don't have enough memory.

What I thought I can use is partial_fit to accept streaming data when new
data comes in.

But the incoming data has hierarchical, it's hard to cluster them cuz I
don't have older and newer data together to cluster.

How to design the system better?

thx
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to