For example, I may have huge different regions and every regions have many or less points.
And I also want to real-time to analyze the newest data and older data, but I don't want to put data into memory cuz I don't have enough memory. What I thought I can use is partial_fit to accept streaming data when new data comes in. But the incoming data has hierarchical, it's hard to cluster them cuz I don't have older and newer data together to cluster. How to design the system better? thx
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn