Hello Scikit-learn community!
I was just wondering if anyone was using Cassandra
as a datastore for scikit-learn, and what your data
pipeline architecture looks like ? Do you just use Pycassa
to get the data, and run scikit-learn off of it ?
How do you iterate through the data when modeling so that
all the data doesn't fit into memory ? (I'd like to use all
the data in our Cassandra cluster for modeling/training/etc...)
Thank you very much for any help you can give!
Harold
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general