Re: Run Spark with Ignite Shared RDD on Large Volume of Data

2018-12-06 Thread zaleslaw
I could make only simple advise: try Ignite KMeans clusterization over the data directly stored in Ignite Please, have look to example KMeans If you w

Re: Run Spark with Ignite Shared RDD on Large Volume of Data

2018-12-05 Thread Jia Zou
Anyone has a clue about this? Thanks a lot! Jia -- Sent from: http://apache-ignite-users.70518.x6.nabble.com/

Run Spark with Ignite Shared RDD on Large Volume of Data

2018-11-15 Thread Jia Zou
In recent I'm running Spark MLLIb KMeans with Apach Ignite 2.6.0 shared RDD on ten AWS r4.2xlarge workers. It works and runs to finish on 1 billion points (within memory), but failed with 2 billion points (exceeding available memory) My code for loading data to Ignite Shared RDD is here: https://