Hi,
Have you tried creating more column blocks?
BlockMatrix matrix = cmatrix.toBlockMatrix(100, 100);
for example.
Is your data randomly spread out, or do you generally have clusters of
data points together?
On Wed, Jan 25, 2017 at 4:23 AM, Petr Shestov wrote:
> Hi
Hi all!
I'm using Spark 2.0.1 with two workers (one executor each) with 20Gb each.
And run following code:
JavaRDD entries = ...; // filing the dataCoordinateMatrix
cmatrix = new CoordinateMatrix(entries.rdd());BlockMatrix matrix =
cmatrix.toBlockMatrix(100, 1000);BlockMatrix cooc =