It will run distributed
On Mar 2, 2016 3:00 PM, "Priya Ch" wrote:
> Hi All,
>
> I am running k-means clustering algorithm. Now, when I am running the
> algorithm as -
>
> val conf = new SparkConf
> val sc = new SparkContext(conf)
> .
> .
> val kmeans = new KMeans()
> val model = kmeans.run(RDD[
Hi All,
I am running k-means clustering algorithm. Now, when I am running the
algorithm as -
val conf = new SparkConf
val sc = new SparkContext(conf)
.
.
val kmeans = new KMeans()
val model = kmeans.run(RDD[Vector])
.
.
.
The 'kmeans' object gets created on driver. Now does *kmeans.run() *get
e
them from there
>> when they are needed.
>> Actually, it's not necessary to set so large driver memory in your case,
>> because KMeans use low memory for driver if your k is not very large.
>>
>> Cheers
>> Yanbo
>>
>> 2015-12-30 22:20 GMT+08:00 Jia
hat don't fit on disk and read them from there when
> they are needed.
> Actually, it's not necessary to set so large driver memory in your case,
> because KMeans use low memory for driver if your k is not very large.
>
> Cheers
> Yanbo
>
> 2015-12-30 22:20 GMT
t so large driver memory in your case,
because KMeans use low memory for driver if your k is not very large.
Cheers
Yanbo
2015-12-30 22:20 GMT+08:00 Jia Zou :
> I am running Spark MLLib KMeans in one EC2 M3.2xlarge instance with 8 CPU
> cores and 30GB memory. Executor memory is set to 15GB, and
I am running Spark MLLib KMeans in one EC2 M3.2xlarge instance with 8 CPU
cores and 30GB memory. Executor memory is set to 15GB, and driver memory is
set to 15GB.
The observation is that, when input data size is smaller than 15GB, the
performance is quite stable. However, when input data becomes
i want evaluate some different distance measure for time-space clustering.
so i need a api for implement my own function in java.
2015-05-19 22:08 GMT+02:00 Xiangrui Meng :
> Just curious, what distance measure do you need? -Xiangrui
>
> On Mon, May 11, 2015 at 8:28 AM, Jaonary Rabarisoa
> wrote
Just curious, what distance measure do you need? -Xiangrui
On Mon, May 11, 2015 at 8:28 AM, Jaonary Rabarisoa wrote:
> take a look at this
> https://github.com/derrickburns/generalized-kmeans-clustering
>
> Best,
>
> Jao
>
> On Mon, May 11, 2015 at 3:55 PM, Driesprong, Fokko
> wrote:
>>
>> Hi Pa
take a look at this
https://github.com/derrickburns/generalized-kmeans-clustering
Best,
Jao
On Mon, May 11, 2015 at 3:55 PM, Driesprong, Fokko
wrote:
> Hi Paul,
>
> I would say that it should be possible, but you'll need a different
> distance measure which conforms to your coordinate system.
Hi Paul,
I would say that it should be possible, but you'll need a different
distance measure which conforms to your coordinate system.
2015-05-11 14:59 GMT+02:00 Pa Rö :
> hi,
>
> it is possible to use a custom distance measure and a other data typ as
> vector?
> i want cluster temporal geo dat
hi,
it is possible to use a custom distance measure and a other data typ as
vector?
i want cluster temporal geo datas.
best regards
paul
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-MLLib-KMeans-Top-Terms-tp22154.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe, e-mail:
12 matches
Mail list logo