Re: ANOVA test in Spark

2016-05-28 Thread cyberjog
If any specific algorithm is not present, perhaps you can use R/Python
scikit, pipe your data to it & get the model back, 

I'm currently trying this, and it works fine. 



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/ANOVA-test-in-Spark-tp26949p27043.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



local Vs Standalonecluster production deployment

2016-05-28 Thread cyberjog
Hi, 

I have a question w.r.t  production deployment mode of spark, 

I have 3 applications which i would like to run independently on a single
machine, i need to run the drivers in the same machine.

The amount of resources i have is also limited, like 4- 5GB RAM , 3 - 4
cores. 

For deployment in standalone mode : i believe i need 

1 Driver JVM,  1 worker node ( 1 executor ) 
1 Driver JVM,  1 worker node ( 1 executor ) 
1 Driver JVM,  1 worker node ( 1 executor ) 

The issue here is i will require 6 JVM running in parallel, for which i do
not have sufficient CPU/MEM resources, 


Hence i was looking more towards a local mode deployment mode, would like to
know if anybody is using local mode where Driver + Executor run in a single
JVM in production mode. 

Are there any inherent issues upfront using local mode for production base
systems.?..



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/local-Vs-Standalonecluster-production-deployment-tp27042.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org