Hi Daniel
Well said
Regards
Vineel
On Tue, Jul 14, 2015, 6:11 AM Daniel Darabos
daniel.dara...@lynxanalytics.com wrote:
Hi Shahid,
To be honest I think this question is better suited for Stack Overflow
than for a PhD thesis.
On Tue, Jul 14, 2015 at 7:42 AM, shahid ashraf sha...@trialx.com wrote:
hi
I have a 10 node cluster i loaded the data onto hdfs, so the no. of
partitions i get is 9. I am running a spark application , it gets stuck on
one of tasks, looking at the UI it seems application is not using all nodes
to do calculations. attached is the screen shot of tasks, it seems tasks
are put on each node more then once. looking at tasks 8 tasks get completed
under 7-8 minutes and one task takes around 30 minutes so causing the delay
in results.
On Tue, Jul 14, 2015 at 10:48 AM, Shashidhar Rao
raoshashidhar...@gmail.com wrote:
Hi,
I am doing my PHD thesis on large scale machine learning e.g Online
learning, batch and mini batch learning.
Could somebody help me with ideas especially in the context of Spark and
to the above learning methods.
Some ideas like improvement to existing algorithms, implementing new
features especially the above learning methods and algorithms that have not
been implemented etc.
If somebody could help me with some ideas it would really accelerate my
work.
Plus few ideas on research papers regarding Spark or Mahout.
Thanks in advance.
Regards
--
with Regards
Shahid Ashraf
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org