from:"Jeremy Davis"

Limit on the number of Jobs per Application

2018-05-30 Thread Jeremy Davis

I have an application that does many thousand univariate GLM regressions that seems to break down after completing around 25K jobs. Plenty of resources: disk, network, memory, CPU are free, but eventually it is only scheduling on a few threads (out of 400+ possible on the cluster) No task failu

Re: OutOfMemory while calculating window functions

2016-09-21 Thread Jeremy Davis

0))) .withColumn("min"+(-win),min("value").over(baseWin.rowsBetween(-win,0))) .withColumn("max"+(-win),max("value").over(baseWin.rowsBetween(-win,0))) } resultFrame.show() } } > On Sep 20, 2016, at 10:26 PM, Jeremy Dav

Re: Need Help Diagnosing/operating/tuning

2015-11-22 Thread Jeremy Davis

It seems like the problem is related to —executor-cores. Is there possibly some sort of race condition when using multiple cores per executor? On Nov 22, 2015, at 12:38 PM, Jeremy Davis mailto:jda...@marketshare.com>> wrote: Hello, I’m at a loss trying to diagnose why my spark job is f

Need Help Diagnosing/operating/tuning

2015-11-22 Thread Jeremy Davis

Hello, I’m at a loss trying to diagnose why my spark job is failing. (works fine on small data) It is failing during the repartition, or on the subsequent steps.. which then seem to fail and fall back to repartitioning.. I’ve tried adjusting every parameter I can find, but have had no success. I

Limit on the number of Jobs per Application

Re: OutOfMemory while calculating window functions

Re: Need Help Diagnosing/operating/tuning

Need Help Diagnosing/operating/tuning

4 matches

Site Navigation

Mail list logo

Footer information