Re: [Help] Codegen Stage grows beyond 64 KB

2018-06-16 Thread Aakash Basu
Hi, I already went through it, that's one use case. I've a complex and very big pipeline of multiple jobs under one spark session. Not getting, on how to solve this, as it is happening over Logistic Regression and Random Forest models, which I'm just using from Spark ML package rather than doing

Re: [Help] Codegen Stage grows beyond 64 KB

2018-06-16 Thread vaquar khan
Hi Akash, Please check stackoverflow. https://stackoverflow.com/questions/41098953/codegen-grows-beyond-64-kb-error-when-normalizing-large-pyspark-dataframe Regards, Vaquar khan On Sat, Jun 16, 2018 at 3:27 PM, Aakash Basu wrote: > Hi guys, > > I'm getting an error when I'm feature

Re: Not able to sort out environment settings to start spark from windows

2018-06-16 Thread Raymond Xie
Thank you. But there is no special char or space, I actually copied it from Program Files to the root to ensure no space in the path. ** *Sincerely yours,* *Raymond* On Sat, Jun 16, 2018 at 3:42 PM, vaquar khan wrote: > Plz check ur Java Home

[Help] Codegen Stage grows beyond 64 KB

2018-06-16 Thread Aakash Basu
Hi guys, I'm getting an error when I'm feature engineering on 30+ columns to create about 200+ columns. It is not failing the job, but the ERROR shows. I want to know how can I avoid this. Spark - 2.3.1 Python - 3.6 Cluster Config - 1 Master - 32 GB RAM, 16 Cores 4 Slaves - 16 GB RAM, 8 Cores

Re: Not able to sort out environment settings to start spark from windows

2018-06-16 Thread vaquar khan
Plz check ur Java Home path . May be spacial char or space on ur path. Regards, Vaquar khan On Sat, Jun 16, 2018, 1:36 PM Raymond Xie wrote: > I am trying to run spark-shell in Windows but receive error of: > > \Java\jre1.8.0_151\bin\java was unexpected at this time. > > Environment: > >

Not able to sort out environment settings to start spark from windows

2018-06-16 Thread Raymond Xie
I am trying to run spark-shell in Windows but receive error of: \Java\jre1.8.0_151\bin\java was unexpected at this time. Environment: System variables: SPARK_HOME: c:\spark Path: C:\Program Files (x86)\Common