GraphFrames and IPython notebook issue - No module named graphframes

2016-04-25 Thread Camelia Elena Ciolac
in a terminal: ipython notebook --profile = nbserver I open in the browser http://127.0.0.1:/ and in my IPython notebook I have, among others : jar_path = '/home/camelia/spark_extra_jars/spark-csv_2.11-1.2.0.jar,/home/camelia/spark_extra_jars/commons-csv-1.2.jar,/home/camelia/spark_extra_jars

What happens in the master or slave launch ?

2015-10-07 Thread Camelia Elena Ciolac
Hello, I have the following question: I have two scenarios: 1) in one scenario (if I'm connected on the target node) the master starts successfully. Its log contains: Spark Command: /usr/opt/java/jdk1.7.0_07/jre/bin/java -cp /home/camelia/spark-1.4.1-bin-hadoop2.6/sbin/../conf/:/home/camelia

What happens in the master or slave launch ?

2015-10-07 Thread camelia
Hello, I have the following question: I have two scenarios: 1) in one scenario (if I'm connected on the target node) the master starts successfully. Its log contains: Spark Command: /usr/opt/java/jdk1.7.0_07/jre/bin/java -cp /home/camelia/spark-1.4.1-bin-hadoop2.6/sbin/../conf/:/home/camelia

Need for advice - performance improvement and out of memory resolution

2015-09-30 Thread Camelia Elena Ciolac
t I paste below : === df_result_12.head(1) ------- Py4JJavaError Traceback (most recent call last) in () > 1 df_result_12.head(1) /home/camelia/spark-1.4.1-bin-hadoop2.

Need for advice - performance improvement and out of memory resolution

2015-09-30 Thread Camelia Elena Ciolac
I paste below : === df_result_12.head(1) ------- Py4JJavaError Traceback (most recent call last) in () > 1 df_result_12.head(1) /home/camelia/spark-1.4.1-bin-hadoop2.

Need for advice - performance improvement and out of memory resolution

2015-09-30 Thread camelia
df_result_12.head(1) ------- Py4JJavaError Traceback (most recent call last) in () > 1 df_result_12.head(1) /home/camelia/spark-1.4.1-bin-hadoop2.6/python