I started my pyspark shell with command (I am using spark 1.6). bin/pyspark --packages graphframes:graphframes:0.1.0-spark1.6
I have copied http://dl.bintray.com/spark-packages/maven/graphframes/graphframes/0.1.0-spark1.6/graphframes-0.1.0-spark1.6.jar to the lib directory of Spark as well. I was getting below error >>> from graphframes import * Traceback (most recent call last): File "<stdin>", line 1, in <module> zipimport.ZipImportError: can't find module 'graphframes' >>> So, as per suggestions from similar questions, I have extracted the graphframes python directory and copied to the local directory where I am running pyspark. >>> from graphframes import * But, not able to create the GraphFrame >>> g = GraphFrame(v, e) Traceback (most recent call last): File "<stdin>", line 1, in <module> NameError: name 'GraphFrame' is not defined Also, I am getting below error. >>> from graphframes.examples import Graphs Traceback (most recent call last): File "<stdin>", line 1, in <module> ImportError: Bad magic number in graphframes/examples.pyc Any help will be highly appreciated. - Arun