I tried locally with the example described in the latest guide: http://54.82.157.211:4000/mllib-naive-bayes.html , and it worked fine. Do you mind sharing the code you used? -Xiangrui
On Thu, Apr 24, 2014 at 1:57 PM, John King <usedforprinting...@gmail.com> wrote: > Yes, I got it running for large RDD (~7 million lines) and mapping. Just > received this error when trying to classify. > > > On Thu, Apr 24, 2014 at 4:32 PM, Xiangrui Meng <men...@gmail.com> wrote: >> >> Is your Spark cluster running? Try to start with generating simple >> RDDs and counting. -Xiangrui >> >> On Thu, Apr 24, 2014 at 11:38 AM, John King >> <usedforprinting...@gmail.com> wrote: >> > I receive this error: >> > >> > Traceback (most recent call last): >> > >> > File "<stdin>", line 1, in <module> >> > >> > File >> > "/home/ubuntu/spark-1.0.0-rc2/python/pyspark/mllib/classification.py", >> > line >> > 178, in train >> > >> > ans = sc._jvm.PythonMLLibAPI().trainNaiveBayes(dataBytes._jrdd, >> > lambda_) >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 535, in __call__ >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 368, in send_command >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 361, in send_command >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 317, in _get_connection >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 324, in _create_connection >> > >> > File >> > >> > "/home/ubuntu/spark-1.0.0-rc2/python/lib/py4j-0.8.1-src.zip/py4j/java_gateway.py", >> > line 431, in start >> > >> > py4j.protocol.Py4JNetworkError: An error occurred while trying to >> > connect to >> > the Java server > >