I recently started using Spark version 1.3.0 in standalone mode (with Scala
2.10.3), and I'm running into an odd problem. I'm loading data from a file
using sc.textFile, doing some conversion of the data, and then clustering
it. When I do this with a small file (10 lines, 9 KB), it works fine, and
the program terminates.

However, when I load my full data file (400,000 lines, 167 MB), the process
hangs sometime after the last line of my main() method. (The last line is a
print statement, and it gets printed.) I am creating quite a few objects, so
I don't know if perhaps the garbage collection is just taking a really long
time? (5+ minutes; I haven't had the patience to let it go longer than
that.)

Is there anything I should try to fix this, or to help diagnose the issue?

Thanks,
Michael



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-hanging-after-main-method-completes-tp22544.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to