You are probably casually sending UIMA objects from the driver to executors in a closure. You'll have to design your program so that you do not need to ship these objects to or from the remote task workers.
On Fri, Nov 21, 2014 at 8:39 AM, jatinpreet <jatinpr...@gmail.com> wrote: > Hi, > > I am planning to use UIMA library to process data in my RDDs. I have had bad > experiences while using third party libraries inside worker tasks. The > system gets plagued with Serialization issues. But as UIMA classes are not > necessarily Serializable, I am not sure if it will work. > > Please explain which classes need to be Serializable and which of them can > be left as it is? A clear understanding will help me a lot. > > Thanks, > Jatin > > > > ----- > Novice Big Data Programmer > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Spark-serialization-issues-with-third-party-libraries-tp19454.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org