Hi, I have a quick serialization issue. I'm trying to read a date range of input files and I'm getting a serialization issue when using an input path that has a object generate a date range. Specifically, my code uses DateTimeFormat in the Joda time package, which is not serializable. How do I get spark to not lazily compute the input path and run into the serialization issue?
Code: object Range { val now = new DateTime val dateFormatter = DateTimeFormat.forPattern("MMddyyyy") def dateRange(from: DateTime, to: DateTime, step: Period): Iterator[DateTime] = Iterator.iterate(from)(_.plus(step)).takeWhile(!_.isAfter(to)) def getRange: String = { dateRange(now.minusDays(22), now, Period.days(1)).map(dateFormatter.print(_)).mkString(",") } } val notWorking = "path/output_{" + Range.getRange +"}/*/*" val working = "path/output_{08121914,08132014,08142014,08152014,08162014,08172014,08182014,08192014,08202014,08212014,08222014,08232014,08242014,08252014,08262014,08272014,08282014,08292014,08302014,08312014,09012014,09022014,09032014,09042014}/*/*" val lines = sc.textFile(working).count -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Serialize-input-path-tp13519.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org