Re: [SQL] Why does spark.read.csv.cache give me a WARN about cache but not text?!

2016-08-17 Thread Jacek Laskowski
Hi Michael, Thanks a lot for your help. See below explains for csv and text. Do you see anything worth investigating? scala> spark.read.csv("people.csv").cache.explain(extended = true) == Parsed Logical Plan == Relation[_c0#39,_c1#40,_c2#41,_c3#42] csv == Analyzed Logical Plan == _c0: string,

Re: [SQL] Why does spark.read.csv.cache give me a WARN about cache but not text?!

2016-08-16 Thread Michael Armbrust
try running explain on each of these. my guess would be caching in broken in some cases. On Tue, Aug 16, 2016 at 6:05 PM, Jacek Laskowski wrote: > Hi, > > Can anyone explain why spark.read.csv("people.csv").cache.show ends up > with a WARN while

[SQL] Why does spark.read.csv.cache give me a WARN about cache but not text?!

2016-08-16 Thread Jacek Laskowski
Hi, Can anyone explain why spark.read.csv("people.csv").cache.show ends up with a WARN while spark.read.text("people.csv").cache.show does not? It happens in 2.0 and today's build. scala> sc.version res5: String = 2.1.0-SNAPSHOT scala> spark.read.csv("people.csv").cache.show