Hi, I am trying to answer a simple query with SparkSQL over the Parquet file. When execute the query several times, the first run will take about 2s while the later run will take <0.1s.
By looking at the log file it seems the later runs doesn't load the data from disk. However, I didn't enable any cache explicitly. Is there any automatic cache used by SparkSQL? Is there anyway to check this? Thank you? Best, Wenlei