Hi,

I am trying to answer a simple query with SparkSQL over the Parquet file.
When execute the query several times, the first run will take about 2s
while the later run will take <0.1s.

By looking at the log file it seems the later runs doesn't load the data
from disk. However, I didn't enable any cache explicitly. Is there any
automatic cache used by SparkSQL? Is there anyway to check this?

Thank you?

Best,
Wenlei

Reply via email to