Hi,
Correct me if I were wrong. It looks like, the current version of
Spark-SQL is *tuple-at-a-time* module. Basically, each time the physical
operator produces a tuple by recursively call child->execute .
There are papers that illustrate the benefits of vectorized query
engine. And Hive-Stinger also embrace this style.
So, the question is, will Spark-SQL give a support to vectorized query
execution someday?
Thanks
