Hi
I am reading Spark SQL codes, what do streamedPlan and buildPlan
of HashJoin trait for?
protected lazy val (buildPlan, streamedPlan) = buildSide match {
case BuildLeft => (left, right)
case BuildRight => (right, left)
}
https://github.com/apache/spark/blob/master/sql/core/src/main/scala
Would like to bring this back for consideration again. I'm open to adding
types for all the parameters, but it does seem onerous, and in the case of
Python, we don't do that. Do you feel strongly about adding them?
On Sat, May 30, 2015 at 8:04 PM Reynold Xin wrote:
> We added all the typetags fo
Hello Lubo,
The idea of timeouts is to make a best-effort and last-resort effort to
process a key, when it has not received data for a while. With processing
time timeout is 1 minute, the system guarantees that it will not timeout
unless at least 1 minute has passed. Defining a precise timing on w
Hi all
I have a question about the Stateful operations [map/flatmap]GroupsWithState
in Structured streaming. Issue are as follows:
Take StructuredSessionization case for example, first I input two words like
apache and spark in batch 0, then input another word Hadoop in batch 1 until
timeout