Why does it have to be a stream?
> Am 18.11.2018 um 23:29 schrieb Nicolas Paris :
>
> Hi
>
> I have pdf to load into spark with at least
> format. I have considered some options:
>
> - spark streaming does not provide a native file stream for binary with
> variable size (binaryRecordStream
Hi
I have pdf to load into spark with at least
format. I have considered some options:
- spark streaming does not provide a native file stream for binary with
variable size (binaryRecordStream specifies a constant size) and I
would have to write my own receiver.
- Structured streaming
Severity: Low
Vendor: The Apache Software Foundation
Versions Affected:
All versions of Apache Spark
Description:
Spark's standalone resource manager accepts code to execute on a 'master' host,
that then runs that code on 'worker' hosts. The master itself does not, by
design, execute user code.