Dell - Internal Use - Confidential
While writing to Kafka from Storm, the hdfs bolt provides a nice way to batch
the messages , rotate files, file name convention etc as shown below.
Do you know of something similar in Spark Streaming or do we have to roll our
own? If anyone attempted this can
Dell - Internal Use - Confidential
Dell - Internal Use - Confidential
While writing to Kafka from Storm, the hdfs bolt provides a nice way to batch
the messages , rotate files, file name convention etc as shown below.
Do you know of something similar in Spark Streaming or do we have to roll our
Dell - Internal Use - Confidential
Did you check https://drive.google.com/file/d/0B7tmGAdbfMI2OXl6azYySk5iTGM/edit
and
http://spark.apache.org/docs/latest/job-scheduling.html#dynamic-resource-allocation
Not sure if the spark kafka receiver emits metrics on the lag, check this link
out