Re: Structured Streaming - HDFS State Store Performance Issues

2020-01-14 Thread Gourav Sengupta
Hi Will, have you tried using S3 as state store with the option in EMR enabled for faster file sync, also there is an option now of using FSx Lustre. Thanks and Regards, Gourav Sengupta On Wed, Jan 15, 2020 at 5:17 AM William Briggs wrote: > Hi all, I've got a problem that really has me

Structured Streaming - HDFS State Store Performance Issues

2020-01-14 Thread William Briggs
Hi all, I've got a problem that really has me stumped. I'm running a Structured Streaming query that reads from Kafka, performs some transformations and stateful aggregations (using flatMapGroupsWithState), and outputs any updated aggregates to another Kafka topic. I'm running this job using