unsubscribe
- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org
Missing data in spark output
Hello Everyone, We are recently observing an intermittent data loss in the spark with output to GCS (google cloud storage). When there are missing rows, they are accompanied by duplicate rows. The re-run of the job doesn't have any duplicate or missing rows. Since it's hard to debug, we are first