[jira] [Assigned] (SPARK-19517) KafkaSource fails to initialize partition offsets
[ https://issues.apache.org/jira/browse/SPARK-19517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-19517: Assignee: Roberto Agostino Vitillo > KafkaSource fails to initialize partition offsets > - > > Key: SPARK-19517 > URL: https://issues.apache.org/jira/browse/SPARK-19517 > Project: Spark > Issue Type: Bug > Components: Structured Streaming >Affects Versions: 2.1.0 >Reporter: Roberto Agostino Vitillo >Assignee: Roberto Agostino Vitillo >Priority: Blocker > Fix For: 2.1.1, 2.2.0 > > Attachments: SPARK-19517ProposalforfixingKafkaOffsetMetadata.pdf > > > A Kafka source with many partitions can cause the check-pointing logic to > fail on restart. I got the following exception when trying to restart a > Structured Streaming app that reads from a Kafka topic with hundred > partitions. > {code} > 17/02/08 15:10:09 ERROR StreamExecution: Query [id = > 24e2a21a-4545-4a3e-80ea-bbe777d883ab, runId = > 025609c9-d59c-4de3-88b3-5d5f7eda4a66] terminated with error > java.lang.IllegalArgumentException: Expected e.g. > {"topicA":{"0":23,"1":-1},"topicB":{"0":-2}}, got {"telemetry":{"92":302854 > at > org.apache.spark.sql.kafka010.JsonUtils$.partitionOffsets(JsonUtils.scala:74) > at > org.apache.spark.sql.kafka010.KafkaSourceOffset$.apply(KafkaSourceOffset.scala:59) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:134) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:123) > at > org.apache.spark.sql.execution.streaming.HDFSMetadataLog.get(HDFSMetadataLog.scala:237) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets$lzycompute(KafkaSource.scala:138) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets(KafkaSource.scala:121) >… > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-19517) KafkaSource fails to initialize partition offsets
[ https://issues.apache.org/jira/browse/SPARK-19517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19517: Assignee: (was: Apache Spark) > KafkaSource fails to initialize partition offsets > - > > Key: SPARK-19517 > URL: https://issues.apache.org/jira/browse/SPARK-19517 > Project: Spark > Issue Type: Bug > Components: Structured Streaming >Affects Versions: 2.1.0 >Reporter: Roberto Agostino Vitillo >Priority: Critical > > A Kafka source with many partitions can cause the check-pointing logic to > fail on restart. I got the following exception when trying to restart a > Structured Streaming app that reads from a Kafka topic with hundred > partitions. > {code} > 17/02/08 15:10:09 ERROR StreamExecution: Query [id = > 24e2a21a-4545-4a3e-80ea-bbe777d883ab, runId = > 025609c9-d59c-4de3-88b3-5d5f7eda4a66] terminated with error > java.lang.IllegalArgumentException: Expected e.g. > {"topicA":{"0":23,"1":-1},"topicB":{"0":-2}}, got {"telemetry":{"92":302854 > at > org.apache.spark.sql.kafka010.JsonUtils$.partitionOffsets(JsonUtils.scala:74) > at > org.apache.spark.sql.kafka010.KafkaSourceOffset$.apply(KafkaSourceOffset.scala:59) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:134) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:123) > at > org.apache.spark.sql.execution.streaming.HDFSMetadataLog.get(HDFSMetadataLog.scala:237) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets$lzycompute(KafkaSource.scala:138) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets(KafkaSource.scala:121) >… > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-19517) KafkaSource fails to initialize partition offsets
[ https://issues.apache.org/jira/browse/SPARK-19517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19517: Assignee: Apache Spark > KafkaSource fails to initialize partition offsets > - > > Key: SPARK-19517 > URL: https://issues.apache.org/jira/browse/SPARK-19517 > Project: Spark > Issue Type: Bug > Components: Structured Streaming >Affects Versions: 2.1.0 >Reporter: Roberto Agostino Vitillo >Assignee: Apache Spark >Priority: Critical > > A Kafka source with many partitions can cause the check-pointing logic to > fail on restart. I got the following exception when trying to restart a > Structured Streaming app that reads from a Kafka topic with hundred > partitions. > {code} > 17/02/08 15:10:09 ERROR StreamExecution: Query [id = > 24e2a21a-4545-4a3e-80ea-bbe777d883ab, runId = > 025609c9-d59c-4de3-88b3-5d5f7eda4a66] terminated with error > java.lang.IllegalArgumentException: Expected e.g. > {"topicA":{"0":23,"1":-1},"topicB":{"0":-2}}, got {"telemetry":{"92":302854 > at > org.apache.spark.sql.kafka010.JsonUtils$.partitionOffsets(JsonUtils.scala:74) > at > org.apache.spark.sql.kafka010.KafkaSourceOffset$.apply(KafkaSourceOffset.scala:59) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:134) > at > org.apache.spark.sql.kafka010.KafkaSource$$anon$1.deserialize(KafkaSource.scala:123) > at > org.apache.spark.sql.execution.streaming.HDFSMetadataLog.get(HDFSMetadataLog.scala:237) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets$lzycompute(KafkaSource.scala:138) > at > org.apache.spark.sql.kafka010.KafkaSource.initialPartitionOffsets(KafkaSource.scala:121) >… > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org