Bruce Robbins created SPARK-39184: ------------------------------------- Summary: ArrayIndexOutOfBoundsException for some date/time sequences in some time-zones Key: SPARK-39184 URL: https://issues.apache.org/jira/browse/SPARK-39184 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 3.2.1, 3.3.0, 3.4.0 Reporter: Bruce Robbins
The following query gets an {{ArrayIndexOutOfBoundsException}} when run from the {{America/Los_Angeles}} time-zone: {noformat} spark-sql> select sequence(timestamp'2022-03-13 00:00:00', timestamp'2022-03-16 03:00:00', interval 1 day 1 hour) as x; 22/05/13 14:47:27 ERROR SparkSQLDriver: Failed in [select sequence(timestamp'2022-03-13 00:00:00', timestamp'2022-03-16 03:00:00', interval 1 day 1 hour) as x] java.lang.ArrayIndexOutOfBoundsException: 3 {noformat} In fact, any such query will get an {{ArrayIndexOutOfBoundsException}} if the start-stop period in your time-zone includes more instances of "spring forward" than instances of "fall back" and the start-stop period is evenly divisible by the interval. In the {{America/Los_Angeles}} time-zone, examples include: {noformat} -- This query encompasses 2 instances of "spring forward" but only one -- instance of "fall back". select sequence( timestamp'2022-03-13', timestamp'2022-03-13' + (interval '42' hours * 209), interval '42' hours) as x; {noformat} {noformat} select sequence( timestamp'2022-03-13', timestamp'2022-03-13' + (interval '31' hours * 11), interval '31' hours) as x; {noformat} -- This message was sent by Atlassian Jira (v8.20.7#820007) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org