[ https://issues.apache.org/jira/browse/SPARK-37832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dongjoon Hyun resolved SPARK-37832. ----------------------------------- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 35120 [https://github.com/apache/spark/pull/35120] > Orc struct serializer should look up field converters in an array rather than > a linked list > ------------------------------------------------------------------------------------------- > > Key: SPARK-37832 > URL: https://issues.apache.org/jira/browse/SPARK-37832 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.3.0 > Reporter: Bruce Robbins > Assignee: Bruce Robbins > Priority: Major > Fix For: 3.3.0 > > > The OrcSerializer's struct converter uses an index to look up a field > converter in a linked list, resulting in a n*(n/2) average complexity per row > (where n is the field count). > Simply converting the linked list to an array brings performance gains, > especially for wide structs. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org