[ https://issues.apache.org/jira/browse/SPARK-37832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17470258#comment-17470258 ]
Apache Spark commented on SPARK-37832: -------------------------------------- User 'bersprockets' has created a pull request for this issue: https://github.com/apache/spark/pull/35120 > Orc struct serializer should look up field converters in an array rather than > a linked list > ------------------------------------------------------------------------------------------- > > Key: SPARK-37832 > URL: https://issues.apache.org/jira/browse/SPARK-37832 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.3.0 > Reporter: Bruce Robbins > Priority: Major > > The OrcSerializer's struct converter uses an index to look up a field > converter in a linked list, resulting in a n*(n/2) average complexity per row > (where n is the field count). > Simply converting the linked list to an array brings performance gains, > especially for wide structs. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org