sandeep-katta opened a new pull request #32284:
URL: https://github.com/apache/spark/pull/32284


   ### What changes were proposed in this pull request?
   
   As a part of the SPARK-26837 pruning of nested fields from object 
serializers are supported. But it is missed to handle case insensitivity nature 
of spark
   
   In this PR I have resolved the column names to be pruned based on 
`spark.sql.caseSensitive ` config
   **Exception Before Fix**
   
   ```
   Caused by: java.lang.ArrayIndexOutOfBoundsException: 0
     at org.apache.spark.sql.types.StructType.apply(StructType.scala:414)
     at 
org.apache.spark.sql.catalyst.optimizer.ObjectSerializerPruning$$anonfun$apply$4.$anonfun$applyOrElse$3(objects.scala:216)
     at 
scala.collection.TraversableLike.$anonfun$map$1(TraversableLike.scala:238)
     at scala.collection.immutable.List.foreach(List.scala:392)
     at scala.collection.TraversableLike.map(TraversableLike.scala:238)
     at scala.collection.TraversableLike.map$(TraversableLike.scala:231)
     at scala.collection.immutable.List.map(List.scala:298)
     at 
org.apache.spark.sql.catalyst.optimizer.ObjectSerializerPruning$$anonfun$apply$4.applyOrElse(objects.scala:215)
     at 
org.apache.spark.sql.catalyst.optimizer.ObjectSerializerPruning$$anonfun$apply$4.applyOrElse(objects.scala:203)
     at 
org.apache.spark.sql.catalyst.trees.TreeNode.$anonfun$transformDown$1(TreeNode.scala:309)
     at 
org.apache.spark.sql.catalyst.trees.CurrentOrigin$.withOrigin(TreeNode.scala:72)
     at 
org.apache.spark.sql.catalyst.trees.TreeNode.transformDown(TreeNode.scala:309)
     at 
   ```
   
   ### Why are the changes needed?
   After Upgrade to Spark 3 `foreachBatch` API throws` 
java.lang.ArrayIndexOutOfBoundsException`. This issue will be fixed using this 
PR
   
   
   ### Does this PR introduce _any_ user-facing change?
   No, Infact fixes the regression
   
   
   ### How was this patch tested?
   Added tests and also tested verified manually


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to