huaxingao commented on code in PR #5638:
URL: https://github.com/apache/iceberg/pull/5638#discussion_r958586790
##########
spark/v3.3/spark/src/test/java/org/apache/iceberg/spark/source/TestSparkTable.java:
##########
@@ -47,14 +55,39 @@ public void removeTable() {
@Test
public void testTableEquality() throws NoSuchTableException {
- CatalogManager catalogManager = spark.sessionState().catalogManager();
- TableCatalog catalog = (TableCatalog) catalogManager.catalog(catalogName);
- Identifier identifier = Identifier.of(tableIdent.namespace().levels(),
tableIdent.name());
- SparkTable table1 = (SparkTable) catalog.loadTable(identifier);
- SparkTable table2 = (SparkTable) catalog.loadTable(identifier);
-
+ SparkTable table1 = loadTable();
+ SparkTable table2 = loadTable();
// different instances pointing to the same table must be equivalent
Assert.assertNotSame("References must be different", table1, table2);
Assert.assertEquals("Tables must be equivalent", table1, table2);
}
+
+ @Test
+ public void testOverwriteFilterConversions() throws NoSuchTableException {
Review Comment:
Thanks a lot for taking a look at this PR!
I looked at the real-world usage (`INSERT OVERWRITE` or
`DataFrameWriterV2.overwrite`) and realized that actually Spark will throw
`AnalysisException` if the overwrite filters are on invalid columns. So there
is no need to bind the filters. I will close this PR.
The reason I did this PR is because I was trying to address this
[comment](https://github.com/apache/iceberg/pull/5302#discussion_r950580132).
Now since there is no need to bind the filters in
`SparkFilters.convert(Filter[] filters)`, I will add back the
`SparkV2Filters.convert(Predicate[] predicates)`.
I am also wondering if this
[bind](https://github.com/apache/iceberg/blob/master/spark/v3.3/spark/src/main/java/org/apache/iceberg/spark/source/SparkScanBuilder.java#L122)
is needed. If the filter expression is on invalid columns, Spark throws
`AnalysisException` before it reaches here. Shall I remove this bind?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]