nastra commented on code in PR #6801:
URL: https://github.com/apache/iceberg/pull/6801#discussion_r1105482313
##########
spark/v3.3/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestRewriteDataFilesProcedure.java:
##########
@@ -499,13 +553,15 @@ public void testBinPackTableWithSpecialChars() {
public void testSortTableWithSpecialChars() {
Assume.assumeTrue(catalogName.equals(SparkCatalogConfig.HADOOP.catalogName()));
- sql(
- "CREATE TABLE %s (c1 int, c2 string, c3 string) USING iceberg",
- tableName(QUOTED_SPECIAL_CHARS_TABLE_NAME));
+ String tblName = tableName(QUOTED_SPECIAL_CHARS_TABLE_NAME);
+ sql("CREATE TABLE %s (c1 int, c2 string, c3 string) USING iceberg",
tblName);
- insertData(tableName(QUOTED_SPECIAL_CHARS_TABLE_NAME), 10);
+ insertData(tblName, 10);
+ // TODO: metadata table access currently fails with special chars in the
table name
+ // long dataSizeBefore = testDataSize(tblName);
Review Comment:
this seems like a bug to me where the table with special characters can't be
found when running `SELECT sum(file_size_in_bytes) from %s.files`. It fails
with
```
Caused by: java.io.FileNotFoundException: File
file:/tmp/warehouse2890706410427132468.tmp/default/table:with.special:chars/metadata/2ae7cad2-3cff-4c92-9536-9bf9652f119d-m1.avro
does not exist
at
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:779)
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1100)
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:769)
at
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:462)
at
org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.<init>(ChecksumFileSystem.java:160)
at
org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:372)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:976)
at
org.apache.iceberg.hadoop.HadoopInputFile.newStream(HadoopInputFile.java:183)
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]