[I] Gravitino Spark Connector — Iceberg multi-level namespace issue (1.2.0) [gravitino]

via GitHub Mon, 06 Apr 2026 22:09:05 -0700


markhoerth opened a new issue, #10698:
URL: https://github.com/apache/gravitino/issues/10698


   When attempting to query an Iceberg catalog through the Gravitino Spark 
connector, the following error is thrown:
   
   spark-sql (nyc_taxi)> use iceberg_nyc.nyc_taxi
                       > ;
   26/04/07 05:05:22 ERROR SparkSQLDriver: Failed in [use iceberg_nyc.nyc_taxi
   ]
   java.lang.IllegalArgumentException: Doesn't support multi level namespaces: 
iceberg_nyc.nyc_taxi
           at 
org.apache.gravitino.shaded.com.google.common.base.Preconditions.checkArgument(Preconditions.java:143)
           at 
org.apache.gravitino.spark.connector.catalog.BaseCatalog.validateNamespace(BaseCatalog.java:533)
           at 
org.apache.gravitino.spark.connector.catalog.BaseCatalog.loadNamespaceMetadata(BaseCatalog.java:358)
           at 
org.apache.spark.sql.connector.catalog.SupportsNamespaces.namespaceExists(SupportsNamespaces.java:98)
           at 
org.apache.spark.sql.connector.catalog.CatalogManager.assertNamespaceExist(CatalogManager.scala:109)
           at 
org.apache.spark.sql.connector.catalog.CatalogManager.setCurrentNamespace(CatalogManager.scala:127)
   
   Where iceberg_nyc is the Gravitino catalog, nyc_taxi is the schema, and 
yellow_trips is the table — standard three-level catalog.schema.table 
addressing. The same catalog and table are queryable via Trino without issue. 
   
   trino> show catalogs
       -> ;
       Catalog
   ---------------
    gravitino
    hive_lakefs
    hive_nyc
    iceberg_nyc
    jmx
    memory
    postgres_demo
    system
    tpcds
    tpch
   (10 rows)
   
   Query 20260407_050630_00002_df6yw, FINISHED, 1 node
   Splits: 7 total, 7 done (100.00%)
   0.04 [0 rows, 0B] [0 rows/s, 0B/s]
   
   trino> show schemas in iceberg_nyc;
          Schema
   --------------------
    information_schema
    nyc_taxi
    view_test
   (3 rows)
   
   Query 20260407_050644_00003_df6yw, FINISHED, 1 node
   Splits: 7 total, 7 done (100.00%)
   0.18 [3 rows, 50B] [16 rows/s, 272B/s]
   
   trino> show tables in iceberg_nyc.nyc_taxi
       -> ;
       Table
   --------------
    yellow_trips
   (1 row)
   
   Query 20260407_050717_00004_df6yw, FINISHED, 1 node
   Splits: 7 total, 7 done (100.00%)
   0.18 [1 rows, 30B] [5 rows/s, 166B/s]
   
   trino> select * from iceberg_nyc.nyc_tax.yellow_trips;
   Query 20260407_050733_00005_df6yw failed: line 1:15: Schema 'nyc_tax' does 
not exist
   select * from iceberg_nyc.nyc_tax.yellow_trips
   
   trino>  select * from iceberg_nyc.nyc_taxi.yellow_trips;
    vendorid |    tpep_pickup_datetime    |   tpep_dropoff_datetime    | 
passenger_count | trip_distance | ratecodeid | store>
   
----------+----------------------------+----------------------------+-----------------+---------------+------------+------>
           2 | 2024-01-24 15:17:12.000000 | 2024-01-24 15:34:53.000000 |        
       1 |          3.33 |          1 | N    >
           2 | 2024-01-24 15:52:24.000000 | 2024-01-24 16:01:39.000000 |        
       1 |          1.61 |          1 | N    >
           2 | 2024-01-24 15:08:55.000000 | 2024-01-24 15:31:35.000000 |        
       1 |          4.38 |          1 | N    >
           2 | 2024-01-24 15:42:55.000000 | 2024-01-24 15:51:35.000000 |        
       1 |          0.95 |          1 | N    
   
   The problem is specific to the Gravitino Spark connector in 1.2.0. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] Gravitino Spark Connector — Iceberg multi-level namespace issue (1.2.0) [gravitino]

Reply via email to