purna344 opened a new issue, #9404:
URL: https://github.com/apache/iceberg/issues/9404

   ### Feature Request / Improvement
   
   We are using Snowflake Iceberg to read the data from the S3 location and 
that is working fine for the non partitioned data. 
   
   But If the data is partitioned and that partitioned is stored similar to 
legacy HDFS format like `test3/country=USA/part-*1.c000.zstd.parquet`  where   
partitioned column "Country" and its value "USA" is stored in the file path and 
this column is not stored inside the parquet file, then Snowflake Iceberg is 
unable to read this data. Whereas other frameworks like Spark, Databricks are 
able to read this data. We contacted the Snowflake team and they said Iceberg 
is not able to recognize the partitioned column info in the folder path and 
Iceberg is expecting the partitioned column information inside the parquet 
file. 
   
   We try to read the same data by using Apace Spark Iceberg and it is working 
and if we try to access same data using Snowflake Iceberg then it is failing 
and unable to recognize the partition details. 
   
   To support this folder format through Snowflake Iceberg what changes needs 
to be done at application side like any config settings etc? 
   
   Please let us know how to fix this issue. 
   
   ### Query engine
   
   None


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to