RussellSpitzer commented on issue #13698: URL: https://github.com/apache/iceberg/issues/13698#issuecomment-3133661258
The Spark issue would basically lead mostly to problems for a Spark Connect or Spark Thrift server based application since you could essentially break one of these longer running processes with a bad file, but for users of normal spark applications using stand alone drivers or apps, it would just be a normal failure. So probably not a security issue in this context since if you can get those files into your table you probably have other issues that could break/ corrupt a table ... maybe As for what file format to use, Parquet is definitely what folks should be using at the moment. We have a large pool of engineers working on that as opposed to ORC which we are struggling to find folks to work on. We have actively talked about removing ORC from the main iceberg repo do to lack of engineers. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org