Hi user@, I'm working on a project using the 7.0.0 Java arrow-dataset library and noticing that if I try to create a DataSet from a uri pointing to a file in Google Cloud Storage (e.g. "gs://my-bucket/my-file.parquet") I'm getting the following abbreviated stacktrace (with my bucket/object names redacted):
java.lang.RuntimeException: Unrecognized filesystem type in URI: gs://<bucketname>/<filename>.parquet at org.apache.arrow.dataset.file.JniWrapper.makeFileSystemDatasetFactory(Native Method) at org.apache.arrow.dataset.file.FileSystemDatasetFactory.createNative(FileSystemDatasetFactory.java:35) at org.apache.arrow.dataset.file.FileSystemDatasetFactory.<init>(FileSystemDatasetFactory.java:31) ... Looking at the Arrow cpp source, it seems the most likely culprit is the included libarrow_dataset_jni shared library in the arrow-dataset-7.0.0.jar was built without GCS support. Is this a mistake or a known issue? Anyone know? Thanks, Dave Voutila
