Riddle4045 opened a new issue, #8870:
URL: https://github.com/apache/hudi/issues/8870

   **_Tips before filing an issue_**
   
   - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
   
   - Join the mailing list to engage in conversations and get faster support at 
dev-subscr...@hudi.apache.org.
   
   - If you have triaged this as a bug, then file an 
[issue](https://issues.apache.org/jira/projects/HUDI/issues) directly.
   
   **Describe the problem you faced**
   TL;DR Trino returns 0 records from hudi table when I can see data in object 
store.
   
   I am writing hudi tables in ABFS  - reduced code
   
   ```java
           DataStream<RowData> fares = env.addSource(new 
TaxiFareGenerator()).map(
                   event -> GenericRowData.of(
                           event.getRideId(),
                           event.getDriverId(),
                           event.getTaxiId(),
                           event.getStartTime(),
                           event.getTip(),
                           event.getTolls(),
                           event.getTotalFare()//,
                           //    event.getPaymentType()
                   ));
   
           String targetTable = "TaxiFare";
           String outputPath = String.join("/",basePath, "hudi4");
           Map<String, String> options = new HashMap<>();
   
           options.put(FlinkOptions.PATH.key(), outputPath);
           options.put(FlinkOptions.TABLE_TYPE.key(), 
HoodieTableType.MERGE_ON_READ.name());
   
           HoodiePipeline.Builder builder = HoodiePipeline.builder(targetTable)
                   .column("rideId BIGINT")
                   .column("driverId BIGINT")
                   .column("taxiId BIGINT")
                   .column("startTime BIGINT")
                   .column("tip FLOAT")
                   .column("tolls FLOAT")
                   .column("totalFare FLOAT")
                   .pk("driverId")
                   .options(options);
   
           builder.sink(fares, false);
           env.execute("Hudi Table");
   ```
   
   I sync these tables to HMS using Hudi-Sync-Tool. 
   ```
   2023-06-01T13:15:09,757 INFO [main] org.apache.hudi.hive.HiveSyncTool - Sync 
complete for **hudi5_ro**
   2023-06-01T13:15:09,757 INFO [main] org.apache.hudi.hive.HiveSyncTool - 
Trying to sync hoodie table hudi5_rt with base path 
abfs://flink@****.dfs.core.windows.net/flink/click_events/hudi4 of type 
MERGE_ON_READ
   2023-06-01T13:15:11,977 INFO [main] org.apache.hudi.hive.HiveSyncTool - Sync 
table hudi5_rt for the first time.
   2023-06-01T13:15:17,712 INFO [main] org.apache.hudi.hive.HiveSyncTool - Last 
commit time synced was found to be null
   2023-06-01T13:15:17,712 INFO [main] org.apache.hudi.hive.HiveSyncTool - Sync 
all partitions given the last commit time synced is empty or before the start 
of the active timeline. Listing all partitions in 
abfs://flink@****.dfs.core.windows.net/flink/click_events/hudi4, file system: 
AzureBlobFileSystem{uri=abfs://flink@****.dfs.core.windows.net, user='ispatw', 
primaryUserGroup='ispatw'}
   2023-06-01T13:15:24,755 INFO [main] org.apache.hudi.hive.HiveSyncTool - Sync 
complete for **hudi5_rt**
   2023-06-01T13:15:24,761 INFO [main] 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient - Closed a connection to 
metastore, current connections: 0
   ```
   
   I can see data streaming into the ABFS location 
   
![image](https://github.com/apache/hudi/assets/3648351/66b233bf-6b14-43b0-a462-d903895ad664)
   
   When I try to query it using Trino my tables have no records
   
![image](https://github.com/apache/hudi/assets/3648351/5d85d470-a818-450e-997a-f79ab4158475)
   
   
   **Expected behavior**
   
   A clear and concise description of what you expected to happen.
   
   **Environment Description**
   
   * Hudi version : 0.13
   * trino : 410
   * Storage (HDFS/S3/GCS..) : ABFS
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to