GitHub user A-Mongy edited a comment on the discussion: Velox, GCS and cURL | CURL error [77]=Problem with the SSL CA cert (path? access rights?)
Thanks @zhouyuan for getting back to me on this! I have tried to change the permission for the `etc/ssl/` folder ``` total 24 drwxr-xr-x 1 root root 4096 Apr 15 04:41 certs -rwxr-xr-x 1 root root 12332 Oct 27 2024 openssl.cnf drwxr-xr-x 1 root root 4096 Oct 27 2024 private ``` So shouldn't be a permission issue. I have also tried the nightly jars, and I still see the same problem Here is an updated log of the error, as I turned on debug mode: ``` 20250618 00:30:14.379505 91 HiveConnectorUtil.cpp:286] No need to check type lowercase modeVARCHAR I20250618 00:30:14.379514 91 HiveConnectorUtil.cpp:286] No need to check type lowercase modeVARCHAR I20250618 00:30:14.379588 91 HiveDataSource.cpp:306] Adding split Hive: gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet 0 - 14245 I20250618 00:30:14.384384 91 FileHandle.cpp:65] Generating file handle for: gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet uuid: 1 I20250618 00:30:14.384630 96 EventBase.h:883] expCoeff_ -5e-07 folly::EventBase::SmoothLoopTime::SmoothLoopTime(std::chrono::microseconds) I20250618 00:30:14.384725 96 EventBase.cpp:543] EventBase(): Starting loop. I20250618 00:30:14.384757 96 EventBase.cpp:794] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) latest 18446744073709551576 next 18446744073709551577 I20250618 00:30:14.384773 96 EventBase.cpp:1058] latest 18446744073709551576 next 18446744073709551577 I20250618 00:30:14.384784 96 EventBase.cpp:801] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) startWork_ 29679038962635 I20250618 00:30:14.384800 96 EventBase.cpp:794] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) latest 18446744073709551577 next 18446744073709551577 I20250618 00:30:14.384812 96 EventBase.cpp:1058] latest 18446744073709551577 next 18446744073709551577 I20250618 00:30:14.384876 96 EventBase.cpp:1058] latest 18446744073709551577 next 18446744073709551577 I20250618 00:30:14.384821 96 EventBase.cpp:655] EventBase 0x7ee85c00a760 did not timeout loop time guess: 76 idle time: 40 busy time: 36 avgLoopTime: 0.001368 maxLatencyLoopTime: 0.001368 maxLatency_: 0us notificationQueueSize: 1 nothingHandledYet(): 0 I20250618 00:30:14.384897 96 EventBase.cpp:686] EventBase 0x7ee85c00a760 loop time: 0 I20250618 00:30:14.384910 96 EventBase.cpp:794] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) latest 18446744073709551577 next 18446744073709551578 I20250618 00:30:14.384919 96 EventBase.cpp:1058] latest 18446744073709551577 next 18446744073709551578 I20250618 00:30:14.384930 96 EventBase.cpp:801] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) startWork_ 29679039108575 I20250618 00:30:14.384951 96 EventBase.cpp:794] EventBase 0x7ee85c00a760 virtual void folly::EventBase::bumpHandlingTime() (loop) latest 18446744073709551578 next 18446744073709551578 I20250618 00:30:14.384963 96 EventBase.cpp:1058] latest 18446744073709551578 next 18446744073709551578 I20250618 00:30:14.384982 96 EventBase.cpp:1058] latest 18446744073709551578 next 18446744073709551578 I20250618 00:30:14.384972 96 EventBase.cpp:655] EventBase 0x7ee85c00a760 did not timeout loop time guess: 151 idle time: 109 busy time: 42 avgLoopTime: 0.008853 maxLatencyLoopTime: 0.008853 maxLatency_: 0us notificationQueueSize: 0 nothingHandledYet(): 0 I20250618 00:30:14.384994 96 EventBase.cpp:686] EventBase 0x7ee85c00a760 loop time: 0 E20250618 00:30:14.421082 91 Exceptions.h:66] Line: /work/ep/build-velox/build/velox_ep/velox/connectors/hive/storage_adapters/gcs/GcsFileSystem.cpp:62, Function:checkGcsStatus, Expression: Failed to get GCS object due to: Path:'gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet', SDK Error Type:, GCS Status Code:Unknown error, Message:'Permanent error, with a last message of PerformWork() - CURL error [77]=Problem with the SSL CA cert (path? access rights?)', Source: RUNTIME, ErrorCode: INVALID_STATE I20250618 00:30:14.421433 91 Task.cpp:2402] Terminating task Gluten_Stage_0_TID_0_VTID_0 with state Failed after running for 43ms I20250618 00:30:14.421552 91 Task.cpp:1343] All drivers (1) finished for task Gluten_Stage_0_TID_0_VTID_0 after running for 43ms 25/06/18 00:30:14 ERROR TaskResources: Task 0 failed by error: org.apache.gluten.exception.GlutenException: org.apache.gluten.exception.GlutenException: Exception: VeloxRuntimeError Error Source: RUNTIME Error Code: INVALID_STATE Reason: Failed to get GCS object due to: Path:'gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet', SDK Error Type:, GCS Status Code:Unknown error, Message:'Permanent error, with a last message of PerformWork() - CURL error [77]=Problem with the SSL CA cert (path? access rights?)' Retriable: False Context: Split [Hive: gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet 0 - 14245] Task Gluten_Stage_0_TID_0_VTID_0 Additional Context: Operator: TableScan[0] 0 Function: checkGcsStatus File: /work/ep/build-velox/build/velox_ep/velox/connectors/hive/storage_adapters/gcs/GcsFileSystem.cpp Line: 62 Stack trace: # 0 _ZN8facebook5velox7process10StackTraceC1Ei # 1 _ZN8facebook5velox14VeloxExceptionC1EPKcmS3_St17basic_string_viewIcSt11char_traitsIcEES7_S7_S7_bNS1_4TypeES7_ # 2 _ZN8facebook5velox6detail14veloxCheckFailINS0_17VeloxRuntimeErrorERKSsEEvRKNS1_18VeloxCheckFailArgsET0_ # 3 _ZN8facebook5velox12_GLOBAL__N_114checkGcsStatusEN6google5cloud5v2_226StatusERKSt17basic_string_viewIcSt11char_traitsIcEERKSsSD_ # 4 _ZNK8facebook5velox12_GLOBAL__N_111GcsReadFile13preadInternalEmmPc # 5 _ZNK8facebook5velox12_GLOBAL__N_111GcsReadFile6preadvEmRKSt6vectorIN5folly5RangeIPcEESaIS7_EEPNS0_11filesystems4File7IoStatsE # 6 _ZN8facebook5velox4dwio6common19ReadFileInputStream4readERKSt6vectorIN5folly5RangeIPcEESaIS8_EEmNS2_10MetricsLog11MetricsTypeE # 7 _ZN8facebook5velox4dwio6common19DirectCoalescedLoad8loadDataEb # 8 _ZN8facebook5velox5cache13CoalescedLoad12loadOrFutureEPN5folly10SemiFutureIbEEb # 9 _ZN8facebook5velox4dwio6common17DirectInputStream12loadPositionEv # 10 _ZN8facebook5velox4dwio6common17DirectInputStream4NextEPPKvPi # 11 _ZN8facebook5velox7parquet10ReaderBase16loadFileMetaDataEv # 12 _ZN8facebook5velox7parquet10ReaderBaseC1ESt10unique_ptrINS0_4dwio6common13BufferedInputESt14default_deleteIS6_EERKNS5_13ReaderOptionsE # 13 _ZN8facebook5velox7parquet13ParquetReaderC2ESt10unique_ptrINS0_4dwio6common13BufferedInputESt14default_deleteIS6_EERKNS5_13ReaderOptionsE # 14 _ZN8facebook5velox7parquet20ParquetReaderFactory12createReaderESt10unique_ptrINS0_4dwio6common13BufferedInputESt14default_deleteIS6_EERKNS5_13ReaderOptionsE # 15 _ZN8facebook5velox9connector4hive11SplitReader12createReaderEv # 16 _ZN8facebook5velox9connector4hive11SplitReader12prepareSplitESt10shared_ptrINS0_6common14MetadataFilterEERNS0_4dwio6common17RuntimeStatisticsE # 17 _ZN8facebook5velox9connector4hive14HiveDataSource8addSplitESt10shared_ptrINS1_14ConnectorSplitEE # 18 _ZN8facebook5velox4exec9TableScan8getSplitEv # 19 _ZN8facebook5velox4exec9TableScan9getOutputEv # 20 _ZZN8facebook5velox4exec6Driver11runInternalERSt10shared_ptrIS2_ERS3_INS1_13BlockingStateEERS3_INS0_9RowVectorEEENKUlvE3_clEv # 21 _ZN8facebook5velox4exec6Driver11runInternalERSt10shared_ptrIS2_ERS3_INS1_13BlockingStateEERS3_INS0_9RowVectorEE # 22 _ZN8facebook5velox4exec6Driver4nextEPN5folly10SemiFutureINS3_4UnitEEERPNS1_8OperatorERNS1_14BlockingReasonE # 23 _ZN8facebook5velox4exec4Task4nextEPN5folly10SemiFutureINS3_4UnitEEE # 24 _ZN6gluten24WholeStageResultIterator4nextEv # 25 Java_org_apache_gluten_vectorized_ColumnarBatchOutIterator_nativeHasNext # 26 0x00007ee91c6b9930 ``` You can notice the two lines where the gcs bucket is also mentioned are: ``` I20250618 00:30:14.379588 91 HiveDataSource.cpp:306] Adding split Hive: gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet 0 - 14245 I20250618 00:30:14.384384 91 FileHandle.cpp:65] Generating file handle for: gs://bucket/Tenants/default/source/OnlineStore/salesorderdetail/_delta_log/00000001748882003640.checkpoint.parquet uuid: 1 ``` So I would say gluten is able to access the gcs bucket? what do you think? GitHub link: https://github.com/apache/incubator-gluten/discussions/9946#discussioncomment-13502613 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
