[jira] [Commented] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458717#comment-17458717 ] sivabalan narayanan commented on HUDI-735: -- [~nicholasjiang] : Sorry, is it just the error msg is not very apt or are you running into any issues as such. if you don't mind, can you clarify please. > Improve deltastreamer error message when case mismatch of commandline > arguments. > > > Key: HUDI-735 > URL: https://issues.apache.org/jira/browse/HUDI-735 > Project: Apache Hudi > Issue Type: Improvement > Components: Code Cleanup, DeltaStreamer, Usability >Reporter: Vinoth Chandar >Assignee: Nicholas Jiang >Priority: Major > Labels: sev:normal, user-support-issues > > Team, > When following the blog "Change Capture Using AWS Database Migration > Service and Hudi" with my own data set, the initial load works perfectly. > When issuing the command with the DMS CDC files on S3, I get the following > error: > {code} > 20/03/24 17:56:28 ERROR HoodieDeltaStreamer: Got error running delta sync > once. Shutting down > org.apache.hudi.exception.HoodieException: Please provide a valid schema > provider class! at > org.apache.hudi.utilities.sources.InputBatch.getSchemaProvider(InputBatch.java:53) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.readFromSource(DeltaSync.java:312) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:226) > {code} > I tried using the --schemaprovider-class > org.apache.hudi.utilities.schema.FilebasedSchemaProvider.Source and provide > the schema. The error does not occur but there are no write to Hudi. > I am not performing any transformations (other than the DMS transform) and > using default record key strategy. > If the team has any pointers, please let me know. > Thank you! > --- > Thank you Vinoth. I was able to find the issue. All my column names were in > high caps case. I switched column names and table names to lower case and > it works perfectly. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Commented] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17194105#comment-17194105 ] Pratyaksh Sharma commented on HUDI-735: --- Anything else needs to be done here? As per the description of this Jira, guess we can close this? [~vinoth] > Improve deltastreamer error message when case mismatch of commandline > arguments. > > > Key: HUDI-735 > URL: https://issues.apache.org/jira/browse/HUDI-735 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Usability, Utilities >Reporter: Vinoth Chandar >Assignee: Nicholas Jiang >Priority: Major > Labels: bug-bash-0.6.0 > > Team, > When following the blog "Change Capture Using AWS Database Migration > Service and Hudi" with my own data set, the initial load works perfectly. > When issuing the command with the DMS CDC files on S3, I get the following > error: > {code} > 20/03/24 17:56:28 ERROR HoodieDeltaStreamer: Got error running delta sync > once. Shutting down > org.apache.hudi.exception.HoodieException: Please provide a valid schema > provider class! at > org.apache.hudi.utilities.sources.InputBatch.getSchemaProvider(InputBatch.java:53) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.readFromSource(DeltaSync.java:312) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:226) > {code} > I tried using the --schemaprovider-class > org.apache.hudi.utilities.schema.FilebasedSchemaProvider.Source and provide > the schema. The error does not occur but there are no write to Hudi. > I am not performing any transformations (other than the DMS transform) and > using default record key strategy. > If the team has any pointers, please let me know. > Thank you! > --- > Thank you Vinoth. I was able to find the issue. All my column names were in > high caps case. I switched column names and table names to lower case and > it works perfectly. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17114952#comment-17114952 ] sivabalan narayanan commented on HUDI-735: -- [~nicholasjiang]: did you get a chance to work on it. If you have any PR, do update the jira. > Improve deltastreamer error message when case mismatch of commandline > arguments. > > > Key: HUDI-735 > URL: https://issues.apache.org/jira/browse/HUDI-735 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Usability, Utilities >Reporter: Vinoth Chandar >Assignee: Nicholas Jiang >Priority: Major > Labels: bug-bash-0.6.0 > > Team, > When following the blog "Change Capture Using AWS Database Migration > Service and Hudi" with my own data set, the initial load works perfectly. > When issuing the command with the DMS CDC files on S3, I get the following > error: > {code} > 20/03/24 17:56:28 ERROR HoodieDeltaStreamer: Got error running delta sync > once. Shutting down > org.apache.hudi.exception.HoodieException: Please provide a valid schema > provider class! at > org.apache.hudi.utilities.sources.InputBatch.getSchemaProvider(InputBatch.java:53) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.readFromSource(DeltaSync.java:312) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:226) > {code} > I tried using the --schemaprovider-class > org.apache.hudi.utilities.schema.FilebasedSchemaProvider.Source and provide > the schema. The error does not occur but there are no write to Hudi. > I am not performing any transformations (other than the DMS transform) and > using default record key strategy. > If the team has any pointers, please let me know. > Thank you! > --- > Thank you Vinoth. I was able to find the issue. All my column names were in > high caps case. I switched column names and table names to lower case and > it works perfectly. -- This message was sent by Atlassian Jira (v8.3.4#803005)