amaranathv commented on issue #741: java.lang.NoClassDefFoundError: 
com/uber/hoodie/exception/HoodieException
URL: https://github.com/apache/incubator-hudi/issues/741#issuecomment-503164955
 
 
   I am getting invalid data path now.
   
    InValid data 
paths=[file:///mapr/user/avenka23/delta-streamer/output5/2018/08/31/fe957979-67f8-4b68-a371-6c789e81339c-0_0-21-23_20190618093523.parquet]
   
   Is there any property needs to be set or environment variables?
   
   I am trying to mimic the docker demo using deltastreamer command line to 
ingest data from json into hoodie data set.
   
   
   $ spark-submit --master yarn  --class 
com.uber.hoodie.utilities.deltastreamer.HoodieDeltaStreamer `ls 
/mapr/user/avenka23/hoodie/incubator-hudi/packaging/hoodie-utilities-bundle/target/hoodie-utilities-bundle*-SNAPSHOT.jar`
   --props 
file:///mapr/user/avenka23/delta-streamer/config/dfs-source.properties   
--schemaprovider-class com.uber.hoodie.utilities.schema.FilebasedSchemaProvider 
  --source-class com.uber.hoodie.utilities.sources.JsonDFSSource   
--source-ordering-field ts   --target-base-path 
file:///mapr/user/avenka23/delta-streamer/output5  --target-table 
stock_ticks_cow   --storage-type COPY_ON_WRITE
   19/06/18 09:35:00 WARN Client: Neither spark.yarn.jars nor 
spark.yarn.archive is set, falling back to uploading libraries under SPARK_HOME.
   ERROR StatusLogger No log4j2 configuration file found. Using default 
configuration: logging only errors to the console.
   19/06/18 09:35:19 WARN SparkContext: Using an existing SparkContext; some 
configuration may not take effect.
   19/06/18 09:35:28 WARN HoodieTable: InValid data 
paths=[file:///mapr/user/avenka23/delta-streamer/output5/2018/08/31/fe957979-67f8-4b68-a371-6c789e81339c-0_0-21-23_20190618093523.parquet]
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to