Chhaya Vankhede created GOBBLIN-1059: ----------------------------------------
Summary: MySQL to HDFS ingestion Key: GOBBLIN-1059 URL: https://issues.apache.org/jira/browse/GOBBLIN-1059 Project: Apache Gobblin Issue Type: Bug Components: gobblin-sql Environment: linux subsystem, hadoop 2.3.0, Reporter: Chhaya Vankhede Assignee: Shirshanka Das I am tring to ingest data from mysql to hdfs but getting below error. The job file and properties are as below mysql.pull ``` # Job properties job.name=GobblinMySql job.group=MySql job.description=Data pull from MySql # Extract properties extract.namespace=demodb extract.table.type=snapshot_only extract.table.name=user # Property to consider the extract as full dump extract.is.full=true # Source properties source.querybased.schema=user source.entity=user source.querybased.extract.type=snapshot ``` mysql.propertes ``` # Source properties - source class to extract data from Mysql Source source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource # Source properties source.max.number.of.partitions=1 source.querybased.partition.interval=1 source.querybased.is.compression=true source.querybased.watermark.type=timestamp # Source connection properties source.conn.driver=com.mysql.jdbc.Driver source.conn.username=root source.conn.password=root source.conn.host=localhost source.conn.port=3306 source.conn.timeout=1500 # Converter properties - Record from mysql source will be processed by the below series of converters converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter # date columns format converter.avro.timestamp.format=yyyy-MM-dd HH:mm:ss'.0' converter.avro.date.format=yyyy-MM-dd converter.avro.time.format=HH:mm:ss # Qualitychecker properties qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy qualitychecker.task.policy.types=OPTIONAL,OPTIONAL # Publisher properties data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher ``` below is error log ``` GMT ERROR [JobScheduler-0] org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed to run job GobblinMySql org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445) at org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Caused by: java.lang.RuntimeException: Failed to create job launcher: java.lang.ClassNotFoundException: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68) at org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450) at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443) ... 4 more Caused by: java.lang.ClassNotFoundException: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource at java.net.URLClassLoader.findClass(URLClassLoader.java:382) at java.lang.ClassLoader.loadClass(ClassLoader.java:419) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352) at java.lang.ClassLoader.loadClass(ClassLoader.java:352) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:264) at org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240) at org.apache.gobblin.runtime.JobContext.<init>(JobContext.java:172) at org.apache.gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:201) at org.apache.gobblin.runtime.local.LocalJobLauncher.<init>(LocalJobLauncher.java:86) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144) ... 9 more 2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed to run job GobblinMySql org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445) at org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Caused by: java.lang.RuntimeException: Failed to create job launcher: java.lang.ClassNotFoundException: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68) at org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450) at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443) ... 4 more Caused by: java.lang.ClassNotFoundException: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource at java.net.URLClassLoader.findClass(URLClassLoader.java:382) at java.lang.ClassLoader.loadClass(ClassLoader.java:419) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352) at java.lang.ClassLoader.loadClass(ClassLoader.java:352) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:264) at org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240) at org.apache.gobblin.runtime.JobContext.<init>(JobContext.java:172) at org.apache.gobblin.runtime.AbstractJobLauncher.<init>(AbstractJobLauncher.java:201) at org.apache.gobblin.runtime.local.LocalJobLauncher.<init>(LocalJobLauncher.java:86) at org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144) ... 9 more``` using command `bin/gobblin.sh service standalone start --jars /mnt/C/Users/name/incubator-gobblin/build/gobblin-sql/libs/gobblin-sql-0.15.0.jar` How to solve it? -- This message was sent by Atlassian Jira (v8.3.4#803005)