Kip Kohn created GOBBLIN-2054:
---------------------------------
Summary: `CommitActivityImpl` fails for job types (sources) other
than Iceberg-Distcp
Key: GOBBLIN-2054
URL: https://issues.apache.org/jira/browse/GOBBLIN-2054
Project: Apache Gobblin
Issue Type: New Feature
Components: gobblin-core
Reporter: Kip Kohn
Assignee: Abhishek Tiwari
gobblin-on-temporal execution has been failing for other job types than
iceberg-distcp (which uses `CopySource`). in particular Commit fails with:
{code}
java.lang.IllegalArgumentException: Missing required property writer.output.dir
at
com.google.common.base.Preconditions.checkArgument(Preconditions.java:122)
at
org.apache.gobblin.util.WriterUtils.getWriterOutputDir(WriterUtils.java:121)
at
org.apache.gobblin.publisher.BaseDataPublisher.publishData(BaseDataPublisher.java:390)
at
org.apache.gobblin.publisher.BaseDataPublisher.publishMultiTaskData(BaseDataPublisher.java:379)
at
org.apache.gobblin.publisher.BaseDataPublisher.publishData(BaseDataPublisher.java:366)
at
org.apache.gobblin.publisher.DataPublisher.publish(DataPublisher.java:81)
at
org.apache.gobblin.runtime.SafeDatasetCommit.commitDataset(SafeDatasetCommit.java:260)
at
org.apache.gobblin.runtime.SafeDatasetCommit.call(SafeDatasetCommit.java:168)
at
org.apache.gobblin.runtime.SafeDatasetCommit.call(SafeDatasetCommit.java:64)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
org.apache.gobblin.util.executors.MDCPropagatingRunnable.run(MDCPropagatingRunnable.java:39)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{code}
this is odd because that same prop had already been used prior to commit, while
processing the `WorkUnit`! moreover logging shows it to be present within the
`JobState`
anyway, even when using a private build that hard-coded that property, this
later error arises:
{code}
Caused by: java.lang.IllegalArgumentException: Can not create a Path from a
null string
at org.apache.hadoop.fs.Path.checkPathArg(Path.java:159)
at org.apache.hadoop.fs.Path.<init>(Path.java:175)
at org.apache.hadoop.fs.Path.<init>(Path.java:110)
at
org.apache.gobblin.runtime.FsDatasetStateStore.sanitizeDatasetStatestoreNameFromDatasetURN(FsDatasetStateStore.java:175)
at
org.apache.gobblin.runtime.FsDatasetStateStore.persistDatasetState(FsDatasetStateStore.java:386)
at
org.apache.gobblin.runtime.FsDatasetStateStore.persistDatasetState(FsDatasetStateStore.java:90)
at
org.apache.gobblin.runtime.SafeDatasetCommit.persistDatasetState(SafeDatasetCommit.java:418)
at
org.apache.gobblin.runtime.SafeDatasetCommit.call(SafeDatasetCommit.java:191)
... 8 more
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)