[
https://issues.apache.org/jira/browse/CRUNCH-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14485376#comment-14485376
]
Micah Whitacre commented on CRUNCH-509:
---------------------------------------
[~tomwhite] unfortunately just naming things doesn't fix the issue. The
outputs aren't in the location necessary for things like materialize to work
with Spark.
[~jwills] Thanks for the hint. I was going to try taking a stab at this change
today and will play around.
> Crunch with Spark doesn't name all outputs
> ------------------------------------------
>
> Key: CRUNCH-509
> URL: https://issues.apache.org/jira/browse/CRUNCH-509
> Project: Crunch
> Issue Type: Bug
> Components: Core
> Affects Versions: 0.11.0
> Reporter: Micah Whitacre
> Assignee: Josh Wills
> Fix For: 0.12.0
>
>
> Crunch currently does not "name" all outputs when running with a
> SparkPipeline. This becomes a problem as some Targets (based on CRUNCH-82)
> have coded in checked to ensure that the name must be populated.
> Specifically the implementation I'm running into issues with is the Kite
> DatasetTarget[2].
> Need to read up a bit on context to see if it is a Crunch/Kite issue or where
> it is easiest/correct to fix. [~jwills] or [~tomwhite] feedback would be
> welcome.
> [1] -
> https://github.com/apache/crunch/blob/3ab0b078c47f23b3ba893fdfb05fd723f663d02b/crunch-spark/src/main/java/org/apache/crunch/impl/spark/SparkRuntime.java#L337
> [2] -
> https://github.com/kite-sdk/kite/blob/e080f0237e7383a16fff8547ad43387ccf55c473/kite-data/kite-data-crunch/src/main/java/org/kitesdk/data/crunch/DatasetTarget.java#L178
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)