[
https://issues.apache.org/jira/browse/CRUNCH-556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Wills updated CRUNCH-556:
------------------------------
Attachment: CRUNCH-556b.patch
After beating myself over the head w/the crunch-spark inside of crunch-hbase
approach, I switched to a crunch-hbase inside of crunch-spark approach that
worked in like 2 seconds. So, there's that. Will push this to master presently.
> Fix total sorts in Crunch-on-Spark
> ----------------------------------
>
> Key: CRUNCH-556
> URL: https://issues.apache.org/jira/browse/CRUNCH-556
> Project: Crunch
> Issue Type: Bug
> Components: Spark
> Affects Versions: 0.13.0
> Reporter: Josh Wills
> Fix For: 0.14.0
>
> Attachments: CRUNCH-556.patch, CRUNCH-556b.patch
>
>
> From the user mailing list, trying to perform a total sort to create an HFile
> w/Crunch on Spark throws the following exception:
> https://gist.github.com/anonymous/920c000f20229eaa76d8
> The problem can be traced to not properly configuring the partitioner w/the
> path to the partition file that is stored in the GroupingOptions extra
> configuration settings. These settings get passed correctly for the MR job,
> but not for the Spark ones.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)