[
https://issues.apache.org/jira/browse/PIG-4621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150432#comment-16150432
]
Adam Szita commented on PIG-4621:
---------------------------------
[~YaShock]: the tests returned a few errors:
# Spark mode:
#* All rank related tests: this is because a missing nullcheck in the
RankConverter around {{illustrator.getData().clear();
illustrator.getEquivalenceClasses().get(0).clear();}}
#* TestPigContext#testImportList: the issue is due to POLoad getting passed in
LoadConverter#ToTupleFunction#constructor. Looks like if a POLoad instance is
passed, then for some reason the SparkEngineConf instance (which is also passed
here) is not deserialized on the Spark end in time and we need the UDF import
list before we would have an idea what classes we need. This results in Pig
being unable to resolve TestUDF2 UDF. I propose we make an exception for POLoad
and do the work of sparkIllustrate method inside the
LoadConverter#ToTupleFunction. (We can pass the operatorkey of POLoad insteadof
the op itself so that we can find the appropriate Illustrator instance)
# Tez mode:
#* we're missing some assumeFalse's in TestGrunt#testIllustrateScript(1-7),
these cases should be skipped in Tez mode: {{Assume.assumeFalse("Skip this test
for TEZ. See PIG-3993", Util.isTezExecType(cluster.getExecType()));}}
# MR mode:
#* no failures
+ a few smaller comments:
* POSort#sparkIllustrate method is indented with tabs in some lines, please
change to spaces
* POJoinGroupSpark#sparkIllustratorMarkup when catching ExecException let's log
an error instead of printStackTrace
* TezExecutionEngine please remove unused import LocalSparkSimulator
Can you please upload a patch with these fixes?
> Enable Illustrate in spark
> --------------------------
>
> Key: PIG-4621
> URL: https://issues.apache.org/jira/browse/PIG-4621
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: Jakov Rabinovits
> Labels: MissingFeature
> Fix For: spark-branch
>
> Attachments: PIG-4621.patch
>
>
> Current we don't support illustrate in spark mode.
> How illustrate works
> see:http://pig.apache.org/docs/r0.7.0/piglatin_ref2.html#ILLUSTRATE
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)