[
https://issues.apache.org/jira/browse/PIG-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13263312#comment-13263312
]
Dmitriy V. Ryaboy commented on PIG-2661:
----------------------------------------
Daniel,
We didn't use to generate the first MR job prior to 9. THe change is that we
now see loading with a schema (load foo as (a:int, b:chararray)) as a
projection, and perform it prior to sampling. I think we can at least get this
back (it can cost us a LOT of time -- if all of your loaders provide a schema,
this means piping the whole dataset through an MR job and out to disk) -- by
special casing the map-only, foreach + projection only job.
D
> Pig uses an extra job for loading data in Pigmix L9
> ---------------------------------------------------
>
> Key: PIG-2661
> URL: https://issues.apache.org/jira/browse/PIG-2661
> Project: Pig
> Issue Type: Improvement
> Affects Versions: 0.9.0
> Reporter: Jie Li
>
> See
> https://issues.apache.org/jira/browse/PIG-200?focusedCommentId=13260155&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13260155
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira