Re: How can I track the actual map and reduce tasks executed in pig?

2012-03-12 Thread Yongzhi Wang
I found the option. As long as that can disable all the optimization, it would be helpful. Thanks very much! -t, -optimizer_off - Turn optimizations off. The following values are supported: SplitFilter - Split filter conditions PushUpFilter - Filter as early as possible

[jira] [Commented] (PIG-2317) Ruby/Jruby UDFs

2012-03-12 Thread Jonathan Coveney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228154#comment-13228154 ] Jonathan Coveney commented on PIG-2317: --- Awesome Daniel, those were some key changes.

[jira] [Commented] (PIG-2565) Support IMPORT for macros stored in S3 Buckets

2012-03-12 Thread Thejas M Nair (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228137#comment-13228137 ] Thejas M Nair commented on PIG-2565: Comments on the patch - Changes look good. But I th

[jira] [Commented] (PIG-2570) LimitOptimizer fails with dynamic LIMIT argument

2012-03-12 Thread Thejas M Nair (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228133#comment-13228133 ] Thejas M Nair commented on PIG-2570: It is possible to have multiple scalar variables in

[jira] [Updated] (PIG-2543) PigStats.isSuccessful returns false if embedded pig script has sh commands

2012-03-12 Thread Daniel Dai (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2543: Attachment: PIG-2543-2.patch PIG-2543-2.patch fix unit test failures. > PigStats.isSuccessfu

[jira] [Updated] (PIG-2573) Automagically setting parallelism based on input file size does not work with HCatalog

2012-03-12 Thread Travis Crawford (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Travis Crawford updated PIG-2573: - Attachment: PIG-2573_get_size_from_stats_if_possible_2.diff Updated patch renamed method, and total

Re: How can I track the actual map and reduce tasks executed in pig?

2012-03-12 Thread Dmitriy Ryaboy
There is a switch to turn off all optimizations. I don't recall what it is off the top of my head, but you can easily find out by running pig -h. Does that help? On Mar 12, 2012, at 2:58 PM, Yongzhi Wang wrote: > I found in the Physical plan, there are 3 split operator generated. I think > tha

[jira] [Commented] (PIG-2573) Automagically setting parallelism based on input file size does not work with HCatalog

2012-03-12 Thread Travis Crawford (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13227962#comment-13227962 ] Travis Crawford commented on PIG-2573: -- PIGSTORAGEWITHSTATISTICS COMMENT: Originally I

Re: How can I track the actual map and reduce tasks executed in pig?

2012-03-12 Thread Yongzhi Wang
I found in the Physical plan, there are 3 split operator generated. I think that's the reason of 3 map tasks. Is that possible that in the future pig can provide a parameter or syntax to determine whether optimization will be launched? Sometimes the one to one translating from pig script to the Ma

[jira] [Updated] (PIG-2317) Ruby/Jruby UDFs

2012-03-12 Thread Daniel Dai (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2317: Attachment: PIG-2317-8_plus.patch I tried the latest patch. Find couple of issues in e2e tests in mapreduce m

[jira] [Created] (PIG-2580) JsonStorage.java to also implement FileInputLoadFunc

2012-03-12 Thread Daniel Rule (Created) (JIRA)
JsonStorage.java to also implement FileInputLoadFunc Key: PIG-2580 URL: https://issues.apache.org/jira/browse/PIG-2580 Project: Pig Issue Type: Improvement Components: build