[ 
https://issues.apache.org/jira/browse/PIG-3325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy V. Ryaboy updated PIG-3325:
-----------------------------------

    Attachment: PIG-3325.2.patch

Updating with a patch.

Results:
||Num Tuples in Bag || Trunk avg || Patch 1 avg || Patch 2 avg ||
| 1 | round: 0.00 | round: 0.00 | round: 0.00 |
| 20 | round: 0.01 | round: 0.00 | round: 0.00 |
| 100 | round: 0.13 | round: 0.00 | round: 0.00
| 1000 | round: 0.19 | round: 1.20 | round: 0.03 |

I also ran Mark's bench test in a loop 10 times (again, to account for jit 
effects).

Results are as follows:

My Patch, Mark's test
 7050 ns
 450 ns
 440 ns
 550 ns
 440 ns
 440 ns
 440 ns
 440 ns
 440 ns
 540 ns
 410 ns
 440 ns
 440 ns
 430 ns
 460 ns
 
 
 Trunk, Mark's test
 243240 ns
 156640 ns
 25440 ns
 23470 ns
 18930 ns
 20710 ns
 16890 ns
 20210 ns
 17630 ns
 17900 ns
 21420 ns
 22550 ns
 22900 ns
 19800 ns
 16770 ns
 
 Mark's patch, Mark's Test
 8480 ns
 2750 ns
 2690 ns
 2760 ns
 3270 ns
 3590 ns
 6530 ns
 5900 ns
 6340 ns
 5410 ns
 5400 ns
 5420 ns
 5670 ns
 5410 ns
 5420 ns
                
> Adding a tuple to a bag is slow
> -------------------------------
>
>                 Key: PIG-3325
>                 URL: https://issues.apache.org/jira/browse/PIG-3325
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.11, 0.11.1, 0.11.2
>            Reporter: Mark Wagner
>            Assignee: Mark Wagner
>            Priority: Critical
>         Attachments: PIG-3325.2.patch, PIG-3325.demo.patch, 
> PIG-3325.optimize.1.patch
>
>
> The time it takes to add a tuple to a bag has increased significantly, 
> causing some jobs to take about 50x longer compared to 0.10.1. I've tracked 
> this down to PIG-2923, which has made adding a tuple heavier weight (it now 
> includes some memory estimation).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to