[
https://issues.apache.org/jira/browse/HCATALOG-448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413934#comment-13413934
]
Mithun Radhakrishnan commented on HCATALOG-448:
-----------------------------------------------
This is interesting. Comparing the consumption of 50 megs of data, using
HCatStorer:
Pig without HCat: 25 seconds
Pig with HCat0.2: 40 seconds
Pig with HCat0.4: 4 minutes 8 seconds
I whacked the pig-script from Arup Malakar's tests:
A = load '/tmp/myth/test/part-00000' as (key:chararray, val:chararray);
store A into 'hcat_02_rcfile_table' using org.apache.hcatalog.pig.HCatStorer();
And this is with Pig 0.9.3, storing into a table "stored as rcfile". Must
profile HCatStorer.
> HCatStorer performance is 4x slower in HCat 0.4 than HCat 0.2
> -------------------------------------------------------------
>
> Key: HCATALOG-448
> URL: https://issues.apache.org/jira/browse/HCATALOG-448
> Project: HCatalog
> Issue Type: Bug
> Affects Versions: 0.4.1
> Reporter: Rohini Palaniswamy
> Assignee: Mithun Radhakrishnan
> Priority: Critical
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira