[jira] [Commented] (PHOENIX-1711) Improve performance of CSV loader

James Taylor (JIRA) Wed, 11 Mar 2015 16:41:02 -0700

    [ 
https://issues.apache.org/jira/browse/PHOENIX-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14357819#comment-14357819
 ]


James Taylor commented on PHOENIX-1711:
---------------------------------------

Thanks for getting back to us, [~tulasip]. So this translates to 50% 
improvement on the overall ~40% for csvUpsertExecutor.execute(...)? So 20% 
better throughput?

 If folks think this is worth it, I can cleanup the patch, generalize it a bit 
for the regular UPSERT VALUES case too, and put it up for review.

> Improve performance of CSV loader
> ---------------------------------
>
>                 Key: PHOENIX-1711
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1711
>             Project: Phoenix
>          Issue Type: Bug
>            Reporter: James Taylor
>         Attachments: PHOENIX-1711.patch, PHOENIX-1711_4.0.patch
>
>
> Here is a break-up of percentage execution time for some of the steps inthe 
> mapper:
> csvParser: 18%
> csvUpsertExecutor.execute(ImmutableList.of(csvRecord)): 39%
> PhoenixRuntime.getUncommittedDataIterator(conn, true): 9%
> while (uncommittedDataIterator.hasNext()): 15%
> Read IO & custom processing: 19%
> See details here: http://s.apache.org/6rl



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (PHOENIX-1711) Improve performance of CSV loader

Reply via email to