[
https://issues.apache.org/jira/browse/PHOENIX-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14357819#comment-14357819
]
James Taylor commented on PHOENIX-1711:
---------------------------------------
Thanks for getting back to us, [~tulasip]. So this translates to 50%
improvement on the overall ~40% for csvUpsertExecutor.execute(...)? So 20%
better throughput?
If folks think this is worth it, I can cleanup the patch, generalize it a bit
for the regular UPSERT VALUES case too, and put it up for review.
> Improve performance of CSV loader
> ---------------------------------
>
> Key: PHOENIX-1711
> URL: https://issues.apache.org/jira/browse/PHOENIX-1711
> Project: Phoenix
> Issue Type: Bug
> Reporter: James Taylor
> Attachments: PHOENIX-1711.patch, PHOENIX-1711_4.0.patch
>
>
> Here is a break-up of percentage execution time for some of the steps inthe
> mapper:
> csvParser: 18%
> csvUpsertExecutor.execute(ImmutableList.of(csvRecord)): 39%
> PhoenixRuntime.getUncommittedDataIterator(conn, true): 9%
> while (uncommittedDataIterator.hasNext()): 15%
> Read IO & custom processing: 19%
> See details here: http://s.apache.org/6rl
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)