[ https://issues.apache.org/jira/browse/PHOENIX-2938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15424898#comment-15424898 ]
Kalyan commented on PHOENIX-2938: --------------------------------- Thanks Josh Mahonin for your suggestions. i will work on these changes .. after the below bug fix. PHOENIX-3192 : phoenix-spark dataframe issue with combination of "column family + column name" otherwise i need to refactor again. > HFile support for SparkSQL DataFrame saves > ------------------------------------------ > > Key: PHOENIX-2938 > URL: https://issues.apache.org/jira/browse/PHOENIX-2938 > Project: Phoenix > Issue Type: Improvement > Reporter: Chris Tarnas > Assignee: Kalyan > Priority: Minor > > Currently when saving a DataFrame in Spark it is persisted as upserts. Having > an option to do saves natively via HFiles, as the MapReduce loader does, > would be a great performance improvement for large bulk loads. The current > work around to reduce the load on the regionservers would be to save to csv > from Spark then load via the MapReduce loader. -- This message was sent by Atlassian JIRA (v6.3.4#6332)