Dan Burkert has submitted this change and it was merged. Change subject: KUDU-1533 Spark Kudu Rdd/Dataframe upsert ......................................................................
KUDU-1533 Spark Kudu Rdd/Dataframe upsert This patch improves the Kudu SparkSQL integration in two ways: 1) Removed support for all SaveMode's except Append for the creatableRelationProvider trait of DefaultSource. This is an improvement because the other modes cannot be correctly implemented for Kudu without support for table truncation and because some modes require auto-table creation and, in that case, there's no satisfactory mechanism to specify things like the partition schema. 2) Added {insert, update, upsert, delete}Rows methods to KuduContext. This is the now preferred way to write to Kudu tables. Additionally, inserts to Kudu tables from Spark SQL using DefaultSource are now upserts by default. They can be returned to being strict inserts with the operation parameter. These changes may break some existing clients, so they have been documented in the release notes. Additionally, the enhancements to the KuduContext API, and its preferred status over using the DefaultSource to write to Kudu tables, have been documented in the examples and the release notes. Change-Id: Ib8e0d50fb74dc2ce5e757e8a56fc1e863f699822 Reviewed-on: http://gerrit.cloudera.org:8080/3871 Tested-by: Kudu Jenkins Reviewed-by: Dan Burkert <d...@cloudera.com> --- M docs/developing.adoc M docs/release_notes.adoc M java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/DefaultSource.scala M java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/KuduContext.scala M java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/package.scala M java/kudu-spark/src/test/scala/org/apache/kudu/spark/kudu/DefaultSourceTest.scala 6 files changed, 239 insertions(+), 103 deletions(-) Approvals: Dan Burkert: Looks good to me, approved Kudu Jenkins: Verified -- To view, visit http://gerrit.cloudera.org:8080/3871 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: merged Gerrit-Change-Id: Ib8e0d50fb74dc2ce5e757e8a56fc1e863f699822 Gerrit-PatchSet: 6 Gerrit-Project: kudu Gerrit-Branch: master Gerrit-Owner: Will Berkeley <wdberke...@gmail.com> Gerrit-Reviewer: Chris George <chris.geo...@rms.com> Gerrit-Reviewer: Dan Burkert <d...@cloudera.com> Gerrit-Reviewer: Kudu Jenkins Gerrit-Reviewer: Ram Mettu <ram.me...@rms.com> Gerrit-Reviewer: Will Berkeley <wdberke...@gmail.com>