[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-08-05 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-669248590


   > @lw309637554 this seems ready?
   
   yes, but the test failed for some reason, i rerun it



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-08-01 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-667553349


   > @lw309637554 would you please rebase and fix the conflicts.
   
   okay, done



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-24 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-663442107


   > couple
   
   okay ,thanks 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-21 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-661736470


   @vinothchandar The pr is ready overall. Can you help to review ?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-16 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-659747098


   > > this is a good point. I think remove hive dependency from hudi-spark and 
hudi-utilities is a another work . we can open another issue resolve it
   > 
   > sounds good. follow up ticket: 
https://issues.apache.org/jira/browse/HUDI-1101
   
   very nice



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-15 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-659182664


   > Hi @lw309637554 , I totally understand the importance of the backward 
capability. IMO, that will be great if we can remove the hive dependency from 
`hudi-spark` and `hudi-utilities`. If we treat `syncHive` separately, we still 
need to include some Hive related packages in these two modules.
   > I had this dependency issue before while I was testing the delta streamer. 
I didn't use Hive at all but I need to resolve some Hive dependency conflicts 
in my production environment. So I'd incline to sacrifice some backward 
capability and move all the Hive related packages to `hudi-hive-sync`. Do you 
think this is possible?
   > Happy to hear what you guys think.
   
   this is a good point. I think remove hive dependency from hudi-spark and  
hudi-utilities is a another   work . we can open another issue resolve it 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-15 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-658740024


   @vinothchandar @garyli1019  I think in this PR  the hudi-sync abstract is 
ready.  Expect your review. Thanks
   
   1. the module abstract is 
   hudi-sync
hudi-hive-sync/
hudi-dla-sync/
   
   2. about backwards compatible .
   a. about users code compatible: because do not modify the hivesync class 
name. Users do not need to modify these old code.
   b. about users pom dependency compatible: because do not modify the module 
name hudi-hive-sync. Users not need to modify pom dependency
   c. about users local jar files compatible: if users's local jars is do not 
shaded the hudi-hive-sync's
   Indirect dependence. Old users just need add the hudi-sync-common.jar to 
their directory. Just like hudi-utilities-bundle modiry in this PR.
   d. about sync parameters, just add new parameters such as --enable-sync as 
new default parameter,also backwards compatible   --enable-hive-sync.
   
   3. some others works ,such as update the doc. will do in other issues.
   
   what about your suggestion?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-09 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-656092089


   @vinothchandar  I  meet some mistakes, just opened a new PR.  
   In the new PR, I fix the build break. This is my thinks about backwards 
compatible .
   a.  about users code compatible: because do not modify the hivesync class 
name. Users do not need to modify these old code.
   b. about users  pom dependency compatible: because do not modify the module 
name hudi-hive-sync. Users not need to modify pom dependency
   c. about users local jar files compatible: if users's local jars is do not 
shaded the hudi-hive-sync's 
   Indirect dependence. Old users just need add the hudi-sync-common.jar to 
their directory. Just like hudi-utilities-bundle modiry in this PR.
   Reply: we can do this to put the hudi-sync-common base class to hudi-common
   d. about sync parameters, just add new parameters. Compatible is ok.
   
   what about your suggestion?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-09 Thread GitBox


lw309637554 commented on pull request #1810:
URL: https://github.com/apache/hudi/pull/1810#issuecomment-656086937


   > @lw309637554 Thanks for your contribution! Overall looks good.
   > Open to discuss where to put these modules. I vote to put the base class 
to `hudi-common` and have separate modules for different query engines.
   
   thanks so much for your very valuable suggestion.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org