[Question] HoodieROTablePathFilter not accept dir path

2020-09-08 Thread Raymond Xu
https://github.com/apache/hudi/blob/9bcd3221fd440081dbae70e89d08539c3b484862/hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HoodieROTablePathFilter.java#L120-L121 As shown in the 2 lines above, it does not seem to work with directory Path. It should work for both `new Path("base/partition")`

Re: [DISCUSS] enable cross AZ consistency and quality checks of hudi datasets

2020-09-08 Thread Satish Kotha
Hi folks, Any thoughts on this? At a high level, we want to change high watermark commit through a property to perform pre-commit and post-commit hooks. Is this useful for anyone else? On Thu, Sep 3, 2020 at 11:12 AM Sanjay Sundaresan wrote: > Hello folks, > > We have a use case to make sure da

Re: [DISCUSS] New Community Weekly Sync up Time

2020-09-08 Thread Vinoth Chandar
Anyone else wants to chime in for a new time, that works for everyone? Personally, I can do this time. love to hear more inputs. On Wed, Sep 2, 2020 at 10:16 AM Pratyaksh Sharma wrote: > Hi everyone, > > Currently we are having weekly sync ups between 9 PM - 10 PM PST on > tuesdays. Since I h

Re: [DISCUSS] Formalizing the release process

2020-09-08 Thread Vinoth Chandar
>, bit skeptical on minor version releases every month, but nvm. guess its just a rough estimate. That's an aspirational goal that we should try to hit. We have all worked on teams/projects that shipped at that cadence regularly. It's a matter of getting our test infrastructure and processes strea

Re: [DISCUSS] New Community Weekly Sync up Time

2020-09-08 Thread Raymond Xu
I'm ok with 1 hr earlier. On Tue, Sep 8, 2020, 5:09 PM Vinoth Chandar wrote: > Anyone else wants to chime in for a new time, that works for everyone? > > Personally, I can do this time. > > love to hear more inputs. > > On Wed, Sep 2, 2020 at 10:16 AM Pratyaksh Sharma > wrote: > > > Hi everyon

Re: [DISCUSS] New Community Weekly Sync up Time

2020-09-08 Thread Mehrotra, Udit
I am okay with this too. On 9/8/20, 5:33 PM, "Raymond Xu" wrote: CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe. I'm ok with 1 hr earlier. On Tue, Sep 8, 202

Re: [DISCUSS] Formalizing the release process

2020-09-08 Thread Mehrotra, Udit
+1 on the process. On 9/8/20, 5:11 PM, "Vinoth Chandar" wrote: CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe. >, bit skeptical on minor version releases every mo

Hudi CLI AWS Glue & S3 Tables

2020-09-08 Thread Adam
Hey guys, I'm trying to use the Hudi CLI to connect to tables stored on S3 using the Glue metastore. Using a tip from Ashish M G on Slack, I added the dependencies, re-built and was able t

Re: [DISCUSS] Formalizing the release process

2020-09-08 Thread Raymond Xu
+1. Also a bit skeptical on monthly minor releases. But can give it a try. On Tue, Sep 8, 2020 at 5:55 PM Mehrotra, Udit wrote: > +1 on the process. > > On 9/8/20, 5:11 PM, "Vinoth Chandar" wrote: > > CAUTION: This email originated from outside of the organization. Do > not click links or

Re: [DISCUSS] Formalizing the release process

2020-09-08 Thread Vinoth Chandar
Would love to understand the general skepticism a bit more. Is it rooted more on hitting those in the short term? or even in the longer run with a better test infrastructure in place? On Tue, Sep 8, 2020 at 6:42 PM Raymond Xu wrote: > +1. Also a bit skeptical on monthly minor releases. But can g

Re: [DISCUSS] New Community Weekly Sync up Time

2020-09-08 Thread Balaji Varadarajan
+1 On Tuesday, September 8, 2020, 05:54:52 PM PDT, Mehrotra, Udit wrote: I am okay with this too. On 9/8/20, 5:33 PM, "Raymond Xu" wrote:     CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender an

[Question] Redundant release tag?

2020-09-08 Thread Raymond Xu
I think there is a mistakenly created version tag 0.60 in JIRA; the number does not seem to follow the release format. Anyone care to delete this? https://issues.apache.org/jira/projects/HUDI/versions/12348551

Re: [Question] HoodieROTablePathFilter not accept dir path

2020-09-08 Thread Balaji Varadarajan
Hi Raymond, IIRC, we need to give a blob path to make  HoodieROTablePathFilter to work correctly (e.g: "base/partition/*"). The path-cache is at partition level and not at table level so we need to extract the partition-path correctly to be used as look-up key. To extract partition-path, the ch

Re: [Question] Redundant release tag?

2020-09-08 Thread Balaji Varadarajan
Deleted. Thanks,Balaji.VOn Tuesday, September 8, 2020, 08:51:36 PM PDT, Raymond Xu wrote: I think there is a mistakenly created version tag 0.60 in JIRA; the number does not seem to follow the release format. Anyone care to delete this? https://issues.apache.org/jira/projects/HUDI/vers

Request to Add in Contributor list

2020-09-08 Thread Mani Jindal
Hi team Please guide me how can i request for the contributor access for jira so that i can assign some jira tickets to myself and contribute to the hudi community. JIRA Username: *manijndl77* Email: *manijn...@gmail.com * Full Name : *Mani Jindal* Thanks and Regards Mani Jindal

20200908 Weekly Sync Minutes

2020-09-08 Thread Vinoth Chandar
https://cwiki.apache.org/confluence/display/HUDI/20200908+Weekly+Sync+Minutes Please find this week's sync notes

Re: [DISCUSS] Formalizing the release process

2020-09-08 Thread Pratyaksh Sharma
Missed this thread, the plan looks good to me as well. On Wed, Sep 9, 2020 at 8:31 AM Vinoth Chandar wrote: > Would love to understand the general skepticism a bit more. > Is it rooted more on hitting those in the short term? or even in the longer > run with a better test infrastructure in place