Great!

On Mon, May 4, 2020 at 5:43 PM Allen Underwood
<allen.underw...@broadcom.com.invalid> wrote:

> Hi  Vinoth,
>
> Yes I was going to set some things up in the morning. I’ll let you know
> how it turns out and if it’s worth a PR I’ll get one together.
>
> Thanks again for your help!
>
> Allen
>
> On Mon, May 4, 2020 at 8:40 PM Vinoth Chandar <vin...@apache.org> wrote:
>
>> Thanks both!
>>
>> @allen heard this many times :) hear you. You could write a small class
>> yourself with your custom logic and throw it in there?
>>
>> If you think there is a way to fix the key generator in Hudi to be more
>> resilient to these (e.g taking in a list of supported patterns vs just the
>> one), let us know.
>>
>> On Mon, May 4, 2020 at 3:08 PM Allen Underwood
>> <allen.underw...@broadcom.com.invalid> wrote:
>>
>> > Hi Vinoth - that was extremely helpful...I almost had it working,
>> HOWEVER,
>> > it appears I have dates that some have the ms on the end and others
>> > don't....so if I pick adding a time format with them, then the ones
>> without
>> > the fail and vice versa....Good times.
>> >
>> > After I figure this out I'll see if I can put this information somewhere
>> > easy to find.
>> >
>> > On Mon, May 4, 2020 at 12:23 PM Vinoth Chandar <vin...@apache.org>
>> wrote:
>> >
>> >> Hi Allen,
>> >>
>> >> You are able to configure the key generator for deltastreamer using
>> this
>> >> property (either via a file or --config )
>> >> hoodie.datasource.write.keygenerator.class
>> >>
>> >> You might be interested in this built-in generator.
>> >>
>> >>
>> https://github.com/apache/incubator-hudi/blob/master/hudi-utilities/src/main/java/org/apache/hudi/utilities/keygen/TimestampBasedKeyGenerator.java#L64
>> >> It takes let you configure a field as a recordKey, and if you can parse
>> >> your timestamp using Java SimpleDateFormat, you can specify the
>> datetime
>> >> field and a pattern to parse it into..
>> >>
>> >> Happy to make this work for you.
>> >>
>> >> community, any volunteers to faq/document this? :)
>> >>
>> >>
>> >> On Mon, May 4, 2020 at 9:11 AM Allen Underwood
>> >> <allen.underw...@broadcom.com.invalid> wrote:
>> >>
>> >> > I’ve tried to do my due diligence by googling / searching this slack
>> and
>> >> > I’ve come up empty.Is there a way through configuration /
>> deltastreamer
>> >> > to extract a custom partition key?Basically I have a datetime field
>> in a
>> >> > Kafka Source that has an ISO8601 datetime….is there a way to extract
>> a
>> >> > partition value out of that?I found this after some Googling, but
>> this
>> >> > seems like it’d only be useful if I wanted to write my own writer
>> >> > application:
>> >> >
>> >> >
>> >>
>> https://github.com/apache/incubator-hudi/blob/master/hudi-spark/src/main/java/org/apache/hudi/keygen/ComplexKeyGenerator.javaAny
>> >> > way to do what I need through configuration of the spark job / hudi
>> >> > configuration?
>> >> >
>> hudi-spark/src/main/java/org/apache/hudi/keygen/ComplexKeyGenerator.java
>> >> > <
>> >>
>> https://github.com/apache/incubator-hudi/blob/master/hudi-spark/src/main/java/org/apache/hudi/keygen/ComplexKeyGenerator.java
>> >> >
>> >> >
>> >> > --
>> >> > *Allen Underwood*
>> >> >
>> >>
>> >
>> >
>> > --
>> > *Allen Underwood*
>> > Principal Software Engineer
>> > Broadcom | Symantec Enterprise Division
>> > *Mobile*: 404.808.5926
>> >
>>
> --
> *Allen Underwood*
> Principal Software Engineer
> Broadcom | Symantec Enterprise Division
> *Mobile*: 404.808.5926
>

Reply via email to