Re: [jira] [Created] (AMATERASU-52) Implement AmaContext.datastores

Arun Manivannan Wed, 30 Jan 2019 08:02:30 -0800

Hi,

I realised that making data classes for the config may not be the right
approach considering we won't be able to know all the properties in
advance. Would you consider a Map instead?


I have updated the PR to reflect this proposal.  The datasets.yaml sticks
to the format that I mentioned in yesterday's mail.  Please have a look and
let me know if this works.

Regards,
Arun

On Tue, Jan 29, 2019 at 10:05 PM Arun Manivannan <[email protected]> wrote:

> Makes sense, Nadav. I have been toying with the idea of having the
> structure like this. I am trying to make it work on konf (argggh!!) though.
> Do you think this sounds reasonable?
>
>
> datasets:
>   hive:
>     transactions:
>       uri: /user/somepath
>       format: parquet
>       database: transations_daily
>       table: transx
>
>     second_transactions:
>       uri: /seconduser/somepath
>       format: avro
>       database: transations_monthly
>       table: avro_table
>   file:
>     users:
>       uri: s3://filestore
>       format: parquet
>       mode: overwrite
>
>
>
> Cheers,
> Arun
>
>
> On Tue, Jan 29, 2019 at 1:45 PM Nadav Har Tzvi <[email protected]>
> wrote:
>
>> Hey Arun,
>>
>> I kinda feel like the datastores yaml is somewhat obscure. I propose the
>> following structure.
>>
>> Instead of
>>
>> datasets:
>>   hive:
>>     - key: transactions
>>       uri: /user/somepath
>>       format: parquet
>>       database: transations_daily
>>       table: transx
>>
>>     - key: second_transactions
>>       uri: /seconduser/somepath
>>       format: avro
>>       database: transations_monthly
>>       table: avro_table
>>   file:
>>     - key: users
>>       uri: s3://filestore
>>       format: parquet
>>       mode: overwrite
>>
>> I would have
>>
>> datasets:
>>   - key: transactions
>>     uri: /user/somepath
>>     format: parquet
>>     database: transations_daily
>>     table: transx
>>     type: hive
>>   - key: second_transactions
>>     uri: /seconduser/somepath
>>     format: avro
>>     database: transations_monthly
>>     table: avro_table
>>     type: hive
>>   - key: users
>>     uri: s3://filestore
>>     format: parquet
>>     mode: overwrite
>>     type: file
>>
>> In my opinion it is more straightforward and uniform. I think it is also
>> more straightforward code-wise.
>> What do you think?
>>
>> Cheers,
>> Nadav
>>
>>
>>
>> On Mon, 14 Jan 2019 at 00:57, Yaniv Rodenski <[email protected]> wrote:
>>
>> > Hi Arun,
>> >
>> > I've added my comments to the PR, but good call, I agree @Nadav Har Tzvi
>> > <[email protected]> should at least review as you both need to
>> > maintain compatible APIs.
>> >
>> > Cheers,
>> > Yaniv
>> >
>> > On Sun, Jan 13, 2019 at 10:21 PM Arun Manivannan <[email protected]>
>> wrote:
>> >
>> >> Hi Guy, Yaniv and Nadiv,
>> >>
>> >> This PR <https://github.com/apache/incubator-amaterasu/pull/39> just
>> >> captures part of the issue - the datasets.yaml, ConfigManager and the
>> >> testcases. The Integration with the AmaContext is yet to be done but I
>> >> would like to get your thoughts on the implementation.
>> >>
>> >> Guy - Would it be okay if you could help throw some light on the syntax
>> >> and
>> >> the idiomatic part of Kotlin itself. Newbie here.
>> >>
>> >> Cheers,
>> >> Arun
>> >>
>> >> On Fri, Oct 12, 2018 at 7:15 PM Yaniv Rodenski (JIRA) <[email protected]
>> >
>> >> wrote:
>> >>
>> >> > Yaniv Rodenski created AMATERASU-52:
>> >> > ---------------------------------------
>> >> >
>> >> >              Summary: Implement AmaContext.datastores
>> >> >                  Key: AMATERASU-52
>> >> >                  URL:
>> >> https://issues.apache.org/jira/browse/AMATERASU-52
>> >> >              Project: AMATERASU
>> >> >           Issue Type: Task
>> >> >             Reporter: Yaniv Rodenski
>> >> >             Assignee: Arun Manivannan
>> >> >              Fix For: 0.2.1-incubating
>> >> >
>> >> >
>> >> > AmaContext.datastores should contain the data from datastores.yaml
>> >> >
>> >> >
>> >> >
>> >> > --
>> >> > This message was sent by Atlassian JIRA
>> >> > (v7.6.3#76005)
>> >> >
>> >>
>> >
>> >
>> > --
>> > Yaniv Rodenski
>> >
>> > +61 477 778 405
>> > [email protected]
>> >
>> >
>>
>

Re: [jira] [Created] (AMATERASU-52) Implement AmaContext.datastores

Reply via email to