Re: [Agenda] Drill developer meetup 2018

hanu mapr Thu, 01 Nov 2018 13:58:56 -0700

Hello All,

There was typo for the year in the mail. It should be 2018 instead of 2019.
Thanks Aman for correcting it.


Regards,
-Hanu

On Thu, Nov 1, 2018 at 6:30 AM Charles Givre <[email protected]> wrote:

> Hi Hanumath,
> This looks great!!  Will you be streaming the event for those of us not in
> the Bay Area?
> Thx,
> — C
>
> > On Nov 1, 2018, at 00:10, Hanumath Rao Maduri <[email protected]>
> wrote:
> >
> > Drill Developers,
> >
> >
> > I am quite excited to announce the details of the Drill developers day
> > 2018. I have consolidated the topics from our earlier discussions and
> > prioritized them according to the votes.
> >
> >
> > MapR has offered to host it on Nov 14th in Training room downstairs.
> >
> >
> > Here is the exact location
> >
> >
> > Training Room at
> >
> > 4555 Great America Pkwy, Suite 201, Santa Clara, CA, 95054.
> >
> >
> > Please find the agenda for the meetup.
> >
> >
> >
> > *Lunch starts at 12:00PM.*
> >
> >
> > *[12:25 - 12:40] Welcome *
> >
> >   - Recap on last year's activities
> >   - Preview of this year's focus
> >
> > *[12:40 - 1:00] Storage plugins*
> >
> >
> >
> >   - Adding new storage plugins for the following:
> >      - Netflix Iceberg, Kudu(some code already exists), Cassandra,
> >      Elasticsearch, Carbondata, ORC/XML file formats, Spark
> >      RDD/DataFrames/Datasets, Graph databases & more
> >   - Improving documentation related to Storage plugins
> >
> >
> > *[1:00 - 1:45] Schema discovery & Evolution*
> >
> >
> >
> >   - Creation, management of schema
> >   - Handling schema changes in certain common cases
> >   - Handling NULL values elegantly
> >   - Schema learning (similar to MSGpack plugin)
> >   - Query hints
> >
> > *[1:45 - 2:30] Metadata Management*
> >
> >
> >
> >   - Defining an abstraction layer for various types of metadata: views,
> >   schema, statistics, security
> >   - Underlying storage for metadata: what are the options and their
> >   trade-offs?
> >   - Hive metastore
> >   - Parquet metadata cache (parquet specific for row group metadata)
> >   - Ease of using the parquet files generated by other engines (like
> spark)
> >
> >
> > *[2:30 - 2:45] Break*
> >
> >
> > *[2:45 - 4:00] Resource management*
> >
> >
> >
> >   - Resource limits per query
> >   - Optimal memory assignment for blocking operators based on stats
> >   - Enhancing the blocking and exchange operators to live within memory
> >   limits
> >   - Aligning with admission control/queueing (YARN concepts)
> >   - Query scheduling based on queues using tagging and costing
> >   - Drill on kubernetes
> >
> >
> > *[4:00 - 4:20] Apache Arrow*
> >
> >   - Benefits of integrating Apache Drill with Apache Arrow
> >   - Possible trade-offs & implementation hurdles
> >
> > *[4:20 - 4:40] **Performance Improvements*
> >
> >   - Efficient handling of Broadcast/Semi/Anti Semi join
> >   - Drill Statistics handling
> >   - Optimizing complex Parquet reader
> >
> > Thanks,
> > -Hanu
>
>

Re: [Agenda] Drill developer meetup 2018

Reply via email to