Concur +1 From me! -- C
> On Jul 6, 2023, at 12:46 PM, Ted Dunning <[email protected]> wrote: > > That's a cool abstract. > > > > On Thu, Jul 6, 2023 at 8:29 AM Mike Beckerle <[email protected]> wrote: > >> I decided the only way to force getting this Drill + Daffodil integration >> done, or at least started, is to have a deadline. >> >> So I submitted this abstract below for the upcoming "Community over Code" >> (formerly known as ApacheCon) conference this fall (Oct 7-10) >> >> I'm hoping this forces some of the refactoring that is gating other efforts >> and fixes in Daffodil at the same time. >> >> *Direct Query of Arbitrary Data Formats using Apache Drill and Apache >> Daffodil* >> >> >> *Suppose you have data in an ad-hoc data format like **EDIFACT, ISO8583, >> Asterix, some COBOL FD, or any other kind of data. **You can now describe >> it with a Data Format Description Language (DFDL) schema, then u**sing >> Apache Drill, you can directly query that data and those queries can also >> incorporate data from any of Apache Drill's other array of data sources. >> **This >> talk will describe the integration of Apache Drill with Apache Daffodil's >> DFDL implementation. **This deep integration implements Drill's metadata >> model in terms of the Daffodil DFDL metadata model, and implements Drill's >> data model in terms of the Daffodil DFDL Infoset API. This enables Drill >> queries to operate intelligently on DFDL-described data without the cost of >> data conversion into an expensive intermediate form like JSON or XML. **The >> talk will highlight the specific challenges in this integration and the >> lessons learned that are applicable to integration of other Apache projects >> having their own metadata and data models. * >>
