I need to familiarize with the design first, before I do this work I will
discuss with your guys again.

BR,
TyrantLucifer


Zongwen Li <[email protected]> 于2022年8月12日周五 10:42写道:

> These factories should be enabled to allow users to configure more easily
> and support more features;
>
> Below I briefly explain the factory:
> 1. CatalogFactory: a feature with the same target as the user-defined
> schema, used to obtain the structured information of the table;
> 2. DeserializationFormatFactory & SerializationFormatFactory: used to
> support kafka, file and other unstructured connectors to process data in
> various formats;
> 3. TableSourceFactory: It is used to support multi-table reading and
> metadata column features. It is an upper-level extension of source:
> 4. TableSinkFactory: The same level as TableSourceFactory, but there are
> currently no supported features;
>
> You can review the flowchart in this proposal and refine the entire
> process: https://github.com/apache/incubator-seatunnel/issues/1608
>
> Best,
> Zongwen Li
>
> JUN GAO <[email protected]> 于2022年8月12日周五 10:24写道:
>
> > Should we enable `org.apache.seatunnel.api.table.factory` and optimize
> the
> > design ? I think @Lucifer Tyrant can do this.
> >
> >
> > Zongwen Li <[email protected]> 于2022年8月12日周五 10:15写道:
> >
> > > The Format feature already exists, the problem is that we haven't
> enabled
> > > it yet;
> > > In fact, all the features in the
> `org.apache.seatunnel.api.table.factory`
> > > package under the `seatunnel-api` module are not enabled, they are
> > mutually
> > > dependent;
> > >
> > > Best,
> > > Zongwen Li
> > >
> > > 范佳 <[email protected]> 于2022年8月11日周四 15:32写道:
> > >
> > > > +1
> > > >
> > > > Thanks for open this discussion.
> > > >
> > > > ________________________
> > > >
> > > > Jia Fan
> > > >
> > > > > 2022年8月11日 14:35,Lucifer Tyrant <[email protected]> 写道:
> > > > >
> > > > > Hi seatunnel community,
> > > > >
> > > > > According to pr #2392, seatunnel has a new feature to parse schema
> > > > > information. Next I want to discuss with your guys how to define
> data
> > > > > format in source connector configuration.
> > > > >
> > > > > In my opinion, the config can be defined as the following:
> > > > >
> > > > > schema {
> > > > >  fields {}
> > > > >  format = xxx
> > > > >  delimiter = xxx
> > > > > }
> > > > >
> > > > > We can abstract a new interface to parse format information in
> > config.
> > > > This
> > > > > interface can use *seatunnel-formats* converting upstream from row
> > data
> > > > to
> > > > > a seatunnel row. What do you think about it?
> > > > >
> > > > > TyrantLucifer
> > > > >
> > > >
> > > >
> > >
> >
> >
> > --
> >
> > Best Regards
> >
> > ------------
> >
> > EricJoy2048
> > [email protected]
> >
>

Reply via email to