[Rust][Datafusion] Dataframe state make public

2023-09-21 Thread Jaroslaw Nowosad
Hi, Looking for comments/your view: Would it be possible to: 1. patch datafusion dataframe to make df.state public 2. patch datafusion adding method to dataframe ie: df.transform_logical_plan(mut self, new_plan) -> df where some original plan could be modified / injected with NewPlanNode

[RUST][Ballista] UDF/UDAF in Ballista

2023-06-27 Thread Jaroslaw Nowosad
Hi, Quick question: is UDF/UDAF working in Ballista? I saw "TODO" in the executor part : ```rust // TODO add logic to dynamically load UDF/UDAFs libs from files scalar_functions: HashMap::new(), aggregate_functions: HashMap::new(), ``` To create an example library and add reading functionality

Re: [RUST][Datafusion][Ballista] Writer

2023-04-02 Thread Jaroslaw Nowosad
listing tables. If you are looking for support > beyond this, I'd like to hear the use for more help. > > Mete. > > On Sat, Apr 1, 2023 at 11:07 PM Jaroslaw Nowosad wrote: > > > Hi, > > > > Looking for advice: > > I'm looking into creating a writer

[RUST][Datafusion][Ballista] Writer

2023-04-01 Thread Jaroslaw Nowosad
Hi, Looking for advice: I'm looking into creating a writer part for ballista. There is a data source but not a sink. I started looking into object store -> put/put_multipart. But looks like simple context extension is not enough - do I need to extend logical/physical plan? If you have any

Re: [RUST][Datafusion] SQL UDF in Datafusion

2023-01-13 Thread Jaroslaw Nowosad
- https://github.com/dask-contrib/dask-sql > [2] - > > https://github.com/dask-contrib/dask-sql/blob/main/dask_planner/src/parser.rs#L385 > > On Thu, Jan 12, 2023 at 10:36 AM Jaroslaw Nowosad > wrote: > > > Hi all, > > > > I had a task to investi

[RUST][Datafusion] SQL UDF in Datafusion

2023-01-12 Thread Jaroslaw Nowosad
Hi all, I had a task to investigate how to extend Datafusion to add UDFs written in plain SQL. Reason behind: there is quite a big bunch of SQL UDF in existing java (spark) solutions, however we are starting to move into the Rust ecosystem and Datafussion/Arrow/Ballista looks like the proper

[Datafusion] Streaming - integration with kafka - kafka_writer

2022-06-24 Thread Jaroslaw Nowosad
Hi, I am just trying to integrate datafusion with kafka, final goal is to have end-to-end streaming. But I started from a "different side" -> step 1 is to publish output to kafka, so I copied code/ created kafka publisher: