Re: [Java][Discuss]: consensus for JDK 8 deprecation

2023-10-06 Thread Jacob Wujciak-Jens
>From a release engineer perspective (without java knowledge) I agree with Micah, I'd rather make a patch release for an older version if needed but modernize the codebase and simplify CI! On Sat, Oct 7, 2023 at 5:27 AM Micah Kornfield wrote: > I think given the stability of Arrow Java,

Re: [Java][Discuss]: consensus for JDK 8 deprecation

2023-10-06 Thread Micah Kornfield
I think given the stability of Arrow Java, dropping support probably makes sense. If a bug comes up or consumers really need to new features we can always make a patch release of an older version. On Thu, Oct 5, 2023 at 3:13 PM Dane Pitkin wrote: > I also learned today that Apache Spark has

Re: [Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Dewey Dunnington
+1! On Fri, Oct 6, 2023, 8:03 PM Matt Topol wrote: > +1 > > On Fri, Oct 6, 2023, 6:55 PM Benjamin Kietzman > wrote: > > > +1 > > > > On Fri, Oct 6, 2023, 17:27 Felipe Oliveira Carvalho > > > wrote: > > > > > Hello, > > > > > > I'm writing to propose "+vl" and "+vL" as format strings for

Re: [Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Matt Topol
+1 On Fri, Oct 6, 2023, 6:55 PM Benjamin Kietzman wrote: > +1 > > On Fri, Oct 6, 2023, 17:27 Felipe Oliveira Carvalho > wrote: > > > Hello, > > > > I'm writing to propose "+vl" and "+vL" as format strings for list-view > and > > large list-view arrays passing through the Arrow C data interface

Re: [Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Benjamin Kietzman
+1 On Fri, Oct 6, 2023, 17:27 Felipe Oliveira Carvalho wrote: > Hello, > > I'm writing to propose "+vl" and "+vL" as format strings for list-view and > large list-view arrays passing through the Arrow C data interface [1]. > > The previous proposal was considered a bad idea because existing

[Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Felipe Oliveira Carvalho
Hello, I'm writing to propose "+vl" and "+vL" as format strings for list-view and large list-view arrays passing through the Arrow C data interface [1]. The previous proposal was considered a bad idea because existing parsers of these format strings might be looking at only the first `l` (or

Re: [DISCUSS][C++] Raw pointer string views

2023-10-06 Thread Neal Richardson
Agreed, it's unfortunately not just a simple tradeoff. We have discussed this a bit in [1] and in several other threads around this topic. If we say that Arrow is about interchange and not execution, so we shouldn't adopt the pointer version that DuckDB uses, that means we're also making

Re: [Vote][Format] C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Antoine Pitrou
Le 06/10/2023 à 17:54, Felipe Oliveira Carvalho a écrit : Hello, Since existing C Data Interface integrations sometimes don't parse beyond the first `l` (or `L`) I'm going to start a new [VOTE] thread with Dewey's suggestion: Regardless of which format string we choose for ListView, a bug

Re: [Vote][Format] C data interface format string for ListView and LargeListView arrays

2023-10-06 Thread Felipe Oliveira Carvalho
Hello, Since existing C Data Interface integrations sometimes don't parse beyond the first `l` (or `L`) I'm going to start a new [VOTE] thread with Dewey's suggestion: +vl and +vL If anyone objects to that and has a different suggestion, reply here so I don't have to spam the list with too many

Re: [DISCUSS][C++] Raw pointer string views

2023-10-06 Thread Weston Pace
> I feel the broader question here is what is Arrow's intended use case - interchange or execution The line between interchange and execution is not always clear. For example, I think we would like Arrow to be considered as a standard for UDF libraries. On Fri, Oct 6, 2023 at 7:34 AM Mark

Re: [DISCUSS][C++] Raw pointer string views

2023-10-06 Thread Mark Raasveldt
For the index vs pointer question - DuckDB went with pointers as they are more flexible, and DuckDB was designed to consume data (and strings) from a wide variety of formats in a wide variety of languages. Pointers allows us to easily zero-copy from e.g. Python strings, R strings, Arrow

Re: [DISCUSS][Rust][DataFusion][HiveMetaStore] Possible Metastore integration with Data Fusion

2023-10-06 Thread Andrew Lamb
You might be able to find relevant examples in the Delta lake Rust implementation[1] which I believe features DataFusion catalog integration You might also find the "Catalogs, Schemas, and Tables" section of the library developers guide[2] helpful. [1] https://github.com/delta-io/delta-rs [2]

Re: [DISCUSS][C++] Raw pointer string views

2023-10-06 Thread Andrew Lamb
Given I don't see any input from the DuckDB / Velox development team (this discussion seems primarily Arrow developers) I have filed a ticket in DuckDB requesting their consideration[1] and tried to bump the attention of the existing ticket in Velox[2]. Perhaps their input will provide a way

Re: [VOTE][Format] Variable shape tensor canonical extension type

2023-10-06 Thread Joris Van den Bossche
Worth noting that here were some minor changes made to the spec while the vote was active: - The "uniform_dimensions" metadata key was removed, since this can also be inferred from the "uniform_shape" information - The shape of non-constant dimensions in the "uniform_shape" entry is now

Re: [VOTE][Format] Variable shape tensor canonical extension type

2023-10-06 Thread Rok Mihevc
Hey All, We have 4 binding +1 votes, no non-binding +1 votes, and no -1 votes, so the vote passes. Thanks everyone for your work and participation on this! As a follow up we will: [ ] merge changes to the format ( https://github.com/apache/arrow/pull/37166/files) [ ] merge C++ and Python