[jira] [Created] (ARROW-2577) [Plasma] Add ASV benchmarks
Philipp Moritz created ARROW-2577: - Summary: [Plasma] Add ASV benchmarks Key: ARROW-2577 URL: https://issues.apache.org/jira/browse/ARROW-2577 Project: Apache Arrow Issue Type: Improvement Reporter: Philipp Moritz We are about to merge some PRs that potentially impact plasma performance, so we should set up ASV to track the changes. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
Re: PyArrow and Parquet DELTA_BINARY_PACKED
Hello Feras, `DELTA_BINARY_PACKED` is at the moment only implemented in parquet-cpp on the read path. The necessary encoder implementation for this code is missing at the moment. The change in file size is something I also don't understand. The only difference between the two versions is that with version 2, we encode uint32 columns in version 1 as INT64 whereas in version two, we can encode them as UINT32. This type was not available in version 1. It would be nice, if you could narrow down the issue to e.g. the column which causes the increase in size. You might also use the Java parquet-tools or parquet-cli to inspect the size statistics of the parts of the individual Parquet file. Uwe On Fri, May 11, 2018, at 3:07 AM, Feras Salim wrote: > Hi, I was wondering if I'm missing something or currently the > `DELTA_BINARY_PACKED` is only available for reading when it comes to > parquet files, I can't find a way for the writer to encode timestamp data > with `DELTA_BINARY_PACKED`, furthermore I seem to get about 10% increase in > final file size when I change from ver 1 to ver 2 without changing anything > else about the schema or data.
Re: [VOTE] Accept donation of Arrow Ruby bindings
+1, thanks for the code donation and building the Ruby bindings. Uwe On Sat, May 12, 2018, at 8:53 AM, Kouhei Sutou wrote: > Hi, > > Thanks for starting the vote! > > +1 > > > Thanks, > -- > kou > > In> "[VOTE] Accept donation of Arrow Ruby bindings" on Fri, 11 May 2018 > 18:47:52 -0400, > Wes McKinney wrote: > > > Dear all, > > > > Arrow PMC member Kouhei Sutou has developed Ruby bindings to the GLib > > C interface for Apache Arrow > > > > * https://github.com/red-data-tools/red-arrow > > * https://github.com/red-data-tools/red-arrow-gpu > > > > He is proposing to pull these projects into Apache Arrow to develop > > them all in the same place > > > > https://github.com/apache/arrow/pull/1990 > > > > We are proposing to accept this code into the Apache project. If the > > vote passes, the PMC and Kou will work together to complete the ASF IP > > Clearance process (http://incubator.apache.org/ip-clearance/) and > > import the Ruby bindings for inclusion in a future release: > > > > [ ] +1 : Accept contribution of Ruby bindings > > [ ] 0 : No opinion > > [ ] -1 : Reject contribution because... > > > > Here is my vote: +1 > > > > The vote will be open for at least 72 hours. > > > > Thanks, > > Wes
[jira] [Created] (ARROW-2576) [GLib] Add abs functions for Decimal128.
yosuke shiro created ARROW-2576: --- Summary: [GLib] Add abs functions for Decimal128. Key: ARROW-2576 URL: https://issues.apache.org/jira/browse/ARROW-2576 Project: Apache Arrow Issue Type: New Feature Reporter: yosuke shiro -- This message was sent by Atlassian JIRA (v7.6.3#76005)