[jira] [Created] (ARROW-2577) [Plasma] Add ASV benchmarks

2018-05-13 Thread Philipp Moritz (JIRA)
Philipp Moritz created ARROW-2577:
-

 Summary: [Plasma] Add ASV benchmarks
 Key: ARROW-2577
 URL: https://issues.apache.org/jira/browse/ARROW-2577
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Philipp Moritz


We are about to merge some PRs that potentially impact plasma performance, so 
we should set up ASV to track the changes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


Re: PyArrow and Parquet DELTA_BINARY_PACKED

2018-05-13 Thread Uwe L. Korn
Hello Feras,

`DELTA_BINARY_PACKED` is at the moment only implemented in parquet-cpp on the 
read path. The necessary encoder implementation for this code is missing at the 
moment.

The change in file size is something I also don't understand. The only 
difference between the two versions is that with version 2, we encode uint32 
columns in version 1 as INT64 whereas in version two, we can encode them as 
UINT32. This type was not available in version 1. It would be nice, if you 
could narrow down the issue to e.g. the column which causes the increase in 
size. You might also use the Java parquet-tools or parquet-cli to inspect the 
size statistics of the parts of the individual Parquet file.

Uwe

On Fri, May 11, 2018, at 3:07 AM, Feras Salim wrote:
> Hi, I was wondering if I'm missing something or currently the
> `DELTA_BINARY_PACKED` is only available for reading when it comes to
> parquet files, I can't find a way for the writer to encode timestamp data
> with `DELTA_BINARY_PACKED`, furthermore I seem to get about 10% increase in
> final file size when I change from ver 1 to ver 2 without changing anything
> else about the schema or data.


Re: [VOTE] Accept donation of Arrow Ruby bindings

2018-05-13 Thread Uwe L. Korn
+1, thanks for the code donation and building the Ruby bindings.

Uwe

On Sat, May 12, 2018, at 8:53 AM, Kouhei Sutou wrote:
> Hi,
> 
> Thanks for starting the vote!
> 
> +1
> 
> 
> Thanks,
> --
> kou
> 
> In 
>   "[VOTE] Accept donation of Arrow Ruby bindings" on Fri, 11 May 2018 
> 18:47:52 -0400,
>   Wes McKinney  wrote:
> 
> > Dear all,
> > 
> > Arrow PMC member Kouhei Sutou has developed Ruby bindings to the GLib
> > C interface for Apache Arrow
> > 
> >  * https://github.com/red-data-tools/red-arrow
> >  * https://github.com/red-data-tools/red-arrow-gpu
> > 
> > He is proposing to pull these projects into Apache Arrow to develop
> > them all in the same place
> > 
> > https://github.com/apache/arrow/pull/1990
> > 
> > We are proposing to accept this code into the Apache project. If the
> > vote passes, the PMC and Kou will work together to complete the ASF IP
> > Clearance process (http://incubator.apache.org/ip-clearance/) and
> > import the Ruby bindings for inclusion in a future release:
> > 
> > [ ] +1 : Accept contribution of Ruby bindings
> > [ ]  0 : No opinion
> > [ ] -1 : Reject contribution because...
> > 
> > Here is my vote: +1
> > 
> > The vote will be open for at least 72 hours.
> > 
> > Thanks,
> > Wes


[jira] [Created] (ARROW-2576) [GLib] Add abs functions for Decimal128.

2018-05-13 Thread yosuke shiro (JIRA)
yosuke shiro created ARROW-2576:
---

 Summary: [GLib] Add abs functions for Decimal128.
 Key: ARROW-2576
 URL: https://issues.apache.org/jira/browse/ARROW-2576
 Project: Apache Arrow
  Issue Type: New Feature
Reporter: yosuke shiro






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)