Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Craig Ching
Wes and Li, Thanks for your patience, and sorry for the noise! I did have an apache jira account, but it's been captured by a previous employer. I have created a new account and I can see the jira page now. Thanks! Cheers, Craig On Mon, Jul 23, 2018 at 10:18 PM Wes McKinney wrote: > hi Crai

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Wes McKinney
hi Craig -- do you have a JIRA account? I think that page should be visible to all logged in JIRA users. Let me know if not. - Wes On Mon, Jul 23, 2018 at 10:48 PM, Craig Ching wrote: > Thanks for this, Wes, I appreciate it! > > On Mon, Jul 23, 2018 at 8:16 PM Wes McKinney wrote: > >> You can a

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Craig Ching
Thanks for this, Wes, I appreciate it! On Mon, Jul 23, 2018 at 8:16 PM Wes McKinney wrote: > You can also have a look at > > https://cwiki.apache.org/confluence/display/ARROW/Arrow+0.10.0+Release > > On Mon, Jul 23, 2018 at 8:40 PM, Li Jin wrote: > > Craig, > > > > If you click the link > > htt

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Craig Ching
HI Li, Sorry, I should have said this originally, but that link leads me to a page that says "Sorry, you can't view this page" Cheers, Craig On Mon, Jul 23, 2018 at 7:40 PM Li Jin wrote: > Craig, > > If you click the link > https://issues.apache.org/jira/projects/ARROW/versions/12342562 you ca

[jira] [Created] (ARROW-2902) [Python] HDFS Docker integration tests leave around files created by root

2018-07-23 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2902: --- Summary: [Python] HDFS Docker integration tests leave around files created by root Key: ARROW-2902 URL: https://issues.apache.org/jira/browse/ARROW-2902 Project: Apache

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Wes McKinney
You can also have a look at https://cwiki.apache.org/confluence/display/ARROW/Arrow+0.10.0+Release On Mon, Jul 23, 2018 at 8:40 PM, Li Jin wrote: > Craig, > > If you click the link > https://issues.apache.org/jira/projects/ARROW/versions/12342562 you can > view issues under "issues in progress"

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Li Jin
Craig, If you click the link https://issues.apache.org/jira/projects/ARROW/versions/12342562 you can view issues under "issues in progress" and "issues to do" On Mon, Jul 23, 2018 at 8:30 PM, Craig Ching wrote: > Hi Li, > > Is there a better link to see the 16 issues required for 0.10 to be > r

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Craig Ching
Hi Li, Is there a better link to see the 16 issues required for 0.10 to be released? Cheers, Craig On Mon, Jul 23, 2018 at 5:47 PM Li Jin wrote: > Sounds good. I will stand by now. > > On Mon, Jul 23, 2018 at 1:40 PM, Wes McKinney wrote: > > > hi Li -- thanks. We are working on these issues;

Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Siddharth Teotia
Also look here to see how validity and data are copied independently between two vectors bypassing all Arrow APIs and directly manipulating memory. The link points to

Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Siddharth Teotia
Eric, you can take a look here how we try to optimize the copy (validity and data) in/out of vectors. We try to start with word-wise copy (64 column values and thus 64 valid

Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Wes McKinney
hi Eric, Antoine recently did some work on faster bitsetting in C++ by unrolling the main loop to set one byte at a time https://github.com/apache/arrow/blob/27b869ae5df31f3be61e76e9d96ea7d9b557/cpp/src/arrow/util/bit-util.h#L598 This yielded major speedups when setting a lot of bits. A simi

[jira] [Created] (ARROW-2901) [Java] Build is failing on Java9

2018-07-23 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2901: --- Summary: [Java] Build is failing on Java9 Key: ARROW-2901 URL: https://issues.apache.org/jira/browse/ARROW-2901 Project: Apache Arrow Issue Type: Bug

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Li Jin
Sounds good. I will stand by now. On Mon, Jul 23, 2018 at 1:40 PM, Wes McKinney wrote: > hi Li -- thanks. We are working on these issues; I will report in on > the status at the end of the week. If anything is C++/Python or > Website-related, please don't move anything out of the milestone > wit

Arrow Slack channel export

2018-07-23 Thread Wes McKinney
I exported all the data from our Slack channel, it's here (1.3 MB): https://drive.google.com/file/d/1ED61i-nPxetKF-JzrYhtz5WpTHnyvK2V/view Someone other PMC members probably should store a copy somewhere for archival purposes Thanks, Wes

[JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Eric Wohlstadter
Hi all, I work on a project that uses Arrow streaming format to transfer data between Java processes. We're also following the progress on Java support for Plasma, and may decide use Plasma also. We typically uses a pattern like this to fill Arrow vectors from Java arrays: int[] inputValues

Proof of concept work and conferences

2018-07-23 Thread White4, Ryan (STATCAN)
Hello, I am working on some proof-of-concept with Arrow. I am just getting started, but I've been trolling for a few months, reading the documentation, etc... I pitched the idea of using a standard data format in a conference abstract which has been accepted (https://www.adrfconference.org/). I

Re: [DISCUSS] Contribution of Gandiva to Apache Arrow

2018-07-23 Thread Wes McKinney
hi Jacques, I can say that I for one am excited to see this work happen in Apache Arrow, and I would like to contribute to the initiative. Some responses to queries inline: On Mon, Jul 23, 2018 at 2:50 PM, Jacques Nadeau wrote: > Hey All, > > As I mentioned on some other threads, since the start

[jira] [Created] (ARROW-2900) [Python] Improve performance of appending nested NumPy arrays in builtin_convert.cc

2018-07-23 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2900: --- Summary: [Python] Improve performance of appending nested NumPy arrays in builtin_convert.cc Key: ARROW-2900 URL: https://issues.apache.org/jira/browse/ARROW-2900 Proje

[jira] [Created] (ARROW-2899) [Python] Implement type unification abstraction for type inference code

2018-07-23 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2899: --- Summary: [Python] Implement type unification abstraction for type inference code Key: ARROW-2899 URL: https://issues.apache.org/jira/browse/ARROW-2899 Project: Apache A

[jira] [Created] (ARROW-2898) [Packaging] Setuptools_scm just shipped a new version which fails to parse `apache-arrow-` tag

2018-07-23 Thread Krisztian Szucs (JIRA)
Krisztian Szucs created ARROW-2898: -- Summary: [Packaging] Setuptools_scm just shipped a new version which fails to parse `apache-arrow-` tag Key: ARROW-2898 URL: https://issues.apache.org/jira/browse/ARROW-2898

[DISCUSS] Contribution of Gandiva to Apache Arrow

2018-07-23 Thread Jacques Nadeau
Hey All, As I mentioned on some other threads, since the start, the goal of the Dremio team was to contribute the Gandiva back into the Arrow project once we made some progress in turning it into something real. I think we're getting close to this point and wanted to have a bit of a discussion abo

Re: Working towards 0.10.0 release candidate

2018-07-23 Thread Wes McKinney
hi Li -- thanks. We are working on these issues; I will report in on the status at the end of the week. If anything is C++/Python or Website-related, please don't move anything out of the milestone without checking with me. The most at risk JIRA for the 0.10 release AFAICT is ARROW-2704: https://g

Working towards 0.10.0 release candidate

2018-07-23 Thread Li Jin
Hi All, As we get close to cut 0.10.0 release candidate, I want to clean up the remaining 16 open issues. Here is the board: https://issues.apache.org/jira/projects/ARROW/versions/12342562 For the two with PR open, I'd assume we can include them in the release once review is finished. For the r

Re: Spam in the Arrow wiki

2018-07-23 Thread Uwe L. Korn
> @Uwe, how are you seeing the spam? I haven't gotten any notifications about it I've watched the wiki and get every change in there as a mail. This has normally a delay of ~15min. Uwe

Re: Spam in the Arrow wiki

2018-07-23 Thread Wes McKinney
We can enable specific users to edit pages without making them admins, I believe. If anyone wants to edit the wiki, please let us know so we can add you to the permissions list for now. @Uwe, how are you seeing the spam? I haven't gotten any notifications about it On Mon, Jul 23, 2018 at 11:15 AM

Spam in the Arrow wiki

2018-07-23 Thread Uwe L. Korn
Hello all, I have just deactivated the permissions for non-Admins to edit pages in the Arrow wiki due to some spam bots posting. I'm not sure what the correct way forward is but sadly there are always some problems :( Cheers Uwe

[jira] [Created] (ARROW-2897) Organize supported Ubuntu versions

2018-07-23 Thread okkez (JIRA)
okkez created ARROW-2897: Summary: Organize supported Ubuntu versions Key: ARROW-2897 URL: https://issues.apache.org/jira/browse/ARROW-2897 Project: Apache Arrow Issue Type: Improvement Com