Re: Potential resource for large scale testing

2015-09-25 Thread Edmon Begoli
Steven - send me please your contact info (email for now, preferably Apache if you have one or Dremio) to ebegoliATutkDOTedu. Thank you, Edmon On Fri, Sep 25, 2015 at 12:18 PM, Jacques Nadeau wrote: > That is great news! From the Dremio side, I propose working with Steven. > Let's start taking

Re: Potential resource for large scale testing

2015-09-25 Thread Jacques Nadeau
That is great news! From the Dremio side, I propose working with Steven. Let's start taking advantage of this awesome resource! -- Jacques Nadeau CTO and Co-Founder, Dremio On Wed, Sep 23, 2015 at 5:34 PM, Edmon Begoli wrote: > This request has been approved. I will get more details tomorrow. >

Re: Potential resource for large scale testing

2015-09-23 Thread Edmon Begoli
This request has been approved. I will get more details tomorrow. I could add to the resource few members of the Drill team, maybe one person from MapR and one from Dremio who can have access and can assist in configuring (or instructing resource sysadmins) how to run the big tests, if desired. Th

Re: Potential resource for large scale testing

2015-09-18 Thread Edmon Begoli
I requested 5000 hours a year on Beacon for Apache Drill for high performance benchmarking, testing and optimization. I will let you know of the resolution pretty soon. I expect these resources to be awarded to the project. On Fri, Sep 18, 2015 at 6:22 PM, Parth Chandra wrote: > +1 on running t

Re: Potential resource for large scale testing

2015-09-18 Thread Parth Chandra
+1 on running the build and tests. If we need to run some kind of stress tests, we could consider running TPC-H/TPC-DS at large scale factors. On Fri, Sep 18, 2015 at 2:24 PM, Jacques Nadeau wrote: > Not offhand. It really depends on how the time would work. For example, it > would be nice if we

Re: Potential resource for large scale testing

2015-09-18 Thread Jacques Nadeau
Not offhand. It really depends on how the time would work. For example, it would be nice if we had an automated perfectly fressh (no .m2/repo) nightly build and full test suite run so people can always check the status. Maybe we use this hardware for that? -- Jacques Nadeau CTO and Co-Founder, Dre

Re: Potential resource for large scale testing

2015-09-18 Thread Edmon Begoli
We could use JICS/NICS resources to run memory stress tests - jobs requiring high RAM. Also, cluster stress tests. That is expensive to do on AWS. Plus we can provide some sys admin support. On Friday, September 18, 2015, rahul challapalli wrote: > Edmon, > > We do have the tests available now

Re: Potential resource for large scale testing

2015-09-18 Thread rahul challapalli
Edmon, We do have the tests available now [1]. Jacques, You expressed interest in making these tests available on an Amazon cluster so that users need not have physical hardware required to run these tests. Do you have any specific thoughts on how to leverage the resources that Edmon is willing

Re: Potential resource for large scale testing

2015-09-17 Thread Edmon Begoli
I discussed this idea of bringing large compute resource yesterday with my team at JICS to the project, and there was a general consensus that it can be committed. I will request and hopefully commit pretty large set of clustered CPU/storage resources for the needs of a Drill project. I will be t

Re: Potential resource for large scale testing

2015-09-05 Thread Edmon Begoli
Ted, It is actually very easy and painless to do what I am proposing. I probably made it sound far more bureaucratic/legalistic than it really is. Researchers and projects from across the globe can apply for cycles on Beacon or any other HPC platform we run. (Beacon is by far the best and we alre

Re: Potential resource for large scale testing

2015-09-05 Thread Ted Dunning
Edmon, This is very interesting. I am sure that public acknowledgements of contributions are easily managed. What might be even more useful for you would be small scale publications, especially about the problems of shoe-horning real-world data objects into the quasi-relational model of Drill.

Re: Potential resource for large scale testing

2015-09-04 Thread Edmon Begoli
I can work with my institution and the NSF that we committ the time on the Beacon supercomputing cluster to Apache and the Drill project. Maybe 20 hours a month for 4-5 nodes. I have discretionary hours that I can put in, and I can, with our HPC admins, create deploy scripts on few clustered machi

Re: Potential resource for large scale testing

2015-08-31 Thread Jacques Nadeau
I spent a bunch of time looking at the Phi coprocessors and forgot to get back to the thread. I'd love it if someone spent some time looking at leveraging them (since Drill is frequently processor bound). Any takers? -- Jacques Nadeau CTO and Co-Founder, Dremio On Mon, Aug 31, 2015 at 10:24 PM

Re: Potential resource for large scale testing

2015-08-31 Thread Parth Chandra
Hi Edmon, Sorry no one seems to have got back to you on this. We are in the process of publishing a test suite for regression testing Drill and the cluster you have (even a few nodes ) would be a great resource for folks to run the test suite. Rahul, et al are working on this and I would sugges

Potential resource for large scale testing

2015-08-25 Thread Edmon Begoli
Hey folks, As we discussed today on a hangout, this is a machine that we have at JICS/NICS where I have Drill installed and where I could set up a test cluster over few nodes. https://www.nics.tennessee.edu/computing-resources/beacon/configuration Note that each node is: - 2x8-core Intel® Xeon®