Re: Request for feedback on work intent for non-equijoin support

2015-05-15 Thread Andres.Quiroz
Hello, At this point, I have implemented a standalone version of the 1-bucket-theta join algorithm described in the northeastern paper on Hadoop MR, and would like to start porting it to Hive. I have been looking at the code and believe that the main goal would be to implement a new

Re: Request for feedback on work intent for non-equijoin support

2015-05-15 Thread Thejas Nair
Hi Andres, Glad to hear about the progress! Vikram is a hive join implementation expert. He can guide you through this. We can setup a webex or google hangout and discuss this. Does sometime next week work for you ? (Please let us know some hours that work for you, in Pacific time zone).

Re: Request for feedback on work intent for non-equijoin support

2015-05-15 Thread Andres.Quiroz
Ok, that would be great! Except for Monday and Friday, I could meet any day next week in the afternoon (Pacific time), since it is the end of the day for me. Thanks a lot, Andrés On 5/15/15, 4:13 PM, Thejas Nair thejas.n...@gmail.com wrote: Hi Andres, Glad to hear about the progress! Vikram

RE: Request for feedback on work intent for non-equijoin support

2015-04-08 Thread Andres.Quiroz
So, I'd like to get started on this. The description in the design doc and the theta join paper from Northeastern seem like a good place to start, to have a baseline that I can later use for the more specific join algorithms I want to try. I created a JIRA account, and my username is

RE: Request for feedback on work intent for non-equijoin support

2015-04-08 Thread Xu, Cheng A
You can start your work from JoinOperator. Before that, you should follow the steps in https://cwiki.apache.org/confluence/display/Hive/GettingStarted -Original Message- From: andres.qui...@parc.com [mailto:andres.qui...@parc.com] Sent: Wednesday, April 08, 2015 8:49 PM To:

Re: Request for feedback on work intent for non-equijoin support

2015-04-08 Thread Thejas Nair
Yes, the theta join paper in northeastern is a good place to start. There is also a presentation from the folks in youtube, which is also very useful. I had a look at this issue as well earlier, and I had written up a rough proposal. I had not organized the document well enough for sharing

Re: Request for feedback on work intent for non-equijoin support

2015-04-08 Thread Thejas Nair
I don't have cycles for working on it in the next month or two. Maybe after that. On Wed, Apr 8, 2015 at 2:16 PM, andres.qui...@parc.com wrote: This is certainly very helpful, thank you. Do you have any cycles to devote to this issue at the moment, or in the near future? -Original

RE: Request for feedback on work intent for non-equijoin support

2015-04-08 Thread Andres.Quiroz
This is certainly very helpful, thank you. Do you have any cycles to devote to this issue at the moment, or in the near future? -Original Message- From: Thejas Nair [mailto:thejas.n...@gmail.com] Sent: Wednesday, April 08, 2015 2:32 PM To: dev Subject: Re: Request for feedback on work

RE: Request for feedback on work intent for non-equijoin support

2015-04-02 Thread Andres.Quiroz
This is a great pointer, Szehon and Brock, thank you. I will catch up with the material on theta joins and circle back. Andrés -Original Message- From: Brock Noland [mailto:br...@apache.org] Sent: Thursday, April 02, 2015 1:31 AM To: dev@hive.apache.org Subject: Re: Request for

RE: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Andres.Quiroz
Dear Lefty, Thank you very much for pointing that out and for your initial pointers. Here is the missing link: ftp.parc.com/pub/torres/Hiperfuse/extended_hiperfuse.pdf Regards, Andrés -Original Message- From: Lefty Leverenz [mailto:leftylever...@gmail.com] Sent: Wednesday, April 01,

Re: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Lefty Leverenz
Andrés, I followed that link and got the dread 404 Not Found: The requested URI /pub/torres/Hiperfuse/extended_hiperfuse.pdf was not found on this server. -- Lefty On Wed, Apr 1, 2015 at 7:23 PM, andres.qui...@parc.com wrote: Dear Lefty, Thank you very much for pointing that out and for

Re: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Szehon Ho
From Hive side, there has been some thought on the subject here: https://cwiki.apache.org/confluence/display/Hive/Theta+Join, it has some ideas but nobody has gotten around to giving it a try. It might be of interest. Thanks Szehon On Wed, Apr 1, 2015 at 10:05 PM, Lefty Leverenz

Re: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Lefty Leverenz
D'oh! Thanks Chao. -- Lefty On Thu, Apr 2, 2015 at 12:59 AM, Chao Sun c...@cloudera.com wrote: Hey Lefty, You need to use the ftp protocol, not http. After clicking the link, you'll need to remove http://; from the address bar. Best, Chao On Wed, Apr 1, 2015 at 9:41 PM, Lefty

Re: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Brock Noland
Nice, it'd be great if someone finally implemented this :) On Wed, Apr 1, 2015 at 10:10 PM, Szehon Ho sze...@cloudera.com wrote: From Hive side, there has been some thought on the subject here: https://cwiki.apache.org/confluence/display/Hive/Theta+Join, it has some ideas but nobody has gotten

Re: Request for feedback on work intent for non-equijoin support

2015-04-01 Thread Chao Sun
Hey Lefty, You need to use the ftp protocol, not http. After clicking the link, you'll need to remove http://; from the address bar. Best, Chao On Wed, Apr 1, 2015 at 9:41 PM, Lefty Leverenz leftylever...@gmail.com wrote: Andrés, I followed that link and got the dread 404 Not Found: The

Re: Request for feedback on work intent for non-equijoin support

2015-03-31 Thread Lefty Leverenz
Hello Andres, the link to your paper is missing: In our preliminary work, which you can find here (pointer to the paper) ... You can find general information about contributing to Hive in the wiki: Resources for Contributors

Request for feedback on work intent for non-equijoin support

2015-03-31 Thread Andres.Quiroz
Dear Hive development community members, I am interested in learning more about the current support for non-equijoins in Hive and/or other Hadoop SQL engines, and in getting feedback about community interest in more extensive support for such a feature. I intend to work on this challenge,