Hello,
At this point, I have implemented a standalone version of the
1-bucket-theta join algorithm described in the northeastern paper on
Hadoop MR, and would like to start porting it to Hive.
I have been looking at the code and believe that the main goal would be to
implement a new
Hi Andres,
Glad to hear about the progress!
Vikram is a hive join implementation expert. He can guide you through this.
We can setup a webex or google hangout and discuss this. Does sometime
next week work for you ? (Please let us know some hours that work for
you, in Pacific time zone).
Ok, that would be great! Except for Monday and Friday, I could meet any
day next week in the afternoon (Pacific time), since it is the end of the
day for me.
Thanks a lot,
Andrés
On 5/15/15, 4:13 PM, Thejas Nair thejas.n...@gmail.com wrote:
Hi Andres,
Glad to hear about the progress!
Vikram
So, I'd like to get started on this. The description in the design doc and the
theta join paper from Northeastern seem like a good place to start, to have a
baseline that I can later use for the more specific join algorithms I want to
try.
I created a JIRA account, and my username is
You can start your work from JoinOperator. Before that, you should follow the
steps in https://cwiki.apache.org/confluence/display/Hive/GettingStarted
-Original Message-
From: andres.qui...@parc.com [mailto:andres.qui...@parc.com]
Sent: Wednesday, April 08, 2015 8:49 PM
To:
Yes, the theta join paper in northeastern is a good place to start.
There is also a presentation from the folks in youtube, which is also
very useful.
I had a look at this issue as well earlier, and I had written up a
rough proposal. I had not organized the document well enough for
sharing
I don't have cycles for working on it in the next month or two. Maybe
after that.
On Wed, Apr 8, 2015 at 2:16 PM, andres.qui...@parc.com wrote:
This is certainly very helpful, thank you. Do you have any cycles to devote
to this issue at the moment, or in the near future?
-Original
This is certainly very helpful, thank you. Do you have any cycles to devote to
this issue at the moment, or in the near future?
-Original Message-
From: Thejas Nair [mailto:thejas.n...@gmail.com]
Sent: Wednesday, April 08, 2015 2:32 PM
To: dev
Subject: Re: Request for feedback on work
This is a great pointer, Szehon and Brock, thank you. I will catch up with the
material on theta joins and circle back.
Andrés
-Original Message-
From: Brock Noland [mailto:br...@apache.org]
Sent: Thursday, April 02, 2015 1:31 AM
To: dev@hive.apache.org
Subject: Re: Request for
Dear Lefty,
Thank you very much for pointing that out and for your initial pointers. Here
is the missing link:
ftp.parc.com/pub/torres/Hiperfuse/extended_hiperfuse.pdf
Regards,
Andrés
-Original Message-
From: Lefty Leverenz [mailto:leftylever...@gmail.com]
Sent: Wednesday, April 01,
Andrés, I followed that link and got the dread 404 Not Found:
The requested URI /pub/torres/Hiperfuse/extended_hiperfuse.pdf was not
found on this server.
-- Lefty
On Wed, Apr 1, 2015 at 7:23 PM, andres.qui...@parc.com wrote:
Dear Lefty,
Thank you very much for pointing that out and for
From Hive side, there has been some thought on the subject here:
https://cwiki.apache.org/confluence/display/Hive/Theta+Join, it has some
ideas but nobody has gotten around to giving it a try. It might be of
interest.
Thanks
Szehon
On Wed, Apr 1, 2015 at 10:05 PM, Lefty Leverenz
D'oh! Thanks Chao.
-- Lefty
On Thu, Apr 2, 2015 at 12:59 AM, Chao Sun c...@cloudera.com wrote:
Hey Lefty,
You need to use the ftp protocol, not http.
After clicking the link, you'll need to remove http://; from the address
bar.
Best,
Chao
On Wed, Apr 1, 2015 at 9:41 PM, Lefty
Nice, it'd be great if someone finally implemented this :)
On Wed, Apr 1, 2015 at 10:10 PM, Szehon Ho sze...@cloudera.com wrote:
From Hive side, there has been some thought on the subject here:
https://cwiki.apache.org/confluence/display/Hive/Theta+Join, it has some
ideas but nobody has gotten
Hey Lefty,
You need to use the ftp protocol, not http.
After clicking the link, you'll need to remove http://; from the address
bar.
Best,
Chao
On Wed, Apr 1, 2015 at 9:41 PM, Lefty Leverenz leftylever...@gmail.com
wrote:
Andrés, I followed that link and got the dread 404 Not Found:
The
Hello Andres, the link to your paper is missing:
In our preliminary work, which you can find here (pointer to the paper) ...
You can find general information about contributing to Hive in the
wiki: Resources
for Contributors
Dear Hive development community members,
I am interested in learning more about the current support for non-equijoins in
Hive and/or other Hadoop SQL engines, and in getting feedback about community
interest in more extensive support for such a feature. I intend to work on this
challenge,
17 matches
Mail list logo