[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902616#action_12902616 ] Edward Capriolo commented on HIVE-1434: --- Maven, I am on the fence about it. We actually do not need all the libs I included. Having them in a tarball sounds good, but making a maven repo for only this purpose seems to be a lot of work. {quote} Should we attempt to factor out the HBase commonality immediately, or commit the overlapping code and then do refactoring as a followup? I'm fine either way; I can give suggestions on how to create the reusable abstract bases and where to package+name them.{quote} If you can specify specific instances then sure. The code may be 99% the same, but that one nuance is going to make the abstractions confusing and useless. I await further review. Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Affects Versions: 0.7.0 Reporter: Edward Capriolo Assignee: Edward Capriolo Fix For: 0.7.0 Attachments: cas-handle.tar.gz, hive-1434-1.txt, hive-1434-2-patch.txt, hive-1434-3-patch.txt, hive-1434-4-patch.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902621#action_12902621 ] John Sichi commented on HIVE-1434: -- Regarding the dependencies: if we use the same mechanism as Hadoop, then we don't need a Maven repo. We just point ivy at the tarball location. See target ivy-retrieve-hadoop-source in build-common.xml, and the various ivy.xml files in subdirs. If you can get this working against a standard Apache mirror download, I can start working on getting the files hosted on mirror.facebook.net, which has had better availability in the past. For the refactor, let's do it in a followup and also talk with the Hypertable folks to plan it out, since I think they had to copy a lot of code also. I think it will be possible to do it in a way that is useful and understandable since we now have three instances to work from. Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Affects Versions: 0.7.0 Reporter: Edward Capriolo Assignee: Edward Capriolo Fix For: 0.7.0 Attachments: cas-handle.tar.gz, hive-1434-1.txt, hive-1434-2-patch.txt, hive-1434-3-patch.txt, hive-1434-4-patch.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902688#action_12902688 ] John Sichi commented on HIVE-1434: -- @Ed: to clarify about the tarball; we would just use a standard Cassandra distribution, e.g. http://apache.opensourceresources.org/cassandra/0.6.4/apache-cassandra-0.6.4-bin.tar.gz Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Affects Versions: 0.7.0 Reporter: Edward Capriolo Assignee: Edward Capriolo Fix For: 0.7.0 Attachments: cas-handle.tar.gz, hive-1434-1.txt, hive-1434-2-patch.txt, hive-1434-3-patch.txt, hive-1434-4-patch.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12899129#action_12899129 ] John Sichi commented on HIVE-1434: -- I'll start taking a closer look at this one...may take me a few days. Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Reporter: Edward Capriolo Assignee: Edward Capriolo Attachments: cas-handle.tar.gz, hive-1434-1.txt, hive-1434-2-patch.txt, hive-1434-3-patch.txt, hive-1434-4-patch.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12898786#action_12898786 ] Amr Awadallah commented on HIVE-1434: - I am out of office on vacation and will be slower than usual in responding to emails. If this is urgent then please call my cell phone (or send an sms), otherwise I will reply to your email when I get back. Thanks for your patience, -- amr Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Reporter: Edward Capriolo Assignee: Edward Capriolo Attachments: cas-handle.tar.gz, hive-1434-1.txt, hive-1434-2-patch.txt, hive-1434-3-patch.txt, hive-1434-4-patch.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-1434) Cassandra Storage Handler
[ https://issues.apache.org/jira/browse/HIVE-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884384#action_12884384 ] Edward Capriolo commented on HIVE-1434: --- I actually got pretty far with this simply duplicating the logic in the Hbase Storage handler. Unfortunately I hit a snafu. Cassandra is not using the deprecated mapred.*, their input format is using mapreduce.*. I have seen a few tickets for this, and as far as I know hive is 100% mapred. So to get this done we either have to wait until hive is converted to mapreduce, or I have to make an old school mapred based input format for cassandra. @John am I wrong? Is there a way to work with mapreduce input formats that I am not understanding? Cassandra Storage Handler - Key: HIVE-1434 URL: https://issues.apache.org/jira/browse/HIVE-1434 Project: Hadoop Hive Issue Type: New Feature Reporter: Edward Capriolo Assignee: Edward Capriolo Attachments: hive-1434-1.txt Add a cassandra storage handler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.