RE: Questions about Hive

Hamilton, Robert (Austin) Mon, 17 Sep 2012 07:49:55 -0700

Hello, something :)
Regarding jdbc style: I understand this approach has some limitations, but here 
is an example.
You will need to make sure the hive service is running: 
https://cwiki.apache.org/Hive/hiveserver.html
Here is a sample code that I've used for testing. It is not the best java in 
the world but it gets the job done.
You will need to make sure the hive and hadoop jars are on the classpath. Note 
you will have to edit the connectionString.



import java.sql.*;

public class RunSQL {
   private static String driverName = "org.apache.hadoop.hive.jdbc.HiveDriver";
   private static String connectionString = 
"jdbc:hive://myserver.hp.com:10000/default";

    public static void main(String[] args) throws SQLException 
,org.apache.hadoop.hive.ql.metadata.HiveException {

        String SQLToRun=(args[0]);

        ResultSet res = null;

        try {
            Class.forName(driverName);
        } catch (ClassNotFoundException e) {
            e.printStackTrace();
            System.exit(1);
          }
        Connection con = DriverManager.getConnection(connectionString);
        System.out.println("Connected.");

        Statement stmt = con.createStatement();

        System.out.println("Running: " + SQLToRun);
        res = stmt.executeQuery(SQLToRun);
        ResultSetMetaData meta=res.getMetaData();
        int numberOfColumns=meta.getColumnCount();

        System.out.println("Result:");
        while (res.next()) {
                for (int i=1;i<=numberOfColumns;i++){
                    System.out.print(String.valueOf("\t" + res.getString(i)));
                }
                System.out.println();
        }

    }
}

From: Something Something [mailto:mailinglist...@gmail.com]
Sent: Monday, September 17, 2012 12:39 AM
To: hive-u...@hadoop.apache.org
Subject: Questions about Hive

Note:  I am a newbie to Hive.

Can someone please answer the following questions?

1)  Does Hive provide APIs (like HBase does) that can be used to retrieve data 
from the tables in Hive from a Java program?  I heard somewhere that the data 
can be accessed with JDBC (style) APIs.  True?

2)  I don't see how I can add indexes on the tables, so does that mean a query 
such as the following will trigger a MR job that will search files on HDFS 
sequentially?





hive> SELECT a.foo FROM invites a WHERE a.ds='2008-08-15';

3)  Has anyone compared performance of Hive against other NOSQL databases such 
as HBase, MongoDB.  I understand it's not exactly apples to apples comparison, 
but still...

Thanks.

RE: Questions about Hive

Reply via email to