[jira] Updated: (HIVE-802) Bug in DataNucleus prevents Hive from building if inside a dir with '+' in it

2010-09-08 Thread Carl Steinbach (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-802:


Attachment: datanucleus-core-1.1.2-patched.jar

> Bug in DataNucleus prevents Hive from building if inside a dir with '+' in it
> -
>
> Key: HIVE-802
> URL: https://issues.apache.org/jira/browse/HIVE-802
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Affects Versions: 0.5.0
>Reporter: Todd Lipcon
>Assignee: Arvind Prabhakar
> Attachments: datanucleus-core-1.1.2-patched.jar
>
>
> There's a bug in DataNucleus that causes this issue:
> http://www.jpox.org/servlet/jira/browse/NUCCORE-371
> To reproduce, simply put your hive source tree in a directory that contains a 
> '+' character.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1622) Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true

2010-09-08 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-1622:
-

   Status: Resolved  (was: Patch Available)
 Hadoop Flags: [Reviewed]
Fix Version/s: 0.7.0
   Resolution: Fixed

Committed. Thanks Ning

> Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true
> ---
>
> Key: HIVE-1622
> URL: https://issues.apache.org/jira/browse/HIVE-1622
> Project: Hadoop Hive
>  Issue Type: Improvement
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.7.0
>
> Attachments: HIVE-1622.patch
>
>
> Currently map-only merge (using CombineHiveInputFormat) is only enabled for 
> merging files generated by mappers. It should be used for files generated at 
> readers as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1624) Patch to allows scripts in S3 location

2010-09-08 Thread Vaibhav Aggarwal (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vaibhav Aggarwal updated HIVE-1624:
---

Attachment: HIVE-1624.patch

> Patch to allows scripts in S3 location
> --
>
> Key: HIVE-1624
> URL: https://issues.apache.org/jira/browse/HIVE-1624
> Project: Hadoop Hive
>  Issue Type: New Feature
>Reporter: Vaibhav Aggarwal
> Attachments: HIVE-1624.patch
>
>
> I want to submit a patch which allows user to run scripts located in S3.
> This patch enables Hive to download the hive scripts located in S3 buckets 
> and execute them. This saves users the effort of copying scripts to HDFS 
> before executing them.
> Thanks
> Vaibhav

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1624) Patch to allows scripts in S3 location

2010-09-08 Thread Vaibhav Aggarwal (JIRA)
Patch to allows scripts in S3 location
--

 Key: HIVE-1624
 URL: https://issues.apache.org/jira/browse/HIVE-1624
 Project: Hadoop Hive
  Issue Type: New Feature
Reporter: Vaibhav Aggarwal


I want to submit a patch which allows user to run scripts located in S3.

This patch enables Hive to download the hive scripts located in S3 buckets and 
execute them. This saves users the effort of copying scripts to HDFS before 
executing them.

Thanks
Vaibhav


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1607) Reinstate and deprecate IMetaStoreClient methods removed in HIVE-675

2010-09-08 Thread Carl Steinbach (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1607:
-

Fix Version/s: 0.6.0

> Reinstate and deprecate IMetaStoreClient methods removed in HIVE-675
> 
>
> Key: HIVE-1607
> URL: https://issues.apache.org/jira/browse/HIVE-1607
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Metastore
>Reporter: Carl Steinbach
>Assignee: Carl Steinbach
> Fix For: 0.6.0, 0.7.0
>
> Attachments: HIVE-1607.1.patch.txt, HIVE-1607.2.patch.txt
>
>
> Several methods were removed from the IMetaStoreClient interface as part of 
> HIVE-675:
> {code}
>   /**
>* Drop the table.
>*
>* @param tableName
>*  The table to drop
>* @param deleteData
>*  Should we delete the underlying data
>* @throws MetaException
>*   Could not drop table properly.
>* @throws UnknownTableException
>*   The table wasn't found.
>* @throws TException
>*   A thrift communication error occurred
>* @throws NoSuchObjectException
>*   The table wasn't found.
>*/
>   public void dropTable(String tableName, boolean deleteData)
>   throws MetaException, UnknownTableException, TException,
>   NoSuchObjectException;
>   /**
>* Get a table object.
>*
>* @param tableName
>*  Name of the table to fetch.
>* @return An object representing the table.
>* @throws MetaException
>*   Could not fetch the table
>* @throws TException
>*   A thrift communication error occurred
>* @throws NoSuchObjectException
>*   In case the table wasn't found.
>*/
>   public Table getTable(String tableName) throws MetaException, TException,
>   NoSuchObjectException;
>   public boolean tableExists(String databaseName, String tableName) throws 
> MetaException,
>   TException, UnknownDBException;
> {code}
> These methods should be reinstated with a deprecation warning.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1623) Factor out Hadoop version check logic in bin/hive scripts

2010-09-08 Thread Carl Steinbach (JIRA)
Factor out Hadoop version check logic in bin/hive scripts
-

 Key: HIVE-1623
 URL: https://issues.apache.org/jira/browse/HIVE-1623
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Clients
Reporter: Carl Steinbach


The same Hadoop version check logic is repeated in each of the following files:

bin/ext/hiveserver.sh
bin/ext/hwi.sh
bin/ext/metastore.sh
bin/ext/util/execHiveCmd.sh

This code should be refactored into a version check function.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Reopened: (HIVE-1607) Reinstate and deprecate IMetaStoreClient methods removed in HIVE-675

2010-09-08 Thread Carl Steinbach (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach reopened HIVE-1607:
--


Backport to 0.6.0

> Reinstate and deprecate IMetaStoreClient methods removed in HIVE-675
> 
>
> Key: HIVE-1607
> URL: https://issues.apache.org/jira/browse/HIVE-1607
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Metastore
>Reporter: Carl Steinbach
>Assignee: Carl Steinbach
> Fix For: 0.7.0
>
> Attachments: HIVE-1607.1.patch.txt, HIVE-1607.2.patch.txt
>
>
> Several methods were removed from the IMetaStoreClient interface as part of 
> HIVE-675:
> {code}
>   /**
>* Drop the table.
>*
>* @param tableName
>*  The table to drop
>* @param deleteData
>*  Should we delete the underlying data
>* @throws MetaException
>*   Could not drop table properly.
>* @throws UnknownTableException
>*   The table wasn't found.
>* @throws TException
>*   A thrift communication error occurred
>* @throws NoSuchObjectException
>*   The table wasn't found.
>*/
>   public void dropTable(String tableName, boolean deleteData)
>   throws MetaException, UnknownTableException, TException,
>   NoSuchObjectException;
>   /**
>* Get a table object.
>*
>* @param tableName
>*  Name of the table to fetch.
>* @return An object representing the table.
>* @throws MetaException
>*   Could not fetch the table
>* @throws TException
>*   A thrift communication error occurred
>* @throws NoSuchObjectException
>*   In case the table wasn't found.
>*/
>   public Table getTable(String tableName) throws MetaException, TException,
>   NoSuchObjectException;
>   public boolean tableExists(String databaseName, String tableName) throws 
> MetaException,
>   TException, UnknownDBException;
> {code}
> These methods should be reinstated with a deprecation warning.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1613) hive --service jar looks for hadoop version but was not defined

2010-09-08 Thread Carl Steinbach (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907402#action_12907402
 ] 

Carl Steinbach commented on HIVE-1613:
--

+1 Looks good.


> hive --service jar looks for hadoop version but was not defined
> ---
>
> Key: HIVE-1613
> URL: https://issues.apache.org/jira/browse/HIVE-1613
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Clients
>Affects Versions: 0.5.1
>Reporter: Edward Capriolo
>Assignee: Edward Capriolo
>Priority: Blocker
> Fix For: 0.6.0, 0.7.0
>
> Attachments: hive-1613.patch.txt
>
>
> hive --service jar fails. I have to open another ticket to clean up the 
> scripts and unify functions like version detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1613) hive --service jar looks for hadoop version but was not defined

2010-09-08 Thread Carl Steinbach (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1613:
-

Fix Version/s: 0.7.0

> hive --service jar looks for hadoop version but was not defined
> ---
>
> Key: HIVE-1613
> URL: https://issues.apache.org/jira/browse/HIVE-1613
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Clients
>Affects Versions: 0.5.1
>Reporter: Edward Capriolo
>Assignee: Edward Capriolo
>Priority: Blocker
> Fix For: 0.6.0, 0.7.0
>
> Attachments: hive-1613.patch.txt
>
>
> hive --service jar fails. I have to open another ticket to clean up the 
> scripts and unify functions like version detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1615) Web Interface JSP needs Refactoring for removed meta store methods

2010-09-08 Thread Edward Capriolo (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Edward Capriolo updated HIVE-1615:
--

Fix Version/s: 0.7.0
   (was: 0.6.0)
Affects Version/s: 0.6.0
   (was: 0.5.1)

> Web Interface JSP needs Refactoring for removed meta store methods
> --
>
> Key: HIVE-1615
> URL: https://issues.apache.org/jira/browse/HIVE-1615
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 0.6.0
>Reporter: Edward Capriolo
>Assignee: Edward Capriolo
>Priority: Blocker
> Fix For: 0.7.0
>
> Attachments: hive-1615.patch.2.txt, hive-1615.patch.txt
>
>
> Some meta store methods being called from JSP have been removed. Really 
> should prioritize compiling jsp into servlet code again.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1622) Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true

2010-09-08 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907367#action_12907367
 ] 

Namit Jain commented on HIVE-1622:
--

+1

looks good

> Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true
> ---
>
> Key: HIVE-1622
> URL: https://issues.apache.org/jira/browse/HIVE-1622
> Project: Hadoop Hive
>  Issue Type: Improvement
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Attachments: HIVE-1622.patch
>
>
> Currently map-only merge (using CombineHiveInputFormat) is only enabled for 
> merging files generated by mappers. It should be used for files generated at 
> readers as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1622) Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true

2010-09-08 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1622:
-

Status: Patch Available  (was: Open)

> Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true
> ---
>
> Key: HIVE-1622
> URL: https://issues.apache.org/jira/browse/HIVE-1622
> Project: Hadoop Hive
>  Issue Type: Improvement
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Attachments: HIVE-1622.patch
>
>
> Currently map-only merge (using CombineHiveInputFormat) is only enabled for 
> merging files generated by mappers. It should be used for files generated at 
> readers as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1622) Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true

2010-09-08 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1622:
-

Attachment: HIVE-1622.patch

> Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true
> ---
>
> Key: HIVE-1622
> URL: https://issues.apache.org/jira/browse/HIVE-1622
> Project: Hadoop Hive
>  Issue Type: Improvement
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Attachments: HIVE-1622.patch
>
>
> Currently map-only merge (using CombineHiveInputFormat) is only enabled for 
> merging files generated by mappers. It should be used for files generated at 
> readers as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-675) add database/schema support Hive QL

2010-09-08 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907350#action_12907350
 ] 

John Sichi commented on HIVE-675:
-

I think a DEFAULT clause is needed for DB_LOCATION_URI since there's an 
existing row which needs to be filled in.  

The .jdo has allows-null=false, so the column definition should be NOT NULL.


> add database/schema support Hive QL
> ---
>
> Key: HIVE-675
> URL: https://issues.apache.org/jira/browse/HIVE-675
> Project: Hadoop Hive
>  Issue Type: New Feature
>  Components: Metastore, Query Processor
>Reporter: Prasad Chakka
>Assignee: Carl Steinbach
> Fix For: 0.6.0, 0.7.0
>
> Attachments: hive-675-2009-9-16.patch, hive-675-2009-9-19.patch, 
> hive-675-2009-9-21.patch, hive-675-2009-9-23.patch, hive-675-2009-9-7.patch, 
> hive-675-2009-9-8.patch, HIVE-675-2010-08-16.patch.txt, 
> HIVE-675-2010-7-16.patch.txt, HIVE-675-2010-8-4.patch.txt, 
> HIVE-675.10.patch.txt, HIVE-675.11.patch.txt, HIVE-675.12.patch.txt, 
> HIVE-675.13.patch.txt
>
>
> Currently all Hive tables reside in single namespace (default). Hive should 
> support multiple namespaces (databases or schemas) such that users can create 
> tables in their specific namespaces. These name spaces can have different 
> warehouse directories (with a default naming scheme) and possibly different 
> properties.
> There is already some support for this in metastore but Hive query parser 
> should have this feature as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1620) Patch to write directly to S3 from Hive

2010-09-08 Thread Vaibhav Aggarwal (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907349#action_12907349
 ] 

Vaibhav Aggarwal commented on HIVE-1620:


This is the reason the patch uses task id instead of attempt id to write to s3.
Each process writes to the same file. In case of s3 the last process to commit 
the file wins. 
Hadoop tasks are supposed to be idempotent hence this should work.

> Patch to write directly to S3 from Hive
> ---
>
> Key: HIVE-1620
> URL: https://issues.apache.org/jira/browse/HIVE-1620
> Project: Hadoop Hive
>  Issue Type: New Feature
>Reporter: Vaibhav Aggarwal
>Assignee: Vaibhav Aggarwal
> Attachments: HIVE-1620.patch
>
>
> We want to submit a patch to Hive which allows user to write files directly 
> to S3.
> This patch allow user to specify an S3 location as the table output location 
> and hence eliminates the need  of copying data from HDFS to S3.
> Users can run Hive queries directly over the data stored in S3.
> This patch helps integrate hive with S3 better and quicker.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1622) Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true

2010-09-08 Thread Ning Zhang (JIRA)
Use CombineHiveInputFormat for the merge job if hive.merge.mapredfiles=true
---

 Key: HIVE-1622
 URL: https://issues.apache.org/jira/browse/HIVE-1622
 Project: Hadoop Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang


Currently map-only merge (using CombineHiveInputFormat) is only enabled for 
merging files generated by mappers. It should be used for files generated at 
readers as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1616) Add ProtocolBuffersStructObjectInspector

2010-09-08 Thread Johan Oskarsson (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johan Oskarsson updated HIVE-1616:
--

Status: Patch Available  (was: Open)

> Add ProtocolBuffersStructObjectInspector
> 
>
> Key: HIVE-1616
> URL: https://issues.apache.org/jira/browse/HIVE-1616
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Serializers/Deserializers
>Reporter: Johan Oskarsson
>Assignee: Johan Oskarsson
>Priority: Minor
> Attachments: HIVE-1616.patch
>
>
> Much like there is a ThriftStructObjectInspector that ignores the isset 
> booleans there is a need for a ProtocolBuffersStructObjectInspector that 
> ignores has*. This can then be used together with Twitter's elephant-bird.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1616) Add ProtocolBuffersStructObjectInspector

2010-09-08 Thread Johan Oskarsson (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johan Oskarsson updated HIVE-1616:
--

Attachment: (was: HIVE-1616.patch)

> Add ProtocolBuffersStructObjectInspector
> 
>
> Key: HIVE-1616
> URL: https://issues.apache.org/jira/browse/HIVE-1616
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Serializers/Deserializers
>Reporter: Johan Oskarsson
>Assignee: Johan Oskarsson
>Priority: Minor
> Attachments: HIVE-1616.patch
>
>
> Much like there is a ThriftStructObjectInspector that ignores the isset 
> booleans there is a need for a ProtocolBuffersStructObjectInspector that 
> ignores has*. This can then be used together with Twitter's elephant-bird.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1616) Add ProtocolBuffersStructObjectInspector

2010-09-08 Thread Johan Oskarsson (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johan Oskarsson updated HIVE-1616:
--

Attachment: HIVE-1616.patch

Updated to include unit test, based off the Thrift one. Had to generate a file 
using protocol buffers for the test. In order to avoid having to include the 
protocol buffers java library just for the test I modified the generated code. 
The test is still valid but if we would feel more comfortable including the pb 
jar that is also doable, it's under a BSD license.

> Add ProtocolBuffersStructObjectInspector
> 
>
> Key: HIVE-1616
> URL: https://issues.apache.org/jira/browse/HIVE-1616
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Serializers/Deserializers
>Reporter: Johan Oskarsson
>Assignee: Johan Oskarsson
>Priority: Minor
> Attachments: HIVE-1616.patch
>
>
> Much like there is a ThriftStructObjectInspector that ignores the isset 
> booleans there is a need for a ProtocolBuffersStructObjectInspector that 
> ignores has*. This can then be used together with Twitter's elephant-bird.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.