date:20131112


 [ 
https://issues.apache.org/jira/browse/HIVE-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5677:


Description: 
NO PRECOMMIT TESTS

This is similar to HIVE-5085.

Beeline complains about files not existing if HIVE_OPTS are set.

In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to '' 
 makes sense.

  was:
This is similar to HIVE-5085.

Beeline complains about files not existing if HIVE_OPTS are set.

In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to '' 
 makes sense.


 Beeline warns about unavailable files if HIVE_OPTS is set
 -

 Key: HIVE-5677
 URL: https://issues.apache.org/jira/browse/HIVE-5677
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.12.0
Reporter: Sushanth Sowmyan
 Attachments: HIVE-5677.1.patch.txt


 NO PRECOMMIT TESTS
 This is similar to HIVE-5085.
 Beeline complains about files not existing if HIVE_OPTS are set.
 In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to 
 ''  makes sense.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5677) Beeline warns about unavailable files if HIVE_OPTS is set


 [ 
https://issues.apache.org/jira/browse/HIVE-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis reassigned HIVE-5677:
---

Assignee: Navis

 Beeline warns about unavailable files if HIVE_OPTS is set
 -

 Key: HIVE-5677
 URL: https://issues.apache.org/jira/browse/HIVE-5677
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.12.0
Reporter: Sushanth Sowmyan
Assignee: Navis
 Attachments: HIVE-5677.1.patch.txt


 NO PRECOMMIT TESTS
 This is similar to HIVE-5085.
 Beeline complains about files not existing if HIVE_OPTS are set.
 In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to 
 ''  makes sense.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5677) Beeline warns about unavailable files if HIVE_OPTS is set


 [ 
https://issues.apache.org/jira/browse/HIVE-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5677:


Attachment: HIVE-5677.1.patch.txt

 Beeline warns about unavailable files if HIVE_OPTS is set
 -

 Key: HIVE-5677
 URL: https://issues.apache.org/jira/browse/HIVE-5677
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.12.0
Reporter: Sushanth Sowmyan
 Attachments: HIVE-5677.1.patch.txt


 NO PRECOMMIT TESTS
 This is similar to HIVE-5085.
 Beeline complains about files not existing if HIVE_OPTS are set.
 In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to 
 ''  makes sense.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5677) Beeline warns about unavailable files if HIVE_OPTS is set


 [ 
https://issues.apache.org/jira/browse/HIVE-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5677:


Status: Patch Available  (was: Open)

 Beeline warns about unavailable files if HIVE_OPTS is set
 -

 Key: HIVE-5677
 URL: https://issues.apache.org/jira/browse/HIVE-5677
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.12.0
Reporter: Sushanth Sowmyan
Assignee: Navis
 Attachments: HIVE-5677.1.patch.txt


 NO PRECOMMIT TESTS
 This is similar to HIVE-5085.
 Beeline complains about files not existing if HIVE_OPTS are set.
 In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to 
 ''  makes sense.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5729) Beeline displays version as ???? after mavenization


[ 
https://issues.apache.org/jira/browse/HIVE-5729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820147#comment-13820147
 ] 

Brock Noland commented on HIVE-5729:


+1

 Beeline displays version as  after mavenization
 ---

 Key: HIVE-5729
 URL: https://issues.apache.org/jira/browse/HIVE-5729
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.13.0
Reporter: Szehon Ho
Assignee: Navis
 Attachments: HIVE-5729.1.patch.txt


 NO PRECOMMIT TESTS
 In Beeline.java, method getApplicationTitle(), it looks to the Beeline 
 class's package to find version information.  However, MANIFESTs are not 
 included in Beeline jar after mavenization.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5732) HiveServer2: Duplicated new OperationManager in SessionManager


[ 
https://issues.apache.org/jira/browse/HIVE-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820150#comment-13820150
 ] 

Brock Noland commented on HIVE-5732:


+1

 HiveServer2: Duplicated new OperationManager in SessionManager
 --

 Key: HIVE-5732
 URL: https://issues.apache.org/jira/browse/HIVE-5732
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.12.0
Reporter: Schubert Zhang
Assignee: Navis
Priority: Trivial
 Attachments: HIVE-5732.1.patch.txt


 {noformat}
 public class SessionManager extends CompositeService {
   private static final Log LOG = LogFactory.getLog(CompositeService.class);
   private HiveConf hiveConf;
   private final MapSessionHandle, HiveSession handleToSession = new 
 HashMapSessionHandle, HiveSession();
   private OperationManager operationManager = new OperationManager();
   private static final Object sessionMapLock = new Object();
   private ExecutorService backgroundOperationPool;
   public SessionManager() {
 super(SessionManager);
   }
   @Override
   public synchronized void init(HiveConf hiveConf) {
 this.hiveConf = hiveConf;
 operationManager = new OperationManager();
 int backgroundPoolSize = 
 hiveConf.getIntVar(ConfVars.HIVE_SERVER2_ASYNC_EXEC_THREADS);
 LOG.info(HiveServer2: Async execution pool size + backgroundPoolSize);
 backgroundOperationPool = 
 Executors.newFixedThreadPool(backgroundPoolSize);
 addService(operationManager);
 super.init(hiveConf);
   }
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5732) HiveServer2: Duplicated new OperationManager in SessionManager


 [ 
https://issues.apache.org/jira/browse/HIVE-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brock Noland updated HIVE-5732:
---

Description: 
{noformat}
public class SessionManager extends CompositeService {
  private static final Log LOG = LogFactory.getLog(CompositeService.class);
  private HiveConf hiveConf;
  private final MapSessionHandle, HiveSession handleToSession = new 
HashMapSessionHandle, HiveSession();
  private OperationManager operationManager = new OperationManager();
  private static final Object sessionMapLock = new Object();
  private ExecutorService backgroundOperationPool;

  public SessionManager() {
super(SessionManager);
  }

  @Override
  public synchronized void init(HiveConf hiveConf) {
this.hiveConf = hiveConf;
operationManager = new OperationManager();
int backgroundPoolSize = 
hiveConf.getIntVar(ConfVars.HIVE_SERVER2_ASYNC_EXEC_THREADS);
LOG.info(HiveServer2: Async execution pool size + backgroundPoolSize);
backgroundOperationPool = Executors.newFixedThreadPool(backgroundPoolSize);
addService(operationManager);
super.init(hiveConf);
  }
{noformat}

  was:
public class SessionManager extends CompositeService {
  private static final Log LOG = LogFactory.getLog(CompositeService.class);
  private HiveConf hiveConf;
  private final MapSessionHandle, HiveSession handleToSession = new 
HashMapSessionHandle, HiveSession();
  private OperationManager operationManager = new OperationManager();
  private static final Object sessionMapLock = new Object();
  private ExecutorService backgroundOperationPool;

  public SessionManager() {
super(SessionManager);
  }

  @Override
  public synchronized void init(HiveConf hiveConf) {
this.hiveConf = hiveConf;
operationManager = new OperationManager();
int backgroundPoolSize = 
hiveConf.getIntVar(ConfVars.HIVE_SERVER2_ASYNC_EXEC_THREADS);
LOG.info(HiveServer2: Async execution pool size + backgroundPoolSize);
backgroundOperationPool = Executors.newFixedThreadPool(backgroundPoolSize);
addService(operationManager);
super.init(hiveConf);
  }


 HiveServer2: Duplicated new OperationManager in SessionManager
 --

 Key: HIVE-5732
 URL: https://issues.apache.org/jira/browse/HIVE-5732
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.12.0
Reporter: Schubert Zhang
Assignee: Navis
Priority: Trivial
 Attachments: HIVE-5732.1.patch.txt


 {noformat}
 public class SessionManager extends CompositeService {
   private static final Log LOG = LogFactory.getLog(CompositeService.class);
   private HiveConf hiveConf;
   private final MapSessionHandle, HiveSession handleToSession = new 
 HashMapSessionHandle, HiveSession();
   private OperationManager operationManager = new OperationManager();
   private static final Object sessionMapLock = new Object();
   private ExecutorService backgroundOperationPool;
   public SessionManager() {
 super(SessionManager);
   }
   @Override
   public synchronized void init(HiveConf hiveConf) {
 this.hiveConf = hiveConf;
 operationManager = new OperationManager();
 int backgroundPoolSize = 
 hiveConf.getIntVar(ConfVars.HIVE_SERVER2_ASYNC_EXEC_THREADS);
 LOG.info(HiveServer2: Async execution pool size + backgroundPoolSize);
 backgroundOperationPool = 
 Executors.newFixedThreadPool(backgroundPoolSize);
 addService(operationManager);
 super.init(hiveConf);
   }
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5783) Native Parquet Support in Hive

2013-11-12 Thread Justin Coffey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Justin Coffey reassigned HIVE-5783:
---

Assignee: Justin Coffey

 Native Parquet Support in Hive
 --

 Key: HIVE-5783
 URL: https://issues.apache.org/jira/browse/HIVE-5783
 Project: Hive
  Issue Type: New Feature
Reporter: Justin Coffey
Assignee: Justin Coffey
Priority: Minor

 Problem Statement:
 Hive would be easier to use if it had native Parquet support. Our 
 organization, Criteo, uses Hive extensively. Therefore we built the Parquet 
 Hive integration and would like to now contribute that integration to Hive.
 About Parquet:
 Parquet is a columnar storage format for Hadoop and integrates with many 
 Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, 
 Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native 
 Parquet integration.
 Changes Details:
 Parquet was built with dependency management in mind and therefore only a 
 single Parquet jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5783) Native Parquet Support in Hive

2013-11-12 Thread Justin Coffey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820168#comment-13820168
 ] 

Justin Coffey commented on HIVE-5783:
-

Thanks [~cwsteinbach] and [~ehans].  Regarding vectorization support the 
parquet team will review ASAP!

 Native Parquet Support in Hive
 --

 Key: HIVE-5783
 URL: https://issues.apache.org/jira/browse/HIVE-5783
 Project: Hive
  Issue Type: New Feature
Reporter: Justin Coffey
Assignee: Justin Coffey
Priority: Minor

 Problem Statement:
 Hive would be easier to use if it had native Parquet support. Our 
 organization, Criteo, uses Hive extensively. Therefore we built the Parquet 
 Hive integration and would like to now contribute that integration to Hive.
 About Parquet:
 Parquet is a columnar storage format for Hadoop and integrates with many 
 Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, 
 Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native 
 Parquet integration.
 Changes Details:
 Parquet was built with dependency management in mind and therefore only a 
 single Parquet jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5677) Beeline warns about unavailable files if HIVE_OPTS is set


[ 
https://issues.apache.org/jira/browse/HIVE-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820188#comment-13820188
 ] 

Xuefu Zhang commented on HIVE-5677:
---

Just curious, with the changes in the patch, are we still able to debug beeline 
process remotely?

 Beeline warns about unavailable files if HIVE_OPTS is set
 -

 Key: HIVE-5677
 URL: https://issues.apache.org/jira/browse/HIVE-5677
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.12.0
Reporter: Sushanth Sowmyan
Assignee: Navis
 Attachments: HIVE-5677.1.patch.txt


 NO PRECOMMIT TESTS
 This is similar to HIVE-5085.
 Beeline complains about files not existing if HIVE_OPTS are set.
 In the Beeline commandline sh as well, we should see if setting HIVE_OPTS to 
 ''  makes sense.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5801) Support for reader/writer of ORC format for R environment

2013-11-12 Thread Michael Hausenblas (JIRA)

Michael Hausenblas created HIVE-5801:


 Summary: Support for reader/writer of ORC format for R environment
 Key: HIVE-5801
 URL: https://issues.apache.org/jira/browse/HIVE-5801
 Project: Hive
  Issue Type: Improvement
Reporter: Michael Hausenblas
Priority: Minor


It would be great if the ORC format would directly be accessible from R [1], 
that is, providing reader/writer for it.

[1] http://www.r-project.org/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5801) Support for reader/writer of ORC format for R environment

2013-11-12 Thread Carter Shanklin (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820282#comment-13820282
 ] 

Carter Shanklin commented on HIVE-5801:
---

Hi [~mhausenblas], would it make sense to integrate at the HCatalog layer? It's 
not as optimized as reading directly but does abstract underlying format.

 Support for reader/writer of ORC format for R environment
 -

 Key: HIVE-5801
 URL: https://issues.apache.org/jira/browse/HIVE-5801
 Project: Hive
  Issue Type: Improvement
Reporter: Michael Hausenblas
Priority: Minor

 It would be great if the ORC format would directly be accessible from R [1], 
 that is, providing reader/writer for it.
 [1] http://www.r-project.org/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5356) Move arithmatic UDFs to generic UDF implementations


 [ 
https://issues.apache.org/jira/browse/HIVE-5356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-5356:
--

Attachment: HIVE-5356.6.patch

Patch #6 is equivalent to #5 with some minor refactoring.

 Move arithmatic UDFs to generic UDF implementations
 ---

 Key: HIVE-5356
 URL: https://issues.apache.org/jira/browse/HIVE-5356
 Project: Hive
  Issue Type: Task
  Components: UDF
Affects Versions: 0.11.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang
 Fix For: 0.13.0

 Attachments: HIVE-5356.1.patch, HIVE-5356.2.patch, HIVE-5356.3.patch, 
 HIVE-5356.4.patch, HIVE-5356.5.patch, HIVE-5356.6.patch, HIVE-5356.patch


 Currently, all of the arithmetic operators, such as add/sub/mult/div, are 
 implemented as old-style UDFs and java reflection is used to determine the 
 return type TypeInfos/ObjectInspectors, based on the return type of the 
 evaluate() method chosen for the expression. This works fine for types that 
 don't have type params.
 Hive decimal type participates in these operations just like int or double. 
 Different from double or int, however, decimal has precision and scale, which 
 cannot be determined by just looking at the return type (decimal) of the UDF 
 evaluate() method, even though the operands have certain precision/scale. 
 With the default of decimal without precision/scale, then (10, 0) will be 
 the type params. This is certainly not desirable.
 To solve this problem, all of the arithmetic operators would need to be 
 implemented as GenericUDFs, which allow returning ObjectInspector during the 
 initialize() method. The object inspectors returned can carry type params, 
 from which the exact return type can be determined.
 It's worth mentioning that, for user UDF implemented in non-generic way, if 
 the return type of the chosen evaluate() method is decimal, the return type 
 actually has (10,0) as precision/scale, which might not be desirable. This 
 needs to be documented.
 This JIRA will cover minus, plus, divide, multiply, mod, and pmod, to limit 
 the scope of review. The remaining ones will be covered under HIVE-5706.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5632) Eliminate splits based on SARGs using stripe statistics in ORC


 [ 
https://issues.apache.org/jira/browse/HIVE-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-5632:
-

Status: Open  (was: Patch Available)

 Eliminate splits based on SARGs using stripe statistics in ORC
 --

 Key: HIVE-5632
 URL: https://issues.apache.org/jira/browse/HIVE-5632
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.13.0
Reporter: Prasanth J
Assignee: Prasanth J
  Labels: orcfile
 Attachments: HIVE-5632.1.patch.txt, HIVE-5632.2.patch.txt, 
 HIVE-5632.3.patch.txt, HIVE-5632.4.patch, HIVE-5632.5.patch.txt, 
 orc_split_elim.orc


 HIVE-5562 provides stripe level statistics in ORC. Stripe level statistics 
 combined with predicate pushdown in ORC (HIVE-4246) can be used to eliminate 
 the stripes (thereby splits) that doesn't satisfy the predicate condition. 
 This can greatly reduce unnecessary reads.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5632) Eliminate splits based on SARGs using stripe statistics in ORC


 [ 
https://issues.apache.org/jira/browse/HIVE-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-5632:
-

Status: Patch Available  (was: Open)

 Eliminate splits based on SARGs using stripe statistics in ORC
 --

 Key: HIVE-5632
 URL: https://issues.apache.org/jira/browse/HIVE-5632
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.13.0
Reporter: Prasanth J
Assignee: Prasanth J
  Labels: orcfile
 Attachments: HIVE-5632.1.patch.txt, HIVE-5632.2.patch.txt, 
 HIVE-5632.3.patch.txt, HIVE-5632.4.patch, HIVE-5632.5.patch, 
 HIVE-5632.5.patch.txt, orc_split_elim.orc


 HIVE-5562 provides stripe level statistics in ORC. Stripe level statistics 
 combined with predicate pushdown in ORC (HIVE-4246) can be used to eliminate 
 the stripes (thereby splits) that doesn't satisfy the predicate condition. 
 This can greatly reduce unnecessary reads.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5632) Eliminate splits based on SARGs using stripe statistics in ORC


 [ 
https://issues.apache.org/jira/browse/HIVE-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-5632:
-

Attachment: HIVE-5632.5.patch

Reupload .5 for precommit test.

 Eliminate splits based on SARGs using stripe statistics in ORC
 --

 Key: HIVE-5632
 URL: https://issues.apache.org/jira/browse/HIVE-5632
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.13.0
Reporter: Prasanth J
Assignee: Prasanth J
  Labels: orcfile
 Attachments: HIVE-5632.1.patch.txt, HIVE-5632.2.patch.txt, 
 HIVE-5632.3.patch.txt, HIVE-5632.4.patch, HIVE-5632.5.patch, 
 HIVE-5632.5.patch.txt, orc_split_elim.orc


 HIVE-5562 provides stripe level statistics in ORC. Stripe level statistics 
 combined with predicate pushdown in ORC (HIVE-4246) can be used to eliminate 
 the stripes (thereby splits) that doesn't satisfy the predicate condition. 
 This can greatly reduce unnecessary reads.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5217) Add long polling to asynchronous execution in HiveServer2

2013-11-12 Thread Carl Steinbach (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820372#comment-13820372
 ] 

Carl Steinbach commented on HIVE-5217:
--

[~vaibhavgumashta] Is the patch ready for review?

 Add long polling to asynchronous execution in HiveServer2
 -

 Key: HIVE-5217
 URL: https://issues.apache.org/jira/browse/HIVE-5217
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Affects Versions: 0.13.0
Reporter: Vaibhav Gumashta
Assignee: Vaibhav Gumashta
 Fix For: 0.13.0

 Attachments: HIVE-5217.2.patch, HIVE-5217.3.patch, 
 HIVE-5217.D12801.2.patch, HIVE-5217.D12801.3.patch, HIVE-5217.D12801.4.patch, 
 HIVE-5217.D12801.5.patch, HIVE-5217.D12801.6.patch


 [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
 for async execution in HS2. The client gets an operation handle which it can 
 poll to check on the operation status. However, the polling frequency is 
 entirely left to the client which can be resource inefficient. Long polling 
 will solve this, by blocking the client request to check the operation status 
 for a configurable amount of time (a new HS2 config) if the data is not 
 available, but responding immediately if the data is available.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5732) HiveServer2: Duplicated new OperationManager in SessionManager


[ 
https://issues.apache.org/jira/browse/HIVE-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820380#comment-13820380
 ] 

Hive QA commented on HIVE-5732:
---



{color:green}Overall{color}: +1 all checks pass

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12613329/HIVE-5732.1.patch.txt

{color:green}SUCCESS:{color} +1 4602 tests passed

Test results: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/244/testReport
Console output: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/244/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12613329

 HiveServer2: Duplicated new OperationManager in SessionManager
 --

 Key: HIVE-5732
 URL: https://issues.apache.org/jira/browse/HIVE-5732
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.12.0
Reporter: Schubert Zhang
Assignee: Navis
Priority: Trivial
 Attachments: HIVE-5732.1.patch.txt


 {noformat}
 public class SessionManager extends CompositeService {
   private static final Log LOG = LogFactory.getLog(CompositeService.class);
   private HiveConf hiveConf;
   private final MapSessionHandle, HiveSession handleToSession = new 
 HashMapSessionHandle, HiveSession();
   private OperationManager operationManager = new OperationManager();
   private static final Object sessionMapLock = new Object();
   private ExecutorService backgroundOperationPool;
   public SessionManager() {
 super(SessionManager);
   }
   @Override
   public synchronized void init(HiveConf hiveConf) {
 this.hiveConf = hiveConf;
 operationManager = new OperationManager();
 int backgroundPoolSize = 
 hiveConf.getIntVar(ConfVars.HIVE_SERVER2_ASYNC_EXEC_THREADS);
 LOG.info(HiveServer2: Async execution pool size + backgroundPoolSize);
 backgroundOperationPool = 
 Executors.newFixedThreadPool(backgroundPoolSize);
 addService(operationManager);
 super.init(hiveConf);
   }
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-4723) DDLSemanticAnalyzer.addTablePartsOutputs eats several exceptions


 [ 
https://issues.apache.org/jira/browse/HIVE-4723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Szehon Ho updated HIVE-4723:


Attachment: HIVE-4723.3.patch

Attaching the same patch again to trigger testing.  I believe the pre-commit 
test of the previous patch got lost in the queue the build moved to bigtop.

 DDLSemanticAnalyzer.addTablePartsOutputs eats several exceptions
 

 Key: HIVE-4723
 URL: https://issues.apache.org/jira/browse/HIVE-4723
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.12.0
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-4723.1.patch, HIVE-4723.2.patch, HIVE-4723.3.patch, 
 HIVE-4723.patch


 I accidently tried to archive a partition on a non-partitioned table. The 
 error message was bad, hive ate an exception, and NPE'ed.
 {noformat}
 2013-06-09 16:36:12,628 ERROR parse.DDLSemanticAnalyzer 
 (DDLSemanticAnalyzer.java:addTablePartsOutputs(2899)) - Got HiveException 
 during obtaining list of partitions
 2013-06-09 16:36:12,628 ERROR ql.Driver (SessionState.java:printError(383)) - 
 FAILED: NullPointerException null
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.addTablePartsOutputs(DDLSemanticAnalyzer.java:2912)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.addTablePartsOutputs(DDLSemanticAnalyzer.java:2877)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeAlterTableArchive(DDLSemanticAnalyzer.java:2730)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeInternal(DDLSemanticAnalyzer.java:316)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:277)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:433)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:337)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:902)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:259)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:216)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:413)
   at 
 org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:782)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:675)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:614)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4116) Can't use views using map datatype.


[ 
https://issues.apache.org/jira/browse/HIVE-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820405#comment-13820405
 ] 

Hive QA commented on HIVE-4116:
---



{color:green}Overall{color}: +1 all checks pass

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12613330/HIVE-4116.2.patch.txt

{color:green}SUCCESS:{color} +1 4602 tests passed

Test results: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/246/testReport
Console output: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/246/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12613330

 Can't use views using map datatype.
 ---

 Key: HIVE-4116
 URL: https://issues.apache.org/jira/browse/HIVE-4116
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.1, 0.10.0, 0.11.0
Reporter: Karel Vervaeke
Assignee: Navis
 Attachments: D12975.1.patch, HIVE-4116.2.patch.txt


 Executing the following 
 {noformat}
 DROP TABLE IF EXISTS `items`;
 CREATE TABLE IF NOT EXISTS `items` (id INT, name STRING, info 
 MAPSTRING,STRING) PARTITIONED BY (ds STRING);
 DROP VIEW IF EXISTS `priceview`;
 CREATE VIEW `priceview` AS
 SELECT
 `items`.`id`,
 `items`.info['price']
 FROM
 `items`
 ;
 select * from `priceview`;
 {noformat}
 Produces the following error:
 {noformat}
 karel@tomato:~/tmp$ $HIVE_HOME/bin/hive -f hivebug.sql
 WARNING: org.apache.hadoop.metrics.jvm.EventCounter is deprecated. Please use 
 org.apache.hadoop.log.metrics.EventCounter in all the log4j.properties files.
 Logging initialized using configuration in 
 jar:file:/home/karel/opt/hive-0.10.0-bin/lib/hive-common-0.10.0.jar!/hive-log4j.properties
 Hive history file=/tmp/karel/hive_job_log_karel_201303051117_945318761.txt
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in 
 [jar:file:/home/karel/opt/hadoop-2.0.0-mr1-cdh4.0.0/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in 
 [jar:file:/home/karel/opt/hive-0.10.0-bin/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
 explanation.
 OK
 Time taken: 5.449 seconds
 OK
 Time taken: 0.303 seconds
 OK
 Time taken: 0.131 seconds
 OK
 Time taken: 0.206 seconds
 FAILED: SemanticException line 3:22 mismatched input '.' expecting FROM near 
 '`items`' in from clause
  in definition of VIEW priceview [
 SELECT
 `items`.`id`,
 `items``items`.`info`info['price']
 FROM
 `default`.`items`
 ] used as priceview at Line 3:14
 {noformat}
 Unless I'm not using the right syntax, I would expect this simple example to 
 work. I have tried some variations (quotes, no quotes, ...), to no avail.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4388) Upgrade HBase to 0.96


[ 
https://issues.apache.org/jira/browse/HIVE-4388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820414#comment-13820414
 ] 

Brock Noland commented on HIVE-4388:


This is blocking HIVE-5755 Fix hadoop2 execution environment. It'd be great if 
someone could review! Thanks!!

 Upgrade HBase to 0.96
 -

 Key: HIVE-4388
 URL: https://issues.apache.org/jira/browse/HIVE-4388
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Reporter: Gunther Hagleitner
Assignee: Brock Noland
 Attachments: HIVE-4388-wip.txt, HIVE-4388.10.patch, 
 HIVE-4388.11.patch, HIVE-4388.12.patch, HIVE-4388.13.patch, 
 HIVE-4388.14.patch, HIVE-4388.15.patch, HIVE-4388.15.patch, 
 HIVE-4388.16.patch, HIVE-4388.17.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch


 Currently we're building by default against 0.92. When you run against hadoop 
 2 (-Dhadoop.mr.rev=23) builds fail because of: HBASE-5963.
 HIVE-3861 upgrades the version of hbase used. This will get you past the 
 problem in HBASE-5963 (which was fixed in 0.94.1) but fails with: HBASE-6396.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5614) Subquery support: allow subquery expressions in having clause


 [ 
https://issues.apache.org/jira/browse/HIVE-5614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harish Butani updated HIVE-5614:


Attachment: HIVE-5614.1.patch

 Subquery support: allow subquery expressions in having clause
 -

 Key: HIVE-5614
 URL: https://issues.apache.org/jira/browse/HIVE-5614
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-5614.1.patch






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5614) Subquery support: allow subquery expressions in having clause


[ 
https://issues.apache.org/jira/browse/HIVE-5614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820417#comment-13820417
 ] 

Harish Butani commented on HIVE-5614:
-

uploaded preliminary patch.
Includes changes for HIVE-3107

 Subquery support: allow subquery expressions in having clause
 -

 Key: HIVE-5614
 URL: https://issues.apache.org/jira/browse/HIVE-5614
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-5614.1.patch






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5717) Generate javadoc and source jars


 [ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Szehon Ho reassigned HIVE-5717:
---

Assignee: Szehon Ho

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho

 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-4116) Can't use views using map datatype.


 [ 
https://issues.apache.org/jira/browse/HIVE-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-4116:
---

   Resolution: Fixed
Fix Version/s: 0.13.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks, Navis!

 Can't use views using map datatype.
 ---

 Key: HIVE-4116
 URL: https://issues.apache.org/jira/browse/HIVE-4116
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.1, 0.10.0, 0.11.0
Reporter: Karel Vervaeke
Assignee: Navis
 Fix For: 0.13.0

 Attachments: D12975.1.patch, HIVE-4116.2.patch.txt


 Executing the following 
 {noformat}
 DROP TABLE IF EXISTS `items`;
 CREATE TABLE IF NOT EXISTS `items` (id INT, name STRING, info 
 MAPSTRING,STRING) PARTITIONED BY (ds STRING);
 DROP VIEW IF EXISTS `priceview`;
 CREATE VIEW `priceview` AS
 SELECT
 `items`.`id`,
 `items`.info['price']
 FROM
 `items`
 ;
 select * from `priceview`;
 {noformat}
 Produces the following error:
 {noformat}
 karel@tomato:~/tmp$ $HIVE_HOME/bin/hive -f hivebug.sql
 WARNING: org.apache.hadoop.metrics.jvm.EventCounter is deprecated. Please use 
 org.apache.hadoop.log.metrics.EventCounter in all the log4j.properties files.
 Logging initialized using configuration in 
 jar:file:/home/karel/opt/hive-0.10.0-bin/lib/hive-common-0.10.0.jar!/hive-log4j.properties
 Hive history file=/tmp/karel/hive_job_log_karel_201303051117_945318761.txt
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in 
 [jar:file:/home/karel/opt/hadoop-2.0.0-mr1-cdh4.0.0/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in 
 [jar:file:/home/karel/opt/hive-0.10.0-bin/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
 explanation.
 OK
 Time taken: 5.449 seconds
 OK
 Time taken: 0.303 seconds
 OK
 Time taken: 0.131 seconds
 OK
 Time taken: 0.206 seconds
 FAILED: SemanticException line 3:22 mismatched input '.' expecting FROM near 
 '`items`' in from clause
  in definition of VIEW priceview [
 SELECT
 `items`.`id`,
 `items``items`.`info`info['price']
 FROM
 `default`.`items`
 ] used as priceview at Line 3:14
 {noformat}
 Unless I'm not using the right syntax, I would expect this simple example to 
 work. I have tried some variations (quotes, no quotes, ...), to no avail.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5741) Hcatalog needs to be added to the binary tar


 [ 
https://issues.apache.org/jira/browse/HIVE-5741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brock Noland reassigned HIVE-5741:
--

Assignee: Brock Noland

 Hcatalog needs to be added to the binary tar
 

 Key: HIVE-5741
 URL: https://issues.apache.org/jira/browse/HIVE-5741
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Brock Noland





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5217) Add long polling to asynchronous execution in HiveServer2

2013-11-12 Thread Vaibhav Gumashta (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820460#comment-13820460
 ] 

Vaibhav Gumashta commented on HIVE-5217:


[~cwsteinbach] Yes, you can take a look now. Thanks.

 Add long polling to asynchronous execution in HiveServer2
 -

 Key: HIVE-5217
 URL: https://issues.apache.org/jira/browse/HIVE-5217
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Affects Versions: 0.13.0
Reporter: Vaibhav Gumashta
Assignee: Vaibhav Gumashta
 Fix For: 0.13.0

 Attachments: HIVE-5217.2.patch, HIVE-5217.3.patch, 
 HIVE-5217.D12801.2.patch, HIVE-5217.D12801.3.patch, HIVE-5217.D12801.4.patch, 
 HIVE-5217.D12801.5.patch, HIVE-5217.D12801.6.patch


 [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
 for async execution in HS2. The client gets an operation handle which it can 
 poll to check on the operation status. However, the polling frequency is 
 entirely left to the client which can be resource inefficient. Long polling 
 will solve this, by blocking the client request to check the operation status 
 for a configurable amount of time (a new HS2 config) if the data is not 
 available, but responding immediately if the data is available.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5799) session/operation timeout for hiveserver2


[ 
https://issues.apache.org/jira/browse/HIVE-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820458#comment-13820458
 ] 

Hive QA commented on HIVE-5799:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12613327/HIVE-5799.1.patch.txt

{color:red}ERROR:{color} -1 due to 18 failed/errored test(s), 4537 tests 
executed
*Failed tests:*
{noformat}
org.apache.hive.beeline.src.test.TestBeeLineWithArgs.org.apache.hive.beeline.src.test.TestBeeLineWithArgs
org.apache.hive.jdbc.TestJdbcDriver2.org.apache.hive.jdbc.TestJdbcDriver2
org.apache.hive.jdbc.TestJdbcWithMiniHS2.testConnection
org.apache.hive.jdbc.TestSSL.testConnectionMismatch
org.apache.hive.jdbc.TestSSL.testInvalidConfig
org.apache.hive.jdbc.TestSSL.testSSLConnectionWithProperty
org.apache.hive.jdbc.TestSSL.testSSLConnectionWithURL
org.apache.hive.jdbc.TestSSL.testSSLFetch
org.apache.hive.jdbc.miniHS2.TestHiveServer2.testConnection
org.apache.hive.service.auth.TestCustomAuthentication.org.apache.hive.service.auth.TestCustomAuthentication
org.apache.hive.service.auth.TestPlainSaslHelper.testDoAsSetting
org.apache.hive.service.cli.TestEmbeddedThriftBinaryCLIService.org.apache.hive.service.cli.TestEmbeddedThriftBinaryCLIService
org.apache.hive.service.cli.TestScratchDir.testLocalScratchDirs
org.apache.hive.service.cli.TestScratchDir.testResourceDirs
org.apache.hive.service.cli.TestScratchDir.testScratchDirs
org.apache.hive.service.cli.session.TestSessionHooks.testSessionHook
org.apache.hive.service.cli.thrift.TestThriftBinaryCLIService.org.apache.hive.service.cli.thrift.TestThriftBinaryCLIService
org.apache.hive.service.cli.thrift.TestThriftHttpCLIService.org.apache.hive.service.cli.thrift.TestThriftHttpCLIService
{noformat}

Test results: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/247/testReport
Console output: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/247/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests failed with: TestsFailedException: 18 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12613327

 session/operation timeout for hiveserver2
 -

 Key: HIVE-5799
 URL: https://issues.apache.org/jira/browse/HIVE-5799
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-5799.1.patch.txt


 Need some timeout facility for preventing resource leakages from instable  or 
 bad clients.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5218) datanucleus does not work with MS SQLServer in Hive metastore


 [ 
https://issues.apache.org/jira/browse/HIVE-5218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-5218:


Status: Open  (was: Patch Available)

 datanucleus does not work with MS SQLServer in Hive metastore
 -

 Key: HIVE-5218
 URL: https://issues.apache.org/jira/browse/HIVE-5218
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.12.0
Reporter: shanyu zhao
Assignee: shanyu zhao
 Fix For: 0.13.0

 Attachments: 
 0001-HIVE-5218-datanucleus-does-not-work-with-SQLServer-i.patch, 
 HIVE-5218-trunk.patch, HIVE-5218-v2.patch, HIVE-5218.2.patch, HIVE-5218.patch


 HIVE-3632 upgraded datanucleus version to 3.2.x, however, this version of 
 datanucleus doesn't work with SQLServer as the metastore. The problem is that 
 datanucleus tries to use fully qualified object name to find a table in the 
 database but couldn't find it.
 If I downgrade the version to HIVE-2084, SQLServer works fine.
 It could be a bug in datanucleus.
 This is the detailed exception I'm getting when using datanucleus 3.2.x with 
 SQL Server:
 {noformat}
 FAILED: Execution Error, return code 1 from 
 org.apache.hadoop.hive.ql.exec.DDLTa
 sk. MetaException(message:javax.jdo.JDOException: Exception thrown calling 
 table
 .exists() for a2ee36af45e9f46c19e995bfd2d9b5fd1hivemetastore..SEQUENCE_TABLE
 at 
 org.datanucleus.api.jdo.NucleusJDOHelper.getJDOExceptionForNucleusExc
 eption(NucleusJDOHelper.java:596)
 at 
 org.datanucleus.api.jdo.JDOPersistenceManager.jdoMakePersistent(JDOPe
 rsistenceManager.java:732)
 …
 at 
 org.apache.hadoop.hive.metastore.RetryingRawStore.invoke(RetryingRawS
 tore.java:111)
 at $Proxy0.createTable(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_core(HiveMetaStore.java:1071)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_with_environment_context(HiveMetaStore.java:1104)
 …
 at $Proxy11.create_table_with_environment_context(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6417)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6401)
 NestedThrowablesStackTrace:
 com.microsoft.sqlserver.jdbc.SQLServerException: There is already an object 
 name
 d 'SEQUENCE_TABLE' in the database.
 at 
 com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError
 (SQLServerException.java:197)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServ
 erStatement.java:1493)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.doExecuteStatement(SQ
 LServerStatement.java:775)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement$StmtExecCmd.doExecute
 (SQLServerStatement.java:676)
 at com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:4615)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLSe
 rverConnection.java:1400)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLSer
 verStatement.java:179)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLS
 erverStatement.java:154)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.execute(SQLServerStat
 ement.java:649)
 at com.jolbox.bonecp.StatementHandle.execute(StatementHandle.java:300)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatement(A
 bstractTable.java:760)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatementLi
 st(AbstractTable.java:711)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.create(AbstractTable.
 java:425)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.exists(AbstractTable.
 java:488)
 at 
 org.datanucleus.store.rdbms.valuegenerator.TableGenerator.repositoryE
 xists(TableGenerator.java:242)
 at 
 org.datanucleus.store.rdbms.valuegenerator.AbstractRDBMSGenerator.obt
 ainGenerationBlock(AbstractRDBMSGenerator.java:86)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.obtainGenerati
 onBlock(AbstractGenerator.java:197)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.next(AbstractG
 enerator.java:105)
 at 
 org.datanucleus.store.rdbms.RDBMSStoreManager.getStrategyValueForGene
 rator(RDBMSStoreManager.java:2019)
 at 
 org.datanucleus.store.AbstractStoreManager.getStrategyValue(AbstractS
 toreManager.java:1385)
 at 
 org.datanucleus.ExecutionContextImpl.newObjectId(ExecutionContextImpl

[jira] [Updated] (HIVE-5218) datanucleus does not work with MS SQLServer in Hive metastore


 [ 
https://issues.apache.org/jira/browse/HIVE-5218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-5218:


Status: Patch Available  (was: Open)

 datanucleus does not work with MS SQLServer in Hive metastore
 -

 Key: HIVE-5218
 URL: https://issues.apache.org/jira/browse/HIVE-5218
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.12.0
Reporter: shanyu zhao
Assignee: shanyu zhao
 Fix For: 0.13.0

 Attachments: 
 0001-HIVE-5218-datanucleus-does-not-work-with-SQLServer-i.patch, 
 HIVE-5218-trunk.patch, HIVE-5218-trunk.patch, HIVE-5218-v2.patch, 
 HIVE-5218.2.patch, HIVE-5218.patch


 HIVE-3632 upgraded datanucleus version to 3.2.x, however, this version of 
 datanucleus doesn't work with SQLServer as the metastore. The problem is that 
 datanucleus tries to use fully qualified object name to find a table in the 
 database but couldn't find it.
 If I downgrade the version to HIVE-2084, SQLServer works fine.
 It could be a bug in datanucleus.
 This is the detailed exception I'm getting when using datanucleus 3.2.x with 
 SQL Server:
 {noformat}
 FAILED: Execution Error, return code 1 from 
 org.apache.hadoop.hive.ql.exec.DDLTa
 sk. MetaException(message:javax.jdo.JDOException: Exception thrown calling 
 table
 .exists() for a2ee36af45e9f46c19e995bfd2d9b5fd1hivemetastore..SEQUENCE_TABLE
 at 
 org.datanucleus.api.jdo.NucleusJDOHelper.getJDOExceptionForNucleusExc
 eption(NucleusJDOHelper.java:596)
 at 
 org.datanucleus.api.jdo.JDOPersistenceManager.jdoMakePersistent(JDOPe
 rsistenceManager.java:732)
 …
 at 
 org.apache.hadoop.hive.metastore.RetryingRawStore.invoke(RetryingRawS
 tore.java:111)
 at $Proxy0.createTable(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_core(HiveMetaStore.java:1071)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_with_environment_context(HiveMetaStore.java:1104)
 …
 at $Proxy11.create_table_with_environment_context(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6417)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6401)
 NestedThrowablesStackTrace:
 com.microsoft.sqlserver.jdbc.SQLServerException: There is already an object 
 name
 d 'SEQUENCE_TABLE' in the database.
 at 
 com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError
 (SQLServerException.java:197)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServ
 erStatement.java:1493)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.doExecuteStatement(SQ
 LServerStatement.java:775)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement$StmtExecCmd.doExecute
 (SQLServerStatement.java:676)
 at com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:4615)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLSe
 rverConnection.java:1400)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLSer
 verStatement.java:179)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLS
 erverStatement.java:154)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.execute(SQLServerStat
 ement.java:649)
 at com.jolbox.bonecp.StatementHandle.execute(StatementHandle.java:300)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatement(A
 bstractTable.java:760)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatementLi
 st(AbstractTable.java:711)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.create(AbstractTable.
 java:425)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.exists(AbstractTable.
 java:488)
 at 
 org.datanucleus.store.rdbms.valuegenerator.TableGenerator.repositoryE
 xists(TableGenerator.java:242)
 at 
 org.datanucleus.store.rdbms.valuegenerator.AbstractRDBMSGenerator.obt
 ainGenerationBlock(AbstractRDBMSGenerator.java:86)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.obtainGenerati
 onBlock(AbstractGenerator.java:197)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.next(AbstractG
 enerator.java:105)
 at 
 org.datanucleus.store.rdbms.RDBMSStoreManager.getStrategyValueForGene
 rator(RDBMSStoreManager.java:2019)
 at 
 org.datanucleus.store.AbstractStoreManager.getStrategyValue(AbstractS
 toreManager.java:1385)
 at

[jira] [Updated] (HIVE-5218) datanucleus does not work with MS SQLServer in Hive metastore


 [ 
https://issues.apache.org/jira/browse/HIVE-5218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-5218:


Attachment: HIVE-5218-trunk.patch

reattaching patch to run tests

 datanucleus does not work with MS SQLServer in Hive metastore
 -

 Key: HIVE-5218
 URL: https://issues.apache.org/jira/browse/HIVE-5218
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.12.0
Reporter: shanyu zhao
Assignee: shanyu zhao
 Fix For: 0.13.0

 Attachments: 
 0001-HIVE-5218-datanucleus-does-not-work-with-SQLServer-i.patch, 
 HIVE-5218-trunk.patch, HIVE-5218-trunk.patch, HIVE-5218-v2.patch, 
 HIVE-5218.2.patch, HIVE-5218.patch


 HIVE-3632 upgraded datanucleus version to 3.2.x, however, this version of 
 datanucleus doesn't work with SQLServer as the metastore. The problem is that 
 datanucleus tries to use fully qualified object name to find a table in the 
 database but couldn't find it.
 If I downgrade the version to HIVE-2084, SQLServer works fine.
 It could be a bug in datanucleus.
 This is the detailed exception I'm getting when using datanucleus 3.2.x with 
 SQL Server:
 {noformat}
 FAILED: Execution Error, return code 1 from 
 org.apache.hadoop.hive.ql.exec.DDLTa
 sk. MetaException(message:javax.jdo.JDOException: Exception thrown calling 
 table
 .exists() for a2ee36af45e9f46c19e995bfd2d9b5fd1hivemetastore..SEQUENCE_TABLE
 at 
 org.datanucleus.api.jdo.NucleusJDOHelper.getJDOExceptionForNucleusExc
 eption(NucleusJDOHelper.java:596)
 at 
 org.datanucleus.api.jdo.JDOPersistenceManager.jdoMakePersistent(JDOPe
 rsistenceManager.java:732)
 …
 at 
 org.apache.hadoop.hive.metastore.RetryingRawStore.invoke(RetryingRawS
 tore.java:111)
 at $Proxy0.createTable(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_core(HiveMetaStore.java:1071)
 at 
 org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_tabl
 e_with_environment_context(HiveMetaStore.java:1104)
 …
 at $Proxy11.create_table_with_environment_context(Unknown Source)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6417)
 at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Processor$cr
 eate_table_with_environment_context.getResult(ThriftHiveMetastore.java:6401)
 NestedThrowablesStackTrace:
 com.microsoft.sqlserver.jdbc.SQLServerException: There is already an object 
 name
 d 'SEQUENCE_TABLE' in the database.
 at 
 com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError
 (SQLServerException.java:197)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServ
 erStatement.java:1493)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.doExecuteStatement(SQ
 LServerStatement.java:775)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement$StmtExecCmd.doExecute
 (SQLServerStatement.java:676)
 at com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:4615)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLSe
 rverConnection.java:1400)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLSer
 verStatement.java:179)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLS
 erverStatement.java:154)
 at 
 com.microsoft.sqlserver.jdbc.SQLServerStatement.execute(SQLServerStat
 ement.java:649)
 at com.jolbox.bonecp.StatementHandle.execute(StatementHandle.java:300)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatement(A
 bstractTable.java:760)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatementLi
 st(AbstractTable.java:711)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.create(AbstractTable.
 java:425)
 at 
 org.datanucleus.store.rdbms.table.AbstractTable.exists(AbstractTable.
 java:488)
 at 
 org.datanucleus.store.rdbms.valuegenerator.TableGenerator.repositoryE
 xists(TableGenerator.java:242)
 at 
 org.datanucleus.store.rdbms.valuegenerator.AbstractRDBMSGenerator.obt
 ainGenerationBlock(AbstractRDBMSGenerator.java:86)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.obtainGenerati
 onBlock(AbstractGenerator.java:197)
 at 
 org.datanucleus.store.valuegenerator.AbstractGenerator.next(AbstractG
 enerator.java:105)
 at 
 org.datanucleus.store.rdbms.RDBMSStoreManager.getStrategyValueForGene
 rator(RDBMSStoreManager.java:2019)
 at 
 org.datanucleus.store.AbstractStoreManager.getStrategyValue(AbstractS
 toreManager.java:1385)
 at

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820467#comment-13820467
 ] 

Szehon Ho commented on HIVE-5717:
-

I understand we need to generate the javadocs and package them with the 
packaging/src.tar, but I am wondering what is the requirement on source jars?  
Where should they be bundled, if at all?  Thanks.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho

 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820496#comment-13820496
 ] 

Brock Noland commented on HIVE-5717:


I don't think this touches packaging at all. What we want to do is generate a 
-javadoc.jar and -sources.jar for each maven artifact. Those new artifect 
should be attached to the project so they publish to 
~/.m2/repository/org/apache/hive

For example, notice how mrunit 
(http://repo1.maven.org/maven2/org/apache/mrunit/mrunit/1.0.0/) has:

{noformat}
mrunit-1.0.0.jar
mrunit-1.0.0-sources.jar
mrunit-1.0.0-javadoc.jar
{noformat}

we want that exact same thing for every artifact in the project. Additionally 
generating the javadoc can be very slow so we want to do that in the dist 
profile.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho

 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820498#comment-13820498
 ] 

Brock Noland commented on HIVE-5717:


bq. I don't think this touches packaging at all.

Meaning do not worry about including them in packaging at this point. We'll do 
that later.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho

 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820500#comment-13820500
 ] 

Brock Noland commented on HIVE-5717:


mrunit is doing some funky hadoop1 and hadoop2 classifier stuff. Whirr is 
probably a better example:

http://repo1.maven.org/maven2/org/apache/whirr/whirr-core/0.8.2/

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho

 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5802) Hive UDF pow limits the second input to integer when the first input is a decimal

Xuefu Zhang created HIVE-5802:
-

 Summary: Hive UDF pow limits the second input to integer when the 
first input is a decimal
 Key: HIVE-5802
 URL: https://issues.apache.org/jira/browse/HIVE-5802
 Project: Hive
  Issue Type: Improvement
  Components: Types, UDF
Affects Versions: 0.12.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang


For example,
{code}
hive desc test;
OK
i   int None
b   boolean None
d   double  None
s   string  None
dec decimal(5,2)None
hive explain select pow(dec, 2.5) from test;
FAILED: SemanticException [Error 10014]: Line 1:15 Wrong arguments '2.5': No 
matching method for class org.apache.hadoop.hive.ql.udf.UDFPower with 
(decimal(5,2), double). Possible choices: _FUNC_(decimal(65,30), int)  
_FUNC_(double, double)  _FUNC_(double, int)  
{code}

This seems too restrictive. MySQL, on the other hand, supports fractional power 
as the second input.
{code}
mysql select pow(d, 2.5) from test;
mysql desc test;
+---+--+--+-+-+---+
| Field | Type | Null | Key | Default | Extra |
+---+--+--+-+-+---+
| i | int(11)  | YES  | | NULL|   |
| b | tinyint(1)   | YES  | | NULL|   |
| d | double   | YES  | | NULL|   |
| s | varchar(5)   | YES  | | NULL|   |
| dd| decimal(5,2) | YES  | | NULL|   |
+---+--+--+-+-+---+
5 rows in set (0.00 sec)

mysql select pow(dd, 2.5) from test;
+---+
| pow(dd, 2.5)  |
+---+
| 2570.215713318881 |
+---+
1 row in set (0.00 sec)
{code}

Hive should also support this.




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5356) Move arithmatic UDFs to generic UDF implementations


[ 
https://issues.apache.org/jira/browse/HIVE-5356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820521#comment-13820521
 ] 

Hive QA commented on HIVE-5356:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12613404/HIVE-5356.6.patch

{color:red}ERROR:{color} -1 due to 10 failed/errored test(s), 4644 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_input8
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_num_op_type_conv
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_constant_expr
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_udf_assert_true2
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_udf_coalesce
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_udf_in
org.apache.hadoop.hive.ql.exec.TestFunctionRegistry.testCommonClassComparison
org.apache.hadoop.hive.ql.exec.TestFunctionRegistry.testGetMethodInternal
org.apache.hadoop.hive.ql.parse.TestParse.testParse_input8
org.apache.hadoop.hive.ql.udf.generic.TestGenericUDFPosMod.testDecimalPosModDecimal
{noformat}

Test results: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/249/testReport
Console output: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/249/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests failed with: TestsFailedException: 10 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12613404

 Move arithmatic UDFs to generic UDF implementations
 ---

 Key: HIVE-5356
 URL: https://issues.apache.org/jira/browse/HIVE-5356
 Project: Hive
  Issue Type: Task
  Components: UDF
Affects Versions: 0.11.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang
 Fix For: 0.13.0

 Attachments: HIVE-5356.1.patch, HIVE-5356.2.patch, HIVE-5356.3.patch, 
 HIVE-5356.4.patch, HIVE-5356.5.patch, HIVE-5356.6.patch, HIVE-5356.patch


 Currently, all of the arithmetic operators, such as add/sub/mult/div, are 
 implemented as old-style UDFs and java reflection is used to determine the 
 return type TypeInfos/ObjectInspectors, based on the return type of the 
 evaluate() method chosen for the expression. This works fine for types that 
 don't have type params.
 Hive decimal type participates in these operations just like int or double. 
 Different from double or int, however, decimal has precision and scale, which 
 cannot be determined by just looking at the return type (decimal) of the UDF 
 evaluate() method, even though the operands have certain precision/scale. 
 With the default of decimal without precision/scale, then (10, 0) will be 
 the type params. This is certainly not desirable.
 To solve this problem, all of the arithmetic operators would need to be 
 implemented as GenericUDFs, which allow returning ObjectInspector during the 
 initialize() method. The object inspectors returned can carry type params, 
 from which the exact return type can be determined.
 It's worth mentioning that, for user UDF implemented in non-generic way, if 
 the return type of the chosen evaluate() method is decimal, the return type 
 actually has (10,0) as precision/scale, which might not be desirable. This 
 needs to be documented.
 This JIRA will cover minus, plus, divide, multiply, mod, and pmod, to limit 
 the scope of review. The remaining ones will be covered under HIVE-5706.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-2055) Hive HBase Integration issue

2013-11-12 Thread Nick Dimiduk (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820558#comment-13820558
]

Nick Dimiduk commented on HIVE-2055:

My above link has turned stale. I'm referring to
FileUtil#createJarWithClassPath.

Calling this method to construct a classpath jar every time a script calls
$(hbase classpath-min) sounds time-consuming and will leave temp jars orphaned
on the FS.

Another idea is to use maven to generate a dependency list that omits Hadoop
and other runtime jars? Any friends at BigTop who could give advice on this
approach? (cc [~rvs], [~mackrorysd])

Yet another option is to maintain the list manually on the HBase side. We
already sort of do with in TableMapReduceUtils#addDependencyJars. This is
probably the simplest but most fragile and least future-proof option.

Hive HBase Integration issue

Key: HIVE-2055
URL: https://issues.apache.org/jira/browse/HIVE-2055
Project: Hive
Issue Type: Bug
Components: HBase Handler
Affects Versions: 0.10.0
Reporter: sajith v
Attachments:
0001-HIVE-2055-include-hbase-dependencies-in-launch-scrip.patch,
HIVE-2055.patch

Created an external table in hive , which points to the HBase table. When
tried to query a column using the column name in select clause got the
following exception : ( java.lang.ClassNotFoundException:
org.apache.hadoop.hive.hbase.HiveHBaseTableInputFormat), errorCode:12,
SQLState:42000)

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5557) Push down qualifying Where clause predicates as join conditions


 [ 
https://issues.apache.org/jira/browse/HIVE-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harish Butani updated HIVE-5557:


Attachment: HIVE-5557.5.patch

 Push down qualifying Where clause predicates as join conditions
 ---

 Key: HIVE-5557
 URL: https://issues.apache.org/jira/browse/HIVE-5557
 Project: Hive
  Issue Type: Sub-task
  Components: Query Processor
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-5557.1.patch, HIVE-5557.2.patch, HIVE-5557.3.patch, 
 HIVE-5557.4.patch, HIVE-5557.5.patch


 See details in HIVE-



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5557) Push down qualifying Where clause predicates as join conditions


 [ 
https://issues.apache.org/jira/browse/HIVE-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harish Butani updated HIVE-5557:


Status: Open  (was: Patch Available)

 Push down qualifying Where clause predicates as join conditions
 ---

 Key: HIVE-5557
 URL: https://issues.apache.org/jira/browse/HIVE-5557
 Project: Hive
  Issue Type: Sub-task
  Components: Query Processor
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-5557.1.patch, HIVE-5557.2.patch, HIVE-5557.3.patch, 
 HIVE-5557.4.patch, HIVE-5557.5.patch


 See details in HIVE-



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5557) Push down qualifying Where clause predicates as join conditions


 [ 
https://issues.apache.org/jira/browse/HIVE-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harish Butani updated HIVE-5557:


Status: Patch Available  (was: Open)

 Push down qualifying Where clause predicates as join conditions
 ---

 Key: HIVE-5557
 URL: https://issues.apache.org/jira/browse/HIVE-5557
 Project: Hive
  Issue Type: Sub-task
  Components: Query Processor
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-5557.1.patch, HIVE-5557.2.patch, HIVE-5557.3.patch, 
 HIVE-5557.4.patch, HIVE-5557.5.patch


 See details in HIVE-



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-2055) Hive should add HBase classpath dependencies when available

2013-11-12 Thread Nick Dimiduk (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HIVE-2055:
---

Summary: Hive should add HBase classpath dependencies when available  (was: 
Hive HBase Integration issue)

 Hive should add HBase classpath dependencies when available
 ---

 Key: HIVE-2055
 URL: https://issues.apache.org/jira/browse/HIVE-2055
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Affects Versions: 0.10.0
Reporter: sajith v
 Attachments: 
 0001-HIVE-2055-include-hbase-dependencies-in-launch-scrip.patch, 
 HIVE-2055.patch


 Created an external table in hive , which points to the HBase table. When 
 tried to query a column using the column name in select clause got the 
 following exception : ( java.lang.ClassNotFoundException: 
 org.apache.hadoop.hive.hbase.HiveHBaseTableInputFormat), errorCode:12, 
 SQLState:42000)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Re: Review Request 15435: Add long polling to asynchronous execution in HiveServer2

2013-11-12 Thread Carl Steinbach


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15435/#review28749
---



service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java
https://reviews.apache.org/r/15435/#comment55768

Should probably use a long and HiveConf.getLongVar instead of ints.



service/src/test/org/apache/hive/service/cli/CLIServiceTest.java
https://reviews.apache.org/r/15435/#comment55770

Please try to avoid using concrete collection types on the LHS or 
assignments or in method parameter lists, i.e. use MapString, String instead 
of HashMapString, String.



service/src/test/org/apache/hive/service/cli/CLIServiceTest.java
https://reviews.apache.org/r/15435/#comment55742

Please remove TABs



service/src/test/org/apache/hive/service/cli/CLIServiceTest.java
https://reviews.apache.org/r/15435/#comment55767

Replace hive.server2.long.polling.timeout with 
HiveConf.ConfVars.HIVE_SERVER2_LONG_POLLING_TIMEOUT.varname.



service/src/test/org/apache/hive/service/cli/CLIServiceTest.java
https://reviews.apache.org/r/15435/#comment55764

Please use a switch statement here.



service/src/test/org/apache/hive/service/cli/CLIServiceTest.java
https://reviews.apache.org/r/15435/#comment55769

Formatting.


- Carl Steinbach


On Nov. 12, 2013, 2:16 a.m., Vaibhav Gumashta wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/15435/
 ---
 
 (Updated Nov. 12, 2013, 2:16 a.m.)
 
 
 Review request for hive, Carl Steinbach and Thejas Nair.
 
 
 Bugs: HIVE-5217
 https://issues.apache.org/jira/browse/HIVE-5217
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 Add long polling to asynchronous execution in HiveServer2
 
 
 Diffs
 -
 
   common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 4f32390 
   conf/hive-default.xml.template fe7141e 
   service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java 
 4ee1b74 
   service/src/test/org/apache/hive/service/cli/CLIServiceTest.java cd9d99a 
 
 Diff: https://reviews.apache.org/r/15435/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Vaibhav Gumashta

[jira] [Updated] (HIVE-5564) Need to accomodate table decimal columns that were defined prior to HIVE-3976


 [ 
https://issues.apache.org/jira/browse/HIVE-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-5564:
--

Attachment: (was: HIVE-5564.4.patch)

 Need to accomodate table decimal columns that were defined prior to HIVE-3976
 -

 Key: HIVE-5564
 URL: https://issues.apache.org/jira/browse/HIVE-5564
 Project: Hive
  Issue Type: Task
  Components: Types
Affects Versions: 0.13.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang
 Fix For: 0.13.0

 Attachments: HIVE-5564.1.patch, HIVE-5564.2.patch, HIVE-5564.3.patch, 
 HIVE-5564.patch


 With HIVE-3976, decimal columns are stored with precision/scale, such as 
 decimal(17,5), as the type name. However, such columns defined in hive prior 
 to HIVE-3976 have a name as decimal. Those columns need to continue to work 
 with a precision/scale as (10,0), per the functional doc. With patch in 
 HIVE-3976, we may get the following error message in such case:
 {code}
 0: jdbc:hive2://localhost:1 desc dec;
 Error: Error while processing statement: FAILED: RuntimeException Decimal 
 type is specified without length: decimal:int (state=42000,code=4)
 {code}
 This issue will be addressed in this JIRA as a follow-up task.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5564) Need to accomodate table decimal columns that were defined prior to HIVE-3976


 [ 
https://issues.apache.org/jira/browse/HIVE-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-5564:
--

Attachment: HIVE-5564.4.patch

Reload the same patch to kick off test.

 Need to accomodate table decimal columns that were defined prior to HIVE-3976
 -

 Key: HIVE-5564
 URL: https://issues.apache.org/jira/browse/HIVE-5564
 Project: Hive
  Issue Type: Task
  Components: Types
Affects Versions: 0.13.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang
 Fix For: 0.13.0

 Attachments: HIVE-5564.1.patch, HIVE-5564.2.patch, HIVE-5564.3.patch, 
 HIVE-5564.4.patch, HIVE-5564.patch


 With HIVE-3976, decimal columns are stored with precision/scale, such as 
 decimal(17,5), as the type name. However, such columns defined in hive prior 
 to HIVE-3976 have a name as decimal. Those columns need to continue to work 
 with a precision/scale as (10,0), per the functional doc. With patch in 
 HIVE-3976, we may get the following error message in such case:
 {code}
 0: jdbc:hive2://localhost:1 desc dec;
 Error: Error while processing statement: FAILED: RuntimeException Decimal 
 type is specified without length: decimal:int (state=42000,code=4)
 {code}
 This issue will be addressed in this JIRA as a follow-up task.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5217) Add long polling to asynchronous execution in HiveServer2

2013-11-12 Thread Carl Steinbach (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820574#comment-13820574
 ] 

Carl Steinbach commented on HIVE-5217:
--

I left some more comments on reviewboard.

Are you planning to move the long polling timeout logic from 
SQLOperation.getState() to CLIService?

 Add long polling to asynchronous execution in HiveServer2
 -

 Key: HIVE-5217
 URL: https://issues.apache.org/jira/browse/HIVE-5217
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Affects Versions: 0.13.0
Reporter: Vaibhav Gumashta
Assignee: Vaibhav Gumashta
 Fix For: 0.13.0

 Attachments: HIVE-5217.2.patch, HIVE-5217.3.patch, 
 HIVE-5217.D12801.2.patch, HIVE-5217.D12801.3.patch, HIVE-5217.D12801.4.patch, 
 HIVE-5217.D12801.5.patch, HIVE-5217.D12801.6.patch


 [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
 for async execution in HS2. The client gets an operation handle which it can 
 poll to check on the operation status. However, the polling frequency is 
 entirely left to the client which can be resource inefficient. Long polling 
 will solve this, by blocking the client request to check the operation status 
 for a configurable amount of time (a new HS2 config) if the data is not 
 available, but responding immediately if the data is available.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5717) Generate javadoc and source jars


 [ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Szehon Ho updated HIVE-5717:


Attachment: HIVE-5717.patch

Attaching first cut.

Any thoughts on a separate profile for this build step?  I feel dist profile 
is quite useful as a tool for developers to verify their changes in the final 
product, like what the old /build/dist library used to do in ant.  But whereas 
it used to bring total build time from 45s to just over 1 min before, now it 
takes it to almost 4 mins on my machine.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Review Request 15462: HIVE-5717 Generate javadoc and source jars

2013-11-12 Thread Szehon Ho


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15462/
---

Review request for hive.


Bugs: HIVE-5717
https://issues.apache.org/jira/browse/HIVE-5717


Repository: hive-git


Description
---

Add maven javadoc and sources goal to the hive build.


Diffs
-

  pom.xml 05d887f 

Diff: https://reviews.apache.org/r/15462/diff/


Testing
---


Thanks,

Szehon Ho

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820595#comment-13820595
 ] 

Brock Noland commented on HIVE-5717:


perhaps we should have separate profiles called javadocs and sources

It's only important when we are doing a release or publishing a snapshot.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5803) Support CTAS from a non-avro table to an avro table


 [ 
https://issues.apache.org/jira/browse/HIVE-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mohammad Kamrul Islam reassigned HIVE-5803:
---

Assignee: Carl Steinbach

 Support CTAS from a non-avro table to an avro table
 ---

 Key: HIVE-5803
 URL: https://issues.apache.org/jira/browse/HIVE-5803
 Project: Hive
  Issue Type: Task
Reporter: Mohammad Kamrul Islam
Assignee: Carl Steinbach

 Hive currently does not work with HQL like :
 CREATE TABLE AVRO-BASE-TABLE as SELECT * from NON_AVRO_TABLE;
 Actual it works successfully. But when I run SELECT * from 
 AVRO-BASED-TABLE .. it fails.
 This JIRA depends on HIVE-3159 that translates TypeInfo to Avro schema.
 Findings so far: CTAS uses internal column names (in place of using the 
 column names provided in select) when crating the AVRO data file. In other 
 words, avro data file has column names in this form  of: _col0, _col1 where 
 as table column names are different.
 I tested with the following test cases and it failed:
 - verify 1) can create table using create table as select from non-avro table 
 2) LOAD avro data into new table and read data from the new table
 CREATE TABLE simple_kv_txt (key STRING, value STRING) STORED AS TEXTFILE;
 DESCRIBE simple_kv_txt;
 LOAD DATA LOCAL INPATH '../data/files/kv1.txt' INTO TABLE simple_kv_txt;
 SELECT * FROM simple_kv_txt ORDER BY KEY;
 CREATE TABLE copy_doctors ROW FORMAT SERDE 
 'org.apache.hadoop.hive.serde2.avro.AvroSerDe' STORED AS INPUTFORMAT 
 'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat' OUTPUTFORMAT 
 'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat' as SELECT key 
 as key, value as value FROM simple_kv_txt;
 DESCRIBE copy_doctors;
 SELECT * FROM copy_doctors;
  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5803) Support CTAS from a non-avro table to an avro table

Mohammad Kamrul Islam created HIVE-5803:
---

 Summary: Support CTAS from a non-avro table to an avro table
 Key: HIVE-5803
 URL: https://issues.apache.org/jira/browse/HIVE-5803
 Project: Hive
  Issue Type: Task
Reporter: Mohammad Kamrul Islam


Hive currently does not work with HQL like :
CREATE TABLE AVRO-BASE-TABLE as SELECT * from NON_AVRO_TABLE;
Actual it works successfully. But when I run SELECT * from AVRO-BASED-TABLE 
.. it fails.

This JIRA depends on HIVE-3159 that translates TypeInfo to Avro schema.
Findings so far: CTAS uses internal column names (in place of using the column 
names provided in select) when crating the AVRO data file. In other words, avro 
data file has column names in this form  of: _col0, _col1 where as table column 
names are different.

I tested with the following test cases and it failed:
- verify 1) can create table using create table as select from non-avro table 
2) LOAD avro data into new table and read data from the new table
CREATE TABLE simple_kv_txt (key STRING, value STRING) STORED AS TEXTFILE;
DESCRIBE simple_kv_txt;
LOAD DATA LOCAL INPATH '../data/files/kv1.txt' INTO TABLE simple_kv_txt;
SELECT * FROM simple_kv_txt ORDER BY KEY;

CREATE TABLE copy_doctors ROW FORMAT SERDE 
'org.apache.hadoop.hive.serde2.avro.AvroSerDe' STORED AS INPUTFORMAT 
'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat' OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat' as SELECT key as 
key, value as value FROM simple_kv_txt;
DESCRIBE copy_doctors;

SELECT * FROM copy_doctors;




 




--
This message was sent by Atlassian JIRA
(v6.1#6144)

Re: Review Request 15151: Better error reporting by async threads in HiveServer2

2013-11-12 Thread Vaibhav Gumashta



 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/if/TCLIService.thrift, line 395
  https://reviews.apache.org/r/15151/diff/3/?file=381909#file381909line395
 
  Please reuse TStatus instead of adding a new struct.

My understanding is that TStatus encapsulates the status of an RPC request (as 
pointed out by Prasad earlier). It has a required TStatusCode field as well 
which corresponds to the status of an RPC call. Would it not be better to 
expand TGetOperationStatusResp with sqlState, errorCode and errorMessage?


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  itests/hive-unit/src/test/java/org/apache/hive/service/cli/CLIServiceTest.java,
   line 1
  https://reviews.apache.org/r/15151/diff/3/?file=381906#file381906line1
 
  Does this patch make any changes to CLIServiceTest or 
  ThriftCLIServiceTest, or does it just move these files from service/ to 
  itests/ ? If it does make changes can we move the files in a different 
  patch?

It moves these files, but also adds test for this feature + some minor 
refactoring.


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/if/TCLIService.thrift, line 50
  https://reviews.apache.org/r/15151/diff/3/?file=381909#file381909line50
 
  Need to add HIVE_CLI_SERVICE_PROTOCOL_V5 and update any references to 
  HIVE_CLI_SERVICE_PROTOCOL_V4.

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/CLIService.java, line 274
  https://reviews.apache.org/r/15151/diff/3/?file=381910#file381910line274
 
  Please push this logic into OperationManager.getOperationStatus()

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/if/TCLIService.thrift, line 917
  https://reviews.apache.org/r/15151/diff/3/?file=381909#file381909line917
 
  This modification will break compatibility between 
  upversion/downversion clients and servers since it modifies the type of an 
  existing field. 
  
  It's possible to avoid this problem by instead adding a new TStatus 
  operationStatus field.

Sorry about the slip, will expand TGetOperationStatusResp with a new optional 
field(s). Would appreciate your thoughts on my reply in the previous comment 
regarding TStatus.


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/operation/OperationManager.java,
   line 147
  https://reviews.apache.org/r/15151/diff/3/?file=381918#file381918line147
 
  s/getOperationRunException/getOperationException/

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java, 
  line 71
  https://reviews.apache.org/r/15151/diff/3/?file=381919#file381919line71
 
  s/runException/operationException/

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/operation/Operation.java, line 
  73
  https://reviews.apache.org/r/15151/diff/3/?file=381917#file381917line73
 
  Would it make sense to replace getState() and getRunException() with a 
  getStatus() method that returns an object wrapping the operationState and 
  operationException?
  
  If not, please change the name of getRunException to getException(), 
  and add a comment explaining under what conditions this method will return 
  a non-null value.

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java, 
  line 324
  https://reviews.apache.org/r/15151/diff/3/?file=381919#file381919line324
 
  May as well just push this into Operation.

Done


 On Nov. 11, 2013, 7:52 p.m., Carl Steinbach wrote:
  service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceClient.java,
   line 307
  https://reviews.apache.org/r/15151/diff/3/?file=381921#file381921line307
 
  Todo: set the status information.

Could you elaborate this a bit more?

Thanks for the feedback.


- Vaibhav


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15151/#review28677
---


On Nov. 11, 2013, 7:23 p.m., Vaibhav Gumashta wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/15151/
 ---
 
 (Updated Nov. 11, 2013, 7:23 p.m.)
 
 
 Review request for hive, Prasad Mujumdar and Thejas Nair.
 
 
 Bugs: HIVE-5230
 https://issues.apache.org/jira/browse/HIVE-5230
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
 for async execution in HS2. When a background thread gets an error, currently 
 the client can only poll for the operation state and also the error with its 
 stacktrace is logged. However, it

Re: Review Request 15151: Better error reporting by async threads in HiveServer2

2013-11-12 Thread Vaibhav Gumashta



 On Nov. 11, 2013, 7:54 p.m., Thejas Nair wrote:
  service/if/TCLIService.thrift, line 917
  https://reviews.apache.org/r/15151/diff/4/?file=382149#file382149line917
 
  This will not be backward compatible. It would be better to add a new 
  optional field in the response, that has the additional error information.
 

Fixed. Thanks for pointing out!


- Vaibhav


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15151/#review28683
---


On Nov. 11, 2013, 7:23 p.m., Vaibhav Gumashta wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/15151/
 ---
 
 (Updated Nov. 11, 2013, 7:23 p.m.)
 
 
 Review request for hive, Prasad Mujumdar and Thejas Nair.
 
 
 Bugs: HIVE-5230
 https://issues.apache.org/jira/browse/HIVE-5230
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
 for async execution in HS2. When a background thread gets an error, currently 
 the client can only poll for the operation state and also the error with its 
 stacktrace is logged. However, it will be useful to provide a richer error 
 response like thrift API does with TStatus (which is constructed while 
 building a Thrift response object). 
 
 
 Diffs
 -
 
   
 itests/hive-unit/src/test/java/org/apache/hive/service/cli/CLIServiceTest.java
  PRE-CREATION 
   
 itests/hive-unit/src/test/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceTest.java
  PRE-CREATION 
   jdbc/src/java/org/apache/hive/jdbc/HiveStatement.java fce19bf 
   service/if/TCLIService.thrift 1f49445 
   service/src/java/org/apache/hive/service/cli/CLIService.java 8c85386 
   service/src/java/org/apache/hive/service/cli/CLIServiceClient.java 14ef54f 
   service/src/java/org/apache/hive/service/cli/EmbeddedCLIServiceClient.java 
 9dca874 
   service/src/java/org/apache/hive/service/cli/HiveSQLException.java 74e8b94 
   service/src/java/org/apache/hive/service/cli/ICLIService.java f647ce6 
   service/src/java/org/apache/hive/service/cli/OperationState.java 1ec6bd1 
   service/src/java/org/apache/hive/service/cli/OperationStatus.java 
 PRE-CREATION 
   service/src/java/org/apache/hive/service/cli/operation/Operation.java 
 6f4b8dc 
   
 service/src/java/org/apache/hive/service/cli/operation/OperationManager.java 
 bcdb67f 
   service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java 
 4ee1b74 
   service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java 
 9df110e 
   
 service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceClient.java
  9bb2a0f 
   service/src/test/org/apache/hive/service/cli/CLIServiceTest.java cd9d99a 
   
 service/src/test/org/apache/hive/service/cli/thrift/ThriftCLIServiceTest.java 
 ff7166d 
 
 Diff: https://reviews.apache.org/r/15151/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Vaibhav Gumashta

Re: Review Request 15462: HIVE-5717 Generate javadoc and source jars

2013-11-12 Thread Szehon Ho


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15462/
---

(Updated Nov. 12, 2013, 11:05 p.m.)


Review request for hive.


Changes
---

Using separate profiles for these build goals, to not affect dist profile's 
build time.


Bugs: HIVE-5717
https://issues.apache.org/jira/browse/HIVE-5717


Repository: hive-git


Description
---

Add maven javadoc and sources goal to the hive build.


Diffs (updated)
-

  pom.xml 05d887f 

Diff: https://reviews.apache.org/r/15462/diff/


Testing
---


Thanks,

Szehon Ho

Re: Review Request 11925: Hive-3159 Update AvroSerde to determine schema of new tables

2013-11-12 Thread Mohammad Islam



 On July 29, 2013, 5:41 p.m., Jakob Homan wrote:
  There is still no text covering a map-reduce job on an already existing, 
  non-Avro table into an avro table.  ie, create a text table, populate it, 
  run a CTAS to manipulate the data into an Avro table.
 
 Mohammad Islam wrote:
 In general, Hive creates internal column names such as col0, col1 etc. 
 Due to this, I didn't able to copy non-avro data to avro data and run select 
 SQL. Only option is to change the current behavior to reuse the provided 
 column names. Separate JIRA regarding this could be a choice.
 

 
 Jakob Homan wrote:
 Wouldn't select * or using the new column names (they're named 
 deterministically) work?  This is a major test since otherwise we're missing 
 the most important code path...
 ie
 have a text file c1, c2, c3
 create table t1
 load data into t1 from text file
 create table a1 as select c3, c2 where c2 = foo order by c3;
 select * from a1;
 describe extended a1;
 
 And verify in the q file's result that the table is avro and that the 
 correct rows and columns got converted.

I agree that we need to support this use case as well.
Current patch works on two other uses case.

Created a separate JIRA to handle this use case that I suspect is a slightly 
different issue.
New JIRA: https://issues.apache.org/jira/browse/HIVE-5803


- Mohammad


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/11925/#review24149
---


On Aug. 8, 2013, 12:24 a.m., Mohammad Islam wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/11925/
 ---
 
 (Updated Aug. 8, 2013, 12:24 a.m.)
 
 
 Review request for hive, Ashutosh Chauhan and Jakob Homan.
 
 
 Bugs: HIVE-3159
 https://issues.apache.org/jira/browse/HIVE-3159
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 Problem:
 Hive doesn't support to create a Avro-based table using HQL create table 
 command. It currently requires to specify Avro schema literal or schema file 
 name.
 For multiple cases, it is very inconvenient for user.
 Some of the un-supported use cases:
 1. Create table ... Avro-SERDE etc. as SELECT ... from NON-AVRO FILE
 2. Create table ... Avro-SERDE etc. as SELECT from AVRO TABLE
 3. Create  table  without specifying Avro schema.
 
 
 Diffs
 -
 
   ql/src/test/queries/clientpositive/avro_create_as_select.q PRE-CREATION 
   ql/src/test/queries/clientpositive/avro_create_as_select2.q PRE-CREATION 
   ql/src/test/queries/clientpositive/avro_no_schema_test.q PRE-CREATION 
   ql/src/test/queries/clientpositive/avro_without_schema.q PRE-CREATION 
   ql/src/test/results/clientpositive/avro_create_as_select.q.out PRE-CREATION 
   ql/src/test/results/clientpositive/avro_create_as_select2.q.out 
 PRE-CREATION 
   ql/src/test/results/clientpositive/avro_no_schema_test.q.out PRE-CREATION 
   ql/src/test/results/clientpositive/avro_without_schema.q.out PRE-CREATION 
   serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroSerdeUtils.java 
 13848b6 
   serde/src/java/org/apache/hadoop/hive/serde2/avro/TypeInfoToSchema.java 
 PRE-CREATION 
   serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroSerdeUtils.java 
 010f614 
   serde/src/test/org/apache/hadoop/hive/serde2/avro/TestTypeInfoToSchema.java 
 PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/11925/diff/
 
 
 Testing
 ---
 
 Wrote a new java Test class for a new Java class. Added a new test case into 
 existing java test class. In addition, there are 4 .q file for testing 
 multiple use-cases.
 
 
 Thanks,
 
 Mohammad Islam

Re: Review Request 15151: Better error reporting by async threads in HiveServer2

2013-11-12 Thread Vaibhav Gumashta


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15151/
---

(Updated Nov. 12, 2013, 11:06 p.m.)


Review request for hive, Carl Steinbach, Prasad Mujumdar, and Thejas Nair.


Bugs: HIVE-5230
https://issues.apache.org/jira/browse/HIVE-5230


Repository: hive-git


Description
---

[HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617] provides support 
for async execution in HS2. When a background thread gets an error, currently 
the client can only poll for the operation state and also the error with its 
stacktrace is logged. However, it will be useful to provide a richer error 
response like thrift API does with TStatus (which is constructed while building 
a Thrift response object). 


Diffs
-

  
itests/hive-unit/src/test/java/org/apache/hive/service/cli/CLIServiceTest.java 
PRE-CREATION 
  
itests/hive-unit/src/test/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceTest.java
 PRE-CREATION 
  jdbc/src/java/org/apache/hive/jdbc/HiveStatement.java fce19bf 
  service/if/TCLIService.thrift 1f49445 
  service/src/java/org/apache/hive/service/cli/CLIService.java 8c85386 
  service/src/java/org/apache/hive/service/cli/CLIServiceClient.java 14ef54f 
  service/src/java/org/apache/hive/service/cli/EmbeddedCLIServiceClient.java 
9dca874 
  service/src/java/org/apache/hive/service/cli/HiveSQLException.java 74e8b94 
  service/src/java/org/apache/hive/service/cli/ICLIService.java f647ce6 
  service/src/java/org/apache/hive/service/cli/OperationState.java 1ec6bd1 
  service/src/java/org/apache/hive/service/cli/OperationStatus.java 
PRE-CREATION 
  service/src/java/org/apache/hive/service/cli/operation/Operation.java 6f4b8dc 
  service/src/java/org/apache/hive/service/cli/operation/OperationManager.java 
bcdb67f 
  service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java 
4ee1b74 
  service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java 
9df110e 
  
service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIServiceClient.java 
9bb2a0f 
  service/src/test/org/apache/hive/service/cli/CLIServiceTest.java cd9d99a 
  service/src/test/org/apache/hive/service/cli/thrift/ThriftCLIServiceTest.java 
ff7166d 

Diff: https://reviews.apache.org/r/15151/diff/


Testing
---


Thanks,

Vaibhav Gumashta

[jira] [Updated] (HIVE-5717) Generate javadoc and source jars


 [ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Szehon Ho updated HIVE-5717:


Attachment: HIVE-5717.2.patch

Yep, that sounds better.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.2.patch, HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5717) Generate javadoc and source jars


 [ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Szehon Ho updated HIVE-5717:


Status: Patch Available  (was: Open)

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.2.patch, HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3159) Update AvroSerde to determine schema of new tables


 [ 
https://issues.apache.org/jira/browse/HIVE-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mohammad Kamrul Islam updated HIVE-3159:


Attachment: HIVE-3159.4.patch

 Update AvroSerde to determine schema of new tables
 --

 Key: HIVE-3159
 URL: https://issues.apache.org/jira/browse/HIVE-3159
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Affects Versions: 0.10.0, 0.11.0
Reporter: Jakob Homan
Assignee: Mohammad Kamrul Islam
 Attachments: HIVE-3159.4.patch, HIVE-3159v1.patch


 Currently when writing tables to Avro one must manually provide an Avro 
 schema that matches what is being delivered by Hive. It'd be better to have 
 the serde infer this schema by converting the table's TypeInfo into an 
 appropriate AvroSchema.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3159) Update AvroSerde to determine schema of new tables


 [ 
https://issues.apache.org/jira/browse/HIVE-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mohammad Kamrul Islam updated HIVE-3159:


Affects Version/s: (was: 0.11.0)
   (was: 0.10.0)
   Status: Patch Available  (was: Open)

 Update AvroSerde to determine schema of new tables
 --

 Key: HIVE-3159
 URL: https://issues.apache.org/jira/browse/HIVE-3159
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Jakob Homan
Assignee: Mohammad Kamrul Islam
 Attachments: HIVE-3159.4.patch, HIVE-3159v1.patch


 Currently when writing tables to Avro one must manually provide an Avro 
 schema that matches what is being delivered by Hive. It'd be better to have 
 the serde infer this schema by converting the table's TypeInfo into an 
 appropriate AvroSchema.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5745) TestHiveLogging is failing (at least on mac)


 [ 
https://issues.apache.org/jira/browse/HIVE-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-5745:
---

Fix Version/s: 0.13.0

 TestHiveLogging is failing (at least on mac)
 

 Key: HIVE-5745
 URL: https://issues.apache.org/jira/browse/HIVE-5745
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Fix For: 0.13.0

 Attachments: HIVE-5745.1.patch


 The path for the log file on my mac contains two slashes. That causes mvn 
 install fail.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5801) Support for reader/writer of ORC format for R environment

2013-11-12 Thread Andrew Psaltis (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820640#comment-13820640
 ] 

Andrew Psaltis commented on HIVE-5801:
--

With our use case, being able to read and write ORC files outside of Hive is 
very desirable.  We need to read and or write ORC files from both Hadoop Map 
Reduce jobs and more often than not outside of Hadoop. Having no Hive 
installations or Hive Metastore, we really do not have a desire to take on the 
HCatalog layer integration and the overhead that comes with that.  To date we 
have maintained a patched version of the hive-exec jar that exposes the ORC 
related classes so that we can use it. Obviously this comes with it's own 
burden, and just recently we have been evaluating using Parquet over ORC, 
primarily for this very reason. However, we keep coming back to ORC, as some of 
the features better suit our use case.

I would be more than happy to attach a patch to this JIRA, if that is something 
is desired.



 Support for reader/writer of ORC format for R environment
 -

 Key: HIVE-5801
 URL: https://issues.apache.org/jira/browse/HIVE-5801
 Project: Hive
  Issue Type: Improvement
Reporter: Michael Hausenblas
Priority: Minor

 It would be great if the ORC format would directly be accessible from R [1], 
 that is, providing reader/writer for it.
 [1] http://www.r-project.org/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5804) Support expressions in GBY attributes, JOIN condition etc.

2013-11-12 Thread Prasanth J (JIRA)

Prasanth J created HIVE-5804:


 Summary: Support expressions in GBY attributes, JOIN condition etc.
 Key: HIVE-5804
 URL: https://issues.apache.org/jira/browse/HIVE-5804
 Project: Hive
  Issue Type: Sub-task
Reporter: Prasanth J
Assignee: Prasanth J


Support for expression in group by, join conditions are still not complete. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5369) Annotate hive operator tree with statistics from metastore

2013-11-12 Thread Prasanth J (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-5369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Prasanth J updated HIVE-5369:
-

Attachment: HIVE-5369.4.patch.txt

Refactored patch. Removed dependency from lineage index. Statistics (along with
column statistics) are now passed down the operator tree. Still there are few
open issues. Will create sub-tasks for open issues.

Annotate hive operator tree with statistics from metastore
--

Key: HIVE-5369
URL: https://issues.apache.org/jira/browse/HIVE-5369
Project: Hive
Issue Type: New Feature
Components: Query Processor, Statistics
Affects Versions: 0.13.0
Reporter: Prasanth J
Assignee: Prasanth J
Labels: statistics
Fix For: 0.13.0

Attachments: HIVE-5369.1.txt, HIVE-5369.2.WIP.txt,
HIVE-5369.2.patch.txt, HIVE-5369.3.patch.txt, HIVE-5369.4.patch.txt,
HIVE-5369.WIP.txt, HIVE-5369.refactor.WIP.txt

Currently the statistics gathered at table/partition level and column level
are not used during query planning stage. Statistics at table/partition and
column level can be used for optimizing the query plans. Basic statistics
like uncompressed data size can be used for better reducer estimation. Other
statistics like number of rows, distinct values of columns, average length of
columns etc. can be used by Cost Based Optimizer (CBO) for making better
query plan selection. As a first step in improving query planning the
statistics that are available in the metastore should be attached to hive
operator tree. The operator tree should be walked and annotated with
statistics information. The attached statistics will vary for each operator
depending on the operation it performs. For example, select operator will
change the average row size but doesn't affect the number of rows. Similarly
filter operator will change the number of rows but doesn't change the average
row size. Similar rules can be applied for other operators as well.
Rules for different operators are added as comments in the code. For more
detailed information, the reference book that I am using is Database
Systems: The Complete Book by Garcia-Molina et.al.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5805) Support for operators like PTF, Script, Extract etc. in statistics annotation.

2013-11-12 Thread Prasanth J (JIRA)

Prasanth J created HIVE-5805:


 Summary: Support for operators like PTF, Script, Extract etc. in 
statistics annotation.
 Key: HIVE-5805
 URL: https://issues.apache.org/jira/browse/HIVE-5805
 Project: Hive
  Issue Type: Sub-task
Reporter: Prasanth J


Statistics annotation (HIVE-5369) only supports table scan, select, filter, 
limit, union, groupby, join. This sub task is to add support for remaining 
operators.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4388) Upgrade HBase to 0.96


[ 
https://issues.apache.org/jira/browse/HIVE-4388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820680#comment-13820680
 ] 

Gunther Hagleitner commented on HIVE-4388:
--

+1 It'd be great to get that in. Minor thing: The hamcrest dependency I added 
is probably no longer needed with your fix of adding junit as a dependency. We 
can address that in a follow up though.

 Upgrade HBase to 0.96
 -

 Key: HIVE-4388
 URL: https://issues.apache.org/jira/browse/HIVE-4388
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Reporter: Gunther Hagleitner
Assignee: Brock Noland
 Attachments: HIVE-4388-wip.txt, HIVE-4388.10.patch, 
 HIVE-4388.11.patch, HIVE-4388.12.patch, HIVE-4388.13.patch, 
 HIVE-4388.14.patch, HIVE-4388.15.patch, HIVE-4388.15.patch, 
 HIVE-4388.16.patch, HIVE-4388.17.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch


 Currently we're building by default against 0.92. When you run against hadoop 
 2 (-Dhadoop.mr.rev=23) builds fail because of: HBASE-5963.
 HIVE-3861 upgrades the version of hbase used. This will get you past the 
 problem in HBASE-5963 (which was fixed in 0.94.1) but fails with: HBASE-6396.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-4518) Counter Strike: Operation Operator

2013-11-12 Thread Jason Dere (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-4518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Dere updated HIVE-4518:
-

Attachment: HIVE-4518.8.patch

doesn't look like pre-commit tests ran from the last patch .. uploading patch
again to run to kick off another test run.

Counter Strike: Operation Operator
--

Key: HIVE-4518
URL: https://issues.apache.org/jira/browse/HIVE-4518
Project: Hive
Issue Type: Improvement
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Attachments: HIVE-4518.1.patch, HIVE-4518.2.patch, HIVE-4518.3.patch,
HIVE-4518.4.patch, HIVE-4518.5.patch, HIVE-4518.6.patch.txt,
HIVE-4518.7.patch, HIVE-4518.8.patch

Queries of the form:
from foo
insert overwrite table bar partition (p) select ...
insert overwrite table bar partition (p) select ...
insert overwrite table bar partition (p) select ...
Generate a huge amount of counters. The reason is that task.progress is
turned on for dynamic partitioning queries.
The counters not only make queries slower than necessary (up to 50%) you will
also eventually run out. That's because we're wrapping them in enum values to
comply with hadoop 0.17.
The real reason we turn task.progress on is that we need CREATED_FILES and
FATAL counters to ensure dynamic partitioning queries don't go haywire.
The counters have counter-intuitive names like C1 through C1000 and don't
seem really useful by themselves.
With hadoop 20+ you don't need to wrap the counters anymore, each operator
can simply create and increment counters. That should simplify the code a lot.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4388) Upgrade HBase to 0.96

2013-11-12 Thread Sushanth Sowmyan (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820716#comment-13820716
 ] 

Sushanth Sowmyan commented on HIVE-4388:


+1 from me as well. Code changes look unchanged and okay. Build changes look 
okay, and builds okay. Tests are obviously reflected by the automated build. 
Looks great! :)

 Upgrade HBase to 0.96
 -

 Key: HIVE-4388
 URL: https://issues.apache.org/jira/browse/HIVE-4388
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Reporter: Gunther Hagleitner
Assignee: Brock Noland
 Attachments: HIVE-4388-wip.txt, HIVE-4388.10.patch, 
 HIVE-4388.11.patch, HIVE-4388.12.patch, HIVE-4388.13.patch, 
 HIVE-4388.14.patch, HIVE-4388.15.patch, HIVE-4388.15.patch, 
 HIVE-4388.16.patch, HIVE-4388.17.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch


 Currently we're building by default against 0.92. When you run against hadoop 
 2 (-Dhadoop.mr.rev=23) builds fail because of: HBASE-5963.
 HIVE-3861 upgrades the version of hbase used. This will get you past the 
 problem in HBASE-5963 (which was fixed in 0.94.1) but fails with: HBASE-6396.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars

2013-11-12 Thread Lefty Leverenz (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820734#comment-13820734
 ] 

Lefty Leverenz commented on HIVE-5717:
--

Will the javadocs include HCatalog?  In the Hive 0.12 javadocs, the only HCat 
docs I can find are for shims.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.2.patch, HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5731) Use new GenericUDF instead of basic UDF for UDFDate* classes


[ 
https://issues.apache.org/jira/browse/HIVE-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820736#comment-13820736
 ] 

Mohammad Kamrul Islam commented on HIVE-5731:
-

RB Updated.

 Use new GenericUDF instead of basic UDF for UDFDate* classes 
 -

 Key: HIVE-5731
 URL: https://issues.apache.org/jira/browse/HIVE-5731
 Project: Hive
  Issue Type: Improvement
Reporter: Mohammad Kamrul Islam
Assignee: Mohammad Kamrul Islam
 Attachments: HIVE-5731.1.patch, HIVE-5731.2.patch, HIVE-5731.3.patch, 
 HIVE-5731.4.patch, HIVE-5731.5.patch


 GenericUDF class is the latest and recommended base class for any UDFs.
 This JIRA is to change the current UDFDate* classes extended from GenericUDF.
 The general benefit of GenericUDF is described in comments as
 * The GenericUDF are superior to normal UDFs in the following ways: 1. It can
 accept arguments of complex types, and return complex types. 2. It can 
 accept
 variable length of arguments. 3. It can accept an infinite number of 
 function
 signature - for example, it's easy to write a GenericUDF that accepts
 arrayint, arrayarrayint and so on (arbitrary levels of nesting). 4. 
 It
 can do short-circuit evaluations using DeferedObject.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4388) Upgrade HBase to 0.96


[ 
https://issues.apache.org/jira/browse/HIVE-4388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820744#comment-13820744
 ] 

Brock Noland commented on HIVE-4388:


Thanks guys!

 Upgrade HBase to 0.96
 -

 Key: HIVE-4388
 URL: https://issues.apache.org/jira/browse/HIVE-4388
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Reporter: Gunther Hagleitner
Assignee: Brock Noland
 Attachments: HIVE-4388-wip.txt, HIVE-4388.10.patch, 
 HIVE-4388.11.patch, HIVE-4388.12.patch, HIVE-4388.13.patch, 
 HIVE-4388.14.patch, HIVE-4388.15.patch, HIVE-4388.15.patch, 
 HIVE-4388.16.patch, HIVE-4388.17.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, HIVE-4388.patch, 
 HIVE-4388.patch, HIVE-4388.patch


 Currently we're building by default against 0.92. When you run against hadoop 
 2 (-Dhadoop.mr.rev=23) builds fail because of: HBASE-5963.
 HIVE-3861 upgrades the version of hbase used. This will get you past the 
 problem in HBASE-5963 (which was fixed in 0.94.1) but fails with: HBASE-6396.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5801) Support for reader/writer of ORC format for R environment

2013-11-12 Thread Owen O'Malley (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820750#comment-13820750
 ] 

Owen O'Malley commented on HIVE-5801:
-

Have you seen HIVE-5728? It is making some changes to the ORC reader/writer so 
that Pig can add native load/store functions. What else do you require?

 Support for reader/writer of ORC format for R environment
 -

 Key: HIVE-5801
 URL: https://issues.apache.org/jira/browse/HIVE-5801
 Project: Hive
  Issue Type: Improvement
Reporter: Michael Hausenblas
Priority: Minor

 It would be great if the ORC format would directly be accessible from R [1], 
 that is, providing reader/writer for it.
 [1] http://www.r-project.org/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5801) Support for reader/writer of ORC format for R environment

2013-11-12 Thread Carter Shanklin (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820769#comment-13820769
 ] 

Carter Shanklin commented on HIVE-5801:
---

Haven't seen that, thanks.

Is there a way people can use that before Hive 13? Say on the CDH Hive 12
+ a billion patches thing or on older HDP?







-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


 Support for reader/writer of ORC format for R environment
 -

 Key: HIVE-5801
 URL: https://issues.apache.org/jira/browse/HIVE-5801
 Project: Hive
  Issue Type: Improvement
Reporter: Michael Hausenblas
Priority: Minor

 It would be great if the ORC format would directly be accessible from R [1], 
 that is, providing reader/writer for it.
 [1] http://www.r-project.org/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5756) Implement vectorization support for IF conditional expression for long and double inputs


 [ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Hanson updated HIVE-5756:
--

Summary: Implement vectorization support for IF conditional expression for 
long and double inputs  (was: Implement vectorization support for IF 
conditional expression)

 Implement vectorization support for IF conditional expression for long and 
 double inputs
 

 Key: HIVE-5756
 URL: https://issues.apache.org/jira/browse/HIVE-5756
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson

 Implement full, end-to-end support for IF in vectorized mode, including new 
 VectorExpression class(es), VectorizationContext translation to a 
 VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
 testing. An end-to-end .q test is recommended but optional.
 This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5799) session/operation timeout for hiveserver2


 [ 
https://issues.apache.org/jira/browse/HIVE-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5799:


Status: Open  (was: Patch Available)

 session/operation timeout for hiveserver2
 -

 Key: HIVE-5799
 URL: https://issues.apache.org/jira/browse/HIVE-5799
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-5799.1.patch.txt


 Need some timeout facility for preventing resource leakages from instable  or 
 bad clients.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Re: Review Request 15449: session/operation timeout for hiveserver2

2013-11-12 Thread Navis Ryu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/15449/
---

(Updated Nov. 13, 2013, 1:21 a.m.)


Review request for hive.


Bugs: HIVE-5799
https://issues.apache.org/jira/browse/HIVE-5799


Repository: hive-git


Description
---

Need some timeout facility for preventing resource leakages from instable or 
bad clients.


Diffs (updated)
-

  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 32ab3d8 
  conf/hive-default.xml.template c574ab5 
  service/src/java/org/apache/hive/service/cli/OperationState.java 1ec6bd1 
  service/src/java/org/apache/hive/service/cli/operation/Operation.java 6f4b8dc 
  service/src/java/org/apache/hive/service/cli/session/HiveSession.java 00058cc 
  service/src/java/org/apache/hive/service/cli/session/HiveSessionImpl.java 
cfda752 
  service/src/java/org/apache/hive/service/cli/session/SessionManager.java 
25c6f38 

Diff: https://reviews.apache.org/r/15449/diff/


Testing
---

Confirmed in the local environment.


Thanks,

Navis Ryu

[jira] [Commented] (HIVE-5756) Implement vectorization support for IF conditional expression for long and double inputs


[ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820779#comment-13820779
 ] 

Eric Hanson commented on HIVE-5756:
---

This was getting big so I am going to do a separate JIRA or JIRAs for string, 
timestamp, and Boolean types.

 Implement vectorization support for IF conditional expression for long and 
 double inputs
 

 Key: HIVE-5756
 URL: https://issues.apache.org/jira/browse/HIVE-5756
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson
 Attachments: HIVE-5756.1.patch


 Implement full, end-to-end support for IF in vectorized mode, including new 
 VectorExpression class(es), VectorizationContext translation to a 
 VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
 testing. An end-to-end .q test is recommended but optional.
 This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5756) Implement vectorization support for IF conditional expression for long and double inputs


 [ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Hanson updated HIVE-5756:
--

Attachment: HIVE-5756.1.patch

Includes unit tests for VectorExpressions and VectorizationContext. I also ran 
end-to-end ad hoc tests to verify IF works in vectorized mode from SQL.

 Implement vectorization support for IF conditional expression for long and 
 double inputs
 

 Key: HIVE-5756
 URL: https://issues.apache.org/jira/browse/HIVE-5756
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson
 Attachments: HIVE-5756.1.patch


 Implement full, end-to-end support for IF in vectorized mode, including new 
 VectorExpression class(es), VectorizationContext translation to a 
 VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
 testing. An end-to-end .q test is recommended but optional.
 This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5799) session/operation timeout for hiveserver2


 [ 
https://issues.apache.org/jira/browse/HIVE-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5799:


Attachment: HIVE-5799.2.patch.txt

 session/operation timeout for hiveserver2
 -

 Key: HIVE-5799
 URL: https://issues.apache.org/jira/browse/HIVE-5799
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-5799.1.patch.txt, HIVE-5799.2.patch.txt


 Need some timeout facility for preventing resource leakages from instable  or 
 bad clients.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5756) Implement vectorization support for IF conditional expression for long and double inputs


 [ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Hanson updated HIVE-5756:
--

Status: Patch Available  (was: In Progress)

 Implement vectorization support for IF conditional expression for long and 
 double inputs
 

 Key: HIVE-5756
 URL: https://issues.apache.org/jira/browse/HIVE-5756
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson
 Attachments: HIVE-5756.1.patch


 Implement full, end-to-end support for IF in vectorized mode, including new 
 VectorExpression class(es), VectorizationContext translation to a 
 VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
 testing. An end-to-end .q test is recommended but optional.
 This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5799) session/operation timeout for hiveserver2


 [ 
https://issues.apache.org/jira/browse/HIVE-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-5799:


Status: Patch Available  (was: Open)

 session/operation timeout for hiveserver2
 -

 Key: HIVE-5799
 URL: https://issues.apache.org/jira/browse/HIVE-5799
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-5799.1.patch.txt, HIVE-5799.2.patch.txt


 Need some timeout facility for preventing resource leakages from instable  or 
 bad clients.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5807) Implement vectorization support for IF conditional expression for string inputs

Eric Hanson created HIVE-5807:
-

 Summary: Implement vectorization support for IF conditional 
expression for string inputs
 Key: HIVE-5807
 URL: https://issues.apache.org/jira/browse/HIVE-5807
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5806) Implement vectorization support for IF conditional expression for boolean and timestamp inputs

Eric Hanson created HIVE-5806:
-

 Summary: Implement vectorization support for IF conditional 
expression for boolean and timestamp inputs
 Key: HIVE-5806
 URL: https://issues.apache.org/jira/browse/HIVE-5806
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5806) Implement vectorization support for IF conditional expression for boolean and timestamp inputs


[ 
https://issues.apache.org/jira/browse/HIVE-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820785#comment-13820785
 ] 

Eric Hanson commented on HIVE-5806:
---

Try to piggy back this on the work for HIVE-5756.

 Implement vectorization support for IF conditional expression for boolean and 
 timestamp inputs
 --

 Key: HIVE-5806
 URL: https://issues.apache.org/jira/browse/HIVE-5806
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HIVE-5806) Implement vectorization support for IF conditional expression for boolean and timestamp inputs


 [ 
https://issues.apache.org/jira/browse/HIVE-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Hanson reassigned HIVE-5806:
-

Assignee: Eric Hanson

 Implement vectorization support for IF conditional expression for boolean and 
 timestamp inputs
 --

 Key: HIVE-5806
 URL: https://issues.apache.org/jira/browse/HIVE-5806
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Work started] (HIVE-5806) Implement vectorization support for IF conditional expression for boolean and timestamp inputs


 [ 
https://issues.apache.org/jira/browse/HIVE-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-5806 started by Eric Hanson.

 Implement vectorization support for IF conditional expression for boolean and 
 timestamp inputs
 --

 Key: HIVE-5806
 URL: https://issues.apache.org/jira/browse/HIVE-5806
 Project: Hive
  Issue Type: Sub-task
Reporter: Eric Hanson
Assignee: Eric Hanson





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5581) Implement vectorized year/month/day... etc. for string arguments


[ 
https://issues.apache.org/jira/browse/HIVE-5581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820788#comment-13820788
 ] 

Eric Hanson commented on HIVE-5581:
---

Teddy, when do you think you can update the patch?

 Implement vectorized year/month/day... etc. for string arguments
 

 Key: HIVE-5581
 URL: https://issues.apache.org/jira/browse/HIVE-5581
 Project: Hive
  Issue Type: Sub-task
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Eric Hanson
Assignee: Teddy Choi
 Attachments: HIVE-5581.1.patch.txt, HIVE-5581.2.patch, 
 HIVE-5581.3.patch


 Functions year(), month(), day(), weekofyear(), hour(), minute(), second() 
 need to be implemented for string arguments in vectorized mode. 
 They already work for timestamp arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5717) Generate javadoc and source jars


[ 
https://issues.apache.org/jira/browse/HIVE-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820789#comment-13820789
 ] 

Szehon Ho commented on HIVE-5717:
-

Yes, with this patch we would get that for free, thanks to hcatalog now being 
part of the same central maven hive build.

 Generate javadoc and source jars
 

 Key: HIVE-5717
 URL: https://issues.apache.org/jira/browse/HIVE-5717
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland
Assignee: Szehon Ho
 Attachments: HIVE-5717.2.patch, HIVE-5717.patch


 We should be generating both javadoc and source jars.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5808) broadcast join in tez discards duplicate records from the broadcasted table

Thejas M Nair created HIVE-5808:
---

 Summary: broadcast join in tez discards duplicate records from the 
broadcasted table
 Key: HIVE-5808
 URL: https://issues.apache.org/jira/browse/HIVE-5808
 Project: Hive
  Issue Type: Bug
  Components: Tez
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: tez-branch


When the small(broadcasted table) in a join has duplicate records for a key, 
they are getting discarded, resulting in incorrect output.




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5808) broadcast join in tez discards duplicate records from the broadcasted table


 [ 
https://issues.apache.org/jira/browse/HIVE-5808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-5808:


Attachment: HIVE-5808.1.patch

 broadcast join in tez discards duplicate records from the broadcasted table
 ---

 Key: HIVE-5808
 URL: https://issues.apache.org/jira/browse/HIVE-5808
 Project: Hive
  Issue Type: Bug
  Components: Tez
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: tez-branch

 Attachments: HIVE-5808.1.patch


 When the small(broadcasted table) in a join has duplicate records for a key, 
 they are getting discarded, resulting in incorrect output.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5808) broadcast join in tez discards duplicate records from the broadcasted table


 [ 
https://issues.apache.org/jira/browse/HIVE-5808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-5808:


Status: Patch Available  (was: Open)

 broadcast join in tez discards duplicate records from the broadcasted table
 ---

 Key: HIVE-5808
 URL: https://issues.apache.org/jira/browse/HIVE-5808
 Project: Hive
  Issue Type: Bug
  Components: Tez
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: tez-branch

 Attachments: HIVE-5808.1.patch


 When the small(broadcasted table) in a join has duplicate records for a key, 
 they are getting discarded, resulting in incorrect output.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5809) incorrect stats in some cases with hive.stats.autogather=true

Ashutosh Chauhan created HIVE-5809:
--

 Summary: incorrect stats in some cases with 
hive.stats.autogather=true   
 Key: HIVE-5809
 URL: https://issues.apache.org/jira/browse/HIVE-5809
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5809) incorrect stats in some cases with hive.stats.autogather=true


 [ 
https://issues.apache.org/jira/browse/HIVE-5809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-5809:
---

Attachment: HIVE-5809.patch

preliminary patch

 incorrect stats in some cases with hive.stats.autogather=true   
 

 Key: HIVE-5809
 URL: https://issues.apache.org/jira/browse/HIVE-5809
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-5809.patch






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5809) incorrect stats in some cases with hive.stats.autogather=true


 [ 
https://issues.apache.org/jira/browse/HIVE-5809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-5809:
---

Status: Patch Available  (was: Open)

 incorrect stats in some cases with hive.stats.autogather=true   
 

 Key: HIVE-5809
 URL: https://issues.apache.org/jira/browse/HIVE-5809
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-5809.patch






--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Resolved] (HIVE-4621) describe formatted for table shows incorrect numRows data


 [ 
https://issues.apache.org/jira/browse/HIVE-4621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan resolved HIVE-4621.


Resolution: Invalid

Resolving it as invalid, since I can't repro this. Feel free to reopen if you 
can reproduce it. Also, provide steps to repro in case you are able to repro.

 describe formatted for table shows incorrect numRows data
 ---

 Key: HIVE-4621
 URL: https://issues.apache.org/jira/browse/HIVE-4621
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.10.0
Reporter: Gwen Shapira

 Created table (tab delimited), used load data local to get data into the 
 table.
 After creating the table, describe formatted showed numRow as 0.
 select count(*) from... showed  43205 rows.
 Running describe formatted after select count(*) still shows numRows as 0.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-4632) Use hadoop counter as a stat publisher


 [ 
https://issues.apache.org/jira/browse/HIVE-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-4632:
---

Status: Open  (was: Patch Available)

This looks useful, [~navis] would you like to rebase this on latest trunk ?

 Use hadoop counter as a stat publisher
 --

 Key: HIVE-4632
 URL: https://issues.apache.org/jira/browse/HIVE-4632
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.12.0
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-4632.D11001.1.patch


 Currently stats are all long/aggregation type and can be safely acquired by 
 hadoop counter without other db or hbase.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4632) Use hadoop counter as a stat publisher


[ 
https://issues.apache.org/jira/browse/HIVE-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820852#comment-13820852
 ] 

Ashutosh Chauhan commented on HIVE-4632:


Also, I think counter should be default mechanism to gather stats instead of 
current defaults which never gather stats correctly.

 Use hadoop counter as a stat publisher
 --

 Key: HIVE-4632
 URL: https://issues.apache.org/jira/browse/HIVE-4632
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.12.0
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-4632.D11001.1.patch


 Currently stats are all long/aggregation type and can be safely acquired by 
 hadoop counter without other db or hbase.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Resolved] (HIVE-3324) analyze command is not gathering num_rows present in the table.


 [ 
https://issues.apache.org/jira/browse/HIVE-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan resolved HIVE-3324.


Resolution: Invalid

You can't use derby for this, since a remote daemon is required to be running 
which is accessible to all task nodes. Derby runs in embedded mode, so that 
won't work. You need mysql or orther RDBMS running as daemon and accessible 
from all task nodes.

 analyze command is not gathering num_rows present in the table.
 -

 Key: HIVE-3324
 URL: https://issues.apache.org/jira/browse/HIVE-3324
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Affects Versions: 0.9.1, 0.10.0
Reporter: Rohith Sharma K S

 When analyze command is executed, collectableStats i.e num_rows and 
 raw_data_size is always zero even though table contains data.
 bq. [num_partitions: 0, num_files: 1, num_rows: 0, total_size: 5812, 
 raw_data_size: 0]



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-3324) analyze command is not gathering num_rows present in the table.


[ 
https://issues.apache.org/jira/browse/HIVE-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13820856#comment-13820856
 ] 

Ashutosh Chauhan commented on HIVE-3324:


Also, note HIVE-4632 will simplify stats collection mechanism, so that above is 
no longer a requirement.

 analyze command is not gathering num_rows present in the table.
 -

 Key: HIVE-3324
 URL: https://issues.apache.org/jira/browse/HIVE-3324
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Affects Versions: 0.9.1, 0.10.0
Reporter: Rohith Sharma K S

 When analyze command is executed, collectableStats i.e num_rows and 
 raw_data_size is always zero even though table contains data.
 bq. [num_partitions: 0, num_files: 1, num_rows: 0, total_size: 5812, 
 raw_data_size: 0]



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4621) describe formatted for table shows incorrect numRows data