from:"\"Jitendra Nath Pandey \\\(JIRA\\\)\""

[jira] [Commented] (HIVE-3098) Memory leak from large number of FileSystem instances in FileSystem.CACHE. (Must cache UGIs.)

2012-06-28 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403290#comment-13403290
 ] 

Jitendra Nath Pandey commented on HIVE-3098:


bq. Problem stems from the fact that there is no expiration policy either in fs 
or ugi cache. We need to design for UGI cache eviction policy. There, when we 
are expiring stale ugi's from ugi-cache we can do closeAllForUGI for evicting 
ugi to evict cached FS objects from fs-cache.

+1. It may be more tractable to have a cache expiration policy in ugi-cache 
based on the semantics of this particular use case. In FS-cache it gets 
trickier because of the general purpose nature of the file system.

> Memory leak from large number of FileSystem instances in FileSystem.CACHE. 
> (Must cache UGIs.)
> -
>
> Key: HIVE-3098
> URL: https://issues.apache.org/jira/browse/HIVE-3098
> Project: Hive
>  Issue Type: Bug
>  Components: Shims
>Affects Versions: 0.9.0
> Environment: Running with Hadoop 20.205.0.3+ / 1.0.x with security 
> turned on.
>Reporter: Mithun Radhakrishnan
>Assignee: Mithun Radhakrishnan
> Attachments: HIVE-3098.patch
>
>
> The problem manifested from stress-testing HCatalog 0.4.1 (as part of testing 
> the Oracle backend).
> The HCatalog server ran out of memory (-Xmx2048m) when pounded by 60-threads, 
> in under 24 hours. The heap-dump indicates that hadoop::FileSystem.CACHE had 
> 100 instances of FileSystem, whose combined retained-mem consumed the 
> entire heap.
> It boiled down to hadoop::UserGroupInformation::equals() being implemented 
> such that the "Subject" member is compared for equality ("=="), and not 
> equivalence (".equals()"). This causes equivalent UGI instances to compare as 
> unequal, and causes a new FileSystem instance to be created and cached.
> The UGI.equals() is so implemented, incidentally, as a fix for yet another 
> problem (HADOOP-6670); so it is unlikely that that implementation can be 
> modified.
> The solution for this is to check for UGI equivalence in HCatalog (i.e. in 
> the Hive metastore), using an cache for UGI instances in the shims.
> I have a patch to fix this. I'll upload it shortly. I just ran an overnight 
> test to confirm that the memory-leak has been arrested.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-8095) Tez and Vectorized GROUP BY: ClassCastException: ...HiveDecimal cannot be cast to ...HiveDecimalWritable

2014-09-14 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14133595#comment-14133595
 ] 

Jitendra Nath Pandey commented on HIVE-8095:


+1

> Tez and Vectorized GROUP BY: ClassCastException: ...HiveDecimal cannot be 
> cast to ...HiveDecimalWritable
> 
>
> Key: HIVE-8095
> URL: https://issues.apache.org/jira/browse/HIVE-8095
> Project: Hive
>  Issue Type: Bug
>Reporter: Matt McCline
>Assignee: Matt McCline
>Priority: Critical
> Attachments: HIVE-8095.01.patch, HIVE-8095.02.patch
>
>
> {code}
> Caused by: java.lang.ClassCastException: 
> org.apache.hadoop.hive.common.type.HiveDecimal cannot be cast to 
> org.apache.hadoop.hive.serde2.io.HiveDecimalWritable
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorColumnAssignFactory$17.assignObjectValue(VectorColumnAssignFactory.java:431)
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.writeSingleRow(VectorGroupByOperator.java:886)
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.access$400(VectorGroupByOperator.java:63)
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator$ProcessingModeHashAggregate.flush(VectorGroupByOperator.java:463)
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator$ProcessingModeHashAggregate.close(VectorGroupByOperator.java:369)
>   at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.closeOp(VectorGroupByOperator.java:924)
>   at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:583)
>   at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:595)
>   at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:595)
>   at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:595)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.close(MapRecordProcessor.java:224)
>   ... 13 more
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-8171) Tez and Vectorized Reduce doesn't create scratch columns

2014-09-23 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-8171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14145666#comment-14145666
 ] 

Jitendra Nath Pandey commented on HIVE-8171:


# The vectorized row batch is being constructed for different tags, but all are 
using same set of scratch columns. I am not sure if we really need multiple 
tags here. 
# The indentation can be improved in a few places, particularly log statements.



> Tez and Vectorized Reduce doesn't create scratch columns
> 
>
> Key: HIVE-8171
> URL: https://issues.apache.org/jira/browse/HIVE-8171
> Project: Hive
>  Issue Type: Bug
>Reporter: Matt McCline
>Assignee: Matt McCline
>Priority: Critical
> Attachments: HIVE-8171.01.patch
>
>
> This query fails with ArrayIndexOutofBound exception in the reducer.
> {code}
> create table varchar_3 (
>   field varchar(25)
> ) stored as orc;
> insert into table varchar_3 select cint from alltypesorc limit 10;
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-8171) Tez and Vectorized Reduce doesn't create scratch columns

2014-09-23 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-8171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14145743#comment-14145743
 ] 

Jitendra Nath Pandey commented on HIVE-8171:


bq. ..LOG indentations...
I meant the log lines are longer than 100 chars.

+1 for the patch. LGTM

> Tez and Vectorized Reduce doesn't create scratch columns
> 
>
> Key: HIVE-8171
> URL: https://issues.apache.org/jira/browse/HIVE-8171
> Project: Hive
>  Issue Type: Bug
>Reporter: Matt McCline
>Assignee: Matt McCline
>Priority: Critical
> Attachments: HIVE-8171.01.patch, HIVE-8171.02.patch
>
>
> This query fails with ArrayIndexOutofBound exception in the reducer.
> {code}
> create table varchar_3 (
>   field varchar(25)
> ) stored as orc;
> insert into table varchar_3 select cint from alltypesorc limit 10;
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-5877) Implement vectorized support for IN as boolean-valued expression

2013-12-02 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13837089#comment-13837089
 ] 

Jitendra Nath Pandey commented on HIVE-5877:


lgtm, +1

> Implement vectorized support for IN as boolean-valued expression
> 
>
> Key: HIVE-5877
> URL: https://issues.apache.org/jira/browse/HIVE-5877
> Project: Hive
>  Issue Type: Sub-task
>  Components: Query Processor
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5877.1.patch, HIVE-5877.2.patch, HIVE-5877.3.patch, 
> HIVE-5877.4.patch
>
>
> Implement support for IN as a Boolean-valued expression, e..g.
> select col1 IN (1, 2, 3) from T;
> or 
> select col1
> from T
> where NOT (col1 IN (1, 2, 3));
> This will also automatically add support for NOT IN because NOT IN is 
> automatically transformed into NOT ( ... IN ... ) by the parser.
>  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5756) Implement vectorization support for IF conditional expression for long, double, timestamp and boolean inputs

2013-12-04 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13839475#comment-13839475
 ] 

Jitendra Nath Pandey commented on HIVE-5756:


The patch looks good, a few issues:
# IfExprColumnColumn.txt assumes that the types of the last two expressions are 
same which may not be true. The IF statement in hive allows different datatypes 
for the if-part vs else-part.
# 'copySelected' method in LongColumnVector and DoubleColumnVector : The 
'isRepeating' is always being set to false in the output even if input has it 
repeating.
# 'fill' method: If same value is being filled and selectedInUse is false, 
should we set isRepeating flag to true?
# Vectorizer.java has only indentation changes? I am fine with that but just 
wanted to confirm so that we don't miss something.

> Implement vectorization support for IF conditional expression for long, 
> double, timestamp and boolean inputs
> 
>
> Key: HIVE-5756
> URL: https://issues.apache.org/jira/browse/HIVE-5756
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5756.1.patch, HIVE-5756.2.patch, HIVE-5756.3.patch, 
> HIVE-5756.4.patch, HIVE-5756.5.patch
>
>
> Implement full, end-to-end support for IF in vectorized mode, including new 
> VectorExpression class(es), VectorizationContext translation to a 
> VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
> testing. An end-to-end .q test is recommended but optional.
> This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5978) Rollups not supported in vector mode.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5978:
---

Issue Type: Sub-task  (was: Bug)
Parent: HIVE-4160

> Rollups not supported in vector mode.
> -
>
> Key: HIVE-5978
> URL: https://issues.apache.org/jira/browse/HIVE-5978
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
> Rollups are not supported in vector mode, the query should fail to vectorize. 
> A separate jira will be filed to implement rollups in vector mode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5978) Rollups not supported in vector mode.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-5978:
--

 Summary: Rollups not supported in vector mode.
 Key: HIVE-5978
 URL: https://issues.apache.org/jira/browse/HIVE-5978
 Project: Hive
  Issue Type: Bug
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


Rollups are not supported in vector mode, the query should fail to vectorize. A 
separate jira will be filed to implement rollups in vector mode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5978) Rollups not supported in vector mode.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5978:
---

Status: Patch Available  (was: Open)

> Rollups not supported in vector mode.
> -
>
> Key: HIVE-5978
> URL: https://issues.apache.org/jira/browse/HIVE-5978
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5978.1.patch
>
>
> Rollups are not supported in vector mode, the query should fail to vectorize. 
> A separate jira will be filed to implement rollups in vector mode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5978) Rollups not supported in vector mode.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5978:
---

Attachment: HIVE-5978.1.patch

> Rollups not supported in vector mode.
> -
>
> Key: HIVE-5978
> URL: https://issues.apache.org/jira/browse/HIVE-5978
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5978.1.patch
>
>
> Rollups are not supported in vector mode, the query should fail to vectorize. 
> A separate jira will be filed to implement rollups in vector mode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Description: 
Query ran:
{code}
select cast(t as timestamp), cast(si as timestamp),
   cast(i as timestamp), cast(b as timestamp),
   cast(f as string), cast(d as timestamp),
   cast(bo as timestamp), cast(b * 0 as timestamp),
   cast(ts as timestamp), cast(s as timestamp),
   cast(substr(s, 1, 1) as timestamp)
from Table1;
{code}
Running this query with hive.vectorized.execution.enabled=true fails with the 
following exception:
{noformat}
13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
Hive Runtime Error while processing row
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
at 
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
at 
org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
at 
org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error 
while processing row
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
... 8 more
Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
at java.sql.Timestamp.setNanos(Timestamp.java:383)
at 
org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
at 
org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
at 
org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
... 9 more
{noformat}
Full log is attached.
Schema for the table is as follows:
{code}
hive> desc Table1;
OK
t   tinyint from deserializer
si  smallintfrom deserializer
i   int from deserializer
b   bigint  from deserializer
f   float   from deserializer
d   double  from deserializer
bo  boolean from deserializer
s   string  from deserializer
s2  string  from deserializer
ts  timestamp   from deserializer
Time taken: 0.521 seconds, Fetched: 10 row(s)
{code}

  was:
Query ran:
{code}
select cast(t as timestamp), cast(si as timestamp),
   cast(i as timestamp), cast(b as timestamp),
   cast(f as string), cast(d as timestamp),
   cast(bo as timestamp), cast(b * 0 as timestamp),
   cast(ts as timestamp), cast(s as timestamp),
   cast(substr(s, 1, 1) as timestamp)
from vectortab10korc;
{code}
Running this query with hive.vectorized.execution.enabled=true fails with the 
following exception:
{noformat}
13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
diagnostics=[Task failed, taskId=task_13

[jira] [Created] (HIVE-5979) Failure in cast to timestamps.

2013-12-06 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-5979:
--

 Summary: Failure in cast to timestamps.
 Key: HIVE-5979
 URL: https://issues.apache.org/jira/browse/HIVE-5979
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


Query ran:
{code}
select cast(t as timestamp), cast(si as timestamp),
   cast(i as timestamp), cast(b as timestamp),
   cast(f as string), cast(d as timestamp),
   cast(bo as timestamp), cast(b * 0 as timestamp),
   cast(ts as timestamp), cast(s as timestamp),
   cast(substr(s, 1, 1) as timestamp)
from vectortab10korc;
{code}
Running this query with hive.vectorized.execution.enabled=true fails with the 
following exception:
{noformat}
13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
Hive Runtime Error while processing row
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
at 
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
at 
org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
at 
org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error 
while processing row
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
... 8 more
Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
at java.sql.Timestamp.setNanos(Timestamp.java:383)
at 
org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
at 
org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
at 
org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
... 9 more
{noformat}
Full log is attached.
Schema for the table is as follows:
{code}
hive> desc vectortab10korc;
OK
t   tinyint from deserializer
si  smallintfrom deserializer
i   int from deserializer
b   bigint  from deserializer
f   float   from deserializer
d   double  from deserializer
bo  boolean from deserializer
s   string  from deserializer
s2  string  from deserializer
ts  timestamp   from deserializer
Time taken: 0.521 seconds, Fetched: 10 row(s)
{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5756) Implement vectorization support for IF conditional expression for long, double, timestamp, boolean and string inputs

2013-12-06 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13841996#comment-13841996
 ] 

Jitendra Nath Pandey commented on HIVE-5756:


In IfExprStringColumnStringColumn.java :
{code}
   outputColVector.setVal(i, arg2ColVector.vector[i], arg2ColVector.start[i], 
arg2ColVector.length[i]);
{code}

This could run into trouble if the source byte array is null.

Similar problem exists in other string templates.

> Implement vectorization support for IF conditional expression for long, 
> double, timestamp, boolean and string inputs
> 
>
> Key: HIVE-5756
> URL: https://issues.apache.org/jira/browse/HIVE-5756
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5756.1.patch, HIVE-5756.2.patch, HIVE-5756.3.patch, 
> HIVE-5756.4.patch, HIVE-5756.5.patch, HIVE-5756.6.patch.txt, HIVE-5756.7.patch
>
>
> Implement full, end-to-end support for IF in vectorized mode, including new 
> VectorExpression class(es), VectorizationContext translation to a 
> VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
> testing. An end-to-end .q test is recommended but optional.
> This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5979) Failure in cast to timestamps.

2013-12-09 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13843878#comment-13843878
 ] 

Jitendra Nath Pandey commented on HIVE-5979:


The reason for error is an edge case with negative times in nanosecond. In the 
following code:
{code}
   public static void assignTimeInNanoSec(long timeInNanoSec, Timestamp t) {
 t.setTime((timeInNanoSec)/100);
 t.setNanos((int)((t.getNanos()) + (timeInNanoSec % 100)));
   }
{code}

The "t.setTime()" call sets the nanos part as well from 
(timeInNanoSec)/100. The remaining nano seconds (timeInNanoSec % 100) 
are added in "t.setNanos()". For negative values of timeInNanoSec it is 
expected that t.getNanos() will be positive and much larger than (timeInNanoSec 
% 100) so that "t.setNanos()" call works fine. But, when 
(timeInNanoSec)/100 yields zero nanoseconds the above logic fails. An 
example number that fails above code is "-179475023828416L".


> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from des

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-09 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Status: Patch Available  (was: Open)

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5979.1.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-09 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Attachment: HIVE-5979.1.patch

The attached patch fixes the above issue. A test case is also added that 
reproduces the issue without this patch. 

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5979.1.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {cod

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-10 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Attachment: HIVE-5979.2.patch

Updated the patch with comments.

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-10 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Status: Open  (was: Patch Available)

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-10 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Status: Patch Available  (was: Open)

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-5756) Implement vectorization support for IF conditional expression for long, double, timestamp, boolean and string inputs

2013-12-10 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13844821#comment-13844821
 ] 

Jitendra Nath Pandey commented on HIVE-5756:


Posted a few comments on the review board. A few more null checks are required 
particularly for the case when first argument is not null but other two 
arguments could still have nulls.

> Implement vectorization support for IF conditional expression for long, 
> double, timestamp, boolean and string inputs
> 
>
> Key: HIVE-5756
> URL: https://issues.apache.org/jira/browse/HIVE-5756
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5756.1.patch, HIVE-5756.2.patch, HIVE-5756.3.patch, 
> HIVE-5756.4.patch, HIVE-5756.5.patch, HIVE-5756.6.patch.txt, 
> HIVE-5756.7.patch, HIVE-5756.8.patch
>
>
> Implement full, end-to-end support for IF in vectorized mode, including new 
> VectorExpression class(es), VectorizationContext translation to a 
> VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
> testing. An end-to-end .q test is recommended but optional.
> This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-5756) Implement vectorization support for IF conditional expression for long, double, timestamp, boolean and string inputs

2013-12-10 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13844843#comment-13844843
 ] 

Jitendra Nath Pandey commented on HIVE-5756:


Oh yeah, I looked at the older patch, sorry about that. 
The latest patch looks fine to me. +1

> Implement vectorization support for IF conditional expression for long, 
> double, timestamp, boolean and string inputs
> 
>
> Key: HIVE-5756
> URL: https://issues.apache.org/jira/browse/HIVE-5756
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5756.1.patch, HIVE-5756.2.patch, HIVE-5756.3.patch, 
> HIVE-5756.4.patch, HIVE-5756.5.patch, HIVE-5756.6.patch.txt, 
> HIVE-5756.7.patch, HIVE-5756.8.patch
>
>
> Implement full, end-to-end support for IF in vectorized mode, including new 
> VectorExpression class(es), VectorizationContext translation to a 
> VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
> testing. An end-to-end .q test is recommended but optional.
> This is high priority because IF is the most popular conditional expression.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-5979) Failure in cast to timestamps.

2013-12-11 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13845745#comment-13845745
 ] 

Jitendra Nath Pandey commented on HIVE-5979:


Committed to trunk.

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Fix For: 0.13.0
>
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message w

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-11 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

Release Note:   (was: Committed to trunk.)

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Fix For: 0.13.0
>
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken: 0.521 seconds, Fetched: 10 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v6.

[jira] [Updated] (HIVE-5979) Failure in cast to timestamps.

2013-12-11 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5979:
---

   Resolution: Fixed
Fix Version/s: 0.13.0
 Release Note: Committed to trunk.
   Status: Resolved  (was: Patch Available)

> Failure in cast to timestamps.
> --
>
> Key: HIVE-5979
> URL: https://issues.apache.org/jira/browse/HIVE-5979
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Fix For: 0.13.0
>
> Attachments: HIVE-5979.1.patch, HIVE-5979.2.patch
>
>
> Query ran:
> {code}
> select cast(t as timestamp), cast(si as timestamp),
>cast(i as timestamp), cast(b as timestamp),
>cast(f as string), cast(d as timestamp),
>cast(bo as timestamp), cast(b * 0 as timestamp),
>cast(ts as timestamp), cast(s as timestamp),
>cast(substr(s, 1, 1) as timestamp)
> from Table1;
> {code}
> Running this query with hive.vectorized.execution.enabled=true fails with the 
> following exception:
> {noformat}
> 13/12/05 07:56:36 ERROR tez.TezJobMonitor: Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1386227234886_0482_1_00, 
> diagnostics=[Task failed, taskId=task_1386227234886_0482_1_00_00, 
> diagnostics=[AttemptID:attempt_1386227234886_0482_1_00_00_0 Info:Error: 
> java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: 
> Hive Runtime Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:205)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:171)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:112)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:201)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild$4.run(YarnTezDagChild.java:484)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at 
> org.apache.hadoop.mapred.YarnTezDagChild.main(YarnTezDagChild.java:474)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
> Error while processing row
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.processRow(MapRecordProcessor.java:193)
> ... 8 more
> Caused by: java.lang.IllegalArgumentException: nanos > 9 or < 0
> at java.sql.Timestamp.setNanos(Timestamp.java:383)
> at 
> org.apache.hadoop.hive.ql.exec.vector.TimestampUtils.assignTimeInNanoSec(TimestampUtils.java:27)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$1.writeValue(VectorExpressionWriterFactory.java:412)
> at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterLong.writeValue(VectorExpressionWriterFactory.java:162)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch.toString(VectorizedRowBatch.java:152)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorFileSinkOperator.processOp(VectorFileSinkOperator.java:85)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:129)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:93)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:786)
> at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> ... 9 more
> {noformat}
> Full log is attached.
> Schema for the table is as follows:
> {code}
> hive> desc Table1;
> OK
> t tinyint from deserializer
> sismallintfrom deserializer
> i int from deserializer
> b bigint  from deserializer
> f float   from deserializer
> d double  from deserializer
> boboolean from deserializer
> s string  from deserializer
> s2string  from deserializer
> tstimestamp   from deserializer
> Time taken

[jira] [Commented] (HIVE-5521) Remove CommonRCFileInputFormat

2013-12-11 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13845825#comment-13845825
 ] 

Jitendra Nath Pandey commented on HIVE-5521:


+1

> Remove CommonRCFileInputFormat
> --
>
> Key: HIVE-5521
> URL: https://issues.apache.org/jira/browse/HIVE-5521
> Project: Hive
>  Issue Type: Bug
>  Components: File Formats, Vectorization
>Affects Versions: 0.13.0
>Reporter: Ashutosh Chauhan
>Assignee: Ashutosh Chauhan
> Attachments: HIVE-5521.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-6010) create a test that would ensure vectorization produces same results as non-vectorized execution

2013-12-16 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13849926#comment-13849926
 ] 

Jitendra Nath Pandey commented on HIVE-6010:


+1

> create a test that would ensure vectorization produces same results as 
> non-vectorized execution
> ---
>
> Key: HIVE-6010
> URL: https://issues.apache.org/jira/browse/HIVE-6010
> Project: Hive
>  Issue Type: Test
>  Components: Tests, Vectorization
>Reporter: Sergey Shelukhin
>Assignee: Sergey Shelukhin
> Attachments: HIVE-6010.01.patch, HIVE-6010.02.patch, HIVE-6010.patch
>
>
> So as to ensure that vectorization is not forgotten when changes are made to 
> things. Obviously it would not be viable to have a bulletproof test, but at 
> least a subset of operations can be verified.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HIVE-5521) Remove CommonRCFileInputFormat

2013-12-16 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5521:
---

Resolution: Fixed
Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks to Ashutosh!

> Remove CommonRCFileInputFormat
> --
>
> Key: HIVE-5521
> URL: https://issues.apache.org/jira/browse/HIVE-5521
> Project: Hive
>  Issue Type: Bug
>  Components: File Formats, Vectorization
>Affects Versions: 0.13.0
>Reporter: Ashutosh Chauhan
>Assignee: Ashutosh Chauhan
> Fix For: 0.13.0
>
> Attachments: HIVE-5521.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HIVE-5521) Remove CommonRCFileInputFormat

2013-12-16 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5521:
---

Fix Version/s: 0.13.0

> Remove CommonRCFileInputFormat
> --
>
> Key: HIVE-5521
> URL: https://issues.apache.org/jira/browse/HIVE-5521
> Project: Hive
>  Issue Type: Bug
>  Components: File Formats, Vectorization
>Affects Versions: 0.13.0
>Reporter: Ashutosh Chauhan
>Assignee: Ashutosh Chauhan
> Fix For: 0.13.0
>
> Attachments: HIVE-5521.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-6017) Contribute Decimal128 high-performance decimal(p, s) package from Microsoft to Hive

2013-12-30 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13859172#comment-13859172
 ] 

Jitendra Nath Pandey commented on HIVE-6017:


The code looks good to me. +1
It seems the copywrite needs to be mentioned in the NOTICE file as well, 
although I am not an expert on these rules. Please also refer to 
http://www.apache.org/licenses/ to comply with the guidelines when submitting 
code with employer copywrite or third-party code. Does it require a Software 
Grant Agreement (SGA) with PMC?


> Contribute Decimal128 high-performance decimal(p, s) package from Microsoft 
> to Hive
> ---
>
> Key: HIVE-6017
> URL: https://issues.apache.org/jira/browse/HIVE-6017
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6017.01.patch, HIVE-6017.02.patch, 
> HIVE-6017.03.patch, HIVE-6017.04.patch
>
>
> Contribute the Decimal128 high-performance decimal package developed by 
> Microsoft to Hive. This was originally written for Microsoft PolyBase by 
> Hideaki Kimura.
> This code is about 8X more efficient than Java BigDecimal for typical 
> operations. It uses a finite (128 bit) precision and can handle up to 
> decimal(38, X). It is also "mutable" so you can change the contents of an 
> existing object. This helps reduce the cost of new() and garbage collection.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-5757) Implement vectorized support for CASE

2014-01-02 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13860497#comment-13860497
 ] 

Jitendra Nath Pandey commented on HIVE-5757:


+1, looks ok to me as well.

> Implement vectorized support for CASE
> -
>
> Key: HIVE-5757
> URL: https://issues.apache.org/jira/browse/HIVE-5757
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-5757.1.patch, HIVE-5757.2.patch, HIVE-5757.3.patch, 
> HIVE-5757.4.patch
>
>
> Implement support for CASE in vectorized mode. The approach is to use the 
> vectorized UDF adaptor internally. A higher-performance version that used 
> VectorExpression subclasses was considered but not done due to complexity. 
> Such a version potentially could be done in the future if it's important 
> enough.
> This is high priority because CASE is a fairly popular expression.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6010) create a test that would ensure vectorization produces same results as non-vectorized execution

2014-01-02 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6010:
---

   Resolution: Fixed
Fix Version/s: 0.13.0
 Release Note: Committed to trunk. Thanks Sergey!
   Status: Resolved  (was: Patch Available)

> create a test that would ensure vectorization produces same results as 
> non-vectorized execution
> ---
>
> Key: HIVE-6010
> URL: https://issues.apache.org/jira/browse/HIVE-6010
> Project: Hive
>  Issue Type: Test
>  Components: Tests, Vectorization
>Reporter: Sergey Shelukhin
>Assignee: Sergey Shelukhin
> Fix For: 0.13.0
>
> Attachments: HIVE-6010.01.patch, HIVE-6010.02.patch, 
> HIVE-6010.03.patch, HIVE-6010.04.patch, HIVE-6010.patch
>
>
> So as to ensure that vectorization is not forgotten when changes are made to 
> things. Obviously it would not be viable to have a bulletproof test, but at 
> least a subset of operations can be verified.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6051) Create DecimalColumnVector and a representative VectorExpression for decimal

2014-01-06 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13863794#comment-13863794
 ] 

Jitendra Nath Pandey commented on HIVE-6051:


I suppose DecimalColAddDecimalColumn will be a generated code from template. 
Will it be deleted when the template is added? I am fine with that, however.

+1 lgtm.

> Create DecimalColumnVector and a representative VectorExpression for decimal
> 
>
> Key: HIVE-6051
> URL: https://issues.apache.org/jira/browse/HIVE-6051
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Fix For: 0.13.0
>
> Attachments: HIVE-6051.01.patch, HIVE-6051.02.patch
>
>
> Create a DecimalColumnVector to use as a basis for vectorized decimal 
> operations. Include a representative VectorExpression on decimal (e.g. 
> column-column addition) to demonstrate it's use.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-07 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-6156:
--

 Summary: Implement vectorized reader for Date datatype for ORC 
format.
 Key: HIVE-6156
 URL: https://issues.apache.org/jira/browse/HIVE-6156
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-5595) Implement vectorized SMB JOIN

2014-01-07 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-5595:
---

Description: Vectorized implementation of SMB Map Join.

> Implement vectorized SMB JOIN
> -
>
> Key: HIVE-5595
> URL: https://issues.apache.org/jira/browse/HIVE-5595
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Remus Rusanu
>Assignee: Remus Rusanu
>Priority: Critical
> Attachments: HIVE-5595.1.patch, HIVE-5595.2.patch
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Vectorized implementation of SMB Map Join.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-09 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6156:
---

Attachment: HIVE-6156.1.patch

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-09 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6156:
---

Status: Patch Available  (was: Open)

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-09 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13867046#comment-13867046
 ] 

Jitendra Nath Pandey commented on HIVE-6156:


Review board entry for the patch: https://reviews.apache.org/r/16760/

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-09 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-6178:
--

 Summary: Implement vectorized reader for DECIMAL datatype for ORC 
format.
 Key: HIVE-6178
 URL: https://issues.apache.org/jira/browse/HIVE-6178
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6067) Implement vectorized decimal comparison filters

2014-01-09 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13867452#comment-13867452
 ] 

Jitendra Nath Pandey commented on HIVE-6067:


I have posted a comment on review board. 
+1 for the patch conditional on the comment being addressed.

> Implement vectorized decimal comparison filters
> ---
>
> Key: HIVE-6067
> URL: https://issues.apache.org/jira/browse/HIVE-6067
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6067.01.patch, HIVE-6067.02.patch, 
> HIVE-6067.03.patch, HIVE-6067.03.patch
>
>
> Using the new DecimalColumnVector type, implement templates to generate 
> VectorExpression subclasses for Decimal comparison filters (<, <=, >, >=, =, 
> !=). Include scalar-column, column-scalar, and column-column filter cases. 
> Include unit tests.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6067) Implement vectorized decimal comparison filters

2014-01-10 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13868343#comment-13868343
 ] 

Jitendra Nath Pandey commented on HIVE-6067:


+1

> Implement vectorized decimal comparison filters
> ---
>
> Key: HIVE-6067
> URL: https://issues.apache.org/jira/browse/HIVE-6067
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6067.01.patch, HIVE-6067.02.patch, 
> HIVE-6067.03.patch, HIVE-6067.03.patch, HIVE-6067.04.patch
>
>
> Using the new DecimalColumnVector type, implement templates to generate 
> VectorExpression subclasses for Decimal comparison filters (<, <=, >, >=, =, 
> !=). Include scalar-column, column-scalar, and column-column filter cases. 
> Include unit tests.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-10 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6156:
---

Attachment: HIVE-6156.2.patch

Updated patch addressing the comment.

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch, HIVE-6156.2.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-10 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6156:
---

Status: Open  (was: Patch Available)

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch, HIVE-6156.2.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Resolved] (HIVE-6156) Implement vectorized reader for Date datatype for ORC format.

2014-01-13 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey resolved HIVE-6156.


Resolution: Fixed

Committed to trunk.

> Implement vectorized reader for Date datatype for ORC format.
> -
>
> Key: HIVE-6156
> URL: https://issues.apache.org/jira/browse/HIVE-6156
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6156.1.patch, HIVE-6156.2.patch
>
>
> We need to implement vectorized reader for Date datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6124) Support basic Decimal arithmetic in vector mode (+, -, *)

2014-01-15 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872491#comment-13872491
 ] 

Jitendra Nath Pandey commented on HIVE-6124:


Posted a comment on review board.
+1 otherwise.

> Support basic Decimal arithmetic in vector mode (+, -, *)
> -
>
> Key: HIVE-6124
> URL: https://issues.apache.org/jira/browse/HIVE-6124
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6124.01.patch, HIVE-6124.02.patch, 
> HIVE-6124.03.patch
>
>
> Create support for basic decimal arithmetic (+, -, * but not /, %) based on 
> templates for column-scalar, scalar-column, and column-column operations.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6124) Support basic Decimal arithmetic in vector mode (+, -, *)

2014-01-15 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872687#comment-13872687
 ] 

Jitendra Nath Pandey commented on HIVE-6124:


+1

> Support basic Decimal arithmetic in vector mode (+, -, *)
> -
>
> Key: HIVE-6124
> URL: https://issues.apache.org/jira/browse/HIVE-6124
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6124.01.patch, HIVE-6124.02.patch, 
> HIVE-6124.03.patch, HIVE-6124.04.patch
>
>
> Create support for basic decimal arithmetic (+, -, * but not /, %) based on 
> templates for column-scalar, scalar-column, and column-column operations.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6186) error in vectorized Column-Column comparison filter for repeating case

2014-01-16 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13874234#comment-13874234
 ] 

Jitendra Nath Pandey commented on HIVE-6186:


+1

> error in vectorized Column-Column comparison filter for repeating case
> --
>
> Key: HIVE-6186
> URL: https://issues.apache.org/jira/browse/HIVE-6186
> Project: Hive
>  Issue Type: Sub-task
>  Components: Query Processor
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6186.01.patch
>
>
> FilterColumnCompareColumn.txt  contains this code:
> } else if (inputColVector1.isRepeating && inputColVector2.isRepeating) {
>   if (nullPos1[0] || nullPos2[0]) {
> batch.size = 0; 
>   } 
> This code examines nullPos1[0] and nullPos2[0] without checking that 
> noNulls==false for the appropriate vector. This could lead to incorrect 
> results.
> We should review the case analysis for this template more generally to make 
> sure that all combinations are covered:
> left.noNullsright.noNulls
> T T
> F T
> T F
> F F 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-16 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Attachment: HIVE-6178.1.patch

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-16 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13874242#comment-13874242
 ] 

Jitendra Nath Pandey commented on HIVE-6178:


Review board entry:
https://reviews.apache.org/r/17005/

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-16 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Patch Available  (was: Open)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6183) Implement vectorized type cast from/to decimal(p, s)

2014-01-21 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13878343#comment-13878343
 ] 

Jitendra Nath Pandey commented on HIVE-6183:


The code looks good. +1
Some minor comments.
* DecimalColumnVector#checkPrecisionOverflow should catch a more specific 
exception for overflow.
* CastDecimalToDecimal#func is too generic a name. We could rename it to 
something indicating what it does.
* CastDecimalToTimestamp#initialize: tmp and tmpE9 can be initialized 
statically in the class. It seems we can get rid of the initialize method from 
the base classes as well.





> Implement vectorized type cast from/to decimal(p, s)
> 
>
> Key: HIVE-6183
> URL: https://issues.apache.org/jira/browse/HIVE-6183
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6183.07.patch, HIVE-6183.08.patch
>
>
> Add support for all the type supported type casts to/from decimal(p,s) in 
> vectorized mode.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6183) Implement vectorized type cast from/to decimal(p, s)

2014-01-23 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13880611#comment-13880611
 ] 

Jitendra Nath Pandey commented on HIVE-6183:


+1

> Implement vectorized type cast from/to decimal(p, s)
> 
>
> Key: HIVE-6183
> URL: https://issues.apache.org/jira/browse/HIVE-6183
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6183.07.patch, HIVE-6183.08.patch, 
> HIVE-6183.09.patch, HIVE-6183.09.patch, HIVE-6183.10.patch
>
>
> Add support for all the type supported type casts to/from decimal(p,s) in 
> vectorized mode.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-01-23 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-6303:
--

 Summary: VectorExpressionWriter for date and decimal datatypes.
 Key: HIVE-6303
 URL: https://issues.apache.org/jira/browse/HIVE-6303
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6243) error in high-precision division for Decimal128

2014-01-24 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13881360#comment-13881360
 ] 

Jitendra Nath Pandey commented on HIVE-6243:


+1

> error in high-precision division for Decimal128
> ---
>
> Key: HIVE-6243
> URL: https://issues.apache.org/jira/browse/HIVE-6243
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6243.01.patch, HIVE-6243.02.patch, 
> divide-error.01.patch
>
>
> a = 213474114411690
> b = 5062120663
> a * b = 1080631725579042037750470
> (a * b) / b ==> 
>   actual:   251599050984618
>   expected: 213474114411690



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-24 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Attachment: HIVE-6178.2.patch

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-24 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13881492#comment-13881492
 ] 

Jitendra Nath Pandey commented on HIVE-6178:


Uploaded a new patch addressing a few comments. I have also posted an 
explanation for handling variable scales.

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-24 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Open  (was: Patch Available)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-27 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Patch Available  (was: Open)

Updated patch to fix the scale if the data has a different scale than the one 
in schema.

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-27 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Attachment: HIVE-6178.3.patch

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Attachment: HIVE-6178.4.patch

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Open  (was: Patch Available)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Patch Available  (was: Open)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13884690#comment-13884690
 ] 

Jitendra Nath Pandey commented on HIVE-6178:


Uploaded another patch fixing the style issues. Updated review board as well.
The failed test is not related to the patch and passes in my environment.

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6139) Implement vectorized decimal division and modulo

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13885024#comment-13885024
 ] 

Jitendra Nath Pandey commented on HIVE-6139:


Posted comments on review board.

> Implement vectorized decimal division and modulo
> 
>
> Key: HIVE-6139
> URL: https://issues.apache.org/jira/browse/HIVE-6139
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6139.01.patch, HIVE-6139.02.patch, 
> HIVE-6139.07.patch, HIVE-6139.07.patch, HIVE-6139.08.patch, HIVE-6139.09.patch
>
>
> Support column-scalar, scalar-column, and column-column versions for division 
> and modulo. Include unit tests.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-01-28 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6303:
---

Attachment: HIVE-6303.1.patch

> VectorExpressionWriter for date and decimal datatypes.
> --
>
> Key: HIVE-6303
> URL: https://issues.apache.org/jira/browse/HIVE-6303
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6303.1.patch
>
>
> VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6139) Implement vectorized decimal division and modulo

2014-01-29 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13885816#comment-13885816
 ] 

Jitendra Nath Pandey commented on HIVE-6139:


+1

> Implement vectorized decimal division and modulo
> 
>
> Key: HIVE-6139
> URL: https://issues.apache.org/jira/browse/HIVE-6139
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
> Attachments: HIVE-6139.01.patch, HIVE-6139.02.patch, 
> HIVE-6139.07.patch, HIVE-6139.07.patch, HIVE-6139.08.patch, 
> HIVE-6139.09.patch, HIVE-6139.10.patch
>
>
> Support column-scalar, scalar-column, and column-column versions for division 
> and modulo. Include unit tests.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-01-29 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-6333:
--

 Summary: Generate vectorized plan for decimal expressions.
 Key: HIVE-6333
 URL: https://issues.apache.org/jira/browse/HIVE-6333
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-5761) Implement vectorized support for the DATE data type

2014-01-29 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13885867#comment-13885867
 ] 

Jitendra Nath Pandey commented on HIVE-5761:


The patch needs to be rebased to latest trunk. HIVE-6178 and HIVE-6303 address 
the vectorized reader and writer for DATE, so this patch can just focus on DATE 
expressions.

> Implement vectorized support for the DATE data type
> ---
>
> Key: HIVE-5761
> URL: https://issues.apache.org/jira/browse/HIVE-5761
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Teddy Choi
> Attachments: HIVE-5761.1.patch
>
>
> Add support to allow queries referencing DATE columns and expression results 
> to run efficiently in vectorized mode. This should re-use the code for the 
> the integer/timestamp types to the extent possible and beneficial. Include 
> unit tests and end-to-end tests. Consider re-using or extending existing 
> end-to-end tests for vectorized integer and/or timestamp operations.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-30 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Patch Available  (was: Open)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch, HIVE-6178.5.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-30 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Status: Open  (was: Patch Available)

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch, HIVE-6178.5.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-30 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

Attachment: HIVE-6178.5.patch

Uploading/Submitting same patch again to trigger jenkins.

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch, HIVE-6178.5.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6257) Add more unit tests for high-precision Decimal128 arithmetic

2014-01-30 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13887403#comment-13887403
 ] 

Jitendra Nath Pandey commented on HIVE-6257:


+1

> Add more unit tests for high-precision Decimal128 arithmetic
> 
>
> Key: HIVE-6257
> URL: https://issues.apache.org/jira/browse/HIVE-6257
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 0.13.0
>Reporter: Eric Hanson
>Assignee: Eric Hanson
>Priority: Minor
> Attachments: HIVE-6257.02.patch, HIVE-6257.03.patch, 
> HIVE-6257.04.patch
>
>
> Add more unit tests for high-precision Decimal128 arithmetic, with arguments 
> close to or at 38 digit limit. Consider some random stress tests for broader 
> coverage. Coverage is pretty good now (after HIVE-6243) for precision up to 
> about 18. This is to go beyond that.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6178) Implement vectorized reader for DECIMAL datatype for ORC format.

2014-01-31 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6178:
---

   Resolution: Fixed
Fix Version/s: 0.13.0
   Status: Resolved  (was: Patch Available)

Committed to trunk.

> Implement vectorized reader for DECIMAL datatype for ORC format.
> 
>
> Key: HIVE-6178
> URL: https://issues.apache.org/jira/browse/HIVE-6178
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Fix For: 0.13.0
>
> Attachments: HIVE-6178.1.patch, HIVE-6178.2.patch, HIVE-6178.3.patch, 
> HIVE-6178.4.patch, HIVE-6178.5.patch
>
>
> Implement vectorized reader for DECIMAL datatype for ORC format.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-01-31 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6303:
---

Status: Patch Available  (was: Open)

> VectorExpressionWriter for date and decimal datatypes.
> --
>
> Key: HIVE-6303
> URL: https://issues.apache.org/jira/browse/HIVE-6303
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6303.1.patch
>
>
> VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-01-31 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888331#comment-13888331
 ] 

Jitendra Nath Pandey commented on HIVE-6303:


RB entry:
https://reviews.apache.org/r/17622/

> VectorExpressionWriter for date and decimal datatypes.
> --
>
> Key: HIVE-6303
> URL: https://issues.apache.org/jira/browse/HIVE-6303
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6303.1.patch
>
>
> VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-6349) Column name map is broken

2014-01-31 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-6349:
--

 Summary: Column name map is broken 
 Key: HIVE-6349
 URL: https://issues.apache.org/jira/browse/HIVE-6349
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


Following query results in exception at run time in vector mode.

{code}
explain select n_name from supplier_orc s join ( select n_name, n_nationkey 
from nation_orc n join region_orc r on n.n_regionkey = r.r_regionkey and 
r.r_name = 'XYZ') n1 on s.s_nationkey = n1.n_nationkey;
{code}

Here n_name is a string and all other fields are int.

The stack trace:
{code}
java.lang.RuntimeException: Hive Runtime Error while closing operators
at 
org.apache.hadoop.hive.ql.exec.mr.ExecMapper.close(ExecMapper.java:260)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:429)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:365)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
Caused by: java.lang.ClassCastException: 
org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector cannot be cast to 
org.apache.hadoop.hive.ql.exec.vector.LongColumnVector
at 
org.apache.hadoop.hive.ql.exec.vector.VectorHashKeyWrapperBatch.evaluateBatch(VectorHashKeyWrapperBatch.java:116)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.processOp(VectorMapJoinOperator.java:280)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:790)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:133)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:790)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.flushOutput(VectorMapJoinOperator.java:246)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.closeOp(VectorMapJoinOperator.java:253)
at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:574)
at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:585)
at 
org.apache.hadoop.hive.ql.exec.mr.ExecMapper.close(ExecMapper.java:234)
... 8 more
{code}








--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-03 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6333:
---

Description: Transform non-vector plan to vectorized plan for supported 
decimal expressions. 

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-03 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6333:
---

Attachment: HIVE-6333.1.patch

An early version of the patch. 

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6333:
---

Attachment: HIVE-6333.2.patch

Another iteration of patch.

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch, HIVE-6333.2.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6333:
---

Attachment: HIVE-6333.3.patch

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch, HIVE-6333.2.patch, HIVE-6333.3.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13892843#comment-13892843
 ] 

Jitendra Nath Pandey commented on HIVE-6333:


Another version of the patch is uploaded. This patch is ready for review and is 
submitted for jenkins.

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch, HIVE-6333.2.patch, HIVE-6333.3.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6333:
---

Status: Patch Available  (was: Open)

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch, HIVE-6333.2.patch, HIVE-6333.3.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13892846#comment-13892846
 ] 

Jitendra Nath Pandey commented on HIVE-6303:


This patch is incorporated in HIVE-6333 because that patch effectively fixes 
the test failure.

> VectorExpressionWriter for date and decimal datatypes.
> --
>
> Key: HIVE-6303
> URL: https://issues.apache.org/jira/browse/HIVE-6303
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6303.1.patch
>
>
> VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-6303) VectorExpressionWriter for date and decimal datatypes.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-6303:
---

Resolution: Later
Status: Resolved  (was: Patch Available)

> VectorExpressionWriter for date and decimal datatypes.
> --
>
> Key: HIVE-6303
> URL: https://issues.apache.org/jira/browse/HIVE-6303
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6303.1.patch
>
>
> VectorExpressionWriter for date and decimal datatypes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-6333) Generate vectorized plan for decimal expressions.

2014-02-05 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13892849#comment-13892849
 ] 

Jitendra Nath Pandey commented on HIVE-6333:


Review board entry: https://reviews.apache.org/r/17769/

> Generate vectorized plan for decimal expressions.
> -
>
> Key: HIVE-6333
> URL: https://issues.apache.org/jira/browse/HIVE-6333
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: HIVE-6333.1.patch, HIVE-6333.2.patch, HIVE-6333.3.patch
>
>
> Transform non-vector plan to vectorized plan for supported decimal 
> expressions. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Assigned] (HIVE-5759) Implement vectorized support for COALESCE conditional expression

2014-02-06 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey reassigned HIVE-5759:
--

Assignee: Jitendra Nath Pandey

> Implement vectorized support for COALESCE conditional expression
> 
>
> Key: HIVE-5759
> URL: https://issues.apache.org/jira/browse/HIVE-5759
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Eric Hanson
>Assignee: Jitendra Nath Pandey
>
> Implement full, end-to-end support for COALESCE in vectorized mode, including 
> new VectorExpression class(es), VectorizationContext translation to a 
> VectorExpression, and unit tests for these, as well as end-to-end ad hoc 
> testing. An end-to-end .q test is recommended.
> This is lower priority than IF and CASE but it is still a fairly popular 
> expression.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Created] (HIVE-4160) Vectorized Query Execution in Hive

2013-03-13 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-4160:
--

 Summary: Vectorized Query Execution in Hive
 Key: HIVE-4160
 URL: https://issues.apache.org/jira/browse/HIVE-4160
 Project: Hive
  Issue Type: New Feature
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


  Hive query execution engine currently processes one row at a time. A single 
row of data goes through all the operators before next row can be processed. 
This mode of processing is very inefficient in terms of CPU usage. Research has 
demonstrated that this yields very low instructions per cycle [MonetDB]. Also 
currently hive heavily relies on lazy deserialization and data columns go 
through a layer of object inspectors that identify column type, de-serialize 
data and determine appropriate expression routines in the inner loop. These 
layers of virtual method calls further slow down the processing.

Reference: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P19.pdf



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4160) Vectorized Query Execution in Hive

2013-03-13 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13601558#comment-13601558
 ] 

Jitendra Nath Pandey commented on HIVE-4160:


This will be an incremental work in multiple phases with no regression on 
current system. We will publish a design/scope document very soon.
The main idea behind the proposal is to transform the execution engine to 
process a row batch at a time instead of a single row. The row batch will 
consist of column vectors and each operator will process the whole column 
vector at a time. The column vector will consist of array(s) of primitive types 
as far as possible.
The expressions will be implemented for various data types using 
pre-compiled templates. The appropriate expressions will be added to the 
operators based on data types.
A vectorized iterator interface will be implemented by the file formats to 
provide vectorized input to the operator tree. 


> Vectorized Query Execution in Hive
> --
>
> Key: HIVE-4160
> URL: https://issues.apache.org/jira/browse/HIVE-4160
> Project: Hive
>  Issue Type: New Feature
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
>   Hive query execution engine currently processes one row at a time. A single 
> row of data goes through all the operators before next row can be processed. 
> This mode of processing is very inefficient in terms of CPU usage. Research 
> has demonstrated that this yields very low instructions per cycle [MonetDB]. 
> Also currently hive heavily relies on lazy deserialization and data columns 
> go through a layer of object inspectors that identify column type, 
> de-serialize data and determine appropriate expression routines in the inner 
> loop. These layers of virtual method calls further slow down the processing.
> Reference: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P19.pdf

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4160) Vectorized Query Execution in Hive

2013-03-13 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-4160:
---

Description: 
  Hive query execution engine currently processes one row at a time. A single 
row of data goes through all the operators before next row can be processed. 
This mode of processing is very inefficient in terms of CPU usage. Research has 
demonstrated that this yields very low instructions per cycle [MonetDB]. Also 
currently hive heavily relies on lazy deserialization and data columns go 
through a layer of object inspectors that identify column type, de-serialize 
data and determine appropriate expression routines in the inner loop. These 
layers of virtual method calls further slow down the processing.




  was:
  Hive query execution engine currently processes one row at a time. A single 
row of data goes through all the operators before next row can be processed. 
This mode of processing is very inefficient in terms of CPU usage. Research has 
demonstrated that this yields very low instructions per cycle [MonetDB]. Also 
currently hive heavily relies on lazy deserialization and data columns go 
through a layer of object inspectors that identify column type, de-serialize 
data and determine appropriate expression routines in the inner loop. These 
layers of virtual method calls further slow down the processing.

Reference: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P19.pdf




> Vectorized Query Execution in Hive
> --
>
> Key: HIVE-4160
> URL: https://issues.apache.org/jira/browse/HIVE-4160
> Project: Hive
>  Issue Type: New Feature
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
>   Hive query execution engine currently processes one row at a time. A single 
> row of data goes through all the operators before next row can be processed. 
> This mode of processing is very inefficient in terms of CPU usage. Research 
> has demonstrated that this yields very low instructions per cycle [MonetDB]. 
> Also currently hive heavily relies on lazy deserialization and data columns 
> go through a layer of object inspectors that identify column type, 
> de-serialize data and determine appropriate expression routines in the inner 
> loop. These layers of virtual method calls further slow down the processing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4160) Vectorized Query Execution in Hive

2013-03-13 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13601560#comment-13601560
 ] 

Jitendra Nath Pandey commented on HIVE-4160:


Reference on MonetDB: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P19.pdf

> Vectorized Query Execution in Hive
> --
>
> Key: HIVE-4160
> URL: https://issues.apache.org/jira/browse/HIVE-4160
> Project: Hive
>  Issue Type: New Feature
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
>   Hive query execution engine currently processes one row at a time. A single 
> row of data goes through all the operators before next row can be processed. 
> This mode of processing is very inefficient in terms of CPU usage. Research 
> has demonstrated that this yields very low instructions per cycle [MonetDB]. 
> Also currently hive heavily relies on lazy deserialization and data columns 
> go through a layer of object inspectors that identify column type, 
> de-serialize data and determine appropriate expression routines in the inner 
> loop. These layers of virtual method calls further slow down the processing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4160) Vectorized Query Execution in Hive

2013-03-18 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-4160:
---

Attachment: Hive-Vectorized-Query-Execution-Design.docx

The attached document covers the outline of the design. Any comments/feedback 
are welcome. We will keep updating the document with more details as we include 
more data types, operators and expressions. We will also include the vectorized 
iterator design into the document.

> Vectorized Query Execution in Hive
> --
>
> Key: HIVE-4160
> URL: https://issues.apache.org/jira/browse/HIVE-4160
> Project: Hive
>  Issue Type: New Feature
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
> Attachments: Hive-Vectorized-Query-Execution-Design.docx
>
>
>   Hive query execution engine currently processes one row at a time. A single 
> row of data goes through all the operators before next row can be processed. 
> This mode of processing is very inefficient in terms of CPU usage. Research 
> has demonstrated that this yields very low instructions per cycle [MonetDB]. 
> Also currently hive heavily relies on lazy deserialization and data columns 
> go through a layer of object inspectors that identify column type, 
> de-serialize data and determine appropriate expression routines in the inner 
> loop. These layers of virtual method calls further slow down the processing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4282) Implement vectorized arithmetic expressions.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-4282:
--

 Summary: Implement vectorized arithmetic expressions.
 Key: HIVE-4282
 URL: https://issues.apache.org/jira/browse/HIVE-4282
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Jitendra Nath Pandey


Implement arithmetic expressions that operate on vectors of columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4283) Implement vectorized logical expressions.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-4283:
--

 Summary: Implement vectorized logical expressions.
 Key: HIVE-4283
 URL: https://issues.apache.org/jira/browse/HIVE-4283
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey


Implement logical expressions that operate on column vectors.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HIVE-4283) Implement vectorized logical expressions.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey reassigned HIVE-4283:
--

Assignee: Jitendra Nath Pandey

> Implement vectorized logical expressions.
> -
>
> Key: HIVE-4283
> URL: https://issues.apache.org/jira/browse/HIVE-4283
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
> Implement logical expressions that operate on column vectors.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4285) Implement classes for column vectors.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-4285:
--

 Summary: Implement classes for column vectors.
 Key: HIVE-4285
 URL: https://issues.apache.org/jira/browse/HIVE-4285
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Eric Hanson


Implement classes for column vectors.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4284) Implement class for vectorized row groups.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)

Jitendra Nath Pandey created HIVE-4284:
--

 Summary: Implement class for vectorized row groups.
 Key: HIVE-4284
 URL: https://issues.apache.org/jira/browse/HIVE-4284
 Project: Hive
  Issue Type: Sub-task
Reporter: Jitendra Nath Pandey
Assignee: Eric Hanson




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4284) Implement class for vectorized row group.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-4284:
---

Summary: Implement class for vectorized row group.  (was: Implement class 
for vectorized row groups.)

> Implement class for vectorized row group.
> -
>
> Key: HIVE-4284
> URL: https://issues.apache.org/jira/browse/HIVE-4284
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Eric Hanson
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4284) Implement class for vectorized row group.

2013-04-02 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-4284:
---

Description: Vectorized row group object will represent the row group that 
vectorized operators will work on.

> Implement class for vectorized row group.
> -
>
> Key: HIVE-4284
> URL: https://issues.apache.org/jira/browse/HIVE-4284
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Eric Hanson
>
> Vectorized row group object will represent the row group that vectorized 
> operators will work on.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4284) Implement class for vectorized row batch

2013-04-15 Thread Jitendra Nath Pandey (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13631971#comment-13631971
 ] 

Jitendra Nath Pandey commented on HIVE-4284:


A few comments on the patch.
- Some files seem to have tab characters, the indentation is longer than two 
spaces.
- Please limit the length of a line to 80 chars.
- Javadocs are not correctly formatted in many places, and please make sure all 
public methods have a javadoc.
- There are some test fields in BytesColumnVector, can that be avoided?
- There is code to load random data into VectorizedRowBatch for testing. Since 
these are public classes, its better to provide test utilities to load random 
data instead of putting code in the classes themselves.

> Implement class for vectorized row batch
> 
>
> Key: HIVE-4284
> URL: https://issues.apache.org/jira/browse/HIVE-4284
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Eric Hanson
> Attachments: HIVE-4284.1.patch
>
>
> Vectorized row batch object will represent the row batch that vectorized 
> operators will work on. Refer to design spec attached to HIVE-4160 for 
> details.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HIVE-4370) Change ORC tree readers to return batches of rows instead of a row

2013-04-16 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey reassigned HIVE-4370:
--

Assignee: Sarvesh Sakalanaga

> Change ORC tree readers to return batches of rows instead of a row 
> ---
>
> Key: HIVE-4370
> URL: https://issues.apache.org/jira/browse/HIVE-4370
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Sarvesh Sakalanaga
>Assignee: Sarvesh Sakalanaga
>
> Change ORC Record reader and Tree readers to return a set of Rows instead of 
> a row. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4282) Implement vectorized column-scalar expressions

2013-04-18 Thread Jitendra Nath Pandey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jitendra Nath Pandey updated HIVE-4282:
---

Summary: Implement vectorized column-scalar expressions  (was: Implement 
vectorized arithmetic expressions.)

> Implement vectorized column-scalar expressions
> --
>
> Key: HIVE-4282
> URL: https://issues.apache.org/jira/browse/HIVE-4282
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Jitendra Nath Pandey
>Assignee: Jitendra Nath Pandey
>
> Implement arithmetic expressions that operate on vectors of columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

1 2 3 4 5 6 7 8 9 >

1 - 100 of 829 matches

Mail list logo