[ 
https://issues.apache.org/jira/browse/DRILL-6453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16531980#comment-16531980
 ] 

Khurram Faraaz commented on DRILL-6453:
---------------------------------------

[~ben-zvi] [~priteshm]

Apache Drill 1.14.0 on a 4 node cluster, TPC-DS query 72 fails (in Canceled 
state) after running for 2hrs and 11 mins 
and we see the same Exception as before towards the end of the drillbit.log 
file.


git.commit.id.abbrev=f481a7c

{noformat}
message: "SYSTEM ERROR: IllegalStateException: Record count not set for this 
vector container\n\nFragment 4:87\n\n[Error Id: 
ed305d45-742f-48df-b1ad-6813bb5fdfc4 on qa102-48.qa.lab:31010]"
 exception {
 exception_class: "java.lang.IllegalStateException"
 message: "Record count not set for this vector container"
 stack_trace {
 class_name: "com.google.common.base.Preconditions"
 file_name: "Preconditions.java"
 line_number: 173
 method_name: "checkState"
 is_native_method: false
 }
 stack_trace {
 class_name: "org.apache.drill.exec.record.VectorContainer"
 file_name: "VectorContainer.java"
 line_number: 394
 method_name: "getRecordCount"
 is_native_method: false
 }
 stack_trace {
 class_name: "org.apache.drill.exec.physical.impl.svremover.RemovingRecordBatch"
 file_name: "RemovingRecordBatch.java"
 line_number: 49
 method_name: "getRecordCount"
 is_native_method: false
 }
 stack_trace {
 class_name: "org.apache.drill.exec.record.RecordBatchSizer"
 file_name: "RecordBatchSizer.java"
 line_number: 714
 method_name: "<init>"
 is_native_method: false
 }
 stack_trace {
 class_name: "org.apache.drill.exec.record.RecordBatchSizer"
 file_name: "RecordBatchSizer.java"
 line_number: 686
 method_name: "<init>"
 is_native_method: false
 }
 stack_trace {
 class_name{ "org.apache.drill.exec.record.JoinBatchMemoryManager"
 file_name: "JoinBatchMemoryManager.java"
 line_number: 74
 method_name: "update"

{noformat}

> TPC-DS query 72 has regressed
> -----------------------------
>
>                 Key: DRILL-6453
>                 URL: https://issues.apache.org/jira/browse/DRILL-6453
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Flow
>    Affects Versions: 1.14.0
>            Reporter: Khurram Faraaz
>            Assignee: Boaz Ben-Zvi
>            Priority: Blocker
>             Fix For: 1.14.0
>
>         Attachments: 24f75b18-014a-fb58-21d2-baeab5c3352c.sys.drill
>
>
> TPC-DS query 72 seems to have regressed, query profile for the case where it 
> Canceled after 2 hours on Drill 1.14.0 is attached here.
> {noformat}
> On, Drill 1.14.0-SNAPSHOT 
> commit : 931b43e (TPC-DS query 72 executed successfully on this commit, took 
> around 55 seconds to execute)
> SF1 parquet data on 4 nodes; 
> planner.memory.max_query_memory_per_node = 10737418240. 
> drill.exec.hashagg.fallback.enabled = true
> TPC-DS query 72 executed successfully & took 47 seconds to complete execution.
> {noformat}
> {noformat}
> TPC-DS data in the below run has date values stored as DATE datatype and not 
> VARCHAR type
> On, Drill 1.14.0-SNAPSHOT
> commit : 82e1a12
> SF1 parquet data on 4 nodes; 
> planner.memory.max_query_memory_per_node = 10737418240. 
> drill.exec.hashagg.fallback.enabled = true
> and
> alter system set `exec.hashjoin.num_partitions` = 1;
> TPC-DS query 72 executed for 2 hrs and 11 mins and did not complete, I had to 
> Cancel it by stopping the Foreman drillbit.
> As a result several minor fragments are reported to be in 
> CANCELLATION_REQUESTED state on UI.
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to