[ 
https://issues.apache.org/jira/browse/DRILL-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833051#comment-16833051
 ] 

mehran commented on DRILL-6737:
-------------------------------

I checked the problem in 1.16 release and it is not the parquet writer issue.

 

there is two step at the end of plan that takes each 2 minutes to finish: 
PROJECT_ALLOW_DUP, PROJECT.
h3.  
{panel:bgColor=#ffffff}
{panel:bgColor=#f5f5f5}
[Overview|http://10.233.50.111:8047/profiles/23329968-0346-e658-b1c9-92fd7dc60d2a#operator-overview]{panel}
{panel}
||Operator ID||Type||Avg Setup Time||Max Setup Time||Avg Process Time||Max 
Process Time||Min Wait Time||Avg Wait Time||Max Wait Time||% Fragment Time||% 
Query Time||Rows||Avg Peak Memory||Max Peak Memory||
|00-xx-00|SCREEN|0.000s|0.000s|1.262s|2.510s|0.004s|0.075s|0.145s|0.82%|0.82%|111,491|10MB|20MB|
|00-xx-01|PROJECT|0.002s|0.002s|0.001s|0.001s|0.000s|0.000s|0.000s|0.00%|0.00%|1|-|-|
|00-xx-02|PARQUET_WRITER|0.293s|0.293s|50.750s|50.750s|0.000s|0.000s|0.000s|16.44%|16.44%|111,490|-|-|
|00-xx-03|PROJECT_ALLOW_DUP|0.032s|0.032s|2m0s|2m0s|0.000s|0.000s|0.000s|39.01%|39.01%|111,490|13MB|13MB|
|00-xx-04|PROJECT|16.092s|16.092s|2m15s|2m15s|0.000s|0.000s|0.000s|43.73%|43.73%|111,490|13MB|13MB|
{panel}
{panel}
 

I do not know what these steps do after parquet writer is finished

 

But it takes strangely long time to run.

 

 

 

 

 

> Ctas json to Parquet is very very slow
> --------------------------------------
>
>                 Key: DRILL-6737
>                 URL: https://issues.apache.org/jira/browse/DRILL-6737
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 1.14.0
>            Reporter: mehran
>            Assignee: salim achouche
>            Priority: Critical
>         Attachments: drill.bmp
>
>
> 5 minute takes to insert a json file to parquet, where in 1.13 it takes 10 
> seconds. it  seems to be a blocker bug.
> In plan it is Parquet writer that takes this duration.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to