[ https://issues.apache.org/jira/browse/IMPALA-12737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael Smith updated IMPALA-12737: ----------------------------------- Description: In the Impala query log table where completed queries are stored, add lists of columns that were referenced in the query. The purpose behind this functionality is to know which columns are part of * Select clause * Where clause * Join clause * Aggegrate clause * Order by clause There should be a column for each type of clause, so that decisions can be made based on specific usage or on the union of those clauses. With this information, we will feed into compute stats command to collect stats only on the required columns that are using in joins / filters and aggegrates and not on all the table columns. The information can be collected as an array of [db1.table1.column1,db1.table1.column2] was: In the Impala query log table where completed queries are stored, add lists of columns that were referenced in the query. The purpose behind this functionality is to know which columns are part of * Select clause * Where clause * Join clause * Aggegrate clause There should be a column for each type of clause, so that decisions can be made based on specific usage or on the union of those clauses. With this information, we will feed into compute stats command to collect stats only on the required columns that are using in joins / filters and aggegrates and not on all the table columns. The information can be collected as an array of [db1.table1.column1,db1.table1.column2] > Include List of Referenced Columns in Query Log Table > ----------------------------------------------------- > > Key: IMPALA-12737 > URL: https://issues.apache.org/jira/browse/IMPALA-12737 > Project: IMPALA > Issue Type: Bug > Reporter: Manish Maheshwari > Assignee: Michael Smith > Priority: Major > Labels: workload-management > > In the Impala query log table where completed queries are stored, add lists > of columns that were referenced in the query. The purpose behind this > functionality is to know which columns are part of > * Select clause > * Where clause > * Join clause > * Aggegrate clause > * Order by clause > There should be a column for each type of clause, so that decisions can be > made based on specific usage or on the union of those clauses. > With this information, we will feed into compute stats command to collect > stats only on the required columns that are using in joins / filters and > aggegrates and not on all the table columns. > The information can be collected as an array of > [db1.table1.column1,db1.table1.column2] > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org