[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-02-03 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3917:
-

   Resolution: Fixed
Fix Version/s: 0.11.0
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

Committed. Thanks Tim

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Fix For: 0.11.0

 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3, 
 HIVE-3917.patch.4


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-31 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3917:
-

Status: Open  (was: Patch Available)

comments

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-31 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Attachment: HIVE-3917.patch.4

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3, 
 HIVE-3917.patch.4


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-31 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3917:
-

Status: Open  (was: Patch Available)

comments

Let us thrown an error for external tables.

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3, 
 HIVE-3917.patch.4


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-31 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Status: Patch Available  (was: Open)

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3, 
 HIVE-3917.patch.4


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-30 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Attachment: HIVE-3917.patch.2

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-30 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Status: Patch Available  (was: In Progress)

patch is available.

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-30 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Attachment: (was: HIVE-3917.patch.2)

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-30 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Attachment: HIVE-3917.patch.3

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1, HIVE-3917.patch.2, HIVE-3917.patch.3


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3917) Support noscan operation for analyze command

2013-01-29 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3917:
---

Summary: Support noscan operation for analyze command  (was: Support fast 
operation for analyze command)

 Support noscan operation for analyze command
 

 Key: HIVE-3917
 URL: https://issues.apache.org/jira/browse/HIVE-3917
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3917.patch.1


 hive supports analyze command to gather statistics from existing 
 tables/partition 
 https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
 It collects:
 1. Number of Rows
 2. Number of files
 3. Size in Bytes
 If table/partition is big, the operation would take time since it will open 
 all files and scan all data.
 It would be nice to support fast operation to gather statistics which doesn't 
 require to open all files:
 1. Number of files
 2. Size in Bytes
 Potential syntax is 
 ANALYZE TABLE tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 
 COMPUTE STATISTICS [noscan];
 In the future, all statistics without scan can be retrieved via this optional 
 parameter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira