[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-04-13 Thread Ashutosh Chauhan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-2471:
---

Fix Version/s: 0.9.0

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Fix For: 0.9.0
>
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, 
> HIVE-2471.D2367.2.patch, HIVE-2471.D2367.3.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-19 Thread Namit Jain (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-2471:
-

  Resolution: Fixed
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

Committed. Thanks Kevin

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, 
> HIVE-2471.D2367.2.patch, HIVE-2471.D2367.3.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Phabricator (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-2471:
--

Attachment: HIVE-2471.D2367.3.patch

kevinwilfong updated the revision "HIVE-2471 [jira] Add timestamp column with 
index to the partition stats table.".
Reviewers: JIRA, njain

  Added a big comment saying it is up to the Hive administrator to drop old 
partition stats tables.

REVISION DETAIL
  https://reviews.facebook.net/D2367

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsSetupConstants.java
  ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java


> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, 
> HIVE-2471.D2367.2.patch, HIVE-2471.D2367.3.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Phabricator (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-2471:
--

Attachment: HIVE-2471.D2367.2.patch

kevinwilfong updated the revision "HIVE-2471 [jira] Add timestamp column with 
index to the partition stats table.".
Reviewers: JIRA, njain

  Changed the name of the stats table so that this update will apply 
automatically and immediately, otherwise the update command will fail on old 
schemas.  Also introduced versioning to the name while I think is better than 
the old method of coming up with a new combination of (PARTITION, PART) and 
(STATISTICS, STATS).

REVISION DETAIL
  https://reviews.facebook.net/D2367

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsSetupConstants.java
  ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java


> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, 
> HIVE-2471.D2367.2.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Kevin Wilfong (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong updated HIVE-2471:


Status: Patch Available  (was: Open)

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, 
> HIVE-2471.D2367.2.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Kevin Wilfong (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong updated HIVE-2471:


Status: Open  (was: Patch Available)

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Kevin Wilfong (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong updated HIVE-2471:


Description: Occasionally, when entries are added to the partition stats 
table the program is halted before it can delete those entries, by an 
exception, keyboard interrupt, etc.  These build up to the point where the 
table gets very large, and it hurts the performance of the update statement 
which is often called.  In order to fix this, I am adding a column to the table 
which is auto-populated with the current timestamp.  This will allow us to 
create scripts that go through periodically and clean out old entries from the 
table.  (was: Occasionally, when entries are added to the partition stats table 
the program is halted before it can delete those entries, by an exception, 
keyboard interrupt, etc.  These build up to the point where the table gets very 
large, and it hurts the performance of the update statement which is often 
called.  In order to fix this, I am adding a column to the table which is 
auto-populated with the current timestamp.  I am also adding an index on this 
column.  This will allow us to create scripts that go through periodically and 
clean out old entries from the table.)

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Kevin Wilfong (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong updated HIVE-2471:


Status: Patch Available  (was: Open)

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  This will allow us to create 
> scripts that go through periodically and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2471) Add timestamp column to the partition stats table.

2012-03-16 Thread Kevin Wilfong (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong updated HIVE-2471:


Description: Occasionally, when entries are added to the partition stats 
table the program is halted before it can delete those entries, by an 
exception, keyboard interrupt, etc.  These build up to the point where the 
table gets very large, and it hurts the performance of the update statement 
which is often called.  In order to fix this, I am adding a column to the table 
which is auto-populated with the current timestamp.  I am also adding an index 
on this column.  This will allow us to create scripts that go through 
periodically and clean out old entries from the table.  (was: Occasionally, 
when entries are added to the partition stats table the program is halted 
before it can delete those entries, by an exception, keyboard interrupt, etc.  
These build up to the point where the table gets very large, and it hurts the 
performance of the update statement which is often called.  In order to fix 
this, I am adding a column to the table which is auto-populated with the 
current timestamp.  I am also adding an index on this column.  This will allow 
us to create scripts that go through periodically and clean out old entries 
from the table.  The index will help to keep the runtime of these scripts 
short, and hence reduce the amount of time they need to lock the table/indexes 
for.)
Summary: Add timestamp column to the partition stats table.  (was: Add 
timestamp column with index to the partition stats table.)

> Add timestamp column to the partition stats table.
> --
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
>  Issue Type: Improvement
>Reporter: Kevin Wilfong
>Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch
>
>
> Occasionally, when entries are added to the partition stats table the program 
> is halted before it can delete those entries, by an exception, keyboard 
> interrupt, etc.  These build up to the point where the table gets very large, 
> and it hurts the performance of the update statement which is often called.  
> In order to fix this, I am adding a column to the table which is 
> auto-populated with the current timestamp.  I am also adding an index on this 
> column.  This will allow us to create scripts that go through periodically 
> and clean out old entries from the table.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira