RE: how to treat an existing partition data file as a table?
You need to specify a table partition from which you want to sample. Olga From: Yang [mailto:tedd...@gmail.com] Sent: Sunday, September 29, 2013 1:39 PM To: hive-u...@hadoop.apache.org Subject: how to treat an existing partition data file as a table? we have a huge table, including browsing data for the past 5 years, let's say. now I want to take a few samples to play around with it. so I did select * from mytable limit 10; but it actually went full out and tried to scan the entire table. is there a way to kind of create a view pointing to only one of the data files used by the original table mytable ? this way the total files to be scanned is much smaller. thanks! yang
Re: how to treat an existing partition data file as a table?
thanks guys, I found that the table is not partitioned, so I guess no way out... On Mon, Sep 30, 2013 at 9:31 AM, Olga L. Natkovich ol...@yahoo-inc.comwrote: You need to specify a table partition from which you want to sample. ** ** Olga ** ** *From:* Yang [mailto:tedd...@gmail.com] *Sent:* Sunday, September 29, 2013 1:39 PM *To:* hive-u...@hadoop.apache.org *Subject:* how to treat an existing partition data file as a table? ** ** we have a huge table, including browsing data for the past 5 years, let's say. ** ** now I want to take a few samples to play around with it. so I did select * from mytable limit 10; but it actually went full out and tried to scan the entire table. is there a way to kind of create a view pointing to only one of the data files used by the original table mytable ? this way the total files to be scanned is much smaller. ** ** ** ** thanks! yang
how to treat an existing partition data file as a table?
we have a huge table, including browsing data for the past 5 years, let's say. now I want to take a few samples to play around with it. so I did select * from mytable limit 10; but it actually went full out and tried to scan the entire table. is there a way to kind of create a view pointing to only one of the data files used by the original table mytable ? this way the total files to be scanned is much smaller. thanks! yang
Re: how to treat an existing partition data file as a table?
Is your table partitioned ? Sent from my iPad On Sep 30, 2013, at 2:09 AM, Yang tedd...@gmail.com wrote: we have a huge table, including browsing data for the past 5 years, let's say. now I want to take a few samples to play around with it. so I did select * from mytable limit 10; but it actually went full out and tried to scan the entire table. is there a way to kind of create a view pointing to only one of the data files used by the original table mytable ? this way the total files to be scanned is much smaller. thanks! yang
Re: how to treat an existing partition data file as a table?
Bucket the table and use TABLESAMPLE clause? york From: Yang tedd...@gmail.commailto:tedd...@gmail.com Reply-To: user@hive.apache.orgmailto:user@hive.apache.org user@hive.apache.orgmailto:user@hive.apache.org Date: Sunday, September 29, 2013 3:39 PM To: hive-u...@hadoop.apache.orgmailto:hive-u...@hadoop.apache.org hive-u...@hadoop.apache.orgmailto:hive-u...@hadoop.apache.org Subject: how to treat an existing partition data file as a table? we have a huge table, including browsing data for the past 5 years, let's say. now I want to take a few samples to play around with it. so I did select * from mytable limit 10; but it actually went full out and tried to scan the entire table. is there a way to kind of create a view pointing to only one of the data files used by the original table mytable ? this way the total files to be scanned is much smaller. thanks! yang This transmission may contain information that is privileged, confidential, and/or exempt from disclosure under applicable law. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution, or use of the information contained herein (including any reliance thereon) is strictly prohibited. If you received this transmission in error, please immediately contact the sender and destroy the material in its entirety, whether in electronic or hard copy format.