Re: [ANN] Hivemall: Hive scalable machine learning library

2013-10-03 Thread Dean Wampler
Hivemall within 5 minutes or so. Hope you enjoy the release! Feedback (and pull request) is always welcome. Thank you, Makoto -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: [ANNOUNCE]: Apache Sentry 1.2.0 released

2013-09-26 Thread Dean Wampler
, Sentry team -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: question about partition table in hive

2013-09-13 Thread Dean Wampler
and put it in hdfs ,i want to use hive to do some caculate, query based on timerange,i want to use parttion table , but the data file in hdfs is a big file ,how can i put it into pratition table in hive? -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Inner Map key and value separators

2013-09-13 Thread Dean Wampler
Unfortunately, I believe there's no way to do this. Sent from my rotary phone. On Sep 13, 2013, at 6:42 PM, Sanjay Subramanian sanjay.subraman...@wizecommerce.com wrote: Hi guys I have to load data into the following data type in hive map string,mapstring,string Is there a way

Re: Interesting claims that seem untrue

2013-09-12 Thread Dean Wampler
message along with any attachments, from your computer system. If you are the intended recipient, please be advised that the content of this message is subject to access, review and disclosure by the sender's Email System Administrator. -- Dean Wampler, Ph.D. @deanwampler http

Re: Hive - external (dynamically) partitioned table

2013-07-26 Thread Dean Wampler
-mail and using or disseminating the information, and must notify the sender and delete it from their system. LT Infotech will not accept responsibility or liability for the accuracy or completeness of, or the presence of any virus or disabling code in this e-mail -- Dean Wampler, Ph.D

Re: Hive Query

2013-07-12 Thread Dean Wampler
that hive does not support IN clause. Then what is the effective replacement for this? i need to execute around 250 inputs. I'm using hive 0.9.0 version. Please guide me. Thanks, Manickam P -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Hive - UDF

2013-07-09 Thread Dean Wampler
for each? Please let me know. Thanks, Manickam P -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Partition performance

2013-07-03 Thread Dean Wampler
recipient, please be advised that the content of this message is subject to access, review and disclosure by the sender's Email System Administrator.* *** ** ** ** ** -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Performance difference between tuning reducer num and partition table

2013-06-29 Thread Dean Wampler
performance difference between them(non-partition VS partition+concurrent) besides the job parameter issues? Thanks! -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Hive built-in functions not working

2013-06-21 Thread Dean Wampler
is the same over there. i need to work with hadoop 0.20, so unfortunately i can't try hadoop 1.x.x thanks in advance cheers Wolli -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: Question regarding nested complex data type

2013-06-20 Thread Dean Wampler
with another process. Hive used ^A for the field separator, ^B for the collection separator, in this case, to separate structs in the array, and ^C to separate the elements in each struct, e.g.,: Dean Wampler^Afirst^C1^Bsecond^C2^Bthird^C3 In other words, the structure you would expect for this table

Re: Create table like with partitions

2013-06-11 Thread Dean Wampler
I confirmed it is a pirate site. Sent from my rotary phone. On Jun 11, 2013, at 10:33 AM, Edward Capriolo edlinuxg...@gmail.com wrote: For reference, any that puts the entire book online like this is likely pirated. On Tue, Jun 11, 2013 at 8:34 AM, Richa Sharma

Re: Difference between like %A% and %a%

2013-05-24 Thread Dean Wampler
not work. I've added this comment to https://issues.apache.org/jira/browse/HIVE-4070#comment-13666278 for fun. :) Please? :) On Fri, May 24, 2013 at 7:53 AM, Dean Wampler deanwamp...@gmail.comwrote: Your where clause looks at the abbreviation, requiring 'A', not the state name. You got

Re: Difference between like %A% and %a%

2013-05-24 Thread Dean Wampler
am thinking that the rlike is based on regex and can be told to do case insensitive matching. On Fri, May 24, 2013 at 9:16 AM, Dean Wampler deanwamp...@gmail.comwrote: Hortonworks has announced plans to make Hive more SQL compliant. I suspect bugs like this will be addressed sooner or later

Re: Hive skipping first line

2013-05-23 Thread Dean Wampler
. If you are the intended recipient, please be advised that the content of this message is subject to access, review and disclosure by the sender's Email System Administrator.* *** -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com

Re: hive.metastore.warehouse.dir - Should it point to a physical directory

2013-05-21 Thread Dean Wampler
of default database for the warehouse/description /property Where should I create the HDFS directory ? *From:* Sanjay Subramanian sanjay.subraman...@wizecommerce.com *To:* user@hive.apache.org user@hive.apache.org; Raj Hadoop hadoop...@yahoo.com; Dean Wampler deanwamp...@gmail.com *Cc

Re: Hive query problem on S3 table

2013-04-18 Thread Dean Wampler
7198034 Web: www.innoplexia.com Sitz: 69123 Heidelberg, Mannheimer Str. 175 - Steuernummer 32494/62606 - USt. IdNr.: DE 272 871 728 - Geschäftsführer: Prof. Dr. Herbert Schuster * *** ** ** -- Dean Wampler, Ph.D. @deanwampler http://polyglotprogramming.com image001.png

Re: Partition performance

2013-04-04 Thread Dean Wampler
and destroy all copies of the original message along with any attachments, from your computer system. If you are the intended recipient, please be advised that the content of this message is subject to access, review and disclosure by the sender's Email System Administrator. -- *Dean Wampler, Ph.D

Re: Metastore question

2013-04-03 Thread Dean Wampler
: Trying to make sure I understand this correctly. All databases and tables are stored in hive.metastore.warehouse.dir but the actual metadata for the database and tables (columns, types, partitions, etc) are stored in the hive database (ie.. mysql)? Is that correct? -- *Dean Wampler, Ph.D

Re: Bucketing external tables

2013-03-30 Thread Dean Wampler
to be managed tables; not external tables, right? . Thank again for your time and help. Sadu On Fri, Mar 29, 2013 at 5:57 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: I don't know of any way to avoid creating new tables and moving the data. In fact, that's the official

Re: Noob question on creating tables

2013-03-29 Thread Dean Wampler
a compression scheme supported by Hadoop. Thanks -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Using TABLESAMPLE on inner queries

2013-03-20 Thread Dean Wampler
but it always returns some sort of syntax error and thus not allowing the query to run. Any help is appreciated. Robert ** -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Hive 0.10.0 metastore thrift server installation error

2013-03-14 Thread Dean Wampler
) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) Any clue what's going on here? Thanks, Shanzhong -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Error while table creation

2013-03-10 Thread Dean Wampler
) ... 43 more Caused by: com.mysql.jdbc.exceptions.jdbc4.MySQLIntegrityConstraintViolationException: Duplicate entry 'X' for key 'PRIMARY' Thanks, Abhishek -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: UDFs and Thread Safety?

2013-03-10 Thread Dean Wampler
...@atlassian.comwrote: Hi All, Could anyone describe what the required thread safety for a UDF is? I understand that one is instantiated for each use of the function in an expression, but can there be multiple threads executing the methods of a single UDF object at once? Thanks, Shaun -- *Dean Wampler

Re: Accessing sub column in hive

2013-03-08 Thread Dean Wampler
] from employees; ** ** I get an error. ** ** Please help. ** ** Thanks Sai -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: java.lang.NoClassDefFoundError: com/jayway/jsonpath/PathUtil

2013-03-08 Thread Dean Wampler
org.apache.hadoop.hive.ql.exec.DDLTask Any help would be really appreciated. Thanks Sai -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Find current db we r using in Hive

2013-03-08 Thread Dean Wampler
the current db we r using similar to pwd in Unix. Thanks Sai -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: difference between add jar in hive session and hive --auxpath

2013-03-08 Thread Dean Wampler
in the HUE, as it only supports add jar, but not --auxpath option. Any suggestions? Thanks Yong -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Hive query started map task being killed during execution

2013-03-08 Thread Dean Wampler
o.apache.hadoop.hive.ql.exec.FileSinkOperator: StatsPublishing error: cannot connect to database Please suggest if I need to set anything in Hive when I invoke this query. The query that runs successfully has lot less rows compared to on that fails. Thanks, DK -- *Dean Wampler, Ph.D

Re: Rename external table, including HDFS directory

2013-03-07 Thread Dean Wampler
-- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Best table storage for analytical use case

2013-03-06 Thread Dean Wampler
, my number of mappers =1, so I could try to configure my setup with additional mappers. Does this make sense ? Thank you for your help ! Sekine 2013/3/4 Dean Wampler dean.wamp...@thinkbiganalytics.com RCFile won't help much (and apparently not all in this case ;) unless you have a lot

Re: Where is the location of hive queries

2013-03-06 Thread Dean Wampler
. Thanks Sai -- Nitin Pawar -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Combine two overlapping schema?

2013-03-06 Thread Dean Wampler
-- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Variable Substitution

2013-03-06 Thread Dean Wampler
. Is there something that I've overlooked? Thanks! -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Error while exporting table data from hive to Oracle through Sqoop

2013-03-05 Thread Dean Wampler
. -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Location of external table in hdfs

2013-03-05 Thread Dean Wampler
BY '\t' LOCATION '/tmp/states' ; Any help is really appreciated. Thanks Sai -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Hive sample test

2013-03-05 Thread Dean Wampler
by something=something to select really_expensive_select_clause from ( select * from really_big_table limit 100 )t where something=something group by something=something On Tue, Mar 5, 2013 at 10:57 AM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: Unfortunately

Re: Best table storage for analytical use case

2013-03-04 Thread Dean Wampler
compression help (not sure whether RCFiles are compressed or not) ? Thank you ! -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: doubt with LEFT OUTER JOIN

2013-03-04 Thread Dean Wampler
are of different format types. Regards, Kumar -Original Message- From: Dean Wampler dean.wamp...@thinkbiganalytics.com To: user user@hive.apache.org Sent: Fri, Mar 1, 2013 12:23 pm Subject: Re: doubt with LEFT OUTER JOIN I just tried an experiment where the right-hand table

Re: regexp_replace with unicode chars

2013-03-01 Thread Dean Wampler
#Basic_Multilingual_Plane [2] http://grokbase.com/t/hive/dev/131a4n562y/unicode-character-as-delimiter -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: regexp_replace with unicode chars

2013-03-01 Thread Dean Wampler
] http://en.wikipedia.org/wiki/Plane_%28Unicode%29#Basic_Multilingual_Plane [2] http://grokbase.com/t/hive/dev/131a4n562y/unicode-character-as-delimiter -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330 -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: doubt with LEFT OUTER JOIN

2013-03-01 Thread Dean Wampler
RIGHT SIDE table doesn't have at least one record that matches JOIN condition in Hive? Regards, Kumar -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Books and good starting point for Hive

2013-02-24 Thread Dean Wampler
Wow! You guys are my new best friends! Seriously, I'm grateful you've found my participation in the list and the book helpful. I'm sure Ed and Jason would agree (at least about the book ;) Yours, dean -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330 On Sun, Feb 24, 2013 at 12

Re: Loading json files into hive table is giving NULL as output(data is in s3 bucket)

2013-02-18 Thread Dean Wampler
} {_u:test_name3,_ts:2012-01-13,_ip:IP3} When I query :- select uname from table_test; Output :- NULL 13Feb2012 NULL 13Feb2012 NULL 13Feb2012 Please help me and let me know how to add json data in a table. Thanks, Chunky. -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Loading json files into hive table is giving NULL as output(data is in s3 bucket)

2013-02-18 Thread Dean Wampler
as I mentioned in last mail. There are values not in double quotes, some are null and some keys are having multiple values. Dean, is this json file correct for HIVE to handle it ? Thanks, Chunky. On Mon, Feb 18, 2013 at 6:23 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote

Re: Generate Hive DDL

2013-02-15 Thread Dean Wampler
metastore would we just want 4 tables out of the 100 we already have in place. Thanks, murtaza -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: CREATE EXTERNAL TABLE Fails on Some Directories

2013-02-15 Thread Dean Wampler
except: org.apache.hadoop.fs.FileAlreadyExistsException Parent Path is not a directory: /715 715... Like I mentioned it works for all of the other directories, except 715 and 716. Thoughts on troubleshooting path? Thanks Joey D'Antoni -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312

Re: CREATE EXTERNAL TABLE Fails on Some Directories

2013-02-15 Thread Dean Wampler
-- *From:* Dean Wampler dean.wamp...@thinkbiganalytics.com *To:* user@hive.apache.org; Joseph D Antoni jdant...@yahoo.com *Sent:* Friday, February 15, 2013 11:37 AM *Subject:* Re: CREATE EXTERNAL TABLE Fails on Some Directories You confirmed that 715 is an actual directory? It didn't become

Re: Install / Download of Hive 0.7.0 or 0.7.1

2013-02-15 Thread Dean Wampler
...@compositesw.comwrote: Hi, ** ** Where can I get a install /download of Hive 0.7.0 or 0.7.1? ** ** Thx… ** ** Regards, ** ** Vince George Composite Software Mobile: 201-519-3777 ** ** ** ** -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re:

2013-02-14 Thread Dean Wampler
at '192.168.0.151' turned 'flaky' 12 more lines.. Tried different function GenericUDFMax.. same error. Any pointers/advise, what could be wrong? -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Change timestamp format in hive

2013-02-13 Thread Dean Wampler
for this issue ? Thanks, Chunky. -- Alexander Alten-Lorenz http://mapredit.blogspot.com German Hadoop LinkedIn Group: http://goo.gl/N8pCF -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: INSERT INTO table with STRUCT, SELECT FROM

2013-02-13 Thread Dean Wampler
[Error 10044]: Line 1:18 Cannot insert into target table because column number/types are different 'oc': Cannot convert column 0 from structcol1:int,col2:int to structa:int,b:int. -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: How to load hive metadata from conf dir

2013-02-12 Thread Dean Wampler
/org.apache.hive/hive-metastore [2] http://hive.apache.org/docs/r0.7.1/api/org/apache/hadoop/hive/metastore/Hiv eMetaStoreClient.html [3] http://hive.apache.org/docs/r0.7.1/api/org/apache/hadoop/hive/conf/HiveConf .html Thanks in advance, Parag -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1

Re: Transfer Data to new location

2013-02-12 Thread Dean Wampler
solution for safely data transfer?? -- *Muhammad Hamza Asad * +923457261988 -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: How to load hive metadata from conf dir

2013-02-12 Thread Dean Wampler
/docs/r0.7.0/api/org/apache/hadoop/hive/metastore/api/ThriftHiveMetastore.Client.html#get_table(java.lang.String , java.lang.String) Table t = t.getSd().getLocation() On Tue, Feb 12, 2013 at 9:41 AM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: I'll mention another bash hack

Re: ALTER TABLE CHANGE COLUMN issue

2013-01-30 Thread Dean Wampler
, b:int, c:string. -- Nitin Pawar -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: The dreaded Heap Space Issue on a Transform

2013-01-30 Thread Dean Wampler
org.apache.hadoop.hive.ql.exec.ScriptOperator: Script failed with code 1 -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: A few JIRAs closed in v0.10.0 that don't actually appear to be working

2013-01-29 Thread Dean Wampler
Thanks! On Tue, Jan 29, 2013 at 5:34 AM, Navis류승우 navis@nexr.com wrote: HIVE-446 - Implement TRUNCATE : is on trunk (v0.11.0) HIVE-887 - Allow SELECT col without a mapreduce job : It needs set hive.fetch.task.conversion=more 2013/1/29 Dean Wampler dean.wamp...@thinkbiganalytics.com

Re: Automating the partition creation process

2013-01-29 Thread Dean Wampler
-- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Automating the partition creation process

2013-01-29 Thread Dean Wampler
be appreciated. Regards, Sadu -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

A few JIRAs closed in v0.10.0 that don't actually appear to be working

2013-01-28 Thread Dean Wampler
misread the JIRA items? dean -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: A few JIRAs closed in v0.10.0 that don't actually appear to be working

2013-01-28 Thread Dean Wampler
Oh, another one is https://issues.apache.org/jira/browse/HIVE-446 - Implement TRUNCATE. The CLI doesn't recognize it. dean On Mon, Jan 28, 2013 at 11:44 AM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: I've noticed a few JIRA items for new features that are supposed to work

Re: Whatever happened to the MACRO facility, Hive-2655

2013-01-26 Thread Dean Wampler
Wampler dean.wamp...@thinkbiganalytics.com wrote: We mentioned it in our book and now I realize it's not actually implemented, even in 0.10.0. OOPS!! https://issues.apache.org/jira/browse/HIVE-2655 dean -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330 -- *Dean Wampler

Re: Real-life experience of forcing smaller input splits?

2013-01-25 Thread Dean Wampler
... That will be my approach for now, or disabling compression altogether for these files. The only problem I have is that compression is so efficient that any operation in the mapper (so on the uncompressed data) just makes the mapper throw an OOM exception, no matter how much memory I give

Re: querying objects and list fields

2013-01-25 Thread Dean Wampler
; as if it were select messageId,lastmodifiedDate.contexts.contextId but also still respond correctly to select messageId. lastmodifiedDate.contexts.conceptId ? Thanks for the help, Lauren -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Loading a Hive table simultaneously from 2 different sources

2013-01-24 Thread Dean Wampler
to handle this ? Please give your insights. Regards, Krishnan -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: serde jar causing problems in loading other jars.

2013-01-23 Thread Dean Wampler
. Any idea what might be wrong? /Ehsan -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Problem with using Postgres as hive meta store DB.

2013-01-23 Thread Dean Wampler
in postgresql.conf to avoid this. -- *Muhammad Ehsan ul Haque* Klarna AB Norra Stationsgatan 61 SE-113 43 Stockholm Tel: +46 (0)8- 120 120 00 Fax: +46 (0)8- 120 120 99 Web: www.klarna.com -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: serde jar causing problems in loading other jars.

2013-01-23 Thread Dean Wampler
, 2013 at 2:57 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: Is there anything in the logs about problems loading the jar, etc.? The jar files in $HVE_HOME are added to the CLASSPATH in alphabetical order. As an experiment, rename your jar with a name that will go last, something

Re: Missing tables!

2013-01-22 Thread Dean Wampler
operations on hive suddenly i lost all tables. But all tables are showing in warehouse. It seems some configuration error but exact solution is yet to know. Any idea? Regards, Ashish -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Execution of udf

2013-01-18 Thread Dean Wampler
, Nagarjuna -- Sent from iPhone -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Execution of udf

2013-01-18 Thread Dean Wampler
be there On Friday, January 18, 2013, Dean Wampler wrote: There is no reduce phase needed in this query. On Fri, Jan 18, 2013 at 6:59 AM, nagarjuna kanamarlapudi nagarjuna.kanamarlap...@gmail.com wrote: Hi, Select col1,myudf(col2,col3) from table1; In what phase if map reduce an udf

Re: HWI use on AWS/EMR

2013-01-18 Thread Dean Wampler
and SPAM -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: HWI use on AWS/EMR

2013-01-18 Thread Dean Wampler
, Suite 2800, Chicago, IL 60601 Cell: 314-827-4356 ** ** On Fri, Jan 18, 2013 at 11:09 AM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: Oops, I overlooked that you have the public domain name in your message. Can you surf to http://ec2-54-247-61-206.eu-west-1

Re: Interaction between Java and Transform Scripts on Hive

2013-01-16 Thread Dean Wampler
java around? I am curious on what steps I can take to trouble shoot or eliminate this problem. -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Re: create a hive table: always a tab space before each line

2013-01-14 Thread Dean Wampler
. If this output is continously used by hive, it is fine. The problem is that I may use a self-define map-reduce job to read these files. Does that mean I have to take care of this \t by myself? is there any option that I can disable this \t in hive? At 2013-01-09 22:38:11,Dean Wampler dean.wamp

Re: Stack function in Hive : how to specify multiple aliases?

2013-01-10 Thread Dean Wampler
multiple aliases ? Thanks, Mathieu -- Nitin Pawar -- Nitin Pawar -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Best practice for automating jobs

2013-01-10 Thread Dean Wampler
partitions? Thanks in advance! --Tom -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Map Reduce Local Task

2013-01-08 Thread Dean Wampler
. Good wishes,always ! Santosh -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: hive regular expression

2012-12-26 Thread Dean Wampler
=321grgrdgolfll3onsl' or '?MovieTitle=949303sjkskldsososodn' how to extract 'MovieTitle=321grgrdg' or 'MovieTitle=949303sjkskld' using Hive reg expression functions thanks and happy holidays -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Reflect MySQL updates into Hive

2012-12-24 Thread Dean Wampler
way to reflect MySQL updates into Hive with minimal resources? 2. Is sqoop the right tool to do the ETL? 3. Is Hive the right tool to do this kind of queries or we should search for alternatives? Any hint will be useful, thanks in advanced. -- Ibrahim -- *Dean Wampler, Ph.D

Re: Reflect MySQL updates into Hive

2012-12-24 Thread Dean Wampler
/Day) --- Load in Hive as External Table ---(Transform Data Join Tables)-- Save it in Hive tables for reporting. Correct? Appreciated. -- Ibrahim On Mon, Dec 24, 2012 at 5:51 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: This is not as hard as it sounds. The hardest

Re: Creating external table poiting to s3 folder with files not loading data

2012-12-17 Thread Dean Wampler
into a non-partitioned table and then copy the information using hive and dynamic partitioning the data in the same query... is that right? Thanks again! On Fri, Dec 14, 2012 at 1:22 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: A couple of clarifying questions and suggestions

Re: Running commands at hive cli or hive thirft startup

2012-12-14 Thread Dean Wampler
server connection? If not, could we build it in that we can add UDFs to hive without doing a recompile that stay in permanently? I would welcome discussion on this! -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Creating external table poiting to s3 folder with files not loading data

2012-12-14 Thread Dean Wampler
a custom SerDe that I did not write...but I do have the code for it. I'm quite new to hive, so I appreciate any kind of pointers you can throw at me. Thanks! Fernando Doglio -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Storing data in TSV with changing headers

2012-11-30 Thread Dean Wampler
for different days/hours have different headers. Is it possible to do this with Hive? Thanks! -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Alter table is giving error

2012-11-27 Thread Dean Wampler
way for different file systems. Edward On Mon, Nov 5, 2012 at 9:33 AM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: Writing a script to add the external partitions individually is the only way I know of. Sent from my rotary phone. On Nov 5, 2012, at 8:19 AM, Chunky Gupta

Re: Effecient partitions usage in join

2012-11-23 Thread Dean Wampler
/*+ MAPJOIN*/ it ran faster sparing reduce operation, but still read the whole A table. Is there a more efficient way to perform the query w/o reading the whole A content? Thanks Dima -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Multiuser setup on Hive

2012-11-22 Thread Dean Wampler
-- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Hive JOINs not working as expected (returns 0 rows)

2012-11-20 Thread Dean Wampler
DELIMITED FIELDS TERMINATED BY '~' LINES TERMINATED BY '\n' STORED AS TEXTFILE LOCATION '/Work/Hive/tables/order_1' ; ** ** ** ** ** ** -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re:

2012-11-14 Thread Dean Wampler
a solution to run hive under cygwin. Although hadoop works very well, the command hive starts to turn as infinite Thank you in advance for your answers -- Russell Jurney twitter.com/rjurney russell.jur...@gmail.com datasyndrome. com -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312

Run Shark on Windows?

2012-11-14 Thread Dean Wampler
the bullet and install a virtual machine image of Hadoop on Linux running in VMWare or VirtualBox, anyway. dean -- Dean Wampler, Ph.D. thinkbiganalytics.com +1-312-339-1330

Re: Alter table is giving error

2012-11-05 Thread Dean Wampler
for this problem on internet. Please help me. Thanks, Chunky. -- *Dean Wampler, Ph.D.* thinkbiganalytics.com +1-312-339-1330

Re: Alter table is giving error

2012-11-05 Thread Dean Wampler
partitioning all dates, so I have to write script for that to do so for all dates. Is there any easiest way other than this ? Thanks, Chunky On Mon, Nov 5, 2012 at 6:28 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: The RECOVER PARTITIONS is an enhancement added

Re: Creating Indexes

2012-11-02 Thread Dean Wampler
get exactly the same problem as before. Is there some particular place that I should put the derby.jar to make this problem go away? Is there anything else that I can try? ** ** Peter Marron ** ** *From:* Dean Wampler [mailto:dean.wamp...@thinkbiganalytics.com] *Sent

Re: Creating Indexes

2012-11-02 Thread Dean Wampler
/Reduce errors? ** ** Regards, ** ** Peter Marron ** ** *From:* Dean Wampler [mailto:dean.wamp...@thinkbiganalytics.com] *Sent:* 02 November 2012 14:03 *To:* user@hive.apache.org *Subject:* Re: Creating Indexes ** ** Oh, I

Re: Creating Indexes

2012-11-02 Thread Dean Wampler
, but it appears to be an immature feature, unfortunately. dean On Fri, Nov 2, 2012 at 7:02 PM, Dean Wampler dean.wamp...@thinkbiganalytics.com wrote: Wow. Lots of quirks. I'm glad the ADD JAR On Fri, Nov 2, 2012 at 6:59 PM, Peter Marron peter.mar...@trilliumsoftware.com wrote: Hi Dean

Re: Creating Indexes

2012-11-01 Thread Dean Wampler
. Now I’ve been told that I can create and use indexes with the current release of Hive _*without*_ writing and developing any Java code of my own. Is this true? If so, how? Any help appreciated. Peter Marron. ** ** -- *Dean

  1   2   >