Re: Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread r7raul1...@163.com
DDL is CREATE TABLE dw.fct_traffic_navpage_path_detl( date_id string, chanl_id bigint, sessn_id string, gu_id string, prov_id string, city_id string, landing_page_type_id string, landing_track_time string, landing_url string, nav_refer_tracker_id string, nav_refer_page_type_id string,

[jira] [Created] (HIVE-10191) ORC: Cleanup writer per-row synchronization

2015-04-02 Thread Gopal V (JIRA)
Gopal V created HIVE-10191: -- Summary: ORC: Cleanup writer per-row synchronization Key: HIVE-10191 URL: https://issues.apache.org/jira/browse/HIVE-10191 Project: Hive Issue Type: Bug

Dataset for Hive

2015-04-02 Thread xiaohe lan
Hi All, I am new to Hive. Just set up a 5 node Hadoop environment and want to have a try on HiveQL. Is there any dataset I can download to play HiveQL. The dataset should have several tables some I can write some complex join. About 100G should be fine. Thanks, Xiaohe

Re: Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread Thejas Nair
I am unable to reproduce this issue using the sample data - For this query, using 1.1.0, i get the following result- 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM2357378283356 9150119100048 7326356 2015-01-19 10:44:442015-01-19 On Thu, Apr 2, 2015 at 12:36 AM, r7raul1...@163.com

Re: Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread r7raul1...@163.com
I download full data from hdfs. Then load data into my table. In my test enviroment. Everything is ok. My production is hadoop 2.3.0-cdh 5.0.2 REDHAT 5.8 java version 1.6.0_35 r7raul1...@163.com From: r7raul1...@163.com Date: 2015-04-02 16:57 To: dev Subject: Re: Re: hive 0.14 on

Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread Thejas Nair
Can you give more details - the query you are running - schema of the table - serialization format of the table, sample records if possible. On Wed, Apr 1, 2015 at 6:32 PM, r7raul1...@163.com r7raul1...@163.com wrote: I use hive 1.1.0 cli on computer A (linux) the result is

Re: Is it necessary to update beelinepositive q.out files?

2015-04-02 Thread Thejas Nair
beeline tests have been disabled for a while and and i believe the q.out files are already outdated. You don't have to update them. On Wed, Apr 1, 2015 at 12:39 PM, Alexander Pivovarov apivova...@gmail.com wrote: Hello Everyone I'm working on fixing groupby3_map.q query

Re: Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread r7raul1...@163.com
In my test enviroment . I use hive 0.14 ,hive 1.1.0 ,the result is ok. But in production enviroment ,the result is not correct. r7raul1...@163.com From: Thejas Nair Date: 2015-04-02 16:41 To: r7raul1...@163.com CC: dev Subject: Re: Re: hive 0.14 on some platform return some not NULL value

Re: Re: hive 0.14 on some platform return some not NULL value as NULL

2015-04-02 Thread r7raul1...@163.com
Sorry ,I check my production jdk is java version 1.7.0_45 not java version 1.6.0_35 r7raul1...@163.com From: r7raul1...@163.com Date: 2015-04-02 17:01 To: dev Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL I download full data from hdfs. Then load data

Re: Vectorized group-by on strings is super slow in hive 0.13

2015-04-02 Thread Lefty Leverenz
Thanks for the tip, Gopal. I documented hive.limit.pushdown.memory.usage https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.limit.pushdown.memory.usage in the Configuration Properties wiki but had a couple of questions about the description (see

Re: Dataset for Hive

2015-04-02 Thread Chao Sun
Hi Xiaohe, You can try TPC-DS from https://github.com/hortonworks/hive-testbench. It contains large number of queries with complex joins. Chao On Wed, Apr 1, 2015 at 9:30 PM, xiaohe lan zombiexco...@gmail.com wrote: Hi All, I am new to Hive. Just set up a 5 node Hadoop environment and want

[jira] [Created] (HIVE-10199) Add lockId to all ACID log statements

2015-04-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-10199: - Summary: Add lockId to all ACID log statements Key: HIVE-10199 URL: https://issues.apache.org/jira/browse/HIVE-10199 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-10203) Move fragment execution onto a thread pool

2015-04-02 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10203: - Summary: Move fragment execution onto a thread pool Key: HIVE-10203 URL: https://issues.apache.org/jira/browse/HIVE-10203 Project: Hive Issue

Re: Review Request 32406: Add another level of explain for RDBMS audience

2015-04-02 Thread pengcheng xiong
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/32406/ --- (Updated April 2, 2015, 8:07 p.m.) Review request for hive, Ashutosh Chauhan

[jira] [Created] (HIVE-10201) Hive LLAP needs refactoring of the configuration class

2015-04-02 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-10201: - Summary: Hive LLAP needs refactoring of the configuration class Key: HIVE-10201 URL: https://issues.apache.org/jira/browse/HIVE-10201 Project: Hive Issue

[jira] [Created] (HIVE-10200) Add lockId to all ACID log statements

2015-04-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-10200: - Summary: Add lockId to all ACID log statements Key: HIVE-10200 URL: https://issues.apache.org/jira/browse/HIVE-10200 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-10204) vectorMode flags in BaseWork and AbstractOperatorDesc are duplicated

2015-04-02 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-10204: --- Summary: vectorMode flags in BaseWork and AbstractOperatorDesc are duplicated Key: HIVE-10204 URL: https://issues.apache.org/jira/browse/HIVE-10204 Project:

[jira] [Created] (HIVE-10207) Fix test failure in TestAggregateStatsCache

2015-04-02 Thread Vaibhav Gumashta (JIRA)
Vaibhav Gumashta created HIVE-10207: --- Summary: Fix test failure in TestAggregateStatsCache Key: HIVE-10207 URL: https://issues.apache.org/jira/browse/HIVE-10207 Project: Hive Issue Type:

[jira] [Created] (HIVE-10205) A few lines accidentally removed from HBaseReadWrite

2015-04-02 Thread Alan Gates (JIRA)
Alan Gates created HIVE-10205: - Summary: A few lines accidentally removed from HBaseReadWrite Key: HIVE-10205 URL: https://issues.apache.org/jira/browse/HIVE-10205 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-10208) templeton.hive.extra.files should be commented out in webhcat-default.xml

2015-04-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-10208: - Summary: templeton.hive.extra.files should be commented out in webhcat-default.xml Key: HIVE-10208 URL: https://issues.apache.org/jira/browse/HIVE-10208 Project:

[jira] [Created] (HIVE-10206) Cannot modify Avro table properties if existing properties are invalid

2015-04-02 Thread Szehon Ho (JIRA)
Szehon Ho created HIVE-10206: Summary: Cannot modify Avro table properties if existing properties are invalid Key: HIVE-10206 URL: https://issues.apache.org/jira/browse/HIVE-10206 Project: Hive

RE: Request for feedback on work intent for non-equijoin support

2015-04-02 Thread Andres.Quiroz
This is a great pointer, Szehon and Brock, thank you. I will catch up with the material on theta joins and circle back. Andrés -Original Message- From: Brock Noland [mailto:br...@apache.org] Sent: Thursday, April 02, 2015 1:31 AM To: dev@hive.apache.org Subject: Re: Request for

Re: ORC separate project

2015-04-02 Thread Edward Capriolo
To reiterate, one thing I want to avoid is having hive rely on code that sits in several tiny silos across Apache projects, or Apache Licensed but not ASF projects. Hive is a mature TLP with a large number of committers and it would not be a good situation if often work gets bottle necked because

Review Request 32807: HIVE-2327 Optimize REGEX UDFs with constant parameter information

2015-04-02 Thread Alexander Pivovarov
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/32807/ --- Review request for hive and Jason Dere. Bugs: HIVE-2327

[jira] [Created] (HIVE-10196) LLAP: start fetching earlier

2015-04-02 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-10196: --- Summary: LLAP: start fetching earlier Key: HIVE-10196 URL: https://issues.apache.org/jira/browse/HIVE-10196 Project: Hive Issue Type: Sub-task

Review Request 32775: HIVE-9917 After HIVE-3454 is done, make int to timestamp conversion configurable

2015-04-02 Thread Aihua Xu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/32775/ --- Review request for hive. Repository: hive-git Description --- HIVE-9917

[jira] [Created] (HIVE-10192) insert into table failed for partitioned table.

2015-04-02 Thread Ganesh Sathish (JIRA)
Ganesh Sathish created HIVE-10192: - Summary: insert into table failed for partitioned table. Key: HIVE-10192 URL: https://issues.apache.org/jira/browse/HIVE-10192 Project: Hive Issue Type:

[jira] [Created] (HIVE-10193) CBO (Calcite Return Path): Q94 generates cross product

2015-04-02 Thread Mostafa Mokhtar (JIRA)
Mostafa Mokhtar created HIVE-10193: -- Summary: CBO (Calcite Return Path): Q94 generates cross product Key: HIVE-10193 URL: https://issues.apache.org/jira/browse/HIVE-10193 Project: Hive

[jira] [Created] (HIVE-10194) CBO (Calcite Return Path): Q94 generates cross product

2015-04-02 Thread Mostafa Mokhtar (JIRA)
Mostafa Mokhtar created HIVE-10194: -- Summary: CBO (Calcite Return Path): Q94 generates cross product Key: HIVE-10194 URL: https://issues.apache.org/jira/browse/HIVE-10194 Project: Hive

[jira] [Created] (HIVE-10195) CBO (Calcite Return Path): More details for Join operator in CBO cost based algorithm selection [CBO branch]

2015-04-02 Thread Jesus Camacho Rodriguez (JIRA)
Jesus Camacho Rodriguez created HIVE-10195: -- Summary: CBO (Calcite Return Path): More details for Join operator in CBO cost based algorithm selection [CBO branch] Key: HIVE-10195 URL:

[jira] [Created] (HIVE-10197) Add lockId to all ACID log statements

2015-04-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-10197: - Summary: Add lockId to all ACID log statements Key: HIVE-10197 URL: https://issues.apache.org/jira/browse/HIVE-10197 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-10198) Add lockId to all ACID log statements

2015-04-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-10198: - Summary: Add lockId to all ACID log statements Key: HIVE-10198 URL: https://issues.apache.org/jira/browse/HIVE-10198 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-10202) Beeline outputs prompt+query on standard output when used in non-interactive mode

2015-04-02 Thread JIRA
Sergio Peña created HIVE-10202: -- Summary: Beeline outputs prompt+query on standard output when used in non-interactive mode Key: HIVE-10202 URL: https://issues.apache.org/jira/browse/HIVE-10202 Project: