[jira] Commented: (HIVE-629) concat files needed for map-reduce jobs also

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731297#action_12731297 ] Zheng Shao commented on HIVE-629: - Three more questions: 1. How do we determine the number

[jira] Created: (HIVE-639) Create a contrib directory for people to contribute file formats, serdes, and UDFs

2009-07-15 Thread Zheng Shao (JIRA)
Create a contrib directory for people to contribute file formats, serdes, and UDFs -- Key: HIVE-639 URL: https://issues.apache.org/jira/browse/HIVE-639 Project: Hadoop

[jira] Commented: (HIVE-592) renaming internal table should rename HDFS and also change path of the table and partitions accordingly.

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731311#action_12731311 ] Zheng Shao commented on HIVE-592: - Can you remove MIndex.java? I think it belongs to a

[jira] Commented: (HIVE-634) ctrl-A is the only output delimiter used, regardless of the Hive table structure

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731319#action_12731319 ] Zheng Shao commented on HIVE-634: - The format of the local file has nothing to do with the

[jira] Commented: (HIVE-639) Create a contrib directory for people to contribute file formats, serdes, and UDFs

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731329#action_12731329 ] Zheng Shao commented on HIVE-639: - There are 6 sub categories: fileformat, serde, udf,

Build failed in Hudson: Hive-trunk-h0.17 #155

2009-07-15 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Hive-trunk-h0.17/155/changes Changes: [zshao] Removing empty files for HIVE-522. Extend GenericUDAF to support complex types. [namit] HIVE-160. Bug in sampling for subqueries. (Raghu Murthy via namit) [namit] HIVE-522. Extend GenericUDAF to

Error on Load into multiple Partitions

2009-07-15 Thread Matt Pestritto
Hi All. Are there are existing test cases that load into multiple partitions using a single from query? This query worked in an older revision but the mappers fails when I run on trunk: java.lang.RuntimeException: Map operator initialization failed at

Build failed in Hudson: Hive-trunk-h0.18 #157

2009-07-15 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Hive-trunk-h0.18/157/changes Changes: [zshao] Removing empty files for HIVE-522. Extend GenericUDAF to support complex types. [namit] HIVE-160. Bug in sampling for subqueries. (Raghu Murthy via namit) [namit] HIVE-522. Extend GenericUDAF to

Build failed in Hudson: Hive-trunk-h0.19 #156

2009-07-15 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Hive-trunk-h0.19/156/changes Changes: [zshao] Removing empty files for HIVE-522. Extend GenericUDAF to support complex types. [namit] HIVE-160. Bug in sampling for subqueries. (Raghu Murthy via namit) [namit] HIVE-522. Extend GenericUDAF to

[jira] Updated: (HIVE-578) Refactor partition pruning code as an optimizer transformation

2009-07-15 Thread Raghotham Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raghotham Murthy updated HIVE-578: -- Summary: Refactor partition pruning code as an optimizer transformation (was: Refator partition

[jira] Updated: (HIVE-405) Cleanup operator initialization

2009-07-15 Thread Prasad Chakka (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasad Chakka updated HIVE-405: --- Summary: Cleanup operator initialization (was: Operators should pass ObjectInspector in init instead

[jira] Commented: (HIVE-405) Cleanup operator initialization

2009-07-15 Thread Prasad Chakka (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731563#action_12731563 ] Prasad Chakka commented on HIVE-405: currently the operator graph is walked multiple

[jira] Commented: (HIVE-578) Refactor partition pruning code as an optimizer transformation

2009-07-15 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731564#action_12731564 ] Ashish Thusoo commented on HIVE-578: yes that makes sense. In fact that needs to be done

[jira] Commented: (HIVE-634) ctrl-A is the only output delimiter used, regardless of the Hive table structure

2009-07-15 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731565#action_12731565 ] Ashish Thusoo commented on HIVE-634: +1 to the proposal. ctrl-A is the only output

[jira] Commented: (HIVE-599) Embedded Hive SQL into Python

2009-07-15 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731568#action_12731568 ] Ashish Thusoo commented on HIVE-599: Doesn't the jdbc api solve your use case (except the

[jira] Updated: (HIVE-541) Implement UDFs: INSTR and LOCATE

2009-07-15 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashish Thusoo updated HIVE-541: --- Assignee: Min Zhou (was: Zheng Shao) Assigning to Min as he has submitted the patch. Implement

[jira] Commented: (HIVE-541) Implement UDFs: INSTR and LOCATE

2009-07-15 Thread Yuntao Jia (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731582#action_12731582 ] Yuntao Jia commented on HIVE-541: - The patch now uses String.indexOf(String) to find the

[jira] Commented: (HIVE-629) concat files needed for map-reduce jobs also

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731589#action_12731589 ] Namit Jain commented on HIVE-629: - 1. The number of reducers are determined from

[jira] Commented: (HIVE-629) concat files needed for map-reduce jobs also

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731591#action_12731591 ] Namit Jain commented on HIVE-629: - I will change the name of the parameter and regenerate the

[jira] Created: (HIVE-640) Add LazyBinarySerDe to Hive

2009-07-15 Thread Zheng Shao (JIRA)
Add LazyBinarySerDe to Hive --- Key: HIVE-640 URL: https://issues.apache.org/jira/browse/HIVE-640 Project: Hadoop Hive Issue Type: New Feature Reporter: Zheng Shao Assignee: Yuntao Jia

[jira] Commented: (HIVE-599) Embedded Hive SQL into Python

2009-07-15 Thread Edward Capriolo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731625#action_12731625 ] Edward Capriolo commented on HIVE-599: -- I am not saying that python bindings are a bad

[jira] Commented: (HIVE-635) UnionOperator fails when different inputs have different ObjectInspector (but the same TypeInfo)

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731628#action_12731628 ] Namit Jain commented on HIVE-635: - Do you want the common case where all input object

[jira] Updated: (HIVE-629) concat files needed for map-reduce jobs also

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Namit Jain updated HIVE-629: Attachment: hive.629.2.patch concat files needed for map-reduce jobs also

[jira] Updated: (HIVE-636) concatenation task does not work if the destination table is partitioned

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Namit Jain updated HIVE-636: Attachment: hive.636.3.patch concatenation task does not work if the destination table is partitioned

Hive unit testing

2009-07-15 Thread Edward Capriolo
Cloudera recently released mrunit which is a unit testing framework for hadoop. http://www.cloudera.com/blog/2009/07/03/debugging-mapreduce-programs-with-mrunit/ I see the upside of being able to specify inputs and output in the source code. Does this approach make sense for hive? Hive does have

[jira] Commented: (HIVE-454) Support escaping of ; in strings in cli

2009-07-15 Thread Edward Capriolo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731643#action_12731643 ] Edward Capriolo commented on HIVE-454: -- Ashish, I actually am working on a UDF where

[jira] Created: (HIVE-641) udf_reverse.q.out has broken UDF characters

2009-07-15 Thread Emil Ibrishimov (JIRA)
udf_reverse.q.out has broken UDF characters --- Key: HIVE-641 URL: https://issues.apache.org/jira/browse/HIVE-641 Project: Hadoop Hive Issue Type: Bug Reporter: Emil Ibrishimov

[jira] Updated: (HIVE-641) udf_reverse.q.out has broken UDF characters

2009-07-15 Thread Emil Ibrishimov (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emil Ibrishimov updated HIVE-641: - Fix Version/s: 0.4.0 Status: Patch Available (was: Open) udf_reverse.q.out has broken

[jira] Updated: (HIVE-641) udf_reverse.q.out has broken UDF characters

2009-07-15 Thread Emil Ibrishimov (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emil Ibrishimov updated HIVE-641: - Attachment: HIVE-641.1.patch Removed the EXPLAIN query that causes the problem.

[jira] Commented: (HIVE-578) Refactor partition pruning code as an optimizer transformation

2009-07-15 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731696#action_12731696 ] Ashish Thusoo commented on HIVE-578: Actually semantics with table sampling is a bit

[jira] Commented: (HIVE-636) concatenation task does not work if the destination table is partitioned

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731703#action_12731703 ] Zheng Shao commented on HIVE-636: - Testing. Will commit when tests succeed. concatenation

[jira] Updated: (HIVE-636) concatenation task does not work if the destination table is partitioned

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zheng Shao updated HIVE-636: Resolution: Fixed Fix Version/s: 0.4.0 Release Note: HIVE-636. Fix concatenation task with

[jira] Updated: (HIVE-635) UnionOperator fails when different inputs have different ObjectInspector (but the same TypeInfo)

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zheng Shao updated HIVE-635: Attachment: HIVE-635.3.patch UnionOperator fails when different inputs have different ObjectInspector (but

[jira] Commented: (HIVE-635) UnionOperator fails when different inputs have different ObjectInspector (but the same TypeInfo)

2009-07-15 Thread Prasad Chakka (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731749#action_12731749 ] Prasad Chakka commented on HIVE-635: zheng, can you add this context as a comment to

[jira] Commented: (HIVE-635) UnionOperator fails when different inputs have different ObjectInspector (but the same TypeInfo)

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731754#action_12731754 ] Namit Jain commented on HIVE-635: - +1 looks good - will merge after the comments requested by

[jira] Updated: (HIVE-629) concat files needed for map-reduce jobs also

2009-07-15 Thread Zheng Shao (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zheng Shao updated HIVE-629: Resolution: Fixed Fix Version/s: 0.4.0 Release Note: HIVE-629. Concat files for map-reduce jobs.

[jira] Created: (HIVE-642) udf equivalent to string split

2009-07-15 Thread Namit Jain (JIRA)
udf equivalent to string split -- Key: HIVE-642 URL: https://issues.apache.org/jira/browse/HIVE-642 Project: Hadoop Hive Issue Type: New Feature Components: Query Processor Reporter: Namit

[jira] Created: (HIVE-643) Hive to recognized escaped strings

2009-07-15 Thread Zheng Shao (JIRA)
Hive to recognized escaped strings -- Key: HIVE-643 URL: https://issues.apache.org/jira/browse/HIVE-643 Project: Hadoop Hive Issue Type: Bug Reporter: Zheng Shao Priority: Minor The

[jira] Updated: (HIVE-644) change default size for merging files at the end of the job

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Namit Jain updated HIVE-644: Status: Patch Available (was: Open) change default size for merging files at the end of the job

[jira] Commented: (HIVE-541) Implement UDFs: INSTR and LOCATE

2009-07-15 Thread Min Zhou (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731764#action_12731764 ] Min Zhou commented on HIVE-541: --- hmm, It's may be a good way. I will try it soon. Implement

[jira] Updated: (HIVE-635) UnionOperator fails when different inputs have different ObjectInspector (but the same TypeInfo)

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Namit Jain updated HIVE-635: Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available)

[jira] Updated: (HIVE-454) Support escaping of ; in strings in cli

2009-07-15 Thread Edward Capriolo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Edward Capriolo updated HIVE-454: - Attachment: hive-454.patch Updated CLIDriver and added test case Support escaping of ; in

[jira] Updated: (HIVE-541) Implement UDFs: INSTR and LOCATE

2009-07-15 Thread Min Zhou (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Min Zhou updated HIVE-541: -- Attachment: HIVE-541.2.patch Added a GenericUDFUtils.findText() where string encoding and decoding is avoided,

[jira] Commented: (HIVE-541) Implement UDFs: INSTR and LOCATE

2009-07-15 Thread Namit Jain (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731812#action_12731812 ] Namit Jain commented on HIVE-541: - +1 The changes looks good - will merge if the tests pass

[jira] Commented: (HIVE-592) renaming internal table should rename HDFS and also change path of the table and partitions accordingly.

2009-07-15 Thread Prasad Chakka (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12731815#action_12731815 ] Prasad Chakka commented on HIVE-592: this patch changes the way 'alter table old_tbl_name