[jira] Created: (HIVE-1505) Support non-UTF8 data

2010-08-02 Thread bc Wong (JIRA)
Support non-UTF8 data
-

 Key: HIVE-1505
 URL: https://issues.apache.org/jira/browse/HIVE-1505
 Project: Hadoop Hive
  Issue Type: New Feature
  Components: Serializers/Deserializers
Affects Versions: 0.5.0
Reporter: bc Wong


I'd like to work with non-UTF8 data easily.

Suppose I have data in latin1. Currently, doing a "select *" will return the 
upper ascii characters in '\xef\xbf\xbd', which is the replacement character 
'\ufffd' encoded in UTF-8. Would be nice for Hive to understand different 
encodings, or to have a concept of byte string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1211) Tapping logs from child processes

2010-06-29 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1211:
--

Attachment: HIVE-1211-2.patch

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
>Assignee: bc Wong
> Fix For: 0.6.0
>
> Attachments: HIVE-1211-2.patch, HIVE-1211.1.patch
>
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1211) Tapping logs from child processes

2010-06-29 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1211:
--

Status: Patch Available  (was: Open)

Updated patch on 0.6 branch.

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
>Assignee: bc Wong
> Fix For: 0.6.0
>
> Attachments: HIVE-1211-2.patch, HIVE-1211.1.patch
>
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1261) ql.metadata.Hive#close() should check for null metaStoreClient

2010-03-22 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1261:
--

Attachment: HIVE-1261-2.patch

Attaching HIVE-1261-2.patch, which fixes some style issues.

> ql.metadata.Hive#close() should check for null metaStoreClient
> --
>
> Key: HIVE-1261
> URL: https://issues.apache.org/jira/browse/HIVE-1261
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: bc Wong
>Assignee: bc Wong
> Attachments: HIVE-1261-1.patch, HIVE-1261-2.patch
>
>
> {{ql.metadata.Hive#close()}} always does a {{metaStoreClient.close()}}, even 
> though it might be null.
> I'd like to reuse the same thread for multiple queries from different users. 
> Since it's a different user, {{HiveConf}} has a new ugi. But it's not easy to 
> get rid of (or update) the cached {{ql.metadata.Hive}}. The cleanest way 
> seems to be {{Hive.closeCurrent()}}, which unfortunately doesn't check for 
> the {{metaStoreClient}} being {{null}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1268) Cannot start metastore thrift server on a specific port

2010-03-22 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1268:
--

Status: Patch Available  (was: Open)

> Cannot start metastore thrift server on a specific port
> ---
>
> Key: HIVE-1268
> URL: https://issues.apache.org/jira/browse/HIVE-1268
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Metastore
>Reporter: bc Wong
>Assignee: bc Wong
> Attachments: HIVE-1268-1.patch
>
>
> The code to get the port number is broken. Doh.
> {code}
> 14:07:55[dsom:dist]$ METASTORE_PORT=12345 bin/hive --service metastore
> Starting Hive Metastore Server
> Exception in thread "main" java.lang.NullPointerException
> at 
> org.apache.hadoop.hive.metastore.HiveMetaStore.main(HiveMetaStore.java:960)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1268) Cannot start metastore thrift server on a specific port

2010-03-22 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1268:
--

Attachment: HIVE-1268-1.patch

> Cannot start metastore thrift server on a specific port
> ---
>
> Key: HIVE-1268
> URL: https://issues.apache.org/jira/browse/HIVE-1268
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Metastore
>Reporter: bc Wong
>Assignee: bc Wong
> Attachments: HIVE-1268-1.patch
>
>
> The code to get the port number is broken. Doh.
> {code}
> 14:07:55[dsom:dist]$ METASTORE_PORT=12345 bin/hive --service metastore
> Starting Hive Metastore Server
> Exception in thread "main" java.lang.NullPointerException
> at 
> org.apache.hadoop.hive.metastore.HiveMetaStore.main(HiveMetaStore.java:960)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1268) Cannot start metastore thrift server on a specific port

2010-03-22 Thread bc Wong (JIRA)
Cannot start metastore thrift server on a specific port
---

 Key: HIVE-1268
 URL: https://issues.apache.org/jira/browse/HIVE-1268
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Metastore
Reporter: bc Wong
Assignee: bc Wong


The code to get the port number is broken. Doh.

{code}
14:07:55[dsom:dist]$ METASTORE_PORT=12345 bin/hive --service metastore
Starting Hive Metastore Server
Exception in thread "main" java.lang.NullPointerException
at 
org.apache.hadoop.hive.metastore.HiveMetaStore.main(HiveMetaStore.java:960)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
{code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1261) ql.metadata.Hive#close() should check for null metaStoreClient

2010-03-20 Thread bc Wong (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847750#action_12847750
 ] 

bc Wong commented on HIVE-1261:
---

The new test in the patch would currently fail.

> ql.metadata.Hive#close() should check for null metaStoreClient
> --
>
> Key: HIVE-1261
> URL: https://issues.apache.org/jira/browse/HIVE-1261
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: bc Wong
> Attachments: HIVE-1261-1.patch
>
>
> {{ql.metadata.Hive#close()}} always does a {{metaStoreClient.close()}}, even 
> though it might be null.
> I'd like to reuse the same thread for multiple queries from different users. 
> Since it's a different user, {{HiveConf}} has a new ugi. But it's not easy to 
> get rid of (or update) the cached {{ql.metadata.Hive}}. The cleanest way 
> seems to be {{Hive.closeCurrent()}}, which unfortunately doesn't check for 
> the {{metaStoreClient}} being {{null}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1261) ql.metadata.Hive#close() should check for null metaStoreClient

2010-03-20 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1261:
--

Assignee: bc Wong
  Status: Patch Available  (was: Open)

> ql.metadata.Hive#close() should check for null metaStoreClient
> --
>
> Key: HIVE-1261
> URL: https://issues.apache.org/jira/browse/HIVE-1261
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: bc Wong
>Assignee: bc Wong
> Attachments: HIVE-1261-1.patch
>
>
> {{ql.metadata.Hive#close()}} always does a {{metaStoreClient.close()}}, even 
> though it might be null.
> I'd like to reuse the same thread for multiple queries from different users. 
> Since it's a different user, {{HiveConf}} has a new ugi. But it's not easy to 
> get rid of (or update) the cached {{ql.metadata.Hive}}. The cleanest way 
> seems to be {{Hive.closeCurrent()}}, which unfortunately doesn't check for 
> the {{metaStoreClient}} being {{null}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1261) ql.metadata.Hive#close() should check for null metaStoreClient

2010-03-20 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1261:
--

Attachment: HIVE-1261-1.patch

> ql.metadata.Hive#close() should check for null metaStoreClient
> --
>
> Key: HIVE-1261
> URL: https://issues.apache.org/jira/browse/HIVE-1261
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: bc Wong
> Attachments: HIVE-1261-1.patch
>
>
> {{ql.metadata.Hive#close()}} always does a {{metaStoreClient.close()}}, even 
> though it might be null.
> I'd like to reuse the same thread for multiple queries from different users. 
> Since it's a different user, {{HiveConf}} has a new ugi. But it's not easy to 
> get rid of (or update) the cached {{ql.metadata.Hive}}. The cleanest way 
> seems to be {{Hive.closeCurrent()}}, which unfortunately doesn't check for 
> the {{metaStoreClient}} being {{null}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1261) ql.metadata.Hive#close() should check for null metaStoreClient

2010-03-20 Thread bc Wong (JIRA)
ql.metadata.Hive#close() should check for null metaStoreClient
--

 Key: HIVE-1261
 URL: https://issues.apache.org/jira/browse/HIVE-1261
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: bc Wong


{{ql.metadata.Hive#close()}} always does a {{metaStoreClient.close()}}, even 
though it might be null.

I'd like to reuse the same thread for multiple queries from different users. 
Since it's a different user, {{HiveConf}} has a new ugi. But it's not easy to 
get rid of (or update) the cached {{ql.metadata.Hive}}. The cleanest way seems 
to be {{Hive.closeCurrent()}}, which unfortunately doesn't check for the 
{{metaStoreClient}} being {{null}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-49) [Hive] display column header on CLI

2010-03-19 Thread bc Wong (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-49?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847580#action_12847580
 ] 

bc Wong commented on HIVE-49:
-

+1

At least a programmatic interface to get to the column names would be nice.

> [Hive] display column header on CLI
> ---
>
> Key: HIVE-49
> URL: https://issues.apache.org/jira/browse/HIVE-49
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Clients
>Reporter: YoungWoo Kim
>Assignee: Edward Capriolo
>Priority: Minor
>
> The result row(s) on CLI are displayed without column header.
> It would be useful to be able to set header on or off by users.
> for example,
> hive> SET HEAD [ON|OFF]/* similar to oracle sql*plus syntax */
> or
> hive> SET [GLOBAL|SESSION] HEADER=[ON|OFF]/* similar to mysql's "SET" 
> syntax */

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1211) Tapping logs from child processes

2010-03-06 Thread bc Wong (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12842346#action_12842346
 ] 

bc Wong commented on HIVE-1211:
---

I'd like to retrieve the logs related to a particular query.

SessionState.out and SessionState.err are meant for that exact purpose. 
Unfortunately, they don't capture the child processes' output, which currently 
goes directly to System.out and System.err. Therefore this patch introduces 
SessionState.childout and SessionState.childerr, which by default (null) do 
nothing. But it allows anyone to hook a stream there to get the child's output.

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
>Assignee: bc Wong
> Attachments: HIVE-1211.1.patch
>
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1211) Tapping logs from child processes

2010-03-04 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1211:
--

Status: Patch Available  (was: Open)

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
> Attachments: HIVE-1211.1.patch
>
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1211) Tapping logs from child processes

2010-03-03 Thread bc Wong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

bc Wong updated HIVE-1211:
--

Attachment: HIVE-1211.1.patch

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
> Attachments: HIVE-1211.1.patch
>
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1211) Tapping logs from child processes

2010-03-03 Thread bc Wong (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840834#action_12840834
 ] 

bc Wong commented on HIVE-1211:
---

I'd like to change it to go through the {{SessionState}} first. 
{{SessionState}} will have two new {{PrintStream}}, called {{childOut}} and 
{{childErr}}, which are similar to the current {{out}} and {{err}}. They let 
users tap into the child's output. If they're null, then the child's out will 
continue on to stdout/stderr.

And this doesn't break any existing users who aren't interested in the child's 
output. They can tap into {{SessionState.out|err}} and ignore the new fields.

> Tapping logs from child processes
> -
>
> Key: HIVE-1211
> URL: https://issues.apache.org/jira/browse/HIVE-1211
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Logging
>Reporter: bc Wong
>
> Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to 
> the parent's stdout/stderr. There is little one can do to to sort out which 
> log is from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1211) Tapping logs from child processes

2010-03-03 Thread bc Wong (JIRA)
Tapping logs from child processes
-

 Key: HIVE-1211
 URL: https://issues.apache.org/jira/browse/HIVE-1211
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Logging
Reporter: bc Wong


Stdout/stderr from child processes (e.g. {{MapRedTask}}) are redirected to the 
parent's stdout/stderr. There is little one can do to to sort out which log is 
from which query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1206) Return row results in a list instead of a tab-delimited string

2010-02-28 Thread bc Wong (JIRA)
Return row results in a list instead of a tab-delimited string
--

 Key: HIVE-1206
 URL: https://issues.apache.org/jira/browse/HIVE-1206
 Project: Hadoop Hive
  Issue Type: Bug
Reporter: bc Wong


Driver.getResults() returns each row as a string, with fields tab delimited 
always. This breaks for data with tabs. It'd be really nice if the interface 
allows returning the row as a list of fields.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.