Re: hive.query.string not reflecting the current query

2013-12-11 Thread Petter von Dolwitz (Hem)
Hi, thank you all for your replies. I switched to using 'hive.io.filter.text' inline with Peters reply. I also applied the filter negotiation mechanism (HiveStoragePredicateHandler) in my storage handler. It works very well (so far) even though the filter negotiation mechanism is a bit limited in

RE: hive.query.string not reflecting the current query

2013-12-05 Thread Peter Marron
Hi, Sorry for the late reply. Maybe the property 'hive.io.filter.expr.serialized' is something that can help? It works for me, and it certainly works in the case where the query does not result in a Map/Reduce (which is something that I rely on). (If you google you should be able to find out abou

Re: hive.query.string not reflecting the current query

2013-12-03 Thread Navis류승우
Looks like a bug. I've booked this on https://issues.apache.org/jira/browse/HIVE-5935. 2013/12/4 Adam Kawa > Maybe you can parse the output of EXPLAIN operator applied on your query > https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Explain > or look for other configuration prope

Re: hive.query.string not reflecting the current query

2013-12-03 Thread Adam Kawa
Maybe you can parse the output of EXPLAIN operator applied on your query https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Explain or look for other configuration property (e.g. saying that number of map and reduce tasks is equal to 0, or something). 2013/12/3 Petter von Dolwitz (H

Re: hive.query.string not reflecting the current query

2013-12-03 Thread Petter von Dolwitz (Hem)
Yes, it seems related. I think the query string is not refreshed when hive decides to run without a map reduce job. Problem is that I try to interact with the query string to apply an early filter in the record reader. Any other known way to detect that a map reduce job is not spawned so that I can

Re: hive.query.string not reflecting the current query

2013-12-03 Thread Adam Kawa
Hmmm? Maybe it is related to the fact, that a query: > select * from mytable limit 100; does not start any MapReduce job. It is starts a reading operation from HDFS (and a communication with MetaStore to know what is the schema and how to parse the data using InputFormat and SerDe). For example,