Re: What is wrong with my input path?

2013-03-21 Thread Kris Coward
be the same as the input path you've pasted here). Cheers, Kris On Wed, Mar 20, 2013 at 04:14:47PM -0700, Tim Chan wrote: I'm using parameter passing to pass an input path to my pig script. This does not seem to work: -param input=/path1/{08,09,10,11,12}/*/data/,/path2/{01,02,03}/*/data/ -- Kris

Re: UDF that takes bag as input and returns another bag

2013-03-18 Thread Kris Coward
and returns another bag as output? How can this be done? Thanks, -- regards Pranjal -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Re: Behaviour on a failed cast.

2013-02-01 Thread Kris Coward
that the schema is null. And you can using ... as(); to certain the schema and column type, maybe you will defeat the cast fail. 专注于Mysql,MSSQL,Oracle,Hadoop 2013/1/12 Kris Coward k...@melon.org Hi, I remember a while back that there was a setting introduced to allow a pig job to either

Re: Escaping Dollar Sign in Map in Pig 0.10

2013-01-15 Thread Kris Coward
get: Unexpected character '$' Ideas? Thanks! Eli -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Storing dates in LzoJsonStorage

2012-12-18 Thread Kris Coward
. Can this be done? If so, is there a Right Way to do it? Thanks, Kris -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Failing to make sense of an error.

2012-11-28 Thread Kris Coward
-- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Re: Group by Fetching top 100 from each group

2012-06-29 Thread Kris Coward
Juhn wrote: Hi there, I'm trying to write a group by statement, only returning the top 100 records from each group. Does pig support this? Thanks, Ben -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB

Re: Can I check if the field is integer or not

2012-06-08 Thread Kris Coward
and in such cases when I cast it to integer it throughs an error (can't convert string into integer). Is there is any way where I can use case statement say if the field integer then cast it to integer else put '-1'. Thanks for any help, Sonia -- Kris Coward

Re: Does Pig guarantee output won't include duplicated rows?

2012-05-18 Thread Kris Coward
) AND (lat 37.817) AND (lng -122.519) AND (lng -122.356); STORE B INTO '$OUTPUT'; Thanks. -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Re: Replace at position in string

2012-04-13 Thread Kris Coward
with mn mncd as output Any built in UDF or i should write own UDF?. I checked existing Replace method , it replaces value to some another value. Thanks and Regards , -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733

Problem with a protobuf in EB

2011-04-29 Thread Kris Coward
-- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Re: Problem with a protobuf in EB

2011-04-29 Thread Kris Coward
29, 2011 at 04:16:05PM -0700, Dmitriy Ryaboy wrote: and the contents of '/home/kris/swineflu/logformats-0.1.2.jar' (jar -tf) D On Fri, Apr 29, 2011 at 1:15 PM, Kris Coward k...@melon.org wrote: Well I'll send up to the point where it fails and exits, since the rest seems kinda

Re: Percentile UDF

2011-03-10 Thread Kris Coward
for this? I think it would be because there's an inherently serial problem in there (i.e. numbering each entry based on its place in the ordered list). Cheers, Kris -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4

Re: Any reason a bunch of nearly-identical jobs would suddenly stop working?

2011-03-09 Thread Kris Coward
Also, reading some uncompressed data off the same cluster using PigStorage shows a failure to even read the data in the first place :| -K On Tue, Mar 08, 2011 at 09:24:18PM -0500, Kris Coward wrote: None of the nodes have more than 20% utilization on any of their disks; so it must

Any additional consultant-finding resources?

2011-03-09 Thread Kris Coward
, and most of the support options on the wiki seem to be pretty solidly full-service, or bits-of-hadoop-other-than-pig. Any pointers would be much appreciated. Thanks, Kris -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E

Re: Any reason a bunch of nearly-identical jobs would suddenly stop working?

2011-03-08 Thread Kris Coward
, 2011 at 03:24:50PM -0800, Dmitriy Ryaboy wrote: Check task logs. I am guessing you ran out of either hdfs or local disk on the nodes. Also, never let your sysadmin go on vacation, that's what makes things break! :) D On Tue, Mar 8, 2011 at 2:53 PM, Kris Coward k...@melon.org wrote

Re: Problems loading a datafile..

2011-03-02 Thread Kris Coward
extension. D On Wed, Mar 2, 2011 at 12:16 PM, Kris Coward k...@melon.org wrote: I might still be missing something useful (we're running elephant-bird from the gpl-packing distribution, and I've registered most of the jarfiles from it), but the strack trace has changed a little, so now

Re: Problems loading a datafile..

2011-03-01 Thread Kris Coward
I get the output: rw-r--r-- 2 kris supergroup 172694 2011-02-25 01:59 /path/to/file/item/ex/subdir -K On Tue, Mar 01, 2011 at 12:46:31PM -0800, Dmitriy Ryaboy wrote: What happens when you hadoop fs -lsr those paths? D On Sun, Feb 27, 2011 at 7:47 PM, Kris Coward k...@melon.org

Problems loading a datafile..

2011-02-27 Thread Kris Coward
) at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:781) at org.apache.pig.PigServer.store(PigServer.java:529) at org.apache.pig.PigServer.openIterator(PigServer.java:465) ... 6 more Anyone have any suggestions why this may be happening and how to fix it? Thanks, Kris -- Kris Coward

Re: Problems with com.twitter.elephantbird.pig.store.LzoTokenizedStorage

2011-02-13 Thread Kris Coward
0.8... D On Fri, Feb 11, 2011 at 10:47 AM, Kris Coward k...@melon.org wrote: So in the interest of being a little less i/o bound, and saving a whole mess of disk, I've started using com.twitter.elephantbird.pig.store.LzoTokenizedStorage for storage... or more accurately, will be using

Problems with com.twitter.elephantbird.pig.store.LzoTokenizedStorage

2011-02-11 Thread Kris Coward
-- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Casting unclean data.

2011-02-03 Thread Kris Coward
of is a number test that I could use to filter the data before trying to cast it, or do I have to write a UDF or a little program to stream the data through in order to get this sort of data cleaning. Thanks, Kris -- Kris Coward http://unripe.melon.org/ GPG

Problems with STORE

2011-01-31 Thread Kris Coward
/f3453efd460348bbaeee2e9496e25871/1294311600/apa from within grunt to verify that it wasn't a permissions problem (and then removed apa so that STORE wouldn't fail on account of the directory already existing), and the error persists. Any advice on what might be causing this problem? Thanks, Kris -- Kris Coward

Joining inner and outer bags

2011-01-07 Thread Kris Coward
to generate expected values if I'm not? (and if I am, is there something less hackish than a GROUP on a unique tuple element that I could use to load the desired values into a bag or tuple (or just plain pass the entire tuple to a UDF)? Thanks, Kris -- Kris Coward

Re: Joining inner and outer bags

2011-01-07 Thread Kris Coward
On Fri, Jan 07, 2011 at 10:44:03AM -0800, Thejas M Nair wrote: On 1/7/11 9:20 AM, Kris Coward k...@melon.org wrote: I've got an outer bag/relation consistig of a bunch of user information, one of the pieces of which is an inner bag of possible events for that user, and the value of those

Re: Taking advantage of structure when doing UDFs and whatnot?

2011-01-04 Thread Kris Coward
. But in the absence of such a counter, yeah, I think a UDF would be necessary. Cheers, Kris -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC 4733 830E 21A4 05C7 1FEB 12B3

Re: Cumulative totals in an ORDERed relation.

2010-12-19 Thread Kris Coward
it on the result of doing a GROUP ALL on your relation. -Dmitriy On Fri, Dec 17, 2010 at 11:31 AM, Kris Coward k...@melon.org wrote: Hello, Is there some sort of mechanism by which I could cause a value to accumulate within a relation? What I'd like to do is something along the lines of having

Re: Cumulative totals in an ORDERed relation.

2010-12-19 Thread Kris Coward
it on the result of doing a GROUP ALL on your relation. -Dmitriy On Fri, Dec 17, 2010 at 11:31 AM, Kris Coward k...@melon.org wrote: Hello, Is there some sort of mechanism by which I could cause a value to accumulate within a relation? What I'd like to do is something along

Re: should the following query work?

2010-12-09 Thread Kris Coward
, A); -- this works describe D; E = foreach C generate B.(group, A.(x)); describe E; --- pig returns syntax error, but should this work? Or is there a patch for it? thanks, lin -- Kris Coward http://unripe.melon.org/ GPG Fingerprint: 2BF3 957D 310A FEEC

IOException appearing during dump but not illustrate

2010-12-08 Thread Kris Coward
could get this to work, that'd really help me out. Thanks, Kris P.S. For those who remember my earlier post about getting httpArgParse to compile, I took the advice to ditch the InternalMap in favour of a HashMapString,Object -- Kris Coward http