Re: Pig UI

2012-01-17 Thread Prashant Kommireddi
I believe that could be used only with Cloudera distro? Sent from my iPhone On Jan 17, 2012, at 10:38 PM, Geert Van Landeghem wrote: > Prashant, > > Have a look at Hue > > Kind regards > Geert > > On Wednesday, January 18, 2012, Prashant Kommireddi > wrote: >> Is there anything out there that

Re: Pig UI

2012-01-17 Thread Geert Van Landeghem
Prashant, Have a look at Hue Kind regards Geert On Wednesday, January 18, 2012, Prashant Kommireddi wrote: > Is there anything out there that allows you to issue Pig queries/scripts > through a browser/UI? > > Thanks, > Prashant > -- -- * kind regards, Van Landeghem Geert* Managing Direct

Re:Re: could pig-0.9.1 work with hbase-0.90.5?

2012-01-17 Thread lulynn_2008
yes, sure. My current test method is running pig unit tests by "ant test". Do you have any other suggestions? At 2012-01-18 12:54:28,"Dmitriy Ryaboy" wrote: >It works with 0.90.3 for sure. If you test with 0.90.5 and find >incompatibilities, please report them here. > >On Jan 17, 2012, at

Re: could pig-0.9.1 work with hbase-0.90.5?

2012-01-17 Thread Dmitriy Ryaboy
It works with 0.90.3 for sure. If you test with 0.90.5 and find incompatibilities, please report them here. On Jan 17, 2012, at 7:36 PM, lulynn_2008 wrote: > In pig-0.9.1, current hbase is version 0.90.0. > My question is couls pig-0.9.1 work with latest stable hbase with version > 0.90.5? >

could pig-0.9.1 work with hbase-0.90.5?

2012-01-17 Thread lulynn_2008
In pig-0.9.1, current hbase is version 0.90.0. My question is couls pig-0.9.1 work with latest stable hbase with version 0.90.5? Thank you.

Pig UI

2012-01-17 Thread Prashant Kommireddi
Is there anything out there that allows you to issue Pig queries/scripts through a browser/UI? Thanks, Prashant

Re: how to control the number of mappers?

2012-01-17 Thread Yang
ok, I see, I was using pig 0.5 tried 0.9, works now thanks! On Tue, Jan 17, 2012 at 1:20 PM, Yang wrote: > weird > > I tried > > # head a.pg > > set job.name 'blah'; > SET mapred.map.tasks.speculative.execution false; > set mapred.min.split.size 1; > > set mapred.tasktracker.map.tasks.maxi

Re: how to control the number of mappers?

2012-01-17 Thread Yang
weird I tried # head a.pg set job.name 'blah'; SET mapred.map.tasks.speculative.execution false; set mapred.min.split.size 1; set mapred.tasktracker.map.tasks.maximum 1; [root@]# pig a.pg 2012-01-17 16:19:18,407 [main] INFO org.apache.pig.Main - Logging error messages to: /mnt/pig_13

Re: how to control the number of mappers?

2012-01-17 Thread Dmitriy Ryaboy
http://pig.apache.org/docs/r0.9.1/cmds.html#set "All Pig and Hadoop properties can be set, either in the Pig script or via the Grunt command line." On Tue, Jan 17, 2012 at 12:53 PM, Yang wrote: > Prashant: > > I tried splitting the input files, yes that worked, and multiple mappers > were indee

Re: how to control the number of mappers?

2012-01-17 Thread Yang
Prashant: I tried splitting the input files, yes that worked, and multiple mappers were indeed created. but then I would have to create a separate stage simply to split the input files, so that is a bit cumbersome. it would be nice if there is some control to directly limit map file input size et

Re: how to control the number of mappers?

2012-01-17 Thread Yang
thanks, but from http://pig.apache.org/docs/r0.7.0/piglatin_ref2.html#set it looks the params that can be 'set' is very limited, and does not contain the min split size and mapper count that I want On Wed, Jan 11, 2012 at 9:52 PM, Dmitriy Ryaboy wrote: > Yes, you can use the "set" keyword to

Re: Sort databag

2012-01-17 Thread Jonathan Coveney
Is there any reason you can't use pig's order by? 2012/1/17 Harsh J > Hi Marek, > > Moving question to user@pig.apache.org which may be more relevant. > > (BCC'd mapreduce-user@, CC'd you) > > On 17-Jan-2012, at 9:57 PM, Marek Miglinski wrote: > > Hi, > > I receive a DataBag in my custom UDF and

Re: Sort databag

2012-01-17 Thread Harsh J
Hi Marek, Moving question to user@pig.apache.org which may be more relevant. (BCC'd mapreduce-user@, CC'd you) On 17-Jan-2012, at 9:57 PM, Marek Miglinski wrote: > Hi, > > I receive a DataBag in my custom UDF and want to sort it by first field in > Tuples it stores. The way I implemented is:

Re: overwrite output

2012-01-17 Thread Jonathan Coveney
you could have "rmf thefile;" before it, meaning just deleting it and then storing...or do you need some different set of guarantees? 2012/1/17 Marco Cadetg > Hi there, > > AFAICT the STORE function doesn't provide a way to overwrite the output. I > guess you could use your own storage UDF to ac

overwrite output

2012-01-17 Thread Marco Cadetg
Hi there, AFAICT the STORE function doesn't provide a way to overwrite the output. I guess you could use your own storage UDF to accomplish that but is there also another way of doing that? Thanks -Marco