Re: Run a job async

2013-02-05 Thread Prashant Kommireddi
Thanks Alan, this was helpful. On Thu, Jan 24, 2013 at 9:46 AM, Alan Gates wrote: > You might want to look at webhcat's code. It produces a servlet that it > embeds in a jetty server. You may be able to copy paste this to get what > you want. > > The code of interest is in the hcat repository

Re: Run a job async

2013-01-25 Thread Cheolsoo Park
Thank you for the suggestions. I will file a jira and add our discussion there. On Fri, Jan 25, 2013 at 4:23 PM, Rohini Palaniswamy wrote: > Jon, > Those are good areas to check. Few things I have seen regarding those are > > 1) JythonScriptEngine -PythonInterpreter is static and is not suit

Re: Run a job async

2013-01-25 Thread Rohini Palaniswamy
Jon, Those are good areas to check. Few things I have seen regarding those are 1) JythonScriptEngine -PythonInterpreter is static and is not suitable for multiple runs if the script names are same (hit this issue in PIG-2433 unit tests). 2) QueryParserDriver - There is a static cache with macr

Re: Run a job async

2013-01-25 Thread Jonathan Coveney
user to bcc, +dev Cheolsoo, Can you make a JIRA for this? I can imagine a slightly heavier test suite, but I like where you started. If it's not far off, then I think it'll be a win to make it thread safe. But we need to make sure to test the most advanced features...UDF's (esp the same name but

Re: Run a job async

2013-01-25 Thread Cheolsoo Park
>> if you have multiple threads that run a query via PigServer, there is a great chance of the internals clashing because of the use of static variable within Pig. Recently, I spent some time on this, and what I found is that the Pig front-end is quite thread-safe. Here is how I tested it: 1) Wro

Re: Run a job async

2013-01-24 Thread Ramakrishna Nalam
That clarifies it for me, thanks a lot. Regards, Rama. On Fri, Jan 25, 2013 at 10:09 AM, Jonathan Coveney wrote: > Well, when I say that Pig is not multi-threaded, what I mean is that if you > have multiple threads that run a query via PigServer, there is a great > chance of the internals clash

Re: Run a job async

2013-01-24 Thread Jonathan Coveney
Well, when I say that Pig is not multi-threaded, what I mean is that if you have multiple threads that run a query via PigServer, there is a great chance of the internals clashing because of the use of static variables within Pig. Pig itself, when running a single query, is multi-threaded. It's jus

Re: Run a job async

2013-01-24 Thread Ramakrishna Nalam
Hi Jonathan, Pardon if it's a naive question, but Interesting that you say Pig is not multithreaded. We're using Pig 0.10.0, and looking at the code, it seems to do the right things to handle multi threaded requests (ThreadLocal for ScriptState for eg). Would be great if you can point out to the

Re: Run a job async

2013-01-24 Thread Alan Gates
You might want to look at webhcat's code. It produces a servlet that it embeds in a jetty server. You may be able to copy paste this to get what you want. The code of interest is in the hcat repository under webhcat/svr. Alan. On Jan 24, 2013, at 9:42 AM, Prashant Kommireddi wrote: > Thanks

Re: Run a job async

2013-01-24 Thread Prashant Kommireddi
Thanks Alan. We are trying to plug Pig into our existing app server. We have already done this for Java MR. The difficulty we are facing is with the fact that we can use JobClient.submitJob and jobtracker's job end notification to run jobs async, whereas PigServer.executeBatch blocks until pig job

Re: Run a job async

2013-01-24 Thread Alan Gates
If you're looking for an app server for Pig I'd take a look at a couple of other projects already out there that can do this: 1) webhcat (fka Templeton, now part of the HCatalog project). It provides a REST API that launches Pig, Hive, or MR jobs and allows you to manage them, get results, etc

Re: Run a job async

2013-01-24 Thread Praveen M
Are there any plans on making the pigserver multi-threaded? since there is "PigProcessNotificationListener" to subscribe for async callbacks when the pig job completes, is there any real need to keep the pig job submitting thread waiting until the job completes? Is this just a shortcoming today o

Re: Run a job async

2013-01-23 Thread Jonathan Coveney
I think whatever way you slice it, handling thousands of pig jobs asynchronously is going to be a bear. I mean, this is essentially what the job tracker does, albeit with a lot less information. Either way, Pig is not multi-threaded so having more than one instance of Pig in the same JVM is going

Re: Run a job async

2013-01-23 Thread Prashant Kommireddi
Both. Think of it as an app server handling all of these requests. Sent from my iPhone On Jan 23, 2013, at 9:09 PM, Jonathan Coveney wrote: > Thousands of requests, or thousands of Pig jobs? Or both? > > > 2013/1/23 Prashant Kommireddi > >> Did not want to have several threads launched for thi

Re: Run a job async

2013-01-23 Thread Jonathan Coveney
Thousands of requests, or thousands of Pig jobs? Or both? 2013/1/23 Prashant Kommireddi > Did not want to have several threads launched for this. We might have > thousands of requests coming in, and the app is doing a lot more than only > Pig. > > On Wed, Jan 23, 2013 at 5:44 PM, Jonathan Coven

Re: Run a job async

2013-01-23 Thread Prashant Kommireddi
Did not want to have several threads launched for this. We might have thousands of requests coming in, and the app is doing a lot more than only Pig. On Wed, Jan 23, 2013 at 5:44 PM, Jonathan Coveney wrote: > start a separate Process which runs Pig? > > > 2013/1/23 Prashant Kommireddi > > > Hey

Re: Run a job async

2013-01-23 Thread Jonathan Coveney
start a separate Process which runs Pig? 2013/1/23 Prashant Kommireddi > Hey guys, > > I am trying to do the following: > >1. Launch a pig job asynchronously via Java program >2. Get a notification once the job is complete (something similar to >Hadoop callback with a servlet) > > I

Re: Run a job async

2013-01-23 Thread Bill Graham
You can create in instance of PigProcessNotificationListener that calls back when the job finishes. On Wed, Jan 23, 2013 at 4:48 PM, Prashant Kommireddi wrote: > Hey guys, > > I am trying to do the following: > >1. Launch a pig job asynchronously via Java program >2. Get a notification o

Run a job async

2013-01-23 Thread Prashant Kommireddi
Hey guys, I am trying to do the following: 1. Launch a pig job asynchronously via Java program 2. Get a notification once the job is complete (something similar to Hadoop callback with a servlet) I looked at PigServer.executeBatch() and it seems to be waiting until job completes.This is