Re: Off-loading heavy process

Christopher Schultz Fri, 11 Dec 2020 14:01:05 -0800

Rob,

On 12/11/20 15:00, Rob Sargent wrote:
> [huge snip]


Your “Job” example seems along the lines of get-it-off-the-servlet,
which again points back to my current queue handler I think.

Yes, I think so. So let's get back to your original idea -- which Ithink is a good one -- to use a shared queue to manage the jobs.

Just to be clear, the servlet is going to reply to the client ASAP bysaying "I have accepted this job and will do my best to complete it", orit will return an error (see below), or it will refuse a connection (seebelow). Sound okay so far?

[My servlet] takes the payload from the client an writes “lots” of
records in the database.  Do I want that save() call in the servlet
or should I queue it up for some other handler. All on the same
hardware, but that frees up the servlet.

If the client doesn't care about job status information, thenfire-and-forget clients is a reasonable methodology. You may find thatat some point, they will want to get some job-status information. Youcould implement that, later. Version 2.2 maybe?

On the other hand, if you can process some of the request in a streamingway, then you can be writing to your database before your client is donesending the request payload. You can still do that with fire-and-forget,but it requires some more careful handling of the streams and stuff likethat.

The one thing you cannot do is retain a reference to the request(response, etc.) after your servlet's service() method ends. Well,unless you go async but that's a whole different thing which doesn'tsound like what you want to do, now that I have more info.

Calling save() from the servlet would tie-up the request-processingthread until the save completes. That's where you get your 18-hourresponse times, which is not very HTTP-friendly.

Avoiding calling save() from the servlet requires that you fully-readthe request payload before queuing the save() call into a thread poolbundled-up with your data. (Well, there are some tricks you could usebut they are a little dirty and may not buy you much.)

In the small client (my self-made DOS), there’s only a handful of
writes, but still faster to hand that memory to a queue and let the
servlet go back to the storm.

I would make everything work the same way unless there is a compellingreason to have different code paths.

That’s the thinking behind the question of accessing a ThreadPoolExecutor via 
JDNI.  I know my existing impl does queue jobs so (so the load is greater than 
the capacity to handle requests).  I worry that without off-loading Tomcat 
would just spin up more servlet threads, exhaust resources.  I can lose a 
client, but would rather not lose the server (that looses all clients...)

Agreed: rejecting a single request is preferred over the service comingdown -- and all its in-flight jobs with it.


So I think you want something like this:

servlet {
  post {
    // Buffer all our input data
    long bufferSize = request.getContentLengthLong();
    if(bufferSize > Integer.MAX_VALUE || bufferSize < 0) {
      bufferSize = 8192; // Reasonable default?
    }

ByteArrayOutputStream buffer = newByteArrayOutputStream((int)bufferSize);


    int count;
    byte[] buffer = new byte[8192];
    while(-1 != (count = in.read(buf)) {
        buffer.write(buf, 0, count);
    }

    // All data read: tell the client we are good to go
    Job job = new Job(buffer);
    try {
      sharedExecutor.submit(job); // Fire and forget

      response.setStatus(200); // Ok
    } catch (RejectedExecutionException ree) {
      response.setStatus(503); // Service Unavailable
    }
  }
}

Obviously, the job needs to know how to execute itself (making itRunnable means you can use the various Executors Java provides). Also,you need to decide what to do about creating the executor.

I used the ByteArrayOutputStream above to avoid the complexity ofre-scaling buffers in example code. If you have huge buffers and youneed to convert to byte[] at the end, then you are going to need 2x heapspace to do it. Yuck. Consider implementing the auto-re-sizingbyte-array yourself and avoiding ByteArrayOutputStream.

There isn't anything magic about JNDI. You could also put the threadpool directly into your servvlet:


servlet {
  ThreadPoolExecutor sharedExecutor;
  constructor() {
    sharedExecutor = new ThreadPoolExecutor(...);
  }
  ...
}

You get to choose the parameters for the thread pool executor. I thinkyou probably want to limit the number of jobs to something "reasonable".You may even want to have separate executors for different kinds of jobs:


servlet {
  ThreadPoolExecutor smallJobExecutor;
  ThreadPoolExecutor bigJobExecutor;
  constructor() {
    smallJobExecutor = new ThreadPoolExecutor(10, 100);
    bigJobExecutor = new ThreadPoolExecutor(1, 5);
  }
  ...

    Job job = new Job(buffer);
    try {
      if(buffer.size() > SMALL_JOB_MAX_SIZE) {
        bigJobExecutor.submit(job); // Fire and forget
      } else {
        smallJobExecutor.submit(job); // Fire and forget
      }
  ...
}

This will limit you to 5 concurrent big jobs and 100 concurrent smalljobs. You could do the same thing with atomic counters or whatever, butthis way is pretty straightforward, too. It also means that there is"always" come reserved capacity for big jobs even when you are facing ahuge number of small jobs. That is, the small jobs can't "starve" thebig jobs out of the server simply by submitting lots of small jobs.

If you want to put those executors into JNDI, you are welcome to do so,but there is no particular reason to. If it's convenient to configure athread pool executor via some JNDI injection something-or-other, feelfree to use that.

But ultimately, you are just going to get a reference to the executorand drop the job on it.

Next up, is SSL.  One of the reason’s I must switch from my naked socket impl.

Nah, you can do TLS on a naked socket. But I think using Tomcat embedded(or not) will save you the trouble of having to learn a whole lot andwrite a lot of code.

TLS should be fairly easy to get going in Tomcat as long as you alreadyunderstand how to create a key+certificate.


-chris

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: Off-loading heavy process

Reply via email to