Hey John,

Interesting mail.

On 07/11/2007, at 7:04 AM, John Casey wrote:

The point of this exercise is to improve the readability of the code, and allow someone browsing through the source to know what the potential exception chains actually are, without requiring a full static analysis of maven's entire codebase. It has the added benefit of forcing us to look at these exception chains a little bit, and figure out where we might be able to consolidate exception classes, subclass exceptions, and attach better context information at a given exception's construction.

+1 :)

In the case of the ProjectBuildingException, I believe the profusion of constructors is bad;

I'm not quite sure why this is a problem? It is good to centralise any such conversion (if it is necessary), and the important thing is that the exception only contains one type so it is easy to represent it consistently.

The other thing I've been struggling with lately is the use of java.net.URI internally within the ProjectBuildingException. I've done static analysis of maven's core, and the URI instance is not used intact anywhere in the code. While, taken with the super-POM's location, this field type might make a sort of pure sense, I don't see the value of incurring additional instability here. We're converting pom locations from (and, supposedly, to) java.io.File instances using information in this exception, but with the new changes, we're adding all sorts of new complexity in the code by constructing File instances so we can call toUri() on them...which in some cases, causes the build to tank because the file path has a space in it.

This last sentence sounds back to front - it was the URL that caused the problems, file conversions are fine? The toURI Javadoc in fact says that it is designed to ensure things get encoded properly from files :) The URL problem simply seems a bug in 1.4 that was fixed in 1.5 and we have a workaround in place now.

I think it's good to capture the URI when you have it - but you certainly don't want all the conversions happening elsewhere - it seems to make sense to support both File and URI as inputs to the API (though URI need only be in private/protected methods), but only store it as a URI when it is stored (and do the conversion at the point of storage to avoid proliferating that code).


On top of this, we have protected (and private) constructors in this class just to allow support the DRY principle on two fields (projectId and pomUri). This means that, taken with the file construction used to support the (now deprecated) string-based constructors for pomLocation, we have two layers of indirection feeding a private constructor. This seems geared to avoid duplication of calls to set these two fields in the large number of new constructors, but the way I see it, it's a massively over- normalized approach, and it makes this class much harder to read with little or no corresponding benefit. IMO, until we have more than two fields to set, private constructors on these sorts of exceptions will only confuse their readability.

I don't think readability of the exception class is particularly concerning - I would be looking at the usages of it and the javadoc :) But yeah, private constructors in an exception seem unnecessary, and frankly I think we could do away with the projectId and come up with a better way to represent a project between the URI and the ID. The ID gets passed around far more than necessary in the builder itself.

I guess maybe what we really need here is a better discussion of exactly how we support the user through errors. I've been thinking about this stuff quite a bit lately, and I realize it's not enough to have the information for the topmost and bottom-most exceptions in the chain. Instead, each place where an exception is wrapped, it's wrapped because we're traversing a layer in the architecture (or, it should be this way, in any event). This means that there is a new layer of context information associated with the site where the new wrapper is constructed, and this additional context can be critical in making the error easier to understand. Things like certain pertinent parameters, project instances that were in play when the error occurred, extra plugin information pertinent to the error, and so forth all need to be captured.

I agree.

Here are the guidelines I think through off the top of my head when doing such things: - don't be afraid to throw a runtimeexception if the caller shouldn't be expected to be able to deal with it and it is unrecoverable - don't be afraid to rethrow the same, unwrapped exception if the caller would know what it means. For example, if you get passed a file, and it's not found, throw the FileNotFoundException, don't wrap it. - use meaningful parameters in exceptions if they'll help diagnosis independently of the message - if you're coding against an api and you catch an exception that you can't show to the user in a meaningful way right now, then rethrowing it won't help - something has already gone wrong at the level deeper. - if you throw a wrapped exception, add information that helps to the message (or extra parameters) if appropriate. If the existing message is sufficient, reuse it (or maybe just throw the original exception as above). But if you don't know that the existing message will be sufficient, see rule above. - make sure whatever you throw, the calling code will know what to do with it as meaningfully as you can in the current code. - look in nested exceptions and traces for programming errors, not recoverable problems (eg, resource not found on the repository). Describe why it happened and you'll know where it happened and so will the user - and so there should be no need to propagate the nested exception. - and all the usual things about being specific (don't toss and catch Exception and Throwable, etc).


When it comes to actually traversing the tree of possible exceptions and constructing a coherent error message for the user, we also need to have some reasonable assurance that we have 90% of the possible exception chains accounted for. If new functionality results in new exception chains, we should have to *think* about the reporting implications of adding these new chains. To me, this means forcing the developer to deliberately enable the new chain by adding a constructor for the new sub-exception in the wrapper exception. It's not perfect, obviously, but it should help us remember that we need to add reporting code for this new variant of the wrapper.

The error diagnoser you added was a cool way to figure some of this out, but it was kind of covering over the mistakes made elsewhere too :) Ideally, I don't think having to traverse the exception chain should be necessary based on the guidelines above because you should always know enough about what went wrong at each level to express it to the caller explicitly. If you think about a project building exception - someone using maven-project directly, maven-embedder, and the CLI should all be able to get the same, meaningful explanation of a particular failure, right?

WDYT?

Cheers,
Brett

--
Brett Porter - [EMAIL PROTECTED]
Blog: http://www.devzuz.org/blogs/bporter/


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to