I understand that Sharan brought up the framework rewrite subject
at
ApacheCon, and some attendees felt that the framework is fine and
no
action
needs to be taken.
In this message, I will try to give a detailed explanation of why
a
framework rewrite is necessary. I don't plan to take any further
action on
this subject, because I've brought it up before without success,
and
I'm
tired of discussing it. It is my hope that the light bulb will
click
on in
someone's head and they will take action.
My Background
-------------
I became a member of the OFBiz community in 2004. I immediately
started
making contributions to the project by supplying patches to the
issue
tracker. In 2007, I became a committer. Most of my initial work
was
on the
UI and some work in the applications (mainly Asset Maintenance and
Work
Effort). I stayed away from touching the framework code because it
was
deep, dark, and scary.
Eventually, I started to understand how the framework code works
and I
made some minor modifications. As my understanding grew, I
progressed
to
rewriting large swaths of framework code - making it thread-safe,
fault
tolerant, efficient, and easier to use.
I will list some of my contributions here, so everyone can have a
clear
understanding of my experience with the framework code:
New Features
User Preferences
Visual Themes
Custom UI Label XML File Format
Temporal Expressions
Data Type Conversion Framework
Screen Widget Boundary Comments
Metrics
Integrations
UEL
iCalendar
JSR 223
WebDAV
LDAP
Refactorings/Improvements
FlexibleStringExpander
FlexibleMapExpander
FOP Integration
FreeMarkerWorker
Date-Time Handling
Mini-language
Job Scheduler
In addition, I have performed innumerable framework bug fixes.
So, the contents of this message come from years of experience
mucking
about in the framework code.
Okay, let's get started...
Initial Problem Statement
-------------------------
In 2009, David Jones started a framework rewrite in a branch:
https://svn.apache.org/repos/asf/ofbiz/branches/executioncontext20090716
At the time, there was some agreement that a rewrite was
necessary,
but
there was disagreement as to how the rewrite should be
incorporated
into
the project:
https://mail-archives.apache.org/mod_mbox/ofbiz-dev/200908.mbox/%3c455601.62605...@web63102.mail.re1.yahoo.com%3E
There were concerns that a rewrite would break backward
compatibility.
Work on the rewrite branch stopped. Eventually, Jacopo suggested
the
community be more accepting of backward-incompatible changes:
https://mail-archives.apache.org/mod_mbox/ofbiz-dev/201004.mbox/%3cd24f129d-4f9f-444e-84af-aca46f499...@hotwaxmedia.com%3e
Despite an effort to convince David to proceed with the framework
rewrite,
he ended up doing it in a separate project:
http://mail-archives.apache.org/mod_mbox/ofbiz-dev/201104.mbox/%3c07565c88-4023-4d24-93a3-a4906e86f...@me.com%3E
This page describes differences between OFBiz and Moqui, and
within
it you
can extract information on the problems David was trying to solve:
http://sourceforge.net/p/moqui/discussion/1086127/thread/4c52f240/
There was an email he sent out on the OFBiz dev list where he
listed
the
problems he saw in the framework, but I can't find it. The rest of
this
message will include the issues he mentioned (the ones I
remember).
I
was
in agreement with him at the time, and I still agree that a
framework
rewrite is necessary.
The Problems
------------
Code is scattered everywhere - due to an initial effort to make
the
framework modular. This causes serious problems. The mere fact
that
components like entityext and securityext EXIST makes it clear
that
there
are problems - those components should not be there. Also, we run
into the
recurring problem of circular dependencies (component A will not
build
unless component B is built, and component B will not build unless
component A is built).
Bad separation of concerns. There are far too many examples of
classes
that try to be everything to everyone. This makes debugging
difficult, and
it makes maintenance/improvements a nightmare. [Using an analogy,
consider
an automobile design where a spark plug is not separate from a
transmission. Instead, the automobile uses a
spark-plug-transmission.
So
when the engine is running rough because the spark plug is bad,
you
have to
replace the spark plug AND the transmission.] A good framework
example can
be found in my rewrite of the mini-language code. Originally, the
models
AND the script execution context both contained script behaviors -
making
debugging/improvements difficult. I changed it so only the models
contain
script behavior and the script execution context contains only the
script
execution state.
Lack of good OO design. There are many places where a bit of
framework
functionality is contained in a single method that is hundreds or
thousands
of lines long. There is a term for that: Brittle Code. Code isn't
reused.
Instead, it is copy-and-pasted all over - so when a problem is
found
in the
C&P code, it has to be fixed in many places instead of one.
Fail-slow design. There are a lot of places in low-level code
where
an
error condition is encountered, but instead of throwing an
exception,
the
error is ignored and maybe it is logged, or the code tries to
"guess"
at a
solution and then provide an arbitrary default behavior. I've seen
many
developers struggle with debugging a problem because they didn't
look
at
the logs, or because the error was not logged and there is no way
of
knowing what caused it. They end up spending hours single-stepping
through
code until it reaches the error.
Out-of-date code. A good example is the use of Javolution. That
library
was beneficial in the Java 1.4 days, but it is not necessary today
because
of improved garbage collection. Another good example is DCL code.
DCL
was
used extensively in OFBiz, but it is clearly documented to be an
unreliable
design (I can get it to fail 90% of the time). Some DCL code has
been
replaced, but a lot of it is still there.
Portions of the API are overly complicated. Some methods require a
collection of user-specified artifacts/arguments, which makes
client
code
complicated and verbose. (David solved that problem with his
Execution
Context.) Portions of the API are cluttered with unnecessary
"convenience
methods" - making the API harder to learn and memorize. In some
places, a
domain-specific API is spread across instance methods and static
methods
and across different classes - making the API hard to understand
and
use.
Yes, there can be good designs that require something like that,
but
in the
OFBiz framework, it exists because of a bad design, not a good
one.
Use of thread-local variables. This makes multi-threaded design
impossible. The J2EE specification and the Servlet API require one
thread
per request (and most J2EE libraries depend on that behavior), so
the
current design makes sense from a J2EE perspective, but what if I
don't
want to run the framework in a J2EE container? Which leads to the
next
problem...
Dependence on J2EE designs/APIs/libraries. There are developers in
the
Java community (myself included) who are beginning to question if
J2EE is
really necessary to run web applications. The folks at Atomikos
are
a
good
example. OFBiz does not use EJBs, so tying the framework to J2EE
does
not
make sense. It would be better if the framework was designed to
run
outside
a J2EE container, and then have container integration as an
option.
Configuration files are scattered everywhere. Anyone who has
deployed
OFBiz in a production environment will agree this is a problem.
Try
changing the HTTP/HTTPS and port settings - it is a nightmare.
Some
configuration settings are in nonsensical places.
An abysmal lack of unit testing. I don't have an exact figure for
code
coverage, but my gut feeling is coverage is less than 10%.
Basically,
we
all have our fingers crossed - hoping that the framework code
works
as
expected. This was made painfully obvious a while back when I was
looking
at some entity caching code and thought to myself "this code can't
work."
So I wrote some entity cache unit tests and confirmed that the
entity
cache
had serious problems. Think about that - years passed with no
entity
cache
unit tests and consequently we had no idea it wasn't working.
Fix Versus Rewrite
------------------
Jira issues could be created for these problems and teams of
developers
could work to fix them.
Or, we could create a branch and start over from scratch. This
time
around, there should be less push-back from people concerned about
backwards compatibility. A rewrite offers the advantage of
reconsidering
everything - like API design, general problem solving, and new
features.
I created a Wiki page for a framework design:
https://cwiki.apache.org/confluence/display/OFBADMIN/Another+Framework+Vision
but there hasn't been much interest in it. If the community
decides
to go
ahead with a rewrite, then please feel free to use the Wiki pages
as a
guide.
Sandglass Foundation
--------------------
Like David, I came to the conclusion that a framework rewrite
would
be
easier outside the OFBiz community. So, I created my own library
called
Foundation:
http://www.sandglass-software.com/products/sandglass/documents/Foundation_Brochure.pdf
(PDF)
and I only mention it here to stress how wonderful it can be to
start
with
a clean slate and design an API that is concise yet powerful.
(Please
do
not discuss Foundation here, contact me privately if you want more
information.)
Some examples of what can be done with a rewrite:
A single configuration file
Use ANSI/ISO SQL SELECT statement strings instead of
constructing
complicated Java structures
Simultaneous asynchronous queries
Relational integrity across multiple datasources
Multi-table SELECT across multiple datasources
Automatic and transparent row version control
Automatic and transparent multi-language datasource support
Abstract entities (similar to SQL user types)
Service engine throttling (protects against server
over-utilization)
Simplified security (authorization) (
https://cwiki.apache.org/confluence/display/OFBIZ/OFBiz+Security+Redesign
)
Pure interface-based API - so developers are free to modify
framework
behavior by using decorators
Thorough unit tests
Benefits of a rewrite:
Reduced resource requirements (lower hosting fees)
Reduce application development time - due to a simplified
API
Easier framework code maintenance
Better reliability
Conclusion
----------
Like I said at the start, this is all I will say about the
subject.
I'm
done trying to convince everyone. I hope someone agrees with me
and
they
are able to build support for the idea.
--
Adrian Crum
Sandglass Software
www.sandglass-software.com