It may be worth quoting that:
- it's livable to be registered to commons-dev without filter (you need a swift delete key) - the filter practice is deep anchored until the netiquette for both commons-dev and commons-user so that mails without the project prefix are very quickly warned about and corrected

paul

Le 29-déc.-08 à 04:56, Henri Yandell a écrit :

I don't see why it's not a good fit for Commons. We've a few xml
related components in the dormant section. I think a component with
things for both SAX and DOM would be fine, and letting it grow
organically would also work well.

Hen

On Wed, Dec 17, 2008 at 5:09 AM, Jukka Zitting <jukka.zitt...@gmail.com > wrote:
Hi,

In the Apache Tika project [1] we use SAX quite a lot, and have
written a set of quite useful general utility classes for SAX
handling.

For example, in org.apache.tika.sax [2] we have the following:

* ContentHandlerDecorator - Convenient base class for writing
ContentHandler decorators
* EmbeddedContentHandler - Decorator that blocks startDocument() and
endDocument() calls
* TeeContentHandler - Forwards SAX events to multiple handlers
* TextContentHandler - Decorator that blocks everything but character
events (and start/endDocument)
* WriteOutContentHandler - Writes the contents of all character events
to a Writer

In org.apache.tika.sax.xpath [3] we have a simple XPath subset
implementation that supports streaming and filtering of SAX events. In
other words, the implementation doesn't need a DOM tree to evaluate
XPath statements.

I believe this code would be useful also outside Tika, and I was
thinking that it might perhaps make sense to create a Commons project
for this. I also know of some SAX processing classes in Cocoon and
Jackrabbit that could well be of interest to a wider audience.

Do you think something like this would be interesting as a Commons
project? Are there other similar efforts that I should know of? I
looked at XML Commons in xml.apache.org, but it seems pretty dormant.

[1] http://lucene.apache.org/tika/
[2] 
http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/package-summary.html
[3] 
http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/xpath/package-summary.html

BR,

Jukka Zitting

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

Reply via email to