I've modified the tag-value embl parser under demos/tagvalue/EmblParser to tokenize the whole of the document including the feature table and the feature properties. Take it for a spin. It's running at a reasonable speed for me (tested on some of the hum*.dat files from embl). To make this usefull we need to add the layer of listeners that build objects from the events.

Matthew

Keith James wrote:
I started on a hybrid EMBL parser which combined tag-value and
JFlex/CUP for the feature table, but gave it up for more interesting
things. (It was a real drag trying to get conflicts in the feature
table BNF to resolve and then there's the syntax errors in the DB
itself.)

I'd help with this. I'm messing with the same thing in Lisp, so it
would be an interesting excercise. (Dammit! I *swore* I'd never do
another EMBL parser!)

Keith


--
BioJava Consulting LTD - Support and training for BioJava
http://www.biojava.co.uk

__________________________________________________
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts
http://uk.my.yahoo.com

_______________________________________________
Biojava-l mailing list  -  [EMAIL PROTECTED]
http://biojava.org/mailman/listinfo/biojava-l

Reply via email to