a. Two regular expressions might work better in some cases (the second works
on the first) - hard to say without an example.
b. XPATH extractor memory used might be an issue.
Give an example of what you want to do

On Mon, Jan 24, 2011 at 7:27 AM, thanh nguyen <[email protected]>wrote:

> Hi everyone,
>
> I have a big HTML table from which I need to extract data. The table has
> several columns. The regulation expression required to do the extraction
> job
> is very long and complex. The code is hard to debug and to maintain. I'd
> like to know what are the alternatives? Is there HTML parser that create
> DOM
> objects? I could program a postprocessor in beanshell...
>
> Thanks a lot
>
> Thanh
>

Reply via email to