a. Two regular expressions might work better in some cases (the second works on the first) - hard to say without an example. b. XPATH extractor memory used might be an issue. Give an example of what you want to do
On Mon, Jan 24, 2011 at 7:27 AM, thanh nguyen <[email protected]>wrote: > Hi everyone, > > I have a big HTML table from which I need to extract data. The table has > several columns. The regulation expression required to do the extraction > job > is very long and complex. The code is hard to debug and to maintain. I'd > like to know what are the alternatives? Is there HTML parser that create > DOM > objects? I could program a postprocessor in beanshell... > > Thanks a lot > > Thanh >

