[Pharo-dev] Getting some tag in an HTML file

Alexandre Bergel Thu, 13 Aug 2015 16:32:31 -0700

Hi!

Together with Nicolas we are trying to get all the <script …> … </script> from 
html files.
We have tried to use XMLDOMParser, but many webpages are actually not well 
formed, therefore the parser is complaining.


Anyone has tried to get some particular tags from HTML files? This looks like a 
classical thing to do. Maybe some of you have done it.
Is there a way to configure the parser to accept a broken XML/HTML content?

Cheers,
Alexandre
-- 
_,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:
Alexandre Bergel  http://www.bergel.eu
^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;.

[Pharo-dev] Getting some tag in an HTML file

Reply via email to