I've been using the Jehrico Java API. JTidy works well also, but Jehrico seems like a much more active project and it's easy to use.
Clay Sent from my iPhone On Oct 1, 2009, at 10:43 AM, Matthias Samwald <[email protected]> wrote: > > Dear SIOC community, > > At the moment, I am thinking about possible ways of turning existing > bulletin boards (often based on the popular vBulletin software) into > SIOC, by crawling them and extracting the content. > > Does any of you have experience with crawling bulletin boards? Is > there any existing software that could be built upon? > > Cheers, > Matthias > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "SIOC-Dev" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/sioc-dev?hl=en -~----------~----~----~----~------~----~------~--~---
