I've been using the Jehrico Java API. JTidy works well also, but  
Jehrico seems like a much more active project and it's easy to use.

Clay

Sent from my iPhone

On Oct 1, 2009, at 10:43 AM, Matthias Samwald <[email protected]> wrote:

>
> Dear SIOC community,
>
> At the moment, I am thinking about possible ways of turning existing
> bulletin boards (often based on the popular vBulletin software) into
> SIOC, by crawling them and extracting the content.
>
> Does any of you have experience with crawling bulletin boards? Is
> there any existing software that could be built upon?
>
> Cheers,
> Matthias
> >

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"SIOC-Dev" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/sioc-dev?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to