Dear Matthias,
we made some work on scrapping forums and mailing lists:
Diego Berrueta, Sergio Fernández and Lian Shi. Bootstrapping the
Semantic Web of Social Online Communities. WWW2008 Workshop on
Social Web Search and Mining (SWSM2008), Beijing, China, April
22, 2008.
http://www.wikier.org/stuff/research/publications/2008/SWSM2008-bootstrapping-social-semantic-web.pdf
There are many many technologies (TagSoup in Java, pyquery in python,
XSLT or many others...) that can be deployed adapting any current
crawler. But I don't know any packaged open-source product that fullfil
your requirements.
BTW, have RDFa in that forum would be cool.
Cheers,
On Thu, 2009-10-01 at 07:43 -0700, Matthias Samwald wrote:
> Dear SIOC community,
>
> At the moment, I am thinking about possible ways of turning existing
> bulletin boards (often based on the popular vBulletin software) into
> SIOC, by crawling them and extracting the content.
>
> Does any of you have experience with crawling bulletin boards? Is
> there any existing software that could be built upon?
>
> Cheers,
> Matthias
> >
--
Sergio Fernández - [email protected]
Departamento I+D+i
Fundación CTIC - www.fundacionctic.org
Tlfn: +34 984 29 12 12
Fax: +34 984 39 06 12
Edificio Centros Tecnológicos
Parque Científico Tecnológico
33203 Cabueñes - Gijón - Asturias - Spain
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups
"SIOC-Dev" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/sioc-dev?hl=en
-~----------~----~----~----~------~----~------~--~---