Dear Matthias,

we made some work on scrapping forums and mailing lists:

        Diego Berrueta, Sergio Fernández and Lian Shi. Bootstrapping the
        Semantic Web of Social Online Communities. WWW2008 Workshop on
        Social Web Search and Mining (SWSM2008), Beijing, China, April
        22, 2008.
        
http://www.wikier.org/stuff/research/publications/2008/SWSM2008-bootstrapping-social-semantic-web.pdf

There are many many technologies (TagSoup in Java, pyquery in python,
XSLT or many others...) that can be deployed adapting any current
crawler. But I don't know any packaged open-source product that fullfil
your requirements.

BTW, have RDFa in that forum would be cool.  

Cheers,

On Thu, 2009-10-01 at 07:43 -0700, Matthias Samwald wrote:
> Dear SIOC community,
> 
> At the moment, I am thinking about possible ways of turning existing
> bulletin boards (often based on the popular vBulletin software) into
> SIOC, by crawling them and extracting the content.
> 
> Does any of you have experience with crawling bulletin boards? Is
> there any existing software that could be built upon?
> 
> Cheers,
> Matthias
> > 


-- 
Sergio Fernández - [email protected]
Departamento I+D+i
Fundación CTIC - www.fundacionctic.org
Tlfn: +34 984 29 12 12
Fax:  +34 984 39 06 12
Edificio Centros Tecnológicos
Parque Científico Tecnológico
33203 Cabueñes - Gijón - Asturias - Spain


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"SIOC-Dev" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/sioc-dev?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to