Hi,

I want to use Solr 3.1 to index the content of a website. Rather than using a 
web crawler to fetch the content and load it into Solr I want to use the DIH to 
get the data from the Content Management Database that supports the website.

It would be possible to write SQL to obtain a complete set of metadata (for 
example DC.subject or DC.type) for each page or binary document stored in the 
database, using the JDBCDataSource. One of the values obtained would be the 
HTTP URL of the actual page or document, and I would need to obtain and index 
this content as well.

Could you tell me if its possible to nest entities that use a URLDataSource 
inside entities that use a JDBCDataSource ?

Andy


Reply via email to