This sounds like, for the database source, that using SolrJ would
be the way to go. Assuming you can access the database from
Java this is pretty easy.

As for the website, Nutch is certainly an option...

But I'm a little puzzled. You mention a website, and sharepoint
as your sources, then ask about accessing the DB. How are
all these related?

Best
Erick

On Tue, Jan 17, 2012 at 8:38 AM, ahammad <ahmed.ham...@gmail.com> wrote:
> Hello,
>
> I am looking into indexing two data sources. One of those is a standard
> website and the other is a Sharepoint site. The problem is that I have no
> direct database access. Normally I would just use the DIH and get what I
> need from the DB. I do have a java DAO (data access object) class that I am
> using to directly to fetch information for a different purpose.
>
> In cases like this, what would be the best way to index the data? Should I
> somehow integrate Nutch as the crawler? Should I write a custom DIH? Can I
> use the DAO that I have in conjunction with the DIH?
>
> I am really looking for some recommendations here. I do have a few hacks
> that can be done (copy the data in a DB and index with DIH), but I am
> interested in the proper way. Any insight will be greatly appreciated.
>
> Cheers
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/How-can-I-index-this-tp3666106p3666106.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to