I have searched through the parent forum and child forums and have not found an answer to this question yet. I have seen a few of the same questions, but none have responded. I have data in a database that is using .NET Lucene to index. We have written our own query builders and moving everything to Nutch is not an option. What I would like to do is move the content, boost, and urls to the database and use our existing Lucene indexer to index this. I am not opposed to writing java code for this, so keep that option open.
One approach that I see is to write a parser for the segread to grab this information. Another approach I have seen is that you can add JDBC Directory, but I haven't seen any good documentation on this or how to implement it. One other approach that I can think of is to add a match url during indexing which will search Nutch index for the same url as the database and then somehow pull the content from that. This seems like a slow process as well. These are all theories and haven't been applied. I am trying to get this done ASAP, so any and all information is appreciated. Please, Please be as ellaborate as possible. Thanks and best Regards... Mike -- View this message in context: http://www.nabble.com/Need-Help-ASAP-tf3477041.html#a9705651 Sent from the Nutch - User mailing list archive at Nabble.com. ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
