There is also a CLucene project... which is a re-implementation of Java-Lucene in C++. It's LGPL also.
http://sourceforge.net/projects/clucene/ I saw this months ago but hesitated posting it as I didn't want to step on toes.... plenty of people have pored sweat into our current code. ;-) Lucene & CLucene are very fast with respect to searching large indexes.... I don't know how it compares to HtDig.... or how Lucene/CLucene does compressing indexes..... and there are only a few active developers for CLucene. It is creeping towards a 1.0 release. Lucene/CLucene is an indexing/retrieval library.... it has NO spidering code. One proposal would be for HtDig 4.0, we do code cleanup and rework the codebase into using a substantially similar API to Lucene/CLucene for storing & searching. This is what I'm hoping to do with libhtdig (which I am still working on). At that point we could see where CLucene has gone and decide what to do next. There is also an effort to keep the Java Lucene compiling (natively) with gjc. This might be of interest here: http://www.stanford.edu/class/cs276a/projects/reports/dsu800.pdf On Sun, 18 Jan 2004, Lachlan Andrew wrote: > I hadn't heard of Lucene, so thanks for pointing out its existence. > > It would be great if some of the Lucene developers want to join the > ht://Dig team and help us port ht://Dig to Java+Lucene. Personally, > I don't know Java, so I wouldn't be able to contribute much to that > effort... > On Sun, 18 Jan 2004 15:42, Jeff Breidenbach wrote: > > What's the current thinking with respect to Lucene? To me, Lucene > > looks high quality, Java, and mostly a search library. As opposed > > to htdig, which I think of as mostly a drop-in-place website search > > engine application. So the question on my mind: Does Lucene look > > like a possible candidate for powering HtDig4? I see lots of > > chatter about htdig on the lucene-dev mailing list, but not much of > > the reverse. > > > > -Jeff > > > > http://jakarta.apache.org/lucene/docs/index.html > > http://www.mail-archive.com/[EMAIL PROTECTED] > > http://www.mail-archive.com/[EMAIL PROTECTED] > > > > > > ------------------------------------------------------- > > The SF.Net email is sponsored by EclipseCon 2004 > > Premiere Conference on Open Tools Development and Integration > > See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. > > http://www.eclipsecon.org/osdn > > _______________________________________________ > > ht://Dig Developer mailing list: > > [EMAIL PROTECTED] > > List information (subscribe/unsubscribe, etc.) > > https://lists.sourceforge.net/lists/listinfo/htdig-dev > > -- > [EMAIL PROTECTED] > ht://Dig developer DownUnder (http://www.htdig.org) > > > ------------------------------------------------------- > The SF.Net email is sponsored by EclipseCon 2004 > Premiere Conference on Open Tools Development and Integration > See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. > http://www.eclipsecon.org/osdn > _______________________________________________ > ht://Dig Developer mailing list: > [EMAIL PROTECTED] > List information (subscribe/unsubscribe, etc.) > https://lists.sourceforge.net/lists/listinfo/htdig-dev > Neal Richter Knowledgebase Developer RightNow Technologies, Inc. Customer Service for Every Web Site Office: 406-522-1485 ------------------------------------------------------- The SF.Net email is sponsored by EclipseCon 2004 Premiere Conference on Open Tools Development and Integration See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. http://www.eclipsecon.org/osdn _______________________________________________ ht://Dig Developer mailing list: [EMAIL PROTECTED] List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-dev
