There's already a Lucene port to .NET: http://lucene.apache.org/lucene.net/
Although you are posting in mono-vb list, I ,particularly, wouldn't use or recommend VB.NET for that (mono's vb compiler is VB.NET 2.0 [VB8) compatible only]. Try C# 4.0 (dmcs compiler on Mono) or Boo, as the main language. (your arms, wrists and fingers will thank you later). :) Rafael "Monoman" Teixeira --------------------------------------- "To be creative means to be in love with life. You can be creative only if you love life enough that you want to enhance its beauty, you want to bring a little more music to it, a little more poetry to it, a little more dance to it." Osho 2010/2/18 quandary <[email protected]> > I've wanted to do that a long time ago. > > You can take a look at Apache Lucene, a Java search library, which you > could port to .net. > Perhaps you find a way to compile the lucene library from java > source/bytecode directly to .net. > > Another way is to extend this codeproject project: > http://www.codeproject.com/KB/IP/Crawler.aspx > > Then you need a ranking algorithm, such as Google PageRank, or perhaps > better something like Yahoo TrustRank, and a parallel computation library, > and a cluster software for computing the Eigenvectors of the markov chains > (indexing). > > I found this site about PageRank to be particularly useful because of it's > incredible simplicity: > http://www.peterbe.com/PageRank-in-Python > > > > On 02/17/2010 03:21 PM, Mauro Risonho de Paula Assumpção wrote: > > I am developing an open source software, which need a web crawler. I would > like help from the list. The idea is to scan the structure of the site > (HTTP and HTTPS), riding in a treeview in vb.net with GTK (Mono). Does > anyone have any ideas? > > Thanks > > > _______________________________________________ > Mono-vb mailing > [email protected]http://lists.ximian.com/mailman/listinfo/mono-vb > > > > _______________________________________________ > Mono-vb mailing list > [email protected] > http://lists.ximian.com/mailman/listinfo/mono-vb > >
_______________________________________________ Mono-vb mailing list [email protected] http://lists.ximian.com/mailman/listinfo/mono-vb
