Change IndexFiles to record term frequency as well?

2011-11-09 Thread Daniel Quach
I am currently using Lucene to index a dump of Wikipedia. I'm using the demo's IndexFiles function for the most part, but I also want to store the term frequency of a document in the index as well, is this possible? Right now, the index just stores the (term -> document pathname) mappings, b

setting up lucene for use on mac OSX

2011-10-24 Thread Daniel Quach
Hi all, I am unable to get the lucene demo to run on my macbook pro. I downloaded the jars into my home directory and then I set the CLASSPATH variable to point to them. However, once I run the example command for the lucene demo, it still complains to me about the missing class. Is there som

Using Lucene to index Wikipedia

2011-10-20 Thread Daniel Quach
How do I use the Lucene Benchmark to index a wikipedia dump? I want to be able to execute phrase queries on the latest english wikipedia page dump. I'm trying to look for example use cases but I haven't found any. I downloaded the latest english dump, named: enwiki-latest-pages-articles.xml.bz