About "join.search" in 3.4 version.

2011-10-20 Thread Mead Lai
Hello all, Now, I find there is a "org.apache.lucene.search.join" function in Lucene 3.4 version. But I found no demo for "join" function in the source code package: "lucene-3.4.0-src.tar". Now I have some articles, which could be modified by editors, like this relationship: an article : modify

Re: How do you see if a tokenstream has tokens without consuming the tokens ?

2011-10-20 Thread Paul Taylor
On 19/10/2011 15:17, Steven A Rowe wrote: Hi Paul, What version of Lucene are you using? The JFlex spec you quote below looks pre-v3.1? Yes, we copied a version of StandardTokenizer from 2.4 to make some changes, we are actually on 3.1 now but haven't spent any time looking at the new token

Re: this IndexReader is closed only with jar

2011-10-20 Thread Mead Lai
Its a complex situation. We didn't know what did you change. I suggest you to compare/diff your code with offical version. Regards, Mead On Tue, Oct 18, 2011 at 1:13 AM, Zeynep P. wrote: > Hi, > > I am having a weird experience. I made a few changes with the source code > (Lucene 3.3). I creat

Re: How can i search lucene java user list archive?

2011-10-20 Thread Mihai Caraman
http://apache-pivot-users.399431.n3.nabble.com/how-to-search-mailing-list-td1876948.html 2011/10/20 janwen > I want to know how to search the java user list archive. > There is no search function on the site: > http://mail-archives.apache.org/mod_mbox/lucene-java-user/ > Any idea? > thanks > > 2

Re: How can i search lucene java user list archive?

2011-10-20 Thread Otis Gospodnetic
Have a look at http://search-lucene.com/ where you can search Lucene mailing list archives (user, dev, common) its web site, wiki, source code, jira, etc. as well as the same types of data for Solr, Nutch, and so on. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene eco

Re: About "join.search" in 3.4 version.

2011-10-20 Thread Michael McCandless
I don't think the new join package in Lucene 3.4 will work for this case; you need more general join implementation, which eg Solr and ElasticSearch have implemented. Generic join hasn't yet been factored out into Lucene (but I think it really needs to be... any volunteers!?). Lucene's join packa

Return Lucene field name when a query is matched

2011-10-20 Thread damian2b
Hi, I was given a task to investigate whether it is possible to return Lucene field name when a query is matched. At the moment our application returns the usual matched docs, but the new requirement would be to also know which field matched the query (e.g. found in title, header, etc.). We use

Re: Return Lucene field name when a query is matched

2011-10-20 Thread Mihai Caraman
So now you have something like query[title,content,header,...]. Evidently you can find out by query[title], query[content], query[header]. But you'd have to the merge the results. Maybe there's a collector for this. 2011/10/20 damian2b > Hi, > > I was given a task to investigate whether it is po

Re: Return Lucene field name when a query is matched

2011-10-20 Thread Ian Lea
You can work it out from the Explanation returned by IndexSearcher.explain method. Note the performance warning in the javadocs. -- Ian. On Thu, Oct 20, 2011 at 2:35 PM, Mihai Caraman wrote: > So now you have something like query[title,content,header,...]. > Evidently you can find out by query

Using Lucene to index Wikipedia

2011-10-20 Thread Daniel Quach
How do I use the Lucene Benchmark to index a wikipedia dump? I want to be able to execute phrase queries on the latest english wikipedia page dump. I'm trying to look for example use cases but I haven't found any. I downloaded the latest english dump, named: enwiki-latest-pages-articles.xml.bz

idf calculation in Lucene ?

2011-10-20 Thread David Ryan
According to https://lucene.apache.org/java/3_4_0/api/core/org/apache/lucene/search/Similarity.html idf(t) = 1 + log ( numDocs/(docFreq+1)) For example, in the following example, ln(26

Re: About "join.search" in 3.4 version.

2011-10-20 Thread Mead Lai
Thank you, Mike. Are you sure the 'Solr' has implemented 'Join' function. I just skims through some tour guids about Solr, and not sure about that. Appreciate you very much. I figure out another way to handler this problem. Our system also has duplication of these articles and the records(about wh

Re: Return Lucene field name when a query is matched

2011-10-20 Thread Mead Lai
you description was not clear. a query will return lots of results, so every item will be different on matched field name, if you use Boolean_OR to query. if you use boolean and, then every field will match. Regards, Mead On Thu, Oct 20, 2011 at 11:22 PM, Ian Lea wrote: > You can work it out

Re: About "join.search" in 3.4 version.

2011-10-20 Thread Mead Lai
Now I have create a filter by override "DocIdSet getDocIdSet (IndexReader reader) throws IOException ". It works nice, but I feel anxious about the efficiency. The* limit[]* would contain one hundred thousand article_id inside(10,000), and fetech one thousand articles by querying keywords on conte