Re: Sub-Graphs in Hnsw

2025-06-05 Thread Michael Sokolov
; > > > > > > > Hi Michael, > > > > > > > > > > The docs range could vary in extremes from few 10s to > > tens-of-thousands > > > > > and in very heavy usage cases, 100k and above… in a single segment > > >

Re: Sub-Graphs in Hnsw

2025-06-05 Thread Michael Froh
gt; > > On Mon, Jun 2, 2025 at 11:32 AM Ravikumar Govindarajan > > > wrote: > > > > > > > > Hi Michael, > > > > > > > > The docs range could vary in extremes from few 10s to > tens-of-thousands > > > > and in very heavy us

Re: Sub-Graphs in Hnsw

2025-06-05 Thread Michael Sokolov
gt; > > > > > Hi Michael, > > > > > > The docs range could vary in extremes from few 10s to tens-of-thousands > > > and in very heavy usage cases, 100k and above… in a single segment > > > > > > Filtered Hnsw like you said uses a single gra

Re: Sub-Graphs in Hnsw

2025-06-04 Thread Ravikumar Govindarajan
avikumar Govindarajan > wrote: > > > > Hi Michael, > > > > The docs range could vary in extremes from few 10s to tens-of-thousands > > and in very heavy usage cases, 100k and above… in a single segment > > > > Filtered Hnsw like you said uses a single gr

Re: Sub-Graphs in Hnsw

2025-06-02 Thread Michael Sokolov
e docs range could vary in extremes from few 10s to tens-of-thousands > and in very heavy usage cases, 100k and above… in a single segment > > Filtered Hnsw like you said uses a single graph.., which could be better if > designed as sub-graphs > > On Mon, 2 Jun 2025 at 5:42 PM, Mic

Re: Sub-Graphs in Hnsw

2025-06-02 Thread Ravikumar Govindarajan
Hi Michael, The docs range could vary in extremes from few 10s to tens-of-thousands and in very heavy usage cases, 100k and above… in a single segment Filtered Hnsw like you said uses a single graph.., which could be better if designed as sub-graphs On Mon, 2 Jun 2025 at 5:42 PM, Michael

Re: Sub-Graphs in Hnsw

2025-06-02 Thread Michael Sokolov
How many documents do you anticipate in a typical sub range? If it's in the hundreds or even low thousands you would be better off without hnsw. Instead you can use a function score query based on the vector distance. For larger numbers where hnsw becomes useful, you could try using filtered

Sub-Graphs in Hnsw

2025-06-02 Thread Ravikumar Govindarajan
given OrdRange. A sub-graph, to be precise.. The generated segment will contain a lot of these sub-graphs but without any neighbour links to each other at Level-0. Level-1 and above can have cross-links, which should be fine.. Searches will be based on OrdRange and should stop once the sub-gr

Re: Exact sub-phrase matching?

2020-09-25 Thread Erick Erickson
Have you looked at edismax, pf2 and pf3? On Fri, Sep 25, 2020, 15:07 Gregg Donovan wrote: > Hello! > > I'm wondering what the state-of-the-art for matching exact sub phrases > within Lucene is. As a bonus, I'd love to attach a boost to each of the > subphrases match

Exact sub-phrase matching?

2020-09-25 Thread Gregg Donovan
Hello! I'm wondering what the state-of-the-art for matching exact sub phrases within Lucene is. As a bonus, I'd love to attach a boost to each of the subphrases matched (if possible). For example: doc 1: "field": "tree skirt spring skirt spring dress" doc 2:

sub

2017-06-19 Thread kenny mcfarland

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread SUJIT PAL
; Sent: Monday, April 15, 2013 9:43 PM >> To: java-user@lucene.apache.org >> Subject: Re: Statically store sub-collections for search (faceted search?) >> >> Hi Uwe, >> >> Thanks for the info, I was under the impression that it didn't... I got this &g

RE: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Uwe Schindler
Hi, Original Message- > From: Sujit Pal [mailto:sujitatgt...@gmail.com] On Behalf Of SUJIT PAL > Sent: Monday, April 15, 2013 9:43 PM > To: java-user@lucene.apache.org > Subject: Re: Statically store sub-collections for search (faceted search?) > > Hi Uwe, > > Th

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread SUJIT PAL
> - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message- >> From: Carsten Schnober [mailto:schno...@ids-mannheim.de] >> Sent: Monday, April 15, 2013 9:53 AM >>

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Carsten Schnober
Am 15.04.2013 13:43, schrieb Uwe Schindler: Hi, > Passing NULL means all documents are allowed, if this would not be the case, > whole Lucene queries and filters would not work at all, so if you get 0 docs, > you must have missed something else. If this is not the case, your filter may > behav

RE: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Uwe Schindler
Hi, > Hi again, > > >>> You are somehow "misusing" acceptDocs and DocIdSet here, so you > have > >> to take care, semantics are different: > >>> - For acceptDocs "null" means "all documents allowed" -> no deleted > >>> documents > >>> - For DocIdSet "null" means "no documents matched" > >> > >> O

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Carsten Schnober
Am 15.04.2013 11:27, schrieb Uwe Schindler: Hi again, >>> You are somehow "misusing" acceptDocs and DocIdSet here, so you have >> to take care, semantics are different: >>> - For acceptDocs "null" means "all documents allowed" -> no deleted >>> documents >>> - For DocIdSet "null" means "no docume

RE: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Uwe Schindler
Hi, > > AcceptDocs in Lucene are generally all non-deleted documents. For your > call to Filter.getDocIdSet you should therefor pass > AtomicReader.getLiveDocs() and not Bits.MatchAllBits. > > I see. As far as I understand the documentation, getLiveDocs() returns null if > there are no deleted d

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Carsten Schnober
Am 15.04.2013 10:42, schrieb Uwe Schindler: > Not every DocIdSet supports bits(). If it returns null, then bits are not > supported. To enforce a bitset availabe use CachingWrapperFilter (which > internally uses a BitSet to cache). > It might also happen that Filter.getDocIdSet() returns null, w

RE: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Uwe Schindler
There might be 2 problems: Not every DocIdSet supports bits(). If it returns null, then bits are not supported. To enforce a bitset availabe use CachingWrapperFilter (which internally uses a BitSet to cache). It might also happen that Filter.getDocIdSet() returns null, which means that no docum

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Carsten Schnober
Am 15.04.2013 10:04, schrieb Uwe Schindler: > The limit also applies for filters. If you have a list of terms ORed > together, the fastest way is not to use a BooleanQuery at all, but instead a > TermsFilter (which has no limits). Hi Uwe, thanks for the pointer, this looks promising! The only mi

RE: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Uwe Schindler
> -Original Message- > From: Carsten Schnober [mailto:schno...@ids-mannheim.de] > Sent: Monday, April 15, 2013 9:53 AM > To: java-user@lucene.apache.org > Subject: Re: Statically store sub-collections for search (faceted search?) > > Am 12.04.2013 20:08, schrieb SUJI

Re: Statically store sub-collections for search (faceted search?)

2013-04-15 Thread Carsten Schnober
Am 12.04.2013 20:08, schrieb SUJIT PAL: > Hi Carsten, > > Why not use your idea of the BooleanQuery but wrap it in a Filter instead? > Since you are not doing any scoring (only filtering), the max boolean clauses > limit should not apply to a filter. Hi Sujit, thanks for your suggestion! I wasn

Re: Statically store sub-collections for search (faceted search?)

2013-04-13 Thread Shai Erera
sujit > > On Apr 12, 2013, at 7:34 AM, Carsten Schnober wrote: > > > Dear list, > > I would like to create a sub-set of the documents in an index that is to > > be used for further searches. However, the criteria that lead to the > > creation of that sub-set are not p

Re: Statically store sub-collections for search (faceted search?)

2013-04-12 Thread SUJIT PAL
ike to create a sub-set of the documents in an index that is to > be used for further searches. However, the criteria that lead to the > creation of that sub-set are not predefined so I think that faceted > search cannot be applied my this use case. > > For instance: > A user sea

Statically store sub-collections for search (faceted search?)

2013-04-12 Thread Carsten Schnober
Dear list, I would like to create a sub-set of the documents in an index that is to be used for further searches. However, the criteria that lead to the creation of that sub-set are not predefined so I think that faceted search cannot be applied my this use case. For instance: A user searches for

multiply sub-scores in boolean query

2011-07-17 Thread Pavel Goncharik
Hi, as far as I can see, boolean scorers always sum up scores of their sub-scorers. It works, but in case of my application it's required to multiply sub-scores. Is there a simple/efficient way to do this (apart from modifying lucene's source code)? It seems to me that standard t

Re: sub-scores for all clauses in a BooleanQuery

2009-04-28 Thread Chris Hostetter
be to eliminate the queryNorm completely (you can override it in your Similarity class) ... depending on your use case you might not need it at all. : So how can I get the correct sub-scores for *all* clauses of a : BooleanQuery, regardless of whether they matched or not? well, first off: if a

sub-scores for all clauses in a BooleanQuery

2009-04-17 Thread Steven Bethard
I have a BooleanQuery with several clauses. After running a search, in addition to seeing the overall score of each document, I need to see the sub-score produced by each clause. When all clauses match, this is relatively easy to get back by ".explain(...)", which gives me something

RE: matching sub phrases in user entered query...

2008-07-15 Thread Preetham B.R
user query as is using some set of lucene queries and get most relevant results without worrying much about the internals of scoring. In my case, I know that each field will most likely match some sub phrase of the user query and need to have a query or solr request handler which handles this case

Re: matching sub phrases in user entered query...

2008-07-15 Thread Preetam Rao
That is very good performance. But, If I take, on an average, 6 terms per user query, and looking at shingles of size 2 I will have a boolean OR of 5 shingle phrase queries. How better is this compared to a single sub phrase query which would internally be just like another phrase query with

Re: matching sub phrases in user entered query...

2008-07-15 Thread Karl Wettin
e user query as is using some set of lucene queries and get most relevant results without worrying much about the internals of scoring. In my case, I know that each field will most likely match some sub phrase of the user query and need to have a query or solr request handler which handles

Re: matching sub phrases in user entered query...

2008-07-14 Thread Preetam Rao
user query as is using some set of lucene queries and get most relevant results without worrying much about the internals of scoring. In my case, I know that each field will most likely match some sub phrase of the user query and need to have a query or solr request handler which handles this case

RE: matching sub phrases in user entered query...

2008-07-14 Thread Steven A Rowe
Hi Preetam, On 07/14/2008 at 1:40 PM, Preetam Rao wrote: > Is there a query in Lucene which matches sub phrases ? > [snip] > > I was redirected to Shingle filter which is a token filter > that spits out n-grams. But it does not seem to be best solution > since one does not kn

matching sub phrases in user entered query...

2008-07-14 Thread Preetam Rao
Hi, Sorry if you get this mail second or third time. Getting mail delivery errors from gmail for some unknown reason. This is my last attempt at sending the mail for the day.. :-) Is there a query in Lucene which matches sub phrases ? For example if the document text is "new york exi

matching sub phrases in user entered query...

2008-07-14 Thread Preetam Rao
Hi, Sorry if you get this mail second time. Having some trouble with mail box. Is there a query in Lucene which matches sub phrases ? For example if the document text is "new york existing homes *3 bed 2 bath*homes 3 miles from city center 2 rooms" and if user enters "Brookly

matching sub phrases from user entered query...

2008-07-14 Thread Preetam Rao
Hi, Is there a query in Lucene which matches sub phrases ? For example if the document text is "new york existing homes *3 bed 2 bath*homes 3 miles from city center 2 rooms" and if user enters "Brooklyn homes with *3 bed *rooms and swimming pools", I would like to recogn

Re: Boost Sub-Query

2007-10-24 Thread Erick Erickson
> > > Is it possible to boost subqueries with QueryParser? > > For example: > ((apple AND banana)^10 OR orange) > > Thanks > > > > > -- > View this message in context: > http://www.nabble.com/Boost-Sub-Query-tf4685212.html#a13388793 > Sent from

Boost Sub-Query

2007-10-24 Thread java_user_
Is it possible to boost subqueries with QueryParser? For example: ((apple AND banana)^10 OR orange) Thanks -- View this message in context: http://www.nabble.com/Boost-Sub-Query-tf4685212.html#a13388793 Sent from the Lucene - Java Users mailing list archive at Nabble.com

Invoking luke when No sub-file with id ... found

2006-05-07 Thread Paul Elschot
Dear readers, I had this error message from luke when opening a recent index: No sub-file with id ... found After looking around a bit on the web I found the problem mentioned several times, but no solution. Putting the lucene jar that created the index on the classpath before lukeall.jar

Re: sub search

2006-03-07 Thread Daniel Noll
Anton Potehin wrote: Is it possible to make search among results of previous search? After it I want to not make a new search, I want to make search among found results... Simple. Create a new BooleanQuery and put the original query into it, along with the new query. Daniel -- Daniel

Re: sub search

2006-03-07 Thread Eric Jain
Anton Potehin wrote: After it I want to not make a new search, > I want to make search among found results... Perhaps something like this would work: final BitSet results = toBitSet(Hits); searcher.search(newQuery, new Filter() { public BitSet bits(IndexReader reader) { return results;

Re: sub search

2006-03-07 Thread Erik Hatcher
,true,false); hits = Searcher.search(bq,queryFilter); -Original Message- From: hu andy [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 12:40 PM To: java-user@lucene.apache.org Subject: Re: sub search Importance: High 2006/3/7, Anton Potehin <[EMAIL PROTECTED]>: Is it possib

Re: sub search

2006-03-07 Thread hu andy
EMAIL PROTECTED] > Sent: Tuesday, March 07, 2006 12:40 PM > To: java-user@lucene.apache.org > Subject: Re: sub search > Importance: High > > 2006/3/7, Anton Potehin <[EMAIL PROTECTED]>: > > > > Is it possible to make search among results of previous search? >

RE: sub search

2006-03-07 Thread anton
); -Original Message- From: hu andy [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 12:40 PM To: java-user@lucene.apache.org Subject: Re: sub search Importance: High 2006/3/7, Anton Potehin <[EMAIL PROTECTED]>: > > Is it possible to make search among results of previous s

Re: sub search

2006-03-07 Thread hu andy
2006/3/7, Anton Potehin <[EMAIL PROTECTED]>: > > Is it possible to make search among results of previous search? > > > > > > For example: I made search: > > > > Searcher searcher =... > > > > Query query = ... > > > > Hits hits = > > > > hits = Searcher.search(query); > > > > > > > > After it

sub search

2006-03-07 Thread Anton Potehin
Is it possible to make search among results of previous search?   For example: I made search: Searcher searcher =... Query query = ... Hits hits = hits = Searcher.search(query);   After it I want to not make a new search, I want to make search among found results

Re: No sub-file with id _18.f0 found

2006-01-24 Thread Otis Gospodnetic
06 07:31:45 AM EST Subject: Re: No sub-file with id _18.f0 found is there a web page that lists all the files created in a index so i can track down the problem im having im using the latest source via svn and have rebuild using ant everytime i create an index no-matter how basic i get error

Re: No sub-file with id _18.f0 found

2006-01-24 Thread gekkokid
kokid" <[EMAIL PROTECTED]> To: Sent: Monday, January 23, 2006 2:35 PM Subject: No sub-file with id _18.f0 found hi, when i try to view my index with luke i get the loading error: "No sub-file with id _18.f0 found". any ideas what could be causing this? im using IndexWriter.se

No sub-file with id _18.f0 found

2006-01-23 Thread gekkokid
hi, when i try to view my index with luke i get the loading error: "No sub-file with id _18.f0 found". any ideas what could be causing this? im using IndexWriter.setUseCompoundFile(true) in the past it has worked fine without any problems, im on win xp with java 1.5 Regar