question about

2009-08-02 Thread Leonard Gestrin
Hello, I have question about KEYWORD type and searching/updating. I am getting strange behavior that I can't quite comprehend. My index is created using standard analyzer, which used for writing and searching. It has three fields userpin - alphanumeric field which is stored as TEXT docume

Question about Indexsearcher.search()

2019-01-25 Thread NDelt
Hello. I'm trying to make sample search application with Lucene. search() method of IndexSearcher class searches documents with given query, and returns TopDocs instance. TopDocs instance has array of ScoreDoc instances. My question is: 1. Will query be tokenized during search? 2. If so, does Sco

Question about Benchmark

2022-05-16 Thread balmukund mandal
Hi All, I was trying to run the benchmark and had a couple of questions. Indexing takes a long time, so is there a way to configure the benchmark to use an already existing index for search? Also, is there a way to configure the benchmark to use multiple threads for indexing (looks to me that it’s

question about IndexWriter.maxFieldLength

2005-05-17 Thread Pablo Gomes Ludermir
Dear all, I would like to know about the maxFieldLength. It says on the Javadocs that it limits "The maximum number of terms that will be indexed for a single field in a document." So, for instance, in my "contents" field, I would have it limited by default to 10.000 terms. But which terms are tho

Re: question about

2009-08-03 Thread Erick Erickson
s what the effect of different analyzers is. Best Erick On Sun, Aug 2, 2009 at 10:44 PM, Leonard Gestrin < leonard.gest...@markettools.com> wrote: > Hello, > I have question about KEYWORD type and searching/updating. I am getting > strange behavior that I can't quite comp

RE: question about

2009-08-03 Thread Leonard Gestrin
Thank you -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, August 03, 2009 6:21 AM To: java-user@lucene.apache.org Subject: Re: question about When you construct a Term manually, no analyzers are applied, it'sconstructed with whatever you p

Question about IndexCommit

2009-09-01 Thread Ted Stockwell
Hi All, I am interested in using Lucene to index RDF (Resource Description Format) data. Ultimately I want to create a transactional interface to the data with proper transaction isolation. Therefore I am trying to educate myself on the details of index readers and writers, I am using v2.9rc2.

Question About Hits

2008-04-04 Thread Matthew Hall
This is more of a trying to understand the design sort of question, but its still something I need to able to succinctly express to my project manager. I know that lucene is by design not allowing us to see which fields were hit for a given document in an easy manner. Instead it presents us w

Question about QueryParser

2008-10-23 Thread James liu
public class AnalyzerTest { @Test public void test() throws ParseException { QueryParser parser = new MultiFieldQueryParser(new String[]{"title", "body"}, new StandardAnalyzer()); Query query1 = parser.parse("中文"); Query query2 = parser.parse("中 文"); System.out.pri

Question about relevance

2010-01-08 Thread Yannick Caillaux
Hi, I index 2 documents. the first contains the word "Wallis" in the title field. The second has the same title but "Wallis" is replaced by "Wall". I execute the query : "title:wallis" During the search, "Wallis" is cut by the FrenchAnalyzer and becomes "wall". So the two documents are results

question about Scorer.freq()

2010-10-03 Thread Koji Sekiguchi
Hello, I'd like to know which field got hit in each doc in the hit results. To implement it, I thought I could use Scorer.freq() which was introduced 3.1/4.0: https://issues.apache.org/jira/browse/LUCENE-2590 But I didn't become successful so far. What I did is: - in each visit methods in MockS

question about ScoreDocComparator

2007-03-01 Thread Ulf Dittmer
Hello- One of the fields in my index is an ID, which maps to a full text description behind the scenes. Now I want to sort the search results alphabetically according to the description, not the ID. This can be done via SortComparatorSource and a ScoreDocComparator without problems. But t

question about HitCollector

2007-03-28 Thread is_maximum
very strange when I move between documents I saw that it shows all the document regardless of given query!!! thanks regards -- View this message in context: http://www.nabble.com/question-about-HitCollector-tf3484569.html#a9727557 Sent from the Lucene - Java Users mailing list archive at Nabble.com

question about lucene

2007-06-01 Thread Tanya Levshina
Hi, I've just downloaded Lucene, tried demo and looked at the documentation. The Indexing and Searching work great and fast but I also need to display all the actual "hits": the lines from the files that match a particular query. Does Lucene provide means to do it? Thanks a lot, Tanya

Question about MultiSimilarity

2013-06-26 Thread Nikos Voskarides
I am using MultiSimilarity to compute CombSum and I have noticed that the computeNorm() method takes the value of the first Similarity in the array. Is it safe to use MultiSimilarity with similarities that have different computeNorm() implementations? Kind Regards, Nikos Voskarides

Question about JoinUtil

2014-12-11 Thread Glen Newton
Is there any reason JoinUtil (below) does not have a 'Query toQuery' available? I was wanting to filter on the 'to' side as well. I feel I am missing something here. To make sure this is not an XY problem, here is my use case: I have a many-to-many relationship. The left, join, and right 'table'

question about wildcards

2011-06-21 Thread G.Long
Hi :) I've got the following text indexed with simpleAnalyzer : "security is a real problem." If I try to search for secu*, it will find the document. But if I try to search for curi*, there are no results. I raed that it's not possible to add a * wildcard at the begining of the query so wh

Question about MaxFieldLength

2011-08-27 Thread roz dev
Hi All I have a question regarding MaxFieldLength. Is it a limit for number of tokens in 1 field per document or entire index? Example: If MaxFieldLength is set to 100 and I add a document which has 105 tokens for 1 field then I expect 5 tokens to be ignored. but, if i add another document which

Question about CustomScoreQuery

2012-02-16 Thread Dominika Puzio
Hello, I'm trying to understand the behavior of CustomScoreQuery. It seemed to me, that default CustomScoreQuery(Query subQuery, ValueSourceQuery valSrcQuery) should return a score that is a product of subQuery score and valSrcQuery score. So I wrote a simple test case given below: @Test

Question about BooleanQuery

2012-08-23 Thread heikki
View this message in context: http://lucene.472066.n3.nabble.com/Question-about-BooleanQuery-tp4002822.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: java-user-unsubscr...@l

Question about Field.TermVector

2006-07-18 Thread Liao Xuefeng
hi, i'm using lucene 2.0. To index a very long text i use Field.Index.TOKENIZED & Field.Store.NO. I don't know how to get its content (actually, only need words near keywords, like google's results: ... found this keyword here...) without querying database? Someone told me using term vector to p

Re: Question about Indexsearcher.search()

2019-01-25 Thread Tomoko Uchida
Hi, Tokenization is usually performed by a query parser before searching and the result documents may include all terms or some of the terms or only one term in the query string (it depends on your query configuration). > I'm trying to make sample search application with Lucene. Have you checked

Question about PhraseQuery's capacity...

2020-01-10 Thread 小鱼儿
I use SmartChineseAnalyzer to do the indexing, and add a document with a TextField whose value is a long sentence, when anaylized, will get 18 terms. & then i use the same value to construct a PhraseQuery, setting slop to 2, and adding the 18 terms concequently... I expect the search api to find

Re: Question about Benchmark

2022-05-16 Thread Adrien Grand
Hi Balmukund, What benchmark are you talking about? On Mon, May 16, 2022 at 4:35 PM balmukund mandal wrote: > > Hi All, > I was trying to run the benchmark and had a couple of questions. Indexing > takes a long time, so is there a way to configure the benchmark to use an > already existing index

Re: Question about Benchmark

2022-05-16 Thread Mikhail Khludnev
Hi, Balmukund. Assuming you are asking about Lucene benchmark module. 1) If one build index once, it's possible to start benchmark with ResetSystemSoft that keep index files intact and allow to benchmark search again and again, without waiting long for reindex. 2) Check indexing-multithreaded.alg

Re: Question about Benchmark

2022-05-17 Thread balmukund mandal
Hi All, It's my apologies for not mentioning the benchmark which i was using. Also, i realized that i've not subscribed to this group,hence duplicating this mail. The below queries are for ANN-Benchmark https://issues.apache.org/jira/browse/LUCENE-9625 Indexing takes a long time, so is there a way

Re: Question about Benchmark

2022-05-17 Thread Michael Sokolov
OK I replied on the issue. This ann-benchmarks is a separate project, and I think you are asking about how to change it. Probably should take it up with erikbern or whatever community is supporting that actively. I just created a "plugin" so we could use it to test Lucene's KNN implementation, but

Re: question about IndexWriter.maxFieldLength

2005-05-17 Thread Erik Hatcher
On May 17, 2005, at 5:33 AM, Pablo Gomes Ludermir wrote: Dear all, I would like to know about the maxFieldLength. It says on the Javadocs that it limits "The maximum number of terms that will be indexed for a single field in a document." So, for instance, in my "contents" field, I would have it lim

Re: question about IndexWriter.maxFieldLength

2005-05-17 Thread Paul Elschot
On Tuesday 17 May 2005 11:33, Pablo Gomes Ludermir wrote: > Dear all, > > I would like to know about the maxFieldLength. It says on the Javadocs > that it limits "The maximum number of terms that will be indexed for a > single field in a document." So, for instance, in my "contents" field, > I wou

question about grouping text

2009-03-24 Thread MFM
://www.nabble.com/question-about-grouping-text-tp22682433p22682433.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e

Re: Question about IndexCommit

2009-09-01 Thread Shai Erera
If I'm not mistaken, IndexReader reads the .del file into memory, and therefore subsequent updates to it won't be visible to it. Shai On Tue, Sep 1, 2009 at 3:54 PM, Ted Stockwell wrote: > Hi All, > > I am interested in using Lucene to index RDF (Resource Description Format) > data. > Ultimatel

Re: Question about IndexCommit

2009-09-01 Thread Michael McCandless
Further, when IndexWriter writes new .del files, it's always to a new (next generation) filename, so that the old .del file remains present. This means if a fresh IndexReader is opened, it will load the old .del file, and still not see any of IndexWriter's pending changes. Mike On Tue, Sep 1, 20

Re: Question about IndexCommit

2009-09-01 Thread Ted Stockwell
That's excellent. Thanks very much for the explanations - Original Message > From: Michael McCandless > To: java-user@lucene.apache.org > Sent: Tuesday, September 1, 2009 8:26:45 AM > Subject: Re: Question about IndexCommit > > Further, when IndexWrite

Re: Question about IndexCommit

2009-09-01 Thread Chris Hostetter
: Subject: Question about IndexCommit : In-Reply-To: <9ac0c6aa0909010403k3306307dxa7751ecff3fa2...@mail.gmail.com> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing m

Question about Boolean Operators

2008-01-01 Thread CowboyX
? Are each of these pair of operators functionally equivalent? >From the examples provided in the Query Syntax documentation at: http://lucene.apache.org/java/docs/queryparsersyntax.html#Boolean%20operators it looks as if these are the same. Any comments would be welcome on this. -- View this mess

Question about Search formula

2008-01-11 Thread thrgroovyboy
ld be most relevant than the only one ppt slide... Is there something that I can do ? Thanks a lot, Fab -- View this message in context: http://www.nabble.com/Question-about-Search-formula-tp14757377p14757377.html Sent from the Lucene - Java Users mailing list archive at

A question about IndexerReader.termPositions()

2008-01-15 Thread Terry Yang
Hi,ALL Playing with an algorithm(Summarize/Highlight Based on Slide Windows), i find that IndexerReader.termPositions(Term term) not support wildcard term. Is it meaningful or not to write a patch to support wildcard term? - To u

Question about indexing (BrazilianAnalyzer)

2008-06-03 Thread Vinicius Carvalho
Hello there! I'm indexing documents using the BrazilianAnalyzer, and I've noticed that many words are not being indexed. I store and index the entire doc (I'm doing this in order to present the fragments on the results, don't know if its the best way, mostly on large docs, any ideas?). Well using l

Re: Question about QueryParser

2008-10-23 Thread Erick Erickson
It looks to me like you've got a space between the characters in the second example Best Erick 2008/10/23 James liu <[EMAIL PROTECTED]> > public class AnalyzerTest { > @Test > public void test() throws ParseException { > QueryParser parser = new MultiFieldQueryParser(new String[]{"

RE: Question about QueryParser

2008-10-23 Thread Steven A Rowe
Hi James, On 10/23/2008 at 8:30 AM, James liu wrote: > public class AnalyzerTest { >@Test >public void test() throws ParseException { >QueryParser parser = new MultiFieldQueryParser(new String[]{"title", > "body"}, new StandardAnalyzer()); >Query query1 = parser.parse("中文"

Re: Question about QueryParser

2008-10-23 Thread James liu
thks steve, i get it. 2008/10/24 Steven A Rowe <[EMAIL PROTECTED]> > Hi James, > > On 10/23/2008 at 8:30 AM, James liu wrote: > > public class AnalyzerTest { > >@Test > >public void test() throws ParseException { > >QueryParser parser = new MultiFieldQueryParser(new > String[]{"ti

Re: Question about relevance

2010-01-08 Thread Erik Hatcher
One technique I've seen commonly used is to index both stemmed and unstemmed fields, and during search query both and boost the unstemmed field matches higher. Erik On Jan 8, 2010, at 4:05 AM, Yannick Caillaux wrote: Hi, I index 2 documents. the first contains the word "Wallis" in

Question about Field.setOmitTermFreqAndPositions(true)

2010-05-31 Thread Li Li
I read in 'lucene in action" that to save space, we can omit termfreq and postion information. But as far as I know, lucene's default scoring model is vsm, which need tf(term,doc) to calcuate score. If there is no tf saved. Will the relevance score be correct? -

Re: question about Scorer.freq()

2010-10-04 Thread Koji Sekiguchi
Hi Mike, Hmm are you only gathering the MUST_NOT TermScorers? (In which case I'd expect that the .docID() would not match the docID being collected). Or do you also see .docID() not matching for SHOULD and MUST sub queries? The snippet I copy-n-paste at previous mail was not appropriate. Sor

Re: question about Scorer.freq()

2010-10-04 Thread Michael McCandless
Hmm are you only gathering the MUST_NOT TermScorers? (In which case I'd expect that the .docID() would not match the docID being collected). Or do you also see .docID() not matching for SHOULD and MUST sub queries? Also, are you sure you are getting BooleanScorer2? Because I think this feature

Re: question about Scorer.freq()

2010-10-04 Thread Michael McCandless
On Mon, Oct 4, 2010 at 7:32 AM, Koji Sekiguchi wrote: > Hi Mike, > >> Hmm are you only gathering the MUST_NOT TermScorers?  (In which case >> I'd expect that the .docID() would not match the docID being >> collected).  Or do you also see .docID() not matching for SHOULD and >> MUST sub queries? >

Re: question about Scorer.freq()

2010-10-04 Thread Michael McCandless
On Mon, Oct 4, 2010 at 1:12 PM, Michael McCandless wrote: > I'll open an issue for this. OK I opened https://issues.apache.org/jira/browse/LUCENE-2686 Mike - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For

Question about custom Analyzer

2010-11-04 Thread heikki
hello Lucene list, I have a question about a custom Analyzer we're trying to write. The intention is that it tokenizes on whitespace, and abstracts over upper/lowercase and accented characters. It is used both when indexing documents, and before creating lucene queries from search terms. I

Question about Case Sensitive?!

2011-02-10 Thread Gong Li
Hi, I use standardAnalyzer, queryParser, highlighter in my program, but they lowercase the keywords. Now i need to search the keywords CASE SENSITIVE. Is there any methods to achieve this and also use standardAnalyzer and queryParser? Or some other ways? HOW??? Thx.

simple (?) question about scoring

2006-11-02 Thread Michele Amoretti
Hello, I am completely new at Lucene. I browsed the web site and the source code, searching for an example which illustrates the following operation: given a Query and a Document, return the score To me, this is a very basic operation, but I cannot find a class which easily provide this function

Re: question about ScoreDocComparator

2007-03-04 Thread Erick Erickson
Maybe I'm missing something in turn, but why not just use a Sort object at search time? You can have a Hits object or TopFIeldDocs object returned (the Filter in some of these calls can be null). Best Erick On 3/1/07, Ulf Dittmer <[EMAIL PROTECTED]> wrote: Hello- One of the fields in my ind

Re: question about ScoreDocComparator

2007-03-06 Thread Ulf Dittmer
Well, I am using a Sort object ("Hits = Search.search(Query, Filter, Sort)" actually). In setting up the SortField array for that Sort object with a SortComparatorSource the issue comes up that I need to access the field value that is being used for sorting. Maybe that's just the way Lucene

Re: question about lucene

2007-06-01 Thread Erick Erickson
No. Lucene is an *engine*, not an app that has a lot of stuff built on top of it out of the box. You have to index enough information to figure this out somehow. Best Erick On 6/1/07, Tanya Levshina <[EMAIL PROTECTED]> wrote: Hi, I've just downloaded Lucene, tried demo and looked at the do

RE: question about lucene

2007-06-01 Thread Tanya Levshina
: question about lucene No. Lucene is an *engine*, not an app that has a lot of stuff built on top of it out of the box. You have to index enough information to figure this out somehow. Best Erick On 6/1/07, Tanya Levshina <[EMAIL PROTECTED]> wrote: > > Hi, > > > > I&#

Re: question about lucene

2007-06-01 Thread Erick Erickson
07, Tanya Levshina <[EMAIL PROTECTED]> wrote: Wow, it was fast! Thanks. Do you know about any existing application that is built on top of lucene that provides this functionality? Tanya -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Friday, June 01, 2007 7

RE: question about lucene

2007-06-01 Thread Will Johnson
: java-user@lucene.apache.org Subject: Re: question about lucene Nope. But here's what I think you can do (although I haven't tried this exactly, so caveat emptor). Document doc = new Document(); doc.add("text", line1); doc.add("text", line2); doc.add("text&qu

Re: question about lucene

2007-06-01 Thread mark harwood
e From: Will Johnson <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, 1 June, 2007 2:02:17 PM Subject: RE: question about lucene Solr, which is built on top of lucene and adds highlighting among other features, gets close to what you want. Check out: http://wiki.apache.

Re: question about lucene

2007-06-01 Thread Chris Lu
Thanks. Do you know about any existing application that is built on top of lucene that provides this functionality? Tanya -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Friday, June 01, 2007 7:18 AM To: java-user@lucene.apache.org Subject: Re: question about luce

Re: Question about search

2007-06-27 Thread tanya
Hi, >Have you used Luke to examine your index and try queries? This will tell you a >LOT about what's *really* happening. >Google 'lucene' 'luke' and try it. I've tried Luke but still have no clue what is going on: I have the following entry: 2007-06-26T10:56:20-05:00 globus-gatekeeper:

Re: Question about search

2007-06-27 Thread Erick Erickson
Please take the time, before asking others "what's going on" to at least format your mail so we can tell what's what. For instance, what's a field and what's a value in what you sent? I sure can't tell because there are so many colons. Remember that you're asking people to contribute time to solve

simple question about decRef

2013-05-31 Thread Yonghui Zhao
After we use IndexReader do we always need call decRef explicitly? What will happen, if I don't call decRef? Thanks Sent from my iPad - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-m

Question about the CompoundWordTokenFilterBase

2013-09-18 Thread Alex Parvulescu
Hi, While trying to play with the CompoundWordTokenFilterBase I noticed that the behavior is to include the original token together with the new sub-tokens. I assume this is expected (haven't found any relevant docs on this), but I was wondering if it's a hard requirement or can I propose a small

Question about SearcherManager.maybeReopen() method.

2013-11-06 Thread Alexei Morgado
Hi guys, I have a question about a problem we had with the SearcherManager.maybeReopen() method. This works as charm except when the segments in the previous index has the exact same names as the segments in the new one. In this case the maybeReopen() doesn't load the new index, even t

Question about SearcherManager.maybeReopen() method.

2013-11-06 Thread Alexei Morgado
Hi guys, I have a question about a problem we had with the SearcherManager.maybeReopen() method. This works as charm except when the segments in the previous index has the exact same names as the segments in the new one. In this case the maybeReopen() doesn't load the new index, even t

Question about SearcherManager.maybeReopen() method.

2013-11-06 Thread Alexei Morgado
Hi guys, I have a question about a problem we had with the SearcherManager.maybeReopen() method. This works as charm except when the segments in the previous index has the exact same names as the segments in the new one. In this case the maybeReopen() doesn't load the new index, even t

Re: Question about JoinUtil

2014-12-16 Thread Glen Newton
Anyone? On Thu, Dec 11, 2014 at 2:53 PM, Glen Newton wrote: > Is there any reason JoinUtil (below) does not have a 'Query toQuery' > available? I was wanting to filter on the 'to' side as well. I feel I > am missing something here. > > To make sure this is not an XY problem, here is my use case:

Re: Question about JoinUtil

2014-12-16 Thread Gregory Dearing
Glen, Lucene isn't relational at heart and may not be the right tool for what you're trying to accomplish. Note that JoinQuery doesn't join 'left' and 'right' answers; rather it transforms a 'left' answerset into a 'right' answerset. JoinQuery is able to perform this transformation with a single

Re: Question about JoinUtil

2014-12-17 Thread Glen Newton
Hi Gregory, Thanks for your reply. In reading it, I realized that one side of my relational join wasn't that large, and I could bring it in as a couple of fields to the main document without any penalty, so my need to join two different document types then goes away. Thanks! :-) Glen On Tue,

Question about open files

2011-04-05 Thread Jean-Baptiste Reure
Hi all, I have been looking for information about this and found a few things here and there but nothing very clear on when files are opened and closed by Lucene. We have an application that uses Lucene quite heavily in the following fashion: there are multiple indexes in use at all times. For ea

Re: question about wildcards

2011-06-21 Thread Ian Lea
See the javadocs for QueryParser.setAllowLeadingWildcard(boolean allowLeadingWildcard). And from the FAQ, see http://wiki.apache.org/lucene-java/LuceneFAQ#What_wildcard_search_support_is_available_from_Lucene.3F Be sure to heed the warnings about performance. -- Ian. On Tue, Jun 21, 2011 at 4:

Re: question about wildcards

2011-06-21 Thread G.Long
Thank you for the tip :) I'll try it. Regards, Gary Le 21/06/2011 17:38, Ian Lea a écrit : See the javadocs for QueryParser.setAllowLeadingWildcard(boolean allowLeadingWildcard). And from the FAQ, see http://wiki.apache.org/lucene-java/LuceneFAQ#What_wildcard_search_support_is_available_from_

RE: question about wildcards

2011-06-21 Thread Hiller, Dean x66079
...@gmail.com] Sent: Tuesday, June 21, 2011 9:43 AM To: java-user@lucene.apache.org Subject: Re: question about wildcards Thank you for the tip :) I'll try it. Regards, Gary Le 21/06/2011 17:38, Ian Lea a écrit : > See the javadocs for QueryParser.setAllowLeadingWildcard

Re: question about wildcards

2011-06-21 Thread Danny Lade
ords > reversed.depends on your application profile I guess and what you want, > but an additional index may not be too bad in some cases to speed up the > search. > > Dean > > -Original Message- > From: G.Long [mailto:jde...@gmail.com] > Sent: Tuesday, June

Re: question about wildcards

2011-06-27 Thread G.Long
Thank you for your advices :) I'll try this. Regards, Gary Le 21/06/2011 22:28, Danny Lade a écrit : IMO, a "reversed word Index" does not work in this case, because he's looking for a word in the middle (See curi*). Another idea is to build word chunks and save them in a second index plus d

Some question about Lucene

2011-07-10 Thread Ing. Yusniel Hidalgo Delgado
Hello I'm a new Lucene user. I have the following question: is posible to build a crawler/spider with Lucene library or Lucene is only for index/search phases. I am studying three project: Nutch, Lucene and Solr but I don't see what is the main difference between them. Greetings . --

Re: Question about MaxFieldLength

2011-08-27 Thread Li Li
It will affect the entire index because it 's a parameter of IndexWriter. but you can modify it anytime you like before IndexWriter.addDocument. If you want to truncate different fields with different maxLength. you should avoid multithreads' race condition. maybe you can add a TokenFilter t

Re: Question about MaxFieldLength

2011-08-27 Thread roz dev
Thanks Li. It makes sense On Sat, Aug 27, 2011 at 7:28 PM, Li Li wrote: > It will affect the entire index because it 's a parameter of IndexWriter. > but you can modify it anytime you like before IndexWriter.addDocument. > If you want to truncate different fields with different maxLength. you

RE: Question about MaxFieldLength

2011-08-28 Thread Uwe Schindler
o: java-user@lucene.apache.org > Subject: Re: Question about MaxFieldLength > >It will affect the entire index because it 's a parameter of IndexWriter. > but you can modify it anytime you like before IndexWriter.addDocument. > If you want to truncate different fields with

Question about prefix query

2011-09-06 Thread Akos Tajti
Dear List, I'm running a prefix query, something like this: text:dummy*. The problem: in the result some non-exact matches get higher scores than the exact ones. For example the document containing dummythales comes before the document containing dummy exactly. How can this behavious be changed?

my question about lucene

2011-11-09 Thread 强继朋
lucene, I hava a problem i don't know how to do, it's about Score Formula of lucene. In the package of lucene, it provide a method in Class Similarity. My question : if i want to only use some factors of Formula, such as TF and IDF. And then i add some additional factors, in aim to

Re: Question about CustomScoreQuery

2012-02-20 Thread Ian Lea
I can't explain this. Can you get at an oal.search.Explanation? You could write your own CustomScoreProvider - that might help you to double check what is being passed to it, and/or allow you to provide your own calculation. -- Ian. On Thu, Feb 16, 2012 at 8:32 PM, Dominika Puzio wrote: > He

Re: Question about CustomScoreQuery

2012-02-21 Thread Dominika Puzio
Thanks for your answer. I checked what explain() says about my queries, and: MatchAllDocsQuery: 1.0 = (MATCH) MatchAllDocsQuery, product of: 1.0 = queryNorm FieldScoreQuery: 0.5 = (MATCH) float(ratio), product of: 0.5 = float(ratio)=0.5 1.0 = boost 1.0 = queryNorm CustomScoreQuery: 0.24

RE: Question about CustomScoreQuery

2012-02-21 Thread Uwe Schindler
ne.apache.org > Subject: Re: Question about CustomScoreQuery > > Thanks for your answer. I checked what explain() says about my queries, and: > > MatchAllDocsQuery: > 1.0 = (MATCH) MatchAllDocsQuery, product of: >1.0 = queryNorm > > FieldScoreQuery: > 0.5 = (MATCH) float

Re: Question about CustomScoreQuery

2012-02-21 Thread Dominika Puzio
thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Dominika Puzio [mailto:dominika.pu...@gmail.com] Sent: Tuesday, February 21, 2012 10:27 AM To: java-user@lucene.apache.org Subject: Re: Question about CustomScoreQuery Thanks for your answer. I checked what explain() says ab

Lucene Question about Query

2012-05-06 Thread Yogesh patel
Hi I am using lucene for search implementation . I have created index with two columns (1) Name (2) Year I stored values like AC 2001 AC 2002 ACC 2004 ACC 2001 ACJ 2006 ACJ 2007 I want to query in index to get year of AC ..When i query with Name:AC or Name: "AC" , it gives me all record with

Re: Question about BooleanQuery

2012-08-23 Thread Jack Krupansky
cedence. If you need strict evaluation order, use parentheses (or multi-level BooleanQuery at the Lucene Query level.) Read more detail at: http://searchhub.org/dev/2011/12/28/why-not-and-or-and-not/ -- Jack Krupansky -Original Message- From: heikki Sent: Thursday, August 23, 201

Re: Question about BooleanQuery

2012-08-23 Thread heikki
http://lucene.472066.n3.nabble.com/Question-about-BooleanQuery-tp4002822p4002836.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For addition

Re: Question about BooleanQuery

2012-08-23 Thread Jack Krupansky
cene.apache.org Subject: Re: Question about BooleanQuery thanks Jack for your answer, however I'm not quite sure what to do with it: the query is like +( someField_1:0 someField_2:0 ) +booleanField_1:false -( +( someField_1:0 someField_2:0 ) +booleanField_2:true ) (I put

Re: Question about BooleanQuery

2012-08-23 Thread heikki
this message in context: http://lucene.472066.n3.nabble.com/Question-about-BooleanQuery-tp4002822p4002854.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: java-user-unsubscr...@lu

Re: Question about BooleanQuery

2012-08-23 Thread Jack Krupansky
August 23, 2012 9:13 AM To: java-user@lucene.apache.org Subject: Re: Question about BooleanQuery OK, it's not the idea that the nested NOT query has got anything to do with booleanField_1, so I'll try to phrase very clearly what I want : the query should return docs where ( someField

Question about scoring normalisation

2005-11-05 Thread Karl Koch
Hello all, I am wondering how many of you actually work with own scoring mechanism (overwriting Lucenes standard scoring) and how many of you do work on how to normalise this score. I would like to add a second score on top of Lucenes TF/IDF score. The resulting score is most likely higher then

Question about special characters

2006-05-24 Thread Dan Wiggin
I need some functionality and I don't know how to do. The problem is special characters like à, ä , ç or ñ latin characters in the text. Now I use iso latin filter, but the problem is when I want to obtain most term used. These term are stored without ` ´ ^ or another "character attribute". For ex

Re: Question about Field.TermVector

2006-07-18 Thread freeman
http://www.lucene.com.cn/fc.htm maybe it's helpful. On 7/18/06, Liao Xuefeng <[EMAIL PROTECTED]> wrote: hi, i'm using lucene 2.0. To index a very long text i use Field.Index.TOKENIZED & Field.Store.NO. I don't know how to get its content (actually, only need words near keywords, like google's

Question about threading in search

2017-09-03 Thread Peilin Yang
I was wondering if anyone can shed some light on an issue we're having: we're comparing two different indexes on the same collection - one with lots of different segments (default settings), and one with a force merged into one segment. It seems that search is sometimes faster with multiple segmen

Question about usage of LuceneTestCase

2018-08-21 Thread Tomoko Uchida
Hello, Could you give me some advice or comments about usage of LuceneTestCase. Some of our unit tests extending LuceneTestCase fail by assertion error -- sometimes, randomly. I suppose we use LuceneTestCase in inappropriate way, but cannot find out how to fix it. Here is some information about

Question about BytesRef and BinaryDocValues

2018-08-23 Thread Kevin Manuel
Hi, I'm using lucene version 4.3.1 and I've implemented a custom score query. I'm trying to read the value for a field from the field cache. It's a text field so I'm using getTerms which returns a binarydocvalues object. However on trying to get the bytes ref object for a document and converting

Re: Question about PhraseQuery's capacity...

2020-01-10 Thread Adrien Grand
It should match. My guess is that you might not reusing the same positions as set by the analysis chain when creating the phrase query? Can you show us how you build the phrase query? On Fri, Jan 10, 2020 at 9:24 AM 小鱼儿 wrote: > I use SmartChineseAnalyzer to do the indexing, and add a document w

Re: Question about PhraseQuery's capacity...

2020-01-10 Thread 小鱼儿
Hi Adrien, I find i might make a mistake: There is 2 level processing in a Analyzer class: one is Tokenizer, which is HMMChineseTokenizer, and the other is Analyzer which may apply some filtering... I'm using lucene's default interface to set a Analyzer instance to do the indexing, b

Re: Question about PhraseQuery's capacity...

2020-01-10 Thread 小鱼儿
After i directly call Analyzer.tokenStream() method to extract terms from query, i still cannot get results. Doesn't know the why... Code when build index: IndexWriterConfig iwc = new IndexWriterConfig(analyzer); //new SmartChineseAnalyzer(); Code do query: (1) extract terms from query

Re: Question about PhraseQuery's capacity...

2020-01-10 Thread Mikhail Khludnev
Hello, Sometimes IndexSearcher.explain(Query, int) allows to analyse mismatches. On Fri, Jan 10, 2020 at 1:13 PM 小鱼儿 wrote: > After i directly call Analyzer.tokenStream() method to extract terms from > query, i still cannot get results. Doesn't know the why... > > Code when build index: >

  1   2   3   4   5   >