Re: Maven Lucene plugin ?

2007-11-28 Thread Patrick
Hi, Take a look at Proximity (http://proximity.abstracthorizon.org/px1/) a Maven Proxy that include Lucene search. Patrick Olivier Dehon wrote: Hello, Has anyone worked on a lucene maven plugin? I am thinking of embedding a lucene index as part of a maven artifact, so that artifact

Re: Inverted letters

2008-02-12 Thread Patrick
Did you take a look at the org.apache.lucene.analysis.ngram.NGramTokenFilter? Or other ngram implementation? Works great for us. Patrick Ulrich Vachon wrote: Hi all, It's possible to use simplely (without java preprocessing, if possible) Lucene to find items with this constraints

Match all documents with non empty field

2008-07-02 Thread Patrick
th it? Many thanks. Patrick - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Multiple merge-runs from same set of segments

2021-05-24 Thread Patrick Zhai
log.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html Best Patrick Ravikumar Govindarajan 于2021年5月24日周一 上午9:54写道: > Thanks Michael! > > This was just what I was looking for!!. Just a couple of questions. > > >- When we call addIndexes(IndexReader...),

Re: Multiple merge-runs from same set of segments

2021-05-27 Thread Patrick Zhai
Sorry for the delayed response, as for caching termDict data across threads, I do not aware of any existing lucene mechanism could do that (and it might be tricky since it is across threads), but maybe worth trying to see whether we can get some extra speed based on that! Patrick Ravikumar

Re: ContainingIntervalsSource alternative

2021-06-02 Thread Patrick Zhai
Hi Elbek, Maybe go with ContainedByIntervalsSource? ContainingIntervalsSource is actually the big source filtered by small source, and ContainedByIntervalsSource is the opposite so it should give the expect behavior? Best Patrick elbek kamoliddinov 于2021年6月2日周三 下午2:55写道: > Hello every

Multi-Value query test

2022-06-23 Thread Patrick Bernardina
How to test if a value in a multi-value field matches a specific query? Example of the problem: I've created a query to return all documents of some specific authors. The authors field contains multi-value sorted set. When showing the result, I want to show only the name of the authors specified

Multi-Value Query Test

2022-06-23 Thread Patrick Bernardina
How to test if a value in a multi-value field matches a specific query? Example of the problem: I've created a query to return all documents of some specific authors. The authors field contains multi-value sorted set. When showing the result, I want to show only the name of the authors specified

Re: Multi-Value query test

2022-06-23 Thread Patrick Bernardina
Let me clarify: Example query: "(author:Patrick author:Michael) && type:pdf" Example result: 2 items: Doc1 with authors "Patrick, Adalberto" and Doc2 with authors "Patrick, Michael, Elias" I want to show the 2 items, but when I show the authors, I only wa

Is there a way to customize segment names?

2022-12-15 Thread Patrick Zhai
<https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/index/StandardDirectoryReader.java#L218>, and seems all we can do right now is to reload the whole index and that could be potentially a high cost. Sorry for the long email and thank you in advance for any replies! Best Patrick

Re: Is there a way to customize segment names?

2022-12-15 Thread Patrick Zhai
replicator/nrt module has not provided a solution on when the primary node (main indexer) is down, how would we recover with a back up indexer? Thanks Patrick On Thu, Dec 15, 2022 at 7:16 PM Robert Muir wrote: > This multiple-writer isn't going to work and customizing names won't >

Re: Is there a way to customize segment names?

2022-12-16 Thread Patrick Zhai
at playing with filenames can become quite troublesome, but still, even out of my own curiosity, I want to understand whether we're able to control the segment names in some way? Best Patrick On Fri, Dec 16, 2022 at 6:36 AM Michael Sokolov wrote: > +1 trying to coordinate multiple writer

Re: Is there a way to customize segment names?

2022-12-30 Thread Patrick Zhai
; No, you can't control them. And we must not open up anything to try to > support this. > > On Fri, Dec 16, 2022 at 7:28 PM Patrick Zhai wrote: > > > > Hi Mike, Robert > > > > Thanks for replying, the system is almost like what Mike has described: > one wri

Re: Question about searcherManager applyAllDeletes parameter and maybeRefresh method

2023-03-02 Thread Patrick Zhai
s://github.com/apache/lucene-solr/blob/branch_7_4/lucene/core/src/java/org/apache/lucene/index/StandardDirectoryReader.java#L288> ) So basically the applyAllDeletes you passed into SearcherManager will affect every call to the maybeRefresh. Best Patrick On Thu, Mar 2, 2023 at 3:03 PM Ningshan

Re: Question about searcherManager applyAllDeletes parameter and maybeRefresh method

2023-03-03 Thread Patrick Zhai
knows more can chime in, but in the unit test since you're just deleting one doc, it's quite possible that IndexWriter will apply the delete right away regardless of what you have passed in. Hope that helps Patrick On Thu, Mar 2, 2023 at 3:50 PM Ningshan Li wrote: > Hi Patrick,

Re: Question about index segment search order

2023-05-02 Thread Patrick Zhai
<https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/index/DirectoryReader.java#L75> when you're opening the IndexReader Patrick On Tue, May 2, 2023 at 5:28 PM Wei wrote: > Hello, > > We have a index that has multiple segments generated with continuous >

Re: Question about index segment search order

2023-05-03 Thread Patrick Zhai
For that part I'm not entirely sure, if other folks know it please chime in :) On Wed, May 3, 2023 at 8:48 AM Wei wrote: > Thanks Patrick! In the default case when no LeafSorter is provided, are the > segments traversed in the order of creation time, i.e. the oldest segment > is

Re: Question about index segment search order

2023-05-04 Thread Patrick Zhai
; Thanks, > Wei > > On Thu, May 4, 2023 at 3:33 AM Michael Sokolov wrote: > > > There is no meaning to the sequence. The segments are created > concurrently > > by many threads and the merge process will merge them without regards to > > any ordering. > > &g

[ANNOUNCE] Apache Lucene 9.8.0 released

2023-09-28 Thread Patrick Zhai
The Lucene PMC is pleased to announce the release of Apache Lucene 9.8.0. Apache Lucene is a high-performance, full-featured search engine library written entirely in Java. It is a technology suitable for nearly any application that requires structured search, full-text search, faceting, nearest-n

Order of fields returned by Document.getFields()

2008-12-16 Thread Patrick Johnstone
I'm using Lucene via Solr and recently upgraded from an early Summer nightly build to the released version of Solr 1.3 (which seems to use something in the neighborhood of Lucene 2.3). I'm posting this here because I believe that my issue is with Lucene, not Solr. After the upgrade, I noticed th

RE: Order of fields returned by Document.getFields()

2008-12-17 Thread Patrick Johnstone
this would cause but there would be a fair amount of repetitive data. (The documents tend to fall into types which are somewhat similar as a group, thus their field orders would be similar.) Having the field order implicitly defined by the indexing process would be more efficient but I don't ha

RE: Order of fields returned by Document.getFields()

2008-12-17 Thread Patrick Johnstone
't state it explicitly, I have verified this behavior with the most recently released version of Lucene (2.4) without Solr being involved. Patrick - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread patrick o'leary
mors my user name "pjaol" is actually an acronym of my full name, which is only ever used by my mother when I'm in trouble :-) It will be a pleasure to continue working with all of you, and thank you again for this honor. Thanks Patrick O'Leary > On Jan 16, 2009, at 1:54 P

Re: Words that need protection from stemming, i.e., protwords.txt

2009-01-16 Thread patrick o'leary
Porter is a little outdated I've found KStem much better http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem You'll still need a good protected word list, but KStem is just a little nicer On Fri, Jan 16, 2009 at 6:20 PM, David Woodward wrote: > Hi. > > Any good protwords.txt out t

Committed revision 735928.

2009-01-19 Thread patrick o'leary
Committed revision 735928. Adding myself to contrib committers list / testing karma Thanks Patrick scootie:site pjaol$ svn diff docs/*.html Index: docs/whoweare.html === --- docs/whoweare.html (revision 735927) +++ docs

Re: 答复: 答复: Lucene in large database contexts

2009-03-05 Thread Patrick Turcotte
mkjjyy On 8/10/07, Askar Zaidi wrote: Hey Guys, I am trying to do something similar. Make the content search-able as soon as it is added to the website. The way it can work in my scenario is that , I create the Index for a every new user account created. Then, whenever a new document is

Re: similarity function

2009-03-05 Thread patrick o'leary
Sounds like your most difficult part will be the question parser using POS. This is kind of old school but use something like the AliceBot AIML library http://en.wikipedia.org/wiki/AIML Where the subjective terms can be extracted from the questions, and indexed separately. Or as Grant and others

Re: Autonomy search technology

2009-04-03 Thread patrick o'leary
I think you need to ask the question what do you want? A person asked me one, which is better, a gold pen fountain pen or a plastic bic pen? The answer - depends If you want the most fluid writing instrument, which gives you a certain level of accomplishment as you use it, and looks superb then th

Re: How to customize score according to field value?

2009-04-07 Thread patrick o'leary
You might want to play with both boosting and multiple sorting. You might want to look at something like Solr's boost queries or boost functions http://wiki.apache.org/solr/DisMaxRequestHandler#head-6862070cf279d9a09bdab971309135c7aea22fb3 Or if you want to go down the path of a custom score, most

Binary indexing / query efficiency

2009-04-14 Thread Eger, Patrick
,3,4})) searcher.search(q,...); Which would allow me to avoid the Integer -> String -> Padded String -> String -> Integer coding/decoding to index an integer, and avoid Object -> String -> Object conversion (which per abo

FW: Binary indexing / query efficiency

2009-04-16 Thread Eger, Patrick
version (which per above is quite expensive). Thanks for any help! Regards, Patrick - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: A Challenge!: Combining 2 searches into a single resultset?

2009-04-17 Thread patrick o'leary
I'd start by doing some research on the question rather than asking for a solution.. What your asking for can be considered 'Federated Search' http://en.wikipedia.org/wiki/Federated_search And it can be conceived in as many ways as you have document types. Any answer will probably end up customize

Re: A Challenge!: Combining 2 searches into a single resultset?

2009-04-17 Thread patrick o'leary
Why not put the keywords into the same document as another field? and search both fields at once, you can then use lucene syntax to give a boosting to the keyword fields. e.g. body:A good game last night by the redskins keyword: redskins Query= body:(game OR redskins) keyword:(game OR redskins)^10

Re: LocalLucene/Lucene Spatial

2009-04-19 Thread patrick o'leary
1) The only reason for ConstantScoreQuery is because it lets me convert the LocalLucene filter to a query. 2) TierRangeQuery- not at the moment, we're not really doing a range search, we're pulling predefined id's (e.g. a shape) from the index using the term enumerator. Average time take for a bo

Re: LocalLucene/Lucene Spatial

2009-04-20 Thread patrick o'leary
Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: patrick o'leary [mailto:pj...@pjaol.com] > > Sent: Monday, April 20, 2009 3:33 AM > > To: java-user@lucene.apach

Re: Servlets Sharing Resources

2009-04-21 Thread patrick o'leary
Why not have 1 servlet and based on a parameter / url, serve 2 different outputs? if(request.getString("asXML") !=null) showXML(); else showOtherStuff(); Save yourself the hassle of dealing with jndi / contexts / spring or SingleTons On Tue, Apr 21, 2009 at 12:01 PM, David Seltzer wrote:

Re: Servlets Sharing Resources

2009-04-21 Thread patrick o'leary
everything hit the same > doGet(). > > -Original Message- > From: patrick o'leary [mailto:pj...@pjaol.com] > Sent: Tuesday, April 21, 2009 12:51 PM > To: java-user@lucene.apache.org > Subject: Re: Servlets Sharing Resources > > Why not have 1 servlet and base

Re: Lucene Index Encryption

2009-05-08 Thread patrick o'leary
There will always be levels of where data will be insecurely available. Most notably within the memory of an application once it's running. Unless you want to go down the path of encrypting and decrypting each and every string. At which point you loose dictionary functionality and well any useful e

Re: Indexing puncuation and symbols

2007-10-01 Thread Patrick Turcotte
Hi, Don't know the size of your dataset. But, couldn't you index in 2 fields, with PerFieldAnalyzer, tokenizing with Standard for 1 field, and WhiteSpace for the other. Then use multiple field query (there is a query parser for that, just don't remember the name right now). Patr

Re: Indexing puncuation and symbols

2007-10-01 Thread Patrick Turcotte
the trick quite simply. Patrick On 10/1/07, John Byrne <[EMAIL PROTECTED]> wrote: > Well, the size wouldn't be a problem, we could afford the extra field. > But it would seem to complicate the search quite a lot. I'd have to run > the search terms through both analyzers. It

IndexMergeTool - Close indexes before merge?

2007-10-10 Thread Patrick Kimber
d before they are added to the new index. http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/miscellaneous/src/java/org/apache/lucene/misc/IndexMergeTool.java Thanks for your help, Patrick - To unsubscribe, e-mail: [E

Re: how to get the programmatic control over index's document id

2008-02-09 Thread Patrick Turcotte
Add a field to your document. document.add(new Field("id", idString)); Or something like that. (Don't have the doc handy right now). Hope this helps. Patrick On Feb 9, 2008 7:38 AM, Gauri Shankar <[EMAIL PROTECTED]> wrote: > Hi, > > I would like to get the contro

Build Lucene maven artifacts

2008-03-13 Thread Patrick Turcotte
oy contrib locally without having to create all the pom.xml files. Another way to put my question (not being sure it is clear) is : what should the "generate-maven-artifacts" ant task be used for? And where can I get the latest version of the *.pom.template files? M

Re: confused about an entry in the FAQ

2008-05-10 Thread Patrick Turcotte
Did you try the IndexSearcher.doc(int i, FieldSelector fieldSelector) method? Could be faster because Lucene don't have do "prepare" the whole document. Patrick On Sat, May 10, 2008 at 9:35 AM, Stephane Nicoll <[EMAIL PROTECTED]> wrote: > From the FAQ: > > &quo

Re: How to make Lucene search for parts of terms?

2008-11-10 Thread Patrick Turcotte
Take a look at the ngram classes (probably in contrib, don't remember for sure right now). Patrick - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Patrick Samborski
[X] ASF Mirrors (linked in our release announcements or via the Lucene website) [] Maven repository (whether you use Maven, Ant+Ivy, Buildr, etc.) [] I/we build them from source via an SVN/Git checkout. [] Other (someone in your company mirrors them internally or via a downstream project)

How to define different similarity scores per field ?

2011-03-01 Thread Patrick Diviacco
I need to define different similarity scores per document field. For example for field A I want to use Lucene tf.idf score, for the numerical field B I want to use a different metric (difference between values) and so on... thanks

Re: How to define different similarity scores per field ?

2011-03-01 Thread Patrick Diviacco
; return sim.lengthNorm(fieldName, numTokens); >} > } > // same for scorePayload. For the others, I just delegate > // to defaultSimilarity (all I really need is scorePayload in > // my case). > } > > and in the schema.xml, I just set this class to be the similarity

Lucene nightly build javadoc ?

2011-03-02 Thread Patrick Diviacco
Can i read the javadocs for Lucene nightly build 4.0 ? How ? thanks

Lucene nightly build: similarity score per field

2011-03-03 Thread Patrick Diviacco
I've downloaded Lucene nightly build because I need to customize the similarity *per field*. However I don't see the field parameter passed to the methods to compute the score such as "tf" and "idf"... how can I implement different similarities score per document field then ? thanks

Re: WhitespaceAnalyzer in Lucene nightly build ?

2011-03-04 Thread Patrick Diviacco
the modules as JARs): > > https://hudson.apache.org/hudson/job/Lucene-Solr-Maven-trunk/lastSuccessfulB > uild/artifact/maven_artifacts/org/apache/lucene/ > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > e

Lucene 4.0 and WhitespaceAnalyzer

2011-03-04 Thread Patrick Diviacco
What's the best way to replace WhitespaceAnalyzer in this line in Lucene nightly build 4.0 ? Is there a generic analyzer I can use ? writer = new IndexWriter(FSDirectory.open(INDEX_DIR), new WhitespaceAnalyzer(), true, IndexWriter.MaxFieldLength.LIMITED); thanks

Re: WhitespaceAnalyzer in Lucene nightly build ?

2011-03-04 Thread Patrick Diviacco
etaphi.de > > > -Original Message- > > From: Patrick Diviacco [mailto:patrick.divia...@gmail.com] > > Sent: Friday, March 04, 2011 2:34 PM > > To: java-user@lucene.apache.org > > Subject: Re: WhitespaceAnalyzer in Lucene nightly build ? > > >

Re: Lucene nightly build: similarity score per field

2011-03-04 Thread Patrick Diviacco
) thanks On 3 March 2011 16:34, Robert Muir wrote: > On Thu, Mar 3, 2011 at 10:25 AM, Patrick Diviacco > wrote: > > I've downloaded Lucene nightly build because I need to customize the > > similarity *per field*. > > > > However I don't see the

Re: Lucene nightly build: similarity score per field

2011-03-04 Thread Patrick Diviacco
ass implementing the SimilarityProvider and then implement the get method ? Also, inside the get method should I check the passed string field and return different custom similarities classes ? thanks Patrick On 4 March 2011 19:57, Robert Muir wrote: > On Fri, Mar 4, 2011 at 1:18 PM, Patrick Di

Re: Lucene nightly build: similarity score per field

2011-03-04 Thread Patrick Diviacco
20:39, Robert Muir wrote: > On Fri, Mar 4, 2011 at 2:12 PM, Patrick Diviacco > wrote: > > hey Robert, > > > > I know there is the documentation, I'm sorry I've confused setSimilarity > > with setSimilarityProvider. > > > > However, my questio

Re: Lucene nightly build: similarity score per field

2011-03-05 Thread Patrick Diviacco
Nevermind, I've finally solved. I just now need to figure out how to retrieve the scores per fields in my results. I need to know how much similar each field is. I know I can use explain() but it slows down computations... thanks On 4 March 2011 21:21, Patrick Diviacco wrote: > ok tha

Re: Lucene 4.0 and WhitespaceAnalyzer

2011-03-13 Thread Patrick Diviacco
:07, Simon Willnauer wrote: > Why do you want to replace the WhitespaceAnalyzer? I don't really > understand what you are up to. > > simon > > On Fri, Mar 4, 2011 at 3:21 PM, Patrick Diviacco > wrote: > > What's the best way to replace WhitespaceAnalyzer in this li

Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-21 Thread Patrick Diviacco
I'm new to Lucene. If I use description = new TermQuery(new Term("description", "my string")); I ask Lucene to consider "my string" as unique word, right ? I actually need to consider each word, should I use PhraseQuery instead ? Or is it correct ? thanks

How to normalize Lucene scores... (over all queries)

2011-03-21 Thread Patrick Diviacco
I'm combining several scores for my queries performed with Lucene and other software. My issue is that I have lucene scores + other scores (not related to Lucene) for each query result. The other scores are all normalized between 1 and 0. I need to normalize Lucene scores (over all queries) beca

Re: Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-21 Thread Patrick Diviacco
> > > If description field is tokenized/analyzed during indexing you need to use > PhraseQuery. > Uhm yeah I'm using a WhitespaceAnalyzer. This is the code using for indexing: writer = new IndexWriter(FSDirectory.open(INDEX_DIR), new IndexWriterConfig(org.apache.lucene.util.Version.LUCENE_40, new

Re: Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-21 Thread Patrick Diviacco
One more thing: It is actually not clear to me how to use PhraseQuery... I thought I can just pass a phrase to it, but I see only add(Term) method... should I parse the string by myself to single terms ? On 21 March 2011 18:05, Patrick Diviacco wrote: > >> If description field is

Building a query of single terms...

2011-03-21 Thread Patrick Diviacco
I'm new to Lucene and I would like to know what's the difference (if there is any) between PhraseQuery.add(Term1) PhraseQuery.add(Term2) PhraseQuery.add(Term3) and term1 = new TermQuery(new Term(...)); booleanQuery.add(term1, BooleanClause.Occur.SHOULD); term2 = new TermQuery(new Term(...)); bo

Re: Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-22 Thread Patrick Diviacco
OK, so I'm currently doing this: booleanQuery.add(new QueryParser(org.apache.lucene.util.Version.LUCENE_40, "tags", new WhitespaceAnalyzer(org.apache.lucene.util.Version.LUCENE_40)).parse(phrase[i]); , BooleanClause.Occur.SHOULD); I just want to add single terms to my booleanQuery. if I pass a q

how to get all documents in the results ?

2011-03-22 Thread Patrick Diviacco
I'm using the following code because I want to see the entire collection in my query results: //adding wildcards-term to see all results rest = new TermQuery(new Term("*","*")); booleanQuery.add(rest, BooleanClause.Occur.SHOULD); But it doesn't work, I only see the relevant docs and not all the o

Results: get per field scores ?

2011-03-22 Thread Patrick Diviacco
Is there a way to display Lucene scores per field instead of the global one ? Both my query and my docs have 3 fields. I would like to see the scores for each field in the results. Can I ? Or should I run the query 3 times for each single field ? thanks

Re: how to get all documents in the results ?

2011-03-22 Thread Patrick Diviacco
I don't think the link you suggested can help, but maybe I'm wrong. Also, the parameter MAX_HITS is not useful, it just limit the results, it doesn't add the not relevant docs. On 22 March 2011 12:10, Anshum wrote: > Hi Patrick, > You may have a look at this, perhaps th

Re: how to get all documents in the results ?

2011-03-22 Thread Patrick Diviacco
ll' documents or only docs matching your query? > 2. if its about fetching all docs, why not use the matchalldocs query? > 3. did you try using a collector instead of topdocs? > > -- > Anshum Gupta > http://ai-cafe.blogspot.com > > > On Tue, Mar 22, 2011 at 4:46

Re: Building a query of single terms...

2011-03-22 Thread Patrick Diviacco
the queries after they're assembled. I believe you'll > find that the difference is that the PhraseQuery would find text like > "Term1 Term2 Term3" but not text like "Term1 some stuff Term2 more > stuff Term3" whereas BooleanQuery would. > > Best > Eri

Re: Results: get per field scores ?

2011-03-22 Thread Patrick Diviacco
I've been told search explain should be used for debugging only because it slows down a lot computations. Is it true ? On 22 March 2011 14:29, Erick Erickson wrote: > Try Searcher.explain. > > Best > Erick > > On Tue, Mar 22, 2011 at 4:34 AM, Patrick Diviacco > wr

Re: Results: get per field scores ?

2011-03-22 Thread Patrick Diviacco
t > you only cared about this for debugging. What is the use-case > for having it on all the time? > > Best > Erick > > On Tue, Mar 22, 2011 at 12:40 PM, Patrick Diviacco > wrote: > > I've been told search explain should be used for debugging only becau

Re: Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-22 Thread Patrick Diviacco
d is probably perferred. > > Best > Erick > > On Tue, Mar 22, 2011 at 3:41 AM, Patrick Diviacco > wrote: > > OK, so I'm currently doing this: > > > > booleanQuery.add(new > QueryParser(org.apache.lucene.util.Version.LUCENE_40, > > "tags"

Re: Am I correctly parsing the strings ? Terms or Phrases ?

2011-03-23 Thread Patrick Diviacco
s:f14 tags:usm tags:canonef50mmf14 tags:canonef50mmf14usm I can see the tags field repeated multiple times, so it seems to me correctly parsed... correct ? On 23 March 2011 07:50, Patrick Diviacco wrote: > Your answer is quite clear, but my question is a bit more specific: > as you s

Re: how to get all documents in the results ?

2011-03-23 Thread Patrick Diviacco
ou may have a completely different option that you > haven't read which someone could advice if they know the exact intent. > > Hope this helps. > > -- > Anshum Gupta > http://ai-cafe.blogspot.com > > > On Tue, Mar 22, 2011 at 4:59 PM, Patrick Diviacco < > patr

QueryParser syntax French Operator

2006-10-03 Thread Patrick Turcotte
ser to do it? We know we could always modify QueryParser.jj to add them to the list, but we'd rather like not to have to recompile/rejar each time there is a new version of Lucene. Thanks -- Patrick Turcotte - To unsub

Re: QueryParser syntax French Operator

2006-10-04 Thread Patrick Turcotte
I've started to look into this (and the whole javacc syntax) I'll keep you posted on my results. Patrick Erik Hatcher wrote: Currently AND/OR/NOT are hardcoded into the .jj file. A patch to make this configurable would be welcome! Erik On Oct 3, 2006, at 11:15 AM, Patric

RE: QueryParser syntax French Operator

2006-10-09 Thread Patrick Turcotte
runtime using the ResourceBundle paradigm. I'll keep you posted. Patrick > -Message d'origine- > De : karl wettin [mailto:[EMAIL PROTECTED] > Envoyé : 8 octobre, 2006 10:14 > À : java-user@lucene.apache.org > Objet : Re: QueryParser syntax French Operator > >

QueryParser syntax French Operator : DONE!

2006-10-13 Thread Patrick Turcotte
to the those who can decide to integrate it? Where? In what format? Etc. Thanks, Patrick Turcotte - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: QueryParser syntax French Operator : DONE!

2006-10-13 Thread Patrick Turcotte
rocessing with matchedToken.image to get to the matched string, set matchedToken.kind accordingly. // USES fields set by the QueryParser to decide on behavior } Hope this answer your question. Patrick thanks, -Mark On 10/13/06, Patrick Turcotte <[EMAIL PROTECTED]> wrote: Hello!

Re: QueryParser syntax French Operator : DONE!

2006-10-13 Thread Patrick Turcotte
Submitted to Jira with key LUCENE-682 Patrick Grant Ingersoll wrote: Hi Patrick, Thanks for the work. Create a bug in JIRA and upload a patch (see svn diff). See the Wiki for information on how to contribute. Thanks, Grant

Re: QueryParser syntax French Operator : DONE!

2006-10-13 Thread Patrick Turcotte
Thanks Mark! I have to mention Benoit Mercier here who worked with me so we could understand how to expand a term and use TOKEN_MGR_DECLS. Patrick On 10/13/06, Mark Miller <[EMAIL PROTECTED]> wrote: Great work Patrick. I was unfamiliar with the use of TOKEN_MGR_DECLS. Looks like a pow

Help with design

2006-10-17 Thread Patrick Turcotte
it possible? How would I go about it? Design III ? Thanks! Patrick

Re: Lucene search priorities

2006-10-30 Thread Patrick Turcotte
I don't remember the syntax right now, but how about giving a boost to certain fields, either while indexing or while searching ? Patrick On 10/30/06, Amit Soni <[EMAIL PROTECTED]> wrote: Hi Erick, Thanks for the reply. Actually the priorities mean when i search for example for

Re: how to handle words with accent?

2006-10-31 Thread Patrick Turcotte
I was referring to http://lucene.apache.org/java/docs/api/org/apache/lucene/analysis/ISOLatin1AccentFilter.htmlwhich is in the 2.0 version of Lucene. Patrick On 10/31/06, Valerio Schiavoni <[EMAIL PROTECTED]> wrote: actually yes, it would be a good result! is the class you mentioned t

Re: how to handle words with accent?

2006-10-31 Thread Patrick Turcotte
Should both results be returned in both cases? If so, take a look at the IsoLatin1Filter class, it will remove those accents for indexing and searching if needed. Patrick On 10/31/06, Valerio Schiavoni <[EMAIL PROTECTED]> wrote: hello, i use lucene to index documents in Italian. many

Re: for admins: mailing list like spam

2006-11-03 Thread Patrick Turcotte
It will make mails list more easy to read (I am using gmail and I do not have client-side filters). That is not true. You can have labels, and, if you look at the top of the page, right beside the "Search the Web" button, you have a "create filter" link. Patrick

Re: whats the correct way to do normalisation?

2006-11-06 Thread Patrick Turcotte
Hi, Did you take a look at IsoLatin1AccentFilter ? Patrick On 11/6/06, hans meiser <[EMAIL PROTECTED]> wrote: Hi, Lucene indexes documents from 3 different countries here (English, German and French). I want to normalize some characters like umlauts. ä -> ae I did it in the

Re: Specific Query on multiple fields

2006-11-09 Thread Patrick Turcotte
title","examples" Take a look at org.apache.lucene.queryParser.MultiFieldQueryParser, it probably does what you want. Patrick

Re: lucene functionality

2006-12-13 Thread Patrick Turcotte
boost performance even more. Don't hesitate to contact me directly if you have more questions. Patrick On 12/13/06, Mark Mei <[EMAIL PROTECTED]> wrote: At the bottom of this email is the sample xml file that we are using today. We have about 10 million of these. We need to know whether Lucene

Re: lucene functionality

2006-12-14 Thread Patrick Turcotte
On 12/14/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Dec 13, 2006, at 1:51 PM, Patrick Turcotte wrote: > I would suggest you take a look at exist-db (http://exist-db.org/). I really doubt eXist can handle 10M XML files. Last time I tried it, it choked on 20k of them. It

Re: Rebuilding index on a regular basis

2006-12-20 Thread Patrick Turcotte
previous location for next round, where you won't need the initial copy to a fresh folder. That way, you won't have to reindex all your documents (assuming only a small subset needs updating) and will be able to switch to a more up to date index more easily and often. Patrick On 12/20

Re: French stemmer problem

2006-12-22 Thread Patrick Turcotte
) for more info. Patrick On 12/22/06, Mark Miller <[EMAIL PROTECTED]> wrote: Non of the stemmers always stem to a valid word. It is not important as you should be stemming the query as well. The only thing that is important is that each word always stems to the same base. Many English w

Re: Is there any way to optimize existing unoptimized index?

2007-02-07 Thread Patrick Turcotte
ll be optimized for you. I think you can also use Luke (google Luke Lucene) to do this. In any cases, I suggest you backup your index before trying, just to be sure. Hope this helps. Patrick On 2/7/07, maureen tanuwidjaja <[EMAIL PROTECTED]> wrote: Hi, May I also ask wheter there is a

Re: Lucene Web Service

2007-02-09 Thread Patrick Kimber
Hi You could try SOLR http://lucene.apache.org/solr/ This is obviously Java but you can access it using .NET... Hope this helps Patrick On 09/02/07, Kainth, Sachin <[EMAIL PROTECTED]> wrote: Hello all, Does anyone know if there is a .NET version of Lucene Web Service? Cheers This

Re: QueryParser and auto wildcard searches

2007-03-05 Thread Patrick Turcotte
aybe you should consider using a stemming analyzer instead. Look for stemmer filter/analyzer in contrib. Patrick

Re: Query String for a phrase?

2007-03-07 Thread Patrick Turcotte
Hi, Please suggest what should be the query String for a pharse search. Did you take a look at: http://lucene.apache.org/java/docs/queryparsersyntax.html ? Patrick

Re: How to implement AJAX search~Lucene Search part?

2007-06-08 Thread Patrick Turcotte
Hi, What we did was this: 1) When you application starts, it scans the index for terms values and store them in a map or something. 2) When you receive "ajax requests", you compare with the map data and return the relevant part. Works quite fast for us, without round trips to Lucene

Re: How to implement AJAX search~Lucene Search part?

2007-06-13 Thread Patrick Turcotte
7;t Lucene wonderful! Patrick Mathieu Lecarme wrote: > If you do that, you enumerate every terms!!! > If you use a alphabeticaly sorted collection, you can stop, when match > stop, but, you have to test every terms before matching. > Lucene gives you tools to match begining of a term, just

Lucene 2.2, NFS, Lock obtain timed out

2007-06-29 Thread Patrick Kimber
something stupid... but I cannot think what to try next. Thanks for your help Patrick - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene 2.2, NFS, Lock obtain timed out

2007-06-29 Thread Patrick Kimber
investigations as soon as possible. Thanks for your help Patrick On 29/06/07, Doron Cohen <[EMAIL PROTECTED]> wrote: hi Patrick, Mike is the expert in this, but until he gets in, can you add details on the update pattern - note that the DeletionPolicy you describe below is not (afaik) related

  1   2   >