Re: Info required on licensing of Lucene component

2023-03-22 Thread David Smiley
I suppose this begs the question, why are we including NOTICE.txt in our distribution for *anything* we don't distribute? ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Tue, Mar 21, 2023 at 7:57 PM Michael Sokolov wrote: > Lucene is

Re: [External] Re: How to highlight fields that are not stored?

2023-02-16 Thread Shifflett, David [USA]
e document due to RAM size constraints. Not having the original file text in the highlight isn't ideal, but it is better than not being able to highlight text in large documents. David Shifflett On 2/16/23, 4:01 PM, "Michael Sokolov" mailto:msoko...@gmail.com>> wrote: So

How to highlight fields that are not stored?

2023-02-13 Thread Shifflett, David [USA]
gives me the terms from the document, and the positions, iterating over this, and matching to the queries I’m running, seems cumbersome, and inefficient. Any suggestions for highlighting query matches without the searched field being stored? Thanks, David Shifflett Senior Lead

Compare / Diff between stored values of two lucene indexes.

2023-02-13 Thread David Port Louis
hanks in advance, David Port Louis

Re: [External] Streaming documents into the index breaks highlighting

2022-11-17 Thread Shifflett, David [USA]
Just to clarify, Is there a highlighting option that doesn't require the text from the matched document? David Shifflett On 11/17/22, 1:57 PM, "Shifflett, David [USA]" wrote: Hi, I am converting my application from reading documents into memory, then indexin

Streaming documents into the index breaks highlighting

2022-11-17 Thread Shifflett, David [USA]
. Any suggestions for highlighting query matches without the searched field being stored? I was hoping storing the offsets and positions would be enough to enable highlighting. David Shifflett Senior Lead Technologist Enterprise Cross Domain Solutions (ECDS) Booz Allen Hamilton M: 831-920-8341

Migrating WhitespaceTokenizerFactory from 8.2 to 9.4

2022-10-28 Thread Shifflett, David [USA]
tory.class. I am not familiar with how Java SPI is configured and built. I tried creating META-INF/services/org.apache.lucene.analysis.TokenizerFactory containing: org.apache.lucene.analysis.core.WhitespaceTokenizerFactory What am I missing? Any help would be appreciated. Thanks, David Shifflett

Re: [External] Re: Can lucene be used in Android ?

2022-09-11 Thread Shifflett, David [USA]
Hi Uwe, I am a little confused by your 2 statements. > Lucene 9.x series requires JDK 11 to run > The main branch is already on JDK 17 Will Lucene 9.x run on JDK 17? Is 9.x 'the main branch'? Thanks, David Shifflett Senior Lead Technologist Enterprise Cross Domain Solutions (

Re: JCC build fails with Python>=3.8

2021-11-18 Thread David Allouche
python%s.%s' % (sys.version_info[0:2])] kwds["force_shared"] = True# requires jcc/patches/patch.43 elif platform in IMPLIB_LFLAGS: jcclib = 'jcc%s%s.lib' %(py_version_suffix, debug and '_d' or '') > On 18 Nov 2021

Re: JCC build fails with Python>=3.8

2021-11-18 Thread David Allouche
python%s.%s' % (sys.version_info[0:2])] kwds["force_shared"] = True# requires jcc/patches/patch.43 elif platform in IMPLIB_LFLAGS: jcclib = 'jcc%s%s.lib' %(py_version_suffix, debug and '_d' or '') > On 18 Nov 2021

Re: JCC build fails with Python>=3.8

2021-11-18 Thread David Allouche
python%s.%s' % (sys.version_info[0:2])] kwds["force_shared"] = True# requires jcc/patches/patch.43 elif platform in IMPLIB_LFLAGS: jcclib = 'jcc%s%s.lib' %(py_version_suffix, debug and '_d' or '') > On 18 Nov 2021

Re: JCC build fails with Python>=3.8

2021-11-18 Thread David Allouche
python%s.%s' % (sys.version_info[0:2])] kwds["force_shared"] = True# requires jcc/patches/patch.43 elif platform in IMPLIB_LFLAGS: jcclib = 'jcc%s%s.lib' %(py_version_suffix, debug and '_d' or '') > On 18 Nov 2021

JCC build fails with Python>=3.8

2021-11-18 Thread David Allouche
Hello, https://issues.apache.org/jira/projects/PYLUCENE/issues/PYLUCENE-52 Currently porting a code base to Python3, I have found about this issue. If I understand correctly, that means that pylucene cannot be built on Python

I am getting an exception in ComplexPhraseQueryParser when fuzzy searching

2021-11-12 Thread Shifflett, David [USA]
using ComplexPhraseQueryParser wrong? Is this a bug in Lucene? I have also tested this with a query string like ""dog~2 word~1"". This causes the same exception if the content has ‘.d’, ‘.o’, or ‘.g’. Looks like a fuzzy term that reduces to 1 character runs into trouble when encountering a matching single character term in the content. Thanks in advance for any suggestions, or guidance, David Shifflett

I am getting an exception in ComplexPhraseQueryParser when fuzzy searching

2021-11-01 Thread Shifflett, David [USA]
using ComplexPhraseQueryParser wrong? Is this a bug in Lucene? I have also tested this with a query string like ""dog~2 word~1"". This causes the same exception if the content has ‘.d’, ‘.o’, or ‘.g’. Looks like a fuzzy term that reduces to 1 character runs into trouble when encountering a matching single character term in the content. Thanks in advance for any suggestions, or guidance, David Shifflett

Why would a search using a ComplexPhraseQueryParser throw an exception for some content, but not all content?

2021-08-17 Thread Shifflett, David [USA]
Am I using ComplexPhraseQueryParser wrong? Is this a bug in Lucene? I have also tested this with a query string like ""dog~2 word~1"". This causes the same exception if the content has ‘.d’, ‘.o’, or ‘.g’. Looks like a fuzzy term that reduces to 1 character runs into trouble when encountering a matching single character term in the content. Thanks in advance for any suggestions, or guidance, David Shifflett

Re: [VOTE] Lucene logo contest, third time's a charm

2020-09-03 Thread David Smiley
(binding) vote: D, A1 (thanks Ryan for your thorough vote instructions & preparation)

Re: [VOTE] Lucene logo contest

2020-06-15 Thread David Smiley
C. The current Lucene logo [4] ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Mon, Jun 15, 2020 at 6:08 PM Ryan Ernst wrote: > Dear Lucene and Solr developers! > > In February a contest was started to design a new logo for Lucene [1

Re: ComplexPhraseQueryParser performance question

2020-02-12 Thread David Smiley
duates" to Lucene core some day. It's placement in sandbox is why it can't be added to any of Lucene's query parsers like complex phrase. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Feb 12, 2020 at 11:07 AM wrote: > H

Re: [External] Re: ComplexPhraseQueryParser isn't switching search terms to lowercase with StandardAnalyzer

2019-10-22 Thread Shifflett, David [USA]
I saw the changes in the diff. But without looking into the test, I am asking to confirm if it matches my conditions: 1) Uses a StandardAnalyzer 2) Does the actual query.toString() return lowercase J and S David Shifflett On 10/22/19, 10:44 AM, "Mikhail Khludnev" wrote: On T

Re: [External] Re: ComplexPhraseQueryParser isn't switching search terms to lowercase with StandardAnalyzer

2019-10-22 Thread Shifflett, David [USA]
o'. I changed the code snippet to "somefield", and missed changing that part of the output David Shifflett On 10/22/19, 5:51 AM, "Mikhail Khludnev" wrote: Hello, I wonder how it come up with this particular field : content:foo Anyway I added some upper

Re: [External] Re: ComplexPhraseQueryParser isn't switching search terms to lowercase with StandardAnalyzer

2019-10-21 Thread Shifflett, David [USA]
Baris, Sorry I neglected to add that piece. This test was run against 8.0.0, but I also want it to work in later versions. Another piece of my project is using 8.2.0. Thanks again for any info, David Shifflett On 10/21/19, 3:23 PM, "baris.ka...@oracle.com" wrote:

ComplexPhraseQueryParser isn't switching search terms to lowercase with StandardAnalyzer

2019-10-21 Thread Shifflett, David [USA]
because the terms are inside double quotes? How can I specify a search term that I want treated as a Phrase, but also have the query parser apply the LowerCaseFilter? I am hoping to avoid the need to handle this using PhraseQuery, and continue to use the QueryParser. Thanks in advance for any help you can give me, David Shifflett

Re: [External] Re: How to ignore certain words based on query specifics

2019-07-11 Thread Shifflett, David [USA]
Evert, It is definitely not a bug. I was asking about how to do something, I couldn't quite figure out. Stop words is the way to go. David Shifflett On 7/11/19, 11:26 AM, "evert.wagenaar" wrote: I see it as a feature, not a bug. The appearance of stop words in the Searc

Re: [External] Re: How to ignore certain words based on query specifics

2019-07-11 Thread Shifflett, David [USA]
for inclusion in the highlight context (surrounding a match)? The answer is yes the stop words do appear in the context, and are not highlighted. Thanks, David Shifflett On 7/10/19, 9:12 PM, "Michael Sokolov" wrote: I'm not au courant with highlighters as I used to be.

Re: [External] Re: How to ignore certain words based on query specifics

2019-07-09 Thread Shifflett, David [USA]
available for highlighting? Thanks, David Shifflett On 7/9/19, 11:58 AM, "Michael Sokolov" wrote: I think what you're saying in you're example is that "free*" should match anything with a term matching that pattern, but not *only* freedom. In other word

Re: How to ignore certain words based on query specifics

2019-07-09 Thread Shifflett, David [USA]
. I want to keep the ignore words separate so they apply to all queries, but then realized the ignore words should only apply to relevant (matching) queries. I don’t want the users to be required to add ‘and not WORD’ many times to each of the listed queries. David Shifflett From: Diego

How to ignore certain words based on query specifics

2019-07-09 Thread Shifflett, David [USA]
ignore words that contains C, but not A or B. Thanks in advance, for any suggestions or advice, David Shifflett

Re: Live index upgrading

2019-06-21 Thread David Allouche
The bottom line for me, is that I am not going to upgrade to Lucene8 for a while. The index migration would either cause a service interruption, or would require a little while to implement. I have more urgent technical debt to deal with. > On 21 Jun 2019, at 19:11, David Allouche wr

Re: Live index upgrading

2019-06-21 Thread David Allouche
omputed index from them. Yes, Solr/ES can add database-like > behavior where they hold the true original source of the document and use > that to rebuild Lucene indices over time. But Lucene really is just a > "search index" and we need to be free to make important improvem

Re: Live index upgrading

2019-06-21 Thread David Allouche
om scratch on whatever version of Lucene > you want to use. > > Best, > Erick > > > >> On Jun 17, 2019, at 8:41 AM, David Allouche wrote: >> >> Hello, >> >> I use Lucene with PyLucene on a public-facing web application. We have a >> moder

Live index upgrading

2019-06-17 Thread David Allouche
Hello, I use Lucene with PyLucene on a public-facing web application. We have a moderately large index (~24M documents, ~11GB index data), with a constant stream of new documents. I recently upgraded to PyLucene 7. When trying to test the new release of PyLucene 8, I encountered an IndexForma

Re: How can I decode geo point postings?

2019-03-31 Thread David Smiley
Yup. And if you have the original lat/lon then you can forgo the complexity of reverse-engineering it from postings. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Thu, Mar 28, 2019 at 2:49 PM Adrien Grand wrote: > Hi Trejkaz, > > My b

Please help me

2019-01-23 Thread Ayuketah David Tabi
Greetings. I am using Lucene 7.6. To do indexing i am using StandardAnalyzer() and to index a numeric field this is what I did:  doc.add(new IntPoint("niveauexp", 4)); doc.add(new StoredField("niveauexp", 4));  Problem: when I tried to search "niveauexp" using 4 as search parameter, i recieve 

Re: More Spatial Relations

2018-06-01 Thread David Smiley
For predicates other than "intersects", that is true :-/ Any help you might be interested in offering here is most welcome. On Fri, Jun 1, 2018 at 8:38 PM Bingtao Yin wrote: > Hi David, > > Thanks for you reply. > > Compared to the prefix tree, implementation through d

Re: More Spatial Relations

2018-06-01 Thread David Smiley
lStrategy and ShapeValuesPredicate. You could cast the value, a Shape, to a JtsGeometry (a Spatial4j shape) and then call getGeom() to get the underlying JTS Geometry instance. If you find you need to fork entire classes then feel free to suggest improvements to the extensibility. ~ David On Tue, May 29, 201

Re: Spatial Indexing of Polygons

2017-08-15 Thread David Smiley
ng it or trying to document it. BTW I'm in FOSS4G Boston the next few days. Perhaps you might be there? ~ David On Mon, Aug 14, 2017 at 2:27 PM Tom Hirschfeld wrote: > Hey, > Is there a way to spatially index polygons that takes advantage of the new > BKD tree functionality? I was

Re: Lucene GeoNear Search and Sort Performance

2017-07-19 Thread David Smiley
here is an issue in your approach to measuring this. On Tue, Jul 18, 2017 at 9:26 PM sc wrote: > David, > > I was able to get it working with minor changes in my codebase. I didn't > have back port PointVectorStrategy class from 6.6.0 to 5.5.4 > > Code: > fina

Re: Lucene GeoNear Search and Sort Performance

2017-07-17 Thread David Smiley
port it to Lucene 5x. It shouldn't be too hard. Since you only need this for distance sorting, you could only port what's needed; have makeQuery(...) throw an exception. createIndexableFields need only output a 2-element array, one for each DoubleDocValuesField. ~ David On Mon, Jul 1

Re: Lucene GeoNear Search and Sort Performance

2017-07-16 Thread David Smiley
As I mentioned that PointVectorStrategy has an argument that accepts a Lucene FieldType that you can add docValues to. On Sun, Jul 16, 2017 at 2:07 PM sc wrote: > Thanks for the suggestion. > > I changed the strategy to > > this.strategy = new PointVectorStrategy(ctx, "pointVector"); > > And the

Re: Lucene GeoNear Search and Sort Performance

2017-07-14 Thread David Smiley
pose. For that strategy, you only need it to do just that, so you can enable docValues and disable the "index". That strategy accepts a FieldType in the constructor. PointVectorStrategy is limited to one point per document per field, and always uses double precision on both "x&quo

Re: Term Dictionary taking up lots of memory, looking for solutions, lucene 5.3.1

2017-07-02 Thread David Smiley
If there are no filters, then LatLonDocValuesField is going to be asked to sort all of your docs, which is obviously going to take awhile. Can you simply add a filter? Like a distance filter using LatLonPoint? On Thu, Jun 29, 2017 at 11:49 AM sc wrote: > Hi, > >I have similar requirement o

Re: Term Dictionary taking up lots of memory, looking for solutions, lucene 5.3.1

2017-06-14 Thread David Smiley
Nice! On Tue, Jun 13, 2017 at 11:12 PM Tom Hirschfeld wrote: > Hey All, > > I was able to solve my problem a few weeks ago and wanted to update you > all. The root issue was with the caching mechanism in > "makedistancevaluesource" method in the lucene spatial module, it appears > that documents

Re: Term Dictionary taking up lots of heap memory, looking for solutions, lucene 5.3.1

2017-06-06 Thread David Smiley
m super pleased with the performance. ~ David On Wed, May 17, 2017 at 10:59 PM Tom Hirschfeld wrote: > Hey! > > I am working on a lucene based service for reverse geocoding. We have a > large index with lots of unique terms (550 million) and it appears that > we're running in

Re: Highlighting and delineating Passages (fragmenting)

2017-05-30 Thread David Smiley
proach: https://issues.apache.org/jira/browse/LUCENE-5455 Or are the overlaps coming from passage offset ranges from separate queries to the same content? That I could understand better based on everything you said. I'm not sure how your code could be contributed in a way that fits in

Re: Highlighting and delineating Passages (fragmenting)

2017-05-30 Thread David Smiley
Looks like you should use the original Highlighter until requirement #2,3 can be done with the UnifiedHighlighter. Other than #2,3, the UH can handle all these requirements, and the OH can do all. On Sat, May 27, 2017 at 6:08 AM Dawid Weiss wrote: > Thanks for your explanation, David. &g

Highlighting and delineating Passages (fragmenting)

2017-05-26 Thread David Smiley
terface for UH-aware BreakIterators. The former (a new abstraction) would be cleaner, and might also remove a wart in the API due to the statefulness of BreakIterators. It's also kinda hard to write a BI correctly. I've implemented a few already and I know. It's an old API. ~

Re: TimeLimitingCollector accuracy

2016-12-22 Thread David Causse
Le 21/12/2016 à 13:27, David Causse a écrit : But given that some efforts have been done to separate sub scorers from "top-level" scorers (see https://issues.apache.org/jira/browse/LUCENE-5487) would it make sense now to make BulkScorers aware of some time constraints? Looking a

TimeLimitingCollector accuracy

2016-12-21 Thread David Causse
Hi, This subject has been discussed in the past but I don't think that any real solution was implemented yet. Here is a small test case to illustrate the problem: https://github.com/nomoa/lucene-solr/commit/2f025b18899038c8606da64c2cf9f4e1f643607f#diff-65ae49ceb38e45a3fc05115be5e61a2dR387 T

java.lang.IndexOutOfBoundsException: Index: 9634, Size: 97 opening an index

2016-11-24 Thread David Sitsky
ex? It is 120 GB in size and there are no backups.. :-/ Cheers, David

Lucene paid development for "SpanAndQuery" / "SpanAllNearQuery" support

2016-09-14 Thread David Sitsky
s and are interested, please send me an email to get the ball rolling. Many thanks. Cheers, David

Re: "Point in polygon" search with Lucene / Spatial4j / JTS

2016-06-05 Thread David Smiley
n that example in the 4x branch but are unaware it exists in 5x & 6x or wether you deliberately referenced 4x because you must use that version. Good luck, ~ David On Sat, Jun 4, 2016 at 12:34 PM Randall Tidd wrote: > Hello, > > I have what I think is a relatively simple use case t

Re: highlighter with query over more than one word

2016-06-03 Thread David Smiley
It would help tremendously if you can give a specific code example showing the problem. On Thu, Jun 2, 2016 at 6:41 AM Sascha Janz wrote: > > we use highlighter to get textfragments for our hit list. > > the code is straight forward like this > >Analyzer analyzer = new StandardAnalyzer(; >

Re: Best way to plug in alternative range query support

2016-05-25 Thread David Smiley
Ken, See BooleanQuery.Builder. p.s. nice to see you at Apache Big Data in Vancouver. ~ David On Thu, May 19, 2016 at 4:28 PM Ken Krugler wrote: > Hi all, > > I’ve got an alternative representation in the index for numeric fields, > and I need to construct an alternative approa

Classpath issue

2015-07-12 Thread David Yanay
thing wrong? I would appreciate help on this issue. Many Thanks! David. -- David Yanay CTO SmartMedia Marketing S.M.M. Derech HaYam 11, Haifa, 3463106, Israel http://www.smartmediamarketing.com Mobile: +972-50-6856644 Tel: +972-4-8583435 Fax: +972-4-8583436 LinkedIn: https://www.linkedin.com/in/yanay

Re: Part of speech search with lucene

2015-03-04 Thread David Villarejo
Hi Mike, Your solution work! I've been trying it with PhraseQuery and It works pretty good. Thank you so much. David. 2015-03-03 23:00 GMT+01:00 Michael Sokolov : > I believe you can accomplish what you are talking about using PhraseQuery, > say: note that it has > > public v

Re: Part of speech search with lucene

2015-03-03 Thread David Villarejo
} { fox | > noun:fox } > > with punctuation to suggest the token graph > > -Mike > > > On 03/03/2015 01:21 PM, David Villarejo wrote: > >> After many google searchs I decided to post my problem here hoping that >> someone help me. What I want to achieve is to pe

Part of speech search with lucene

2015-03-03 Thread David Villarejo
e queries will work. (correct me if I'm wrong) The second thing I thought was to index extra info as synonyms of the term but, this way, the second query won't work since I can't ask if the first term is an adj and the specific word "brown" simultaneously. Any way to address this problem, suggestions, etc. will be appreciated. David.

Re: Lucene search in attachments

2015-02-10 Thread David Pilato
I don’t understand. If you don’t raise this restriction to a higher value (or to -1), all the text won’t be extracted so only a subset of the text will be indexed. Non indexed parts of the text won’t be searchable. Did I misunderstand your question? -- David Pilato | Technical Advocate

Re: Lucene search in attachments

2015-02-10 Thread David Pilato
-mapper-attachments/blob/master/src/main/java/org/elasticsearch/index/mapper/attachment/AttachmentMapper.java#L456 <https://github.com/elasticsearch/elasticsearch-mapper-attachments/blob/master/src/main/java/org/elasticsearch/index/mapper/attachment/AttachmentMapper.java#L456> -- David

Re: Search "_all" field with a term

2014-10-11 Thread David Pilato
You should ask this on the elasticsearch mailing list. BTW, look at elasticsearch copy_to feature. Better than _all field. My 2 cents. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs > Le 11 oct. 2014 à 11:31, "haiwei.xie-soulinfo" a > écrit : > > H

Issues with lucene 4.10.0 on android

2014-09-25 Thread David Linde
“DirectoryReader.” try { sis.read(dir, fileName); } catch (NoSuchFileException | FileNotFoundException var10) { sis = null; } Is this something you would consider changing to make it compatible with android? Thanks in Advance, David

RE: How to properly correlate relevance in a search across multiple collections

2014-09-09 Thread Baldwin, David
g. Anyone? -Original Message- From: atawfik [mailto:contact.txl...@gmail.com] Sent: Tuesday, September 09, 2014 12:42 AM To: java-user@lucene.apache.org Subject: RE: How to properly correlate relevance in a search across multiple collections Hi David, It seems that MultiSearcher is deprecated i

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
ested. Best, Erick On Mon, Sep 8, 2014 at 8:59 AM, Baldwin, David wrote: > Would it be possible, or does anyone have any experience, in using the raw > score from each separate collection to order and then after a merge come up > with relevancy? > > -Original Message- &

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
as performant as a simple non-df similarity approach. It might also be nice for apps to offer users pure-tf scoring if it provides faster search results, and then the user could click on a "refine results" button to re-do the search with the more expensive cross-corpus df-based

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
Would it be possible, or does anyone have any experience, in using the raw score from each separate collection to order and then after a merge come up with relevancy? -Original Message- From: atawfik [mailto:contact.txl...@gmail.com] Sent: Sunday, September 07, 2014 9:50 AM To: java-use

How to properly correlate relevance in a search across multiple collections

2014-09-05 Thread Baldwin, David
anyone have any good experience with this and could they share some tid-bits or info I may not have run across yet? -David - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java

2.9.2 Memory issue 8.0GB or more / OOM with Term / Highlighting

2014-07-30 Thread Baldwin, David
I am looking to track down an issue in 2.9.2 where during highlighting, certain data may cause rapid memory usage and OOM exception in java: --- java.lang.OutOfMemoryError: Java heap space at org.apache.lucene.analysis.Token.growTermBuffer(Token.java:470) at

Re: How to handle words that stem to stop words

2014-07-07 Thread David Murgatroyd
rom the web.) The work integrating OpenNLP <https://issues.apache.org/jira/browse/LUCENE-2899> might also prove helpful. Best, David Murgatroyd ww.linkedin.com/in/dmurga/ <http://www.linkedin.com/in/dmurga/> [image: Inline image 1] On Mon, Jul 7, 2014 at 5:53 PM, Sujit Pal wrote: &g

RE: Lucene Upgrade from 2.9.x to 4.7.x

2014-05-29 Thread Baldwin, David
I am curious what type of time-frame it took you to do the upgrade, including acceptable unit and QA tesing? -Original Message- From: Xiaolong Zheng [mailto:xiaolong.zh...@mathworks.com] Sent: Thursday, May 29, 2014 9:14 AM To: java-user@lucene.apache.org Subject: Re: Lucene Upgrade from

RE: Lucene Upgrade from 2.9.x to 4.7.x

2014-05-29 Thread Baldwin, David
I am looking for the same. Need to upgrade from 2.9.2 . -Original Message- From: Buddhavarapu, Suresh [mailto:suresh.buddhavar...@emc.com] Sent: Thursday, May 29, 2014 7:57 AM To: java-user@lucene.apache.org Subject: Lucene Upgrade from 2.9.x to 4.7.x Hello, I'm looking for some docum

RE: best choice for ramBufferSizeMB

2014-05-16 Thread Baldwin, David
Is this true as well for 2.9.2? -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Wednesday, May 14, 2014 8:54 AM To: Lucene Users Subject: Re: best choice for ramBufferSizeMB Generally larger is better, as long as JVM's heap is big enough to allow IW t

BooleanScorer - Maximum Prohibited Scorers?

2014-04-16 Thread David Stimpert
Hello, I have found useful functionality in BooleanQuery which allows me to specify a minimum number of matching optional terms (i.e. setMinimumNumberShouldMatch). I do not, however, see similar functionality available for setting the maximum number of MUST_NOTs (i.e. setMaximumNumberMustNotMatch)

FW: [Jts-topo-suite-user] Persistent STR tree

2014-01-17 Thread Smiley, David W.
FYI for those with spatial interests… From: , "Smiley, David W." mailto:dsmi...@mitre.org>> Date: Friday, January 17, 2014 at 11:53 AM To: Demeter Sztanko mailto:szta...@gmail.com>> Cc: "jts-topo-suite-u...@lists.sourceforge.net<mailto:jts-topo-suite-u...@lists.

How to use Lucene-spatial

2013-10-23 Thread Smiley, David W.
/branch_4x/lucene/spatial/src/test/org/apache/lucene/spatial/SpatialExample.java?view=markup In addition, this page from Solr's wiki should be useful even if you aren't using Solr: http://wiki.apache.org/solr/SolrAdaptersForLuceneSpatial4 Cheers, David Smiley

Re: Equivalent LatLongDistanceFilter in Lucene 4.4 API

2013-10-08 Thread David Smiley (@MITRE.org)
sort or boost by the actual distance. ~ David james bond wrote > Hi All, > > Can you please let me know if there is an equivalent of > LatLongDistanceFilter in Lucene 4.4 API. > This API was present in Lucene 3.6 API. > > I have to mainly compute whether a point(lat,la

Search in a specific ScoreDoc result

2013-09-17 Thread David Miranda
Hi, I want to do a kind of 'facet search', that initial research in a field of all documents in the Lucene index, and second search in other field of the documents returned to the first research. Currently I'm do the first research : Query q = new QueryParser(Version.LUCENE_43, "label", analyzer

Re: Lucene Concurrent Search

2013-09-06 Thread David Miranda
; } > } > > Usually does the trick. You need to put some parameters ("indexDir") into > your web.xml and make sure that it knows that SearchListener is a > ServletListener for your Web app. > > But, to re-iterate what someone else said: if you really just want RE

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
> > > I use a singleton class but there are other ways in tomcat. Can't > > remember what - maybe application scope. > > > > > > -- > > Ian. > > > > > > On Thu, Sep 5, 2013 at 4:46 PM, David Miranda > > > wrote: > > >

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
- Search from 1 Million open source projects. > > > > On Thu, Sep 5, 2013 at 6:46 AM, David Miranda >wrote: > > > Hi, > > > > I'm developing a web application, that contains a REST service in the > > Tomcat, that receives several requests per second. &

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
rcherManager sm = new SearcherManager(dir, new SearcherFactory()); > > to get default behaviour. The javadocs for SearcherFactory explain > that you can write your own implementation if you want custom > behaviour such as warming. > > > -- > Ian. > > > On Thu, Sep 5, 2013 a

Lucene Concurrent Search

2013-09-04 Thread David Miranda
Hi, I'm developing a web application, that contains a REST service in the Tomcat, that receives several requests per second. The REST requests do research in a Lucene index, to do this i use the IndexSearch. My questions are: - There are concurrency problems in multiple research? - What the best

Re: Lucene Text Similarity

2013-09-04 Thread David Miranda
st, > >Tim > > > From: Ivan Krišto [ivan.kri...@gmail.com] > Sent: Wednesday, September 04, 2013 3:17 AM > To: java-user@lucene.apache.org > Subject: Re: Lucene Text Similarity > > On 09/03/2013 07:33 PM, David Miranda wrote: > > Is there any wa

Lucene Text Similarity

2013-09-03 Thread David Miranda
r (Version.LUCENE_43, "abstract", analyzer). Parse (mytext); Search field abstract the "mytext", not me no results are returned. What can I do to implement this feature? Thanks in advance, David - To

Re: Searching within a Search Result

2013-08-19 Thread David Miranda
smallest possible > self-contained example that shows the problem. > > > -- > Ian. > > > On Tue, Aug 6, 2013 at 9:55 PM, David Miranda > wrote: > > I experimented with the previous code, but no results are returned from > the > > index. Someone can give me an example? I h

Re: Searching within a Search Result

2013-08-06 Thread David Miranda
I experimented with the previous code, but no results are returned from the index. Someone can give me an example? I have been for some time trying to implement this functionality. Thanks. 2013/8/6 David Miranda > Do this with the BooleanQuery: > >> Query q1 = new QueryParser(Versi

Re: Searching within a Search Result

2013-08-06 Thread David Miranda
e query generated by QueryParser or direct > construction of a TermQuery. See also CachingWrapperFilter if the > index isn't constantly changing. > > > -- > Ian. > > > On Tue, Aug 6, 2013 at 3:19 PM, David Miranda > wrote: > > Hi, > > > > I have a

Searching within a Search Result

2013-08-06 Thread David Miranda
scoreDocs; After the results of the first survey, I want to do a search in the abstract field only in results obtained from the first survey. How i do this? Thanks in advance. David

Re: handling nonexistent fields in an index

2013-07-03 Thread David Carlton
ng field, boolean negate)) > > -- Jack Krupansky > > -Original Message- From: David Carlton > Sent: Wednesday, July 03, 2013 4:27 PM > To: java-user@lucene.apache.org > Subject: handling nonexistent fields in an index > > > I have a bunch of Lucene indices l

handling nonexistent fields in an index

2013-07-03 Thread David Carlton
field, *) will have good performance if field only takes a small number of values. -- David Carlton carl...@sumologic.com

Re: Spatial indexing: IndexOutOfBounds in QuadPrefixTree

2013-03-09 Thread David Smiley (@MITRE.org)
Just finished: http://wiki.apache.org/solr/SpatialForTimeDurations - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066.n3.nabble.com/Spatial-indexing-IndexOutOfBounds-in-QuadPrefixTree-tp4040511p4045998.html Se

Re: Spatial indexing: IndexOutOfBounds in QuadPrefixTree

2013-03-08 Thread David Smiley (@MITRE.org)
Paul, FYI: http://lucene.472066.n3.nabble.com/InvalidShapeException-when-using-SpatialRecursivePrefixTreeFieldType-with-custom-worldBounds-tt4045351.html I suggested to file a bug report. ~ David Paul Alexandrow wrote > Hi List, > > I've encountered this problem using Solr (

Re: DiskDocValues vs Lucene42Codec

2013-03-08 Thread David Smiley (@MITRE.org)
Thanks Robert; that's very helpful. - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066.n3.nabble.com/DiskDocValues-vs-Lucene42Codec-tp4044061p4045935.html Sent from the Lucene - Java Users mailing list archive

DiskDocValues vs Lucene42Codec

2013-03-08 Thread David Smiley (@MITRE.org)
ust discovered Lucene42Codec.getDocValuesFormatForField which I can customize. But that still leaves the 1st question. It would be nice to not have to re-index. ~ David - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066

Re: Lucene vs RDBMS indexing at scale

2013-02-05 Thread David Pilato
So you probably should ask your question to the Elasticsearch mailing list. I think that some ES users already scales to x billion docs. Even if ES is Lucene based, it adds features to scale out (sharding, routing...). HTH -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le

Re: Configuration for edge ngram typeahead

2013-01-04 Thread David Pilato
Oh. My bad! Sorry. I misread your JSON. BTW I see that you solve yourself your problem on StackOverFlow. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 4 janv. 2013 à 23:21, "C. Benson Manica" a écrit : Do I have to do it that way, i.e. POST a separate settin

Re: Configuration for edge ngram typeahead

2013-01-04 Thread David Pilato
Did you define mappings for your docs and fields to use that analyzer? See: http://www.elasticsearch.org/guide/reference/api/admin-indices-put-mapping.html -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 4 janv. 2013 à 22:30, "C. Benson Manica" a écrit : I

Re: what is the offsets and payload in DocsAndPositionsEnum for ??

2012-11-27 Thread David Causse
unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands

Re: Ex.

2012-11-09 Thread David Pilato
Hi Fabio, Did you consider using Hibernate Search? It's based on Lucene. It may help you. David. Le 9 novembre 2012 à 14:15, Fabio Ebner a écrit : > Hi.. can someone help me? i need to integrate LUCENE + SPRING + HIBERNATE > and don't find any articlhe about this.. > >

Re: question re lucene spatial toolkit aka LSP aka spatial4j

2012-08-08 Thread David Smiley (@MITRE.org)
I responded to the solr-user thread. - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066.n3.nabble.com/question-re-lucene-spatial-toolkit-aka-LSP-aka-spatial4j-tp3999889p425.html Sent from the Lucene - Java

  1   2   3   >