RE: What is stemming?

2005-11-20 Thread anton
About stemmers you can read on http://snowball.tartarus.org/ -Original Message- From: Koji Sekiguchi [mailto:[EMAIL PROTECTED] Sent: Monday, November 21, 2005 2:37 AM To: java-user@lucene.apache.org Subject: RE: What is stemming? Gekkokid, Daniel, Giovanni, Thank you very much for your

RE: OutOfMemory during optimize

2005-12-22 Thread anton
We increase system parameter "max number open files". For do it use command ulimit -n [max number]. -Original Message- From: Steve Rajavuori [mailto:[EMAIL PROTECTED] Sent: Friday, December 23, 2005 12:42 AM To: lucene-user@jakarta.apache.org Subject: OutOfMemory during optimize I am pe

RE: Indexing speed

2006-02-27 Thread anton
maxBufferedDocs parameters. You can also look for my article about indexing with Lucene (link in the Wiki), which includes code for playing with various parameters and explains what's going on, etc. Sorry, but where this link ?

RE: Indexing speed

2006-02-27 Thread anton
maxBufferedDocs parameters. You can also look for my article about indexing with Lucene (link in the Wiki), which includes code for playing with various parameters and explains what's going on, etc. Sorry, but where this link ? Where placed your article ? Please, give me url. -

RE: search problem

2006-03-03 Thread anton
2) assuming what you want is not all brands and all prices, but just the prices and brands of the documents in you results, then i would strongly recommend doing your search twice -- once do get the Hits object you need and once using a HitCollector. Within the HitCollector, use FieldCaches to lo

RE: sub search

2006-03-07 Thread anton
); -Original Message- From: hu andy [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 12:40 PM To: java-user@lucene.apache.org Subject: Re: sub search Importance: High 2006/3/7, Anton Potehin <[EMAIL PROTECTED]>: > > Is it possible to make search among results of previous s

RE: Get only count

2006-03-07 Thread anton
g Subject: Re: Get only count Importance: High Anton Potehin wrote: > Now I create new search for get number of results. For example: > > IndexSearcher is = ... > > Query q = ... > > numberOfResults = Is.search(q).length(); > > Can I accelerate this example ? And how ? P

RE: Get only count

2006-03-07 Thread anton
g Subject: Re: Get only count Importance: High Anton Potehin wrote: > Now I create new search for get number of results. For example: > > IndexSearcher is = ... > > Query q = ... > > numberOfResults = Is.search(q).length(); > > Can I accelerate this example ? And how ? P

RE: Get only count

2006-03-07 Thread anton
Can have matching document score equals zero ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:20 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: > Whil

RE: Get only count

2006-03-07 Thread anton
Can have matching document score equals zero ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:20 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: > Whil

RE: Get only count

2006-03-08 Thread anton
Signifies this that method collect can be called for document with score <= 0 ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:35 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED]

RE: speed

2006-03-10 Thread anton
Sent: Friday, March 10, 2006 4:33 PM To: java-user@lucene.apache.org Subject: Re: speed Importance: High I'm curious, how many documents in your index and what kind of queries are you issuing? Even 1.6 seconds is a LONG time for a search in most Lucene indexes. Erik On Mar

RE: speed

2006-03-10 Thread anton
many documents in your index and what kind of queries are you issuing? Even 1.6 seconds is a LONG time for a search in most Lucene indexes. Erik On Mar 10, 2006, at 5:38 AM, Anton Potehin wrote: > I try 2 way for get count results: > > 1) > Query q = > IndexSear

RE: speed

2006-03-10 Thread anton
ect: Re: speed > Importance: High > > I'm curious, how many documents in your index and what kind of > queries are you issuing? Even 1.6 seconds is a LONG time for a > search in most Lucene indexes. > > Erik > > > On Mar 10, 2006, at 5:38 AM, Anton Po

RE: speed

2006-03-10 Thread anton
Each variant started in different process -Original Message- From: Eric Jain [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 5:15 PM To: java-user@lucene.apache.org Subject: Re: speed Importance: High [EMAIL PROTECTED] wrote: > When I make search I get count = 37. > May be

RE: speed

2006-03-10 Thread anton
Sorry. I was repeated test. And now both way have same speed. -Original Message- From: Anton Potehin [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 1:39 PM To: java-user@lucene.apache.org Subject: speed Importance: High I try 2 way for get count results: 1) Query q

RE: speed

2006-03-10 Thread anton
repeated test. And now both way have same speed. -Original Message- From: Anton Potehin [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 1:39 PM To: java-user@lucene.apache.org Subject: speed Importance: High I try 2 way for get count results: 1) Query q = IndexSearcher is

RE: Grouping results by choosen field

2006-03-21 Thread anton
Good grouping by domain realized in nutch... Nutch can serve good example of group on certain field. -Original Message- From: Java Programmer [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 21, 2006 3:56 PM To: java-user@lucene.apache.org Subject: Re: Grouping results by choosen field On

Clusterization of searching

2006-04-11 Thread anton
What be way for clusterizations of searching? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: is Document match Query

2018-12-17 Thread Anton Zenkov
https://lucene.apache.org/core/7_6_0/memory/org/apache/lucene/index/memory/MemoryIndex.html Anton On Mon, Dec 17, 2018 at 8:06 AM Valentin Popov wrote: > Hello. > > I need implement a feature, that answer for a question: is a Document > match a Query. > > Right now, I’m imp

Re: Lucene Index Cloud Replication

2019-07-11 Thread Anton Zenkov
to download the relevant files. > > > > I'd like to share what I've got for 1 and 3, based on S3 and DynamoDB, > but > > I'd like to do it with interfaces that lend themselves to other > > implementations for blob and metadata storage. > > > &

question about spatial module in lucene 5

2015-03-30 Thread Anton Lyska
Hi I have tried to upgrade lucene from 4.x to 5.0 recently.I found out from documentation that FieldCache is removed from lucene 5, and I should use DocValues fields for sorting. I upgraded my sources, and everything works fine except spatial sorting by distance. When I looked into PointVectorStra

scanning whole index stored fields while using best compression mode

2015-06-03 Thread Anton Zenkov
tor(); fieldsReader.visitDocument(i, visitor); visitor.getDocument(); } } I was wondering if there is better way of doing this and if there are plans to make access to the faster document loading through some API. Should I try to come up with a patch for this? Thanks! Anton

Re: scanning whole index stored fields while using best compression mode

2015-06-03 Thread Anton Zenkov
Reindexing. If I want to add new fields or change existing fields in the index I need to go through all documents of the index. On Wed, Jun 3, 2015 at 4:46 PM, Robert Muir wrote: > On Wed, Jun 3, 2015 at 4:00 PM, Anton Zenkov > wrote: > > > > > for (int i = 0; i <

CachingWrapperQuery performance

2015-06-15 Thread Anton Lyska
Hi, I have performance issues with CachingWrapperQuery with lucene 5.2 and dont know how to solve it. Prehistory: I have search with different parameters, where some parameters are used more frequently then others. For these params I used filters(and cached them), and my search looked li

Re: CachingWrapperQuery performance

2015-06-17 Thread Anton Lyska
t any caching 115ms. Huh, I think that's all. I would appreciate any advice. Regards, Anton 2015-06-16 0:35 GMT+03:00 Adrien Grand : > Hi Anton, > > Thanks for reporting this. It is indeed a bit surprising given that > both classes work in a very similar way. Can you confirm tha

Re: Lucene [3.0] Skip Matching Document.

2015-07-28 Thread Anton Zenkov
You can always throw an exception in the collector to stop the collection process. Anton On Tue, Jul 28, 2015 at 4:26 AM, Muhammad Ismail wrote: > Can we skip matching lucene document by using custom collector or some > other way. Like I want to bring all document created by user

Re: IndexWriter is not closing the FDs (deleted files)

2015-08-31 Thread Anton Zenkov
Are you sure you are not holding open readers somewhere? On Mon, Aug 31, 2015 at 7:46 PM, Marcio Napoli wrote: > Hey! :) > > It seems IndexWriter is not closing the descriptors of the removed files, > see the log below. > > Thanks, > Napoli > > [root@server01 log]# ls -l /proc/59491/fd | grep i

sizes of non-fdt flies affected by compression settings

2015-10-01 Thread Anton Zenkov
each) text docs. I guess I could go through the code myself to understand this but may be someone can shed some light on this. Thanks! Anton

Re: 500 millions document for loop.

2015-11-12 Thread Anton Zenkov
Which version of Lucene are you using? On Thu, Nov 12, 2015 at 11:39 AM, Valentin Popov wrote: > Hello everyone. > > We have ~10 indexes for 500M documents, each document has «archive date», > and «to» address, one of our task is calculate statistics of «to» for last > year. Right now we are us

non-standard query

2006-01-19 Thread Anton Potehin
I've the following problem: I've a big number of documents indexed. Suppose that the search query contains 20 terms. It is necessary to find all documents which contains at least 5 terms from search query. Is it possible to implement? If yes, what problems may arise during the solving of thi

Duplicates recods in index

2006-02-08 Thread Anton Potehin
Is it possible to add records into lucene index using following algorithm: 1) create Document object 2) add 5 fields into Document (id, name, field1, field2, field3). All fields are stored, indexed and tokenized 3) check if the document with current id and name was added before 4) if yes

Problem with TermDocs

2006-02-20 Thread Anton Potehin
Ir is IndexReader. termIdent is Term int freq = ir.docFreq(termIdent); if (freq > 1) { TermDocs termDocs = ir.termDocs(termIdent); int[] docsArr = new int[freq]; int[] freqArr = new int[freq]; int number = termDocs.read(docsArr,freqArr); System.out.println(number)

search problem

2006-02-28 Thread Anton Potehin
I have a problem. There is an index, which contains about 6,000,000 records (15,000,000 will be soon) the size is 4GB. Index is optimized and consists of only one segment. This index stores the products. Each product has brand, price and about 10 more additional fields. I want to not just find so

sub search

2006-03-07 Thread Anton Potehin
Is it possible to make search among results of previous search?   For example: I made search: Searcher searcher =... Query query = ... Hits hits = hits = Searcher.search(query);   After it I want to not make a new search, I want to make search among found results

Get only count

2006-03-07 Thread Anton Potehin
Now I create new search for get number of results. For example: IndexSearcher is = ... Query q = ... numberOfResults = Is.search(q).length(); Can I accelerate this example ? And how ?

RangeQuery and RangeFilter

2006-03-08 Thread Anton Potehin
What faster RangeQuery or RangeFilter ?

speed

2006-03-10 Thread Anton Potehin
1.644 sec. Second way return results fot 3.088 sec. Why first way faster then second? Can I accelerate it? And how? Can you offer any ways for get number of results? --CountCollector import org.apache.lucene.search.HitCollector; /** * @author Anton Potekhin * @date: 03.03.2006

How to serach in sentence and dispaly the whole sentence

2006-04-23 Thread anton feldmann
, how I have to change the IndexFile, whether that conforms with the Lucene Team. cheers anton feldmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Highlight

2006-04-26 Thread anton feldmann
Hi I wrote a program that make a pdf document to an Lucene document. The field ate "contents", "sentence", : How do i display the sentence the query String is in? and how do I Highlight the String? cheers anton feldmann package de.coli.seek.lucene; import

How to display a field value

2006-04-26 Thread anton feldmann
Hi how do i display the whole field value of an document the query string is found? cheers anton - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: How to serach in sentence and dispaly the whole sentence

2006-04-26 Thread anton feldmann
Are the names of a field in a document unique or can i make a field with the name "sentence" for each sentence in an text document? Grant Ingersoll schrieb: Anton, I think there are at least a couple of ways of doing this. I assume you have a program that does sentence detection a

lucene search sentence

2006-04-27 Thread Anton Feldmann
in an document? 2) How do I display the sentence the hit is in? and color the hit. 3) How do I display the sentence before and after the sentence the hit is in? Cherrs anton - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]