Scoring question - Get Score of Best Query in BooleanQuery

2007-05-06 Thread Thomas Thomas
Hello everyone, Whenever I search a word in my web application, I search in some default fields, e.g. I search the word "hello", I generate these queries : title:hello headlines:hello summary:hello content:hello Which I add in a BooleanQuery (BooleanClause.Occur.SHOULD) What I want to achieve

Lucene and XML Architecture

2007-07-19 Thread Thomas
ombination of native XML stores and Lucene? Are there any problems that could arise from this combination? - Thomas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene and XML Architecture

2007-07-20 Thread Thomas
Thanx a lot Patrick! That's exactly what I was hoping for. I'll give it a shot. -Thomas Patrick Turcotte wrote: Hi, There is a Lucene-eXist trigger that allows you to do just that. Take a look at patch http://sourceforge.net/tracker/index.php?func=detail&aid=1654205&g

problem understanding the documentation for the TieredMergePolicy class

2012-06-12 Thread thomas
ePolicy.html#findMerges%28org.apache.lucene.index.SegmentInfos%29> Would somebody be so kind to explain it to me? Thanks, thanks a lot Thomas

How to ignore a ,

2016-11-28 Thread Thomas Johnson
; when we search for "Doe*" Thank you. Thomas W. Johnson, Senior Programmer 678-397-1663 tjohn...@paperhost.com<mailto:tjohn...@paperhost.com> [PaperHost] [asdf]<http://bit.ly/PaperHost_Twitter> Follow PaperHost on T

java 17 and older lucene (4.x)

2022-09-26 Thread Thomas Matthijs
Hello, Just wondering if anyone has patched lucene 4.x for usage with java 17+ and willing to share their work? anything would be appreciated. No we cannot upgrade lucene, and will likely spend time to try to backport/patch it ourselves, but maybe someone already has? if anyone has interest in

Indexing puncutation

2005-06-28 Thread Aigner, Thomas
Hello all, I am VERY new to Lucene and we are trying out Lucene to see if it will accomplish the vast majority of our search functions. I have a question about a good way to index some of our product description codes. We have description codes like 21-MA-GAB and other punctuatio

RE: Indexing puncutation

2005-06-28 Thread Aigner, Thomas
on On 6/28/05, Aigner, Thomas <[EMAIL PROTECTED]> wrote: > Hello all, > > I am VERY new to Lucene and we are trying out Lucene to see if > it will accomplish the vast majority of our search functions. > > I have a question about a good way to in

RE: Indexing puncutation

2005-06-29 Thread Aigner, Thomas
#x27;t matter. Two other issues are code/data size (ICU can be big) and the performance hit while indexing documents. -- Ken >Aigner, Thomas wrote: > >>Hello all, >> >> I am VERY new to Lucene and we are trying out Lucene to see if >>it will accomplish the v

question on filters

2005-07-05 Thread Aigner, Thomas
Hi all, I am creating a punctuation filter to filter certain punctuation out of the token stream. I am getting a "The field t.termText is not visible" error. I'm not sure what I would need to include to make this property visible (I am still new to Lucene and Java for that matter). I copi

Filtering accents

2008-12-30 Thread legrand thomas
Dear all, I'd like my lucene searches to be insensitive to (French) accents. For example, considering a indexed term "métal", I want to get it when searching for "metal" or "métal" . I use lucene-2.3.2 and the searches are performed with: IndexSearcher.search(query,filter,sorter), Another filte

Creating document fields by providing termvector directly (bypassing the analyzing/tokenizing stage)

2009-04-21 Thread Thomas Pönitz
] b[2] c[1]. The old discussion had no real solution but it is also a bit outdated, maybe someone has a better idea now. Greets, Thomas - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-

Index and search terms containing character "-"

2009-05-31 Thread legrand thomas
Hi, I have a problem using TermQuery and FuzzyQuery for terms containing the character "-". Considering I've indexed "jack" and "jack-bauer" as 2 tokenized captions, I get no result when searching for "jack-bauer". Moreover, "jack" with a TermQuery returns the two captions.   What should I do t

Re: Index and search terms containing character "-"

2009-06-02 Thread legrand thomas
d strongly recommend you get a copy of Luke, it's invaluable for questions like this because it lets you look at what's actually in your index. It'll also show you how queries get broken down when pushed through various analyzers... BTW, nice test case for demonstrating what you w

Re: Loading an index into memory

2009-07-24 Thread Thomas Becker
/www.windowslive.com/Online/Hotmail/Campaign/QuickAdd?ocid=TXT_TAGLM >>>> _WL_QA_HM_sports_photos_072009&cat=sports >>>> >>> - >>> To unsubscribe, e-mail: java-user-u

2.9 - leftover (deleted) filehandles after upgrade

2009-07-29 Thread Thomas Becker
mpDir); with IndexSearcher indexSearcherTmp = new IndexSearcher(tmpDir, true); No errors in the logfiles, no catched exceptions, etc. I'm a kinda out of ideas at the moment. I googled and tried couple of things (IndexWriter.setUseCompoundFile(true), etc.) but didn't find a solution. A

lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
y took {} ms", durationMillis); } return docs; } I'm wondering why others are experiencing better performance with 2.9 and why our implementations performance is going bad. Maybe our way of using the 2.9 api is not the best and sorting is definetly

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
Missed the attachment, sorry. Thomas Becker wrote: > Hi all, > > I'm experiencing a performance degradation after migrating to 2.9 and running > some tests. I'm getting out of ideas and any help to identify the reasons why > 2.9 is slower than 2.4 are highly appreci

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
Urm and uploaded here: http://ankeschwarzer.de/tmp/graph.jpg Sorry. Thomas Becker wrote: > Missed the attachment, sorry. > > Thomas Becker wrote: >> Hi all, >> >> I'm experiencing a performance degradation after migrating to 2.9 and running >> some tests.

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
nerCache is a Map containing field + parser * (contracttocontentgroup prefix) as the key and as a value yet another map. * The latter map finally contains the docIds as key and positionvalue for this * prefix as value. * * @author Thomas Becker (thomas.bec...@net-m.de) * */ pub

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
with lucene 2.4. I will now try a freshly build 2.9 index and see if performance improves. Maybe that already solves the issue...stupid me... We're updating the index every 30 min. at the moment and it gets optimized after each update. Mark Miller wrote: > Thomas Becker wrote: >> Hey Mar

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
ry as well?! Will check that. Thanks a lot for your support! Cheers, Thomas Mark Miller wrote: > A few quick notes - > > Lucene 2.9 old api doesn't appear much worse than Lucene 2.4? > > You save a lot with the new Intern impl, because thats not a hotspot > anymore. But t

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
org/jira/browse/LUCENE-753 > -- Thomas Becker Senior JEE Developer net mobile AG Zollhof 17 40221 Düsseldorf GERMANY Phone:+49 211 97020-195 Fax: +49 211 97020-949 Mobile: +49 173 5146567 (private) E-Mail: mailto:thomas.bec...@net-m.de Internet: http://www.net-m.de Registergericht: Amts

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
Mark Miller wrote: > Thomas Becker wrote: >> Hey Mark, >> >> yes. I'm running the app on unix. You see the difference between 2.9 and 2.4 >> here: >> >> http://ankeschwarzer.de/tmp/graph.jpg >> > Right - I know your measurements showed

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-15 Thread Thomas Becker
Hi Uwe, already done. See my last message. Cheers, Thomas Uwe Schindler wrote: > On 2.9. NIOFS is only used, if you use FSDirectory.open() instead of > FSDirectory.getDirectory (Deprecated). Can you compare when you use instead > of FSDirectory.open() the direct ctor of SimpleFSDir vs.

Problems with ItemBasedRecommender with Lucene

2009-09-16 Thread Thomas Rewig
e fields... I'm using lucene 2.4.1 and java version "1.6.0_16". Do anyone have an idea to avoid the growing memory. Or do somebody know an other approche for a "realtime Item based Recommender" with Lucene? Regards Thomas --

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
gt; https://issues.apache.org/jira/browse/LUCENE-753 > -- Thomas Becker Senior JEE Developer net mobile AG Zollhof 17 40221 Düsseldorf GERMANY Phone:+49 211 97020-195 Fax: +49 211 97020-949 Mobile: +49 173 5146567 (private) E-Mail: mailto:thomas.bec...@net-m.de Internet: http:/

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
g, can you fill us > in on the query types you are using as well? (eg qualities) > > And grab invocations if its possible. > -- Thomas Becker Senior JEE Developer net mobile AG Zollhof 17 40221 Düsseldorf GERMANY Phone:+49 211 97020-195 Fax: +49 211 97020-949

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
ry types you are using as well? (eg qualities) >> >> And grab invocations if its possible. >> >> -- >> - Mark >> >> http://www.lucidimagination.com >> >> >> >> Thomas Becker wrote: >>> Tests run on tmpfs: >>> config: impl=Sepa

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
guess, based on > the 2.9 new api profiling, is that your queries may not be agreeing with > some of the changes somehow. Along with the profiling, can you fill us > in on the query types you are using as well? (eg qualities) > > And grab invocations if its possible. > -- Thomas B

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
IndexSearcher.search was called only > once. > > Uwe > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > -- Thomas Becker Senior JEE Deve

Re: lucene 2.9.0RC4 slower than 2.4.1?

2009-09-16 Thread Thomas Becker
t was only > one search, you must have two segments and therefore no optimized index for > this to be correct? > > Uwe > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > Fo

Re: Problems with ItemBasedRecommender with Lucene

2009-09-17 Thread Thomas Rewig
You use Lucene 2.9 is there a way to do this with Lucene 2.4.1 because I can't find e.g. the "PayloadEncoder" or do I have to wait for the release? Regards Thomas You might want to ask on mahout-user, but I'm guessing Ted didn't mean a new field for every item-item,

Using TermVectorMapper to compute term frequency across documents

2009-10-12 Thread Thomas D'Silva
getTermFreqVector(). I do not require the term frequency within a document. Thanks, Thomas HashMap termDocCount = new HashMap(); TermQuery tagQuery = new TermQuery(tagTerm); TopDocs docs = searcher.search(tagQuery, numDocs); for (int i=0 ; i public void map(String term, int frequency

Re: Using TermVectorMapper to compute term frequency across documents

2009-10-15 Thread Thomas D'Silva
while to compute the document,tag probabilities. Thanks, Thomas On Wed, Oct 14, 2009 at 8:15 AM, Grant Ingersoll wrote: > > On Oct 12, 2009, at 10:46 PM, Thomas D'Silva wrote: > >> Hi, >> >> I am trying to compute the counts of terms of the documents return

Sorting with MultiSearcher

2007-11-08 Thread WATHELET Thomas
Hi, I have few Indexes with the same structure. I'm using MultiSearcher to search into those indexes and when I try to sort the result by field the result is sort by field and by index (we have all results from index1 and then index2,...) but I would like to have the result sorted on the all result

RE: Sorting with MultiSearcher

2007-11-08 Thread WATHELET Thomas
EMAIL PROTECTED] Sent: 08 November 2007 13:22 To: java-user@lucene.apache.org Subject: Re: Sorting with MultiSearcher Any other info or code snippets? I sort on multisearchers all the time and have never seen that behavior. - Mark (sorting on multisearchers since Lucene 1.4 ) WATHELET Thomas w

Date sorting problem [ IndexSearcher | Hits | Sort | Float ]

2008-03-08 Thread legrand thomas
Dear all, I'm trying to sort query results using a date criteria. My dates are stored as "long" in the database (I cannot change this) and indexed as untokenized. The sorted resuIts I get aren't consistent. This problem does not occur if the number are "smaller". Am I doing something wrong ? I

Max length

2008-04-15 Thread WATHELET Thomas
iddle. Any body can help me thank's Thomas WATHELET Development Team Tel.: (+352) 4300 24752 E-mail: [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> <>

lucene can't find segments file

2008-04-17 Thread Hoelzl, Thomas
file. It contains the following files. master:/home/thomas/keywordsearch/etc # ls /usr/local/jboss-3.2.7/server/default/conf/index/ .. _2.cfs segments.gen segments_9 I have checked the index using luke and it is good. In addition it works on Windows. Can anybody tell me why it is se

AW: lucene can't find segments file

2008-04-17 Thread Hoelzl, Thomas
e.org Betreff: Re: lucene can't find segments file It seems likely you are using an older version of Lucene to access an index created by a newer version of Lucene? Mike Hoelzl, Thomas wrote: > Hi all! > > I have some problems running my lucene application on linux (suse). > > l

Exact string

2008-04-30 Thread WATHELET Thomas
et something special to my search query? I need help... Thanks in advence. Thomas WATHELET Development Team Tel.: (+352) 4300 24752 E-mail: [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> <>

Search for long titles - wildcard queries

2008-05-10 Thread legrand thomas
Dear all, I'm a recent Lucene user and I'm looking for the best way to perform searches over long titles (ad titles on a website). For example, if the following documents exist: - TITLE, "Fender telecaster" - TITLE, "Land rover defender" - TITLE, "I sale a wonderful fender st

Re: Question about indexing (BrazilianAnalyzer)

2008-06-04 Thread Thomas Arni
; c). Probably the problem is with this accents.. You can check this if you adapt the method tokenStream() in the BrazilianAnalzyer by including the ISOLatin1AccentFilter in the filter chain. Thomas Vinicius Carvalho said the following on 03/06/08 20:51: Hello there! I'm indexing documents u

advanced WildcardQuery

2008-07-16 Thread legrand thomas
ardQuery with the term "pretty*car". I also want to get this document when searching for "pretty*sale*". How should I do ? Is it really possible ? I use lucene 2.3.1. Thanks in advance, Thomas Legrand

Range Query Question

2008-07-25 Thread Thomas Becker
Name fields in a range between "A Balladeer*" TO "A Perfect Circle*" and get only terms back which are starting with that terms? Is there a way to accomplish that in Java and try it in luke? And is there a way to sort resultsets in luke? Cheers, Thomas -- Thomas Be

Re: Range Query Question

2008-07-25 Thread Thomas Becker
t Circle"] gives zero results. Tried it also with braces around the term and such stupid things, even if they shouldn't be needed in a range query. I'm kinda clueless. Cheers, Thomas Ian Lea wrote: Hi Are you sure your range queries should have wild card asterisks on the end? Loo

Re: Range Query Question

2008-07-25 Thread Thomas Becker
Btw. I tried the wildcard since I found something on google, which noted wildcards together with StartsWith queries. Thomas Becker wrote: Hi Ian, no the wild cards should not be necessary. That was just the last try out of some. I now the exact content of both fields in my range query. The

is there an histogram feature in lucene ak Magelan

2008-10-13 Thread Thomas Birnbaum
350 damage unrepaired 30 metallic 60 something like this... is there a way to do the same with lucene? thx thomas. -- GMX Kostenlose Spiele: Einfach online spielen und Spaß haben mit Pastry Passion! http://games.entertainment.gmx.net/de/entertainment/games/free/puzzle/6169196

Re: Performance tips when creating a large index from database.

2009-10-22 Thread Thomas Becker
be careful. Load on the DB Server will surely increase. Hope that helps. Cheers, Thomas Paul Taylor wrote: > I'm building a lucene index from a database, creating 1 about 1 million > documents, unsuprisingly this takes quite a long time. > I do this by sending a query to the db o

Concurrent access IndexReader / IndexWriter - FileNotFoundException

2010-01-08 Thread legrand thomas
Hi, I often get a FileNotFoundException when my single IndexWriter commits while the IndexReader also tries to read. My application is multithreaded (Tomcat uses the business APIs); I firstly thought the read/write access was thread-safe but I probably forget something.  Please help me to unde

Re: Concurrent access IndexReader / IndexWriter - FileNotFoundException

2010-01-09 Thread legrand thomas
xWriter is committing) is perfectly fine.  The reader searches the point-in-time snapshot of the index as of when it was opened. But: what filesystem are you using?  NFS presents challenges, for example. Mike On Fri, Jan 8, 2010 at 5:35 AM, legrand thomas wrote: > Hi, > > I often get a Fi

Re: Concurrent access IndexReader / IndexWriter - FileNotFoundException

2010-01-09 Thread legrand thomas
McCandless a écrit : De: Michael McCandless Objet: Re: Concurrent access IndexReader / IndexWriter - FileNotFoundException À: java-user@lucene.apache.org Date: Samedi 9 Janvier 2010, 14h51 Can you post the full FNFE stack trace? Mike On Fri, Jan 8, 2010 at 5:35 AM, legrand thomas wrote: >

Re: Concurrent access IndexReader / IndexWriter - FileNotFoundException

2010-01-09 Thread legrand thomas
ginal questions...: commit/read does not require any external synchronization or locking.  You should generally keep your IW open indefinitely and just periodically commit and/or get a new reader (IndexWriter.getReader()) as needed. Mike On Sat, Jan 9, 2010 at 10:06 AM, legrand thomas wrote: > &g

Re: If you could have one feature in Lucene...

2010-02-25 Thread Thomas Guttesen
For additional commands, e-mail: java-user-h...@lucene.apache.org > > -- Med venlig hilsen Thomas Guttesen

google's index layout, lucene on hbase(?)

2010-03-11 Thread Thomas Koch
; ( or http://tinyurl.com/yjr45ut ) The mail is about a lucene index{reader|writer} on top of cassandra and whether sth. like this could also be done with hbase. Best regards, Thomas Koch, http://www.koch.ro - To unsubscribe, e-mail

[ANN] Eclipse GIT plugin beta version released

2010-03-31 Thread Thomas Koch
http://www.infoq.com/news/2010/03/egit-released http://aniszczyk.org/2010/03/22/the-start-of-an-adventure-egitjgit-0-7-1/ Maybe, one day, some apache / hadoop projects will use GIT... :-) (Yes, I know git.apache.org.) Best regards, Thomas Koch, http://www.ko

IndexSearcher - open file handles by deleted files

2010-05-26 Thread Thomas Rewig
s not automatically if i close searcher.close()? Do I have to close something else, than all IndexSearchers and Directorys? Or am I wrong with my assumption, and the problem is somewhere else? Best Thomas - To unsubscri

Introduction to flexible indexing?

2010-06-14 Thread Thomas Koch
understand this page and help to get it in shape. Best regards, Thomas Koch, http://www.koch.ro - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Fielded Queries Question

2010-07-06 Thread Thomas Nguyen
Hello All, Can someone explain to me how fielded queries work with phrases? My first thought is that the phrase is broken down into terms and those terms are then fielded and separated with the AND operator. An example would be the following: name:"Tom Jones" --> name:"Tom" AND name:"Jones" I

Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
Hello Guys, I trying to understand how lucene score is calculated. So 'm using the searcher.explain() function. But the output it gives is really confusing for me. Below are the details of the query that I gave and o/p it gave me Query: *It is definitely a CES deal that will be over in Sep or Oct

Re: Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
m frequency, idf and field norm > > 0.07028562 = (MATCH) fieldWeight(payload:ces in 550), product of: > > 1.0 = *tf(*termFreq(payload:ces)=1) > > 2.2491398 = *idf(*docFreq=157, maxDocs=551) > > 0.03125 = *fieldNorm*(field=payload, doc=550) > >

Restore documents marked as deleted

2010-10-06 Thread Philippe Thomas
Hi, I was indexing some documents, but my program crashed after several days of work. If I reopen this index it is empty. I guess the reason is that auto-commit was not set and I never performed a commit. (Lesson learned) So probably all documents are marked as "deleted" and re-opening the i

Re: File Handle Leaks During Lucene 3.0.2 Merge

2010-11-10 Thread Thomas Rewig
amount of the deleted file handles will be stable - but first at a amount of 500 or so. Thanks in advance Thomas I integrated your SearchManager class into our code, but I am still seeing file handles marked deleted in the index directory. I am running the following command on Linux: sudo watch

Deleted File Handles - Index Writer

2010-11-12 Thread Thomas Rewig
used" by the indexwriter) grows. Is that possible and if yes why does the indexwriter do it? Is there a max Value of deleted handles an IndexWriter could own, because I don't want to chrash the system because of too much open filehandles? Thanks in advance. Thomas --

Re: Deleted File Handles - Index Writer

2010-11-18 Thread Thomas Rewig
help. Thomas I've found a case, only with compound file, where IndexWriter holds open a SegmentReader on the pre-compound-file files... I'm working on a test case& fix. Mike On Fri, Nov 12, 2010 at 5:49 AM, Thomas Rewig wrote: Hello, I use the searcherManager for LiveIndexin

Re: Deleted File Handles - Index Writer

2010-11-19 Thread Thomas Rewig
.0.2 Release version or have I wait for a future release? Thanks for your help. Thomas Listen Read phonetically

Check Numeric Fields

2011-03-11 Thread Thomas Rewig
t the NumericRangeQuery query does not work? I use lucene v. 3.0.2. Thanks in advance! Thomas

Question about termDocs.read(docs, freqs)

2006-09-19 Thread Kroehling, Thomas
index and I wonder why my WildcardFilter sometimes returns the correct search results and sometimes not. What is the difference between steping through the term docs with termDocs.next() and using the read-method. Can anybodey explain that? Thanks in advance, Thomas

AW: Question about termDocs.read(docs, freqs)

2006-09-19 Thread Kroehling, Thomas
dcard filters. But filters really seem to be quite fast. Thanks again, Thomas -Ursprüngliche Nachricht- Von: Erick Erickson [mailto:[EMAIL PROTECTED] Gesendet: Tuesday, September 19, 2006 3:59 PM An: java-user@lucene.apache.org Betreff: Re: Question about termDocs.read(docs, freqs) I&#x

Possible memory issue?

2006-10-26 Thread Aigner, Thomas
Howdy all, I have a issue with java running out of memory after the search has been running for a while. We are using 1.9.1 release and I check the indexreader's version to determine if I need to get a new searcher for searches (so I pick up any changes to the index). I am seeing jumps i

RE: Possible memory issue?

2006-10-26 Thread Aigner, Thomas
objects aren't properly disposed. -drj On 10/26/06, Aigner, Thomas < [EMAIL PROTECTED]> wrote: > > Howdy all, > > I have a issue with java running out of memory after the search > has been running for a while. We are using 1.9.1 release and I check > the indexrea

Warming up a Searcher

2006-11-02 Thread Aigner, Thomas
I have seen numerous posts on warming up a searcher, but was wondering if someone could post their code that would spin off another thread to warm up a searcher, then switch to the new one when it is warmed up? - To unsubscri

Update an existing index

2006-11-08 Thread WATHELET Thomas
how to update a field in lucene?

stemmer

2006-11-18 Thread Thomas Klein
lt stemmer or do I have to stem the texts before indexing ? Does a multi-language stemmer exists ? (sorry if the answers are in the documentation, I didn't manage to fully read it) Thanks in advance ! Thomas Klein. - To

Reading Performance

2006-12-07 Thread Aigner, Thomas
Howdy all, I have a question on reading many documents and time to do this. I have a loop on the hits object reading a record, then writing it to a file. When there is only 1 user on the Index Searcher, this process to read say 100,000 takes around 3 seconds. This is slow, but can

RE: Reading Performance

2006-12-07 Thread Aigner, Thomas
have it in the loop? On Dec 7, 2006, at 1:14 PM, Aigner, Thomas wrote: > > > > > Howdy all, > > > > I have a question on reading many documents and time to do this. > I have a loop on the hits object reading a record, then writing it > to a > file

RE: Reading Performance

2006-12-08 Thread Aigner, Thomas
is.search(query, hc); -Original Message- From: Aigner, Thomas [mailto:[EMAIL PROTECTED] Sent: Thursday, December 07, 2006 1:36 PM To: java-user@lucene.apache.org Subject: RE: Reading Performance Thanks Grant and Erik for your suggestions. I will try both of them and let you know i

Index creation

2007-01-29 Thread WATHELET Thomas
How could I optimize my index creation? // setUseCompoundFile(?); // setMaxBufferedDocs(?); // setMergeFactor(?); How could I reduce the disk access because I work with more than 100 documents? Thanks

RE: Index creation

2007-01-30 Thread WATHELET Thomas
ink. Otis - Original Message From: WATHELET Thomas <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, January 29, 2007 4:16:22 AM Subject: Index creation How could I optimize my index creation? // setUseCompoundFile(?); // setMaxBufferedDocs(?); /

RAMDirectory

2007-01-30 Thread WATHELET Thomas
I'm using RAMDirectory If the number of documents to index is less than the maxBufferedDocs properties nothing is write into my index. ex: RAMDirectory ramDir =new RAMDirectory (); this.indexWriter.addIndexes(new Directory[] { ramDir }); ramWriter.close(); indexWriter.

RAMDirectory 2

2007-01-30 Thread WATHELET Thomas
P.S. At one point I tried doing an in-memory index using the RAMDirectory and then merging it with an on-disk index and it didn't work. The RAMDirectory never flushed to disk... leaving me with an empty index. Only when the number of documents is greather than the maxBufferedDocs properties. What

RE: RAMDirectory 2

2007-01-30 Thread WATHELET Thomas
s was fixed a while back. Running HEAD or at least 2.0.0? Otis - Original Message From: WATHELET Thomas <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Tuesday, January 30, 2007 11:22:52 PM Subject: RAMDirectory 2 P.S. At one point I tried doing an in-memory inde

RE: Deleting document by file name

2007-02-01 Thread WATHELET Thomas
do something like this: public class Index extends IndexModifier { ... public int deleteDocuments(String field, String value) throws IOException { return super.deleteDocuments(new Term(field, value)); } use like this : index.deleteDocuments(filed name, field value); _

RE: how to query range of Date by given date string?

2007-02-26 Thread WATHELET Thomas
Parse your date with this classe DateTools.stringToDate to search and DateTools.dateToString() to store into index. -Original Message- From: 李寻欢晕菜了 [mailto:[EMAIL PROTECTED] Sent: 26 February 2007 11:17 To: java-user@lucene.apache.org Subject: how to query range of Date by given date st

RamDirectory vs IndexWriter

2007-02-28 Thread WATHELET Thomas
I don't really understand the difference between using the ramDirectory and using IndexWriter. What's the difference between using ramDirectory instead of using IndexWriter with those properties set to: setMergeFactor(1000);setMaxMergeDocs(1);setMaxBufferedDocs(1);

RE: RamDirectory vs IndexWriter

2007-02-28 Thread WATHELET Thomas
lto:[EMAIL PROTECTED] Sent: 28 February 2007 16:29 To: java-user@lucene.apache.org Subject: Re: RamDirectory vs IndexWriter Le Mercredi 28 Février 2007 16:19, WATHELET Thomas a écrit : > I don't really understand the difference between using the ramDirectory > and using IndexWriter. &

RE: Soliciting Design Thoughts on Date Searching

2007-02-28 Thread Aigner, Thomas
Walt, I am no expert, but it sounds like you need to associate many dates to a single record. Can this be handled as you would a synonym? Basically add a token at the same offset as the row itself? i.e. you would have a record that would also have a date field that has 3 offsets that woul

Package org.apache.lucene.search.highlight

2007-03-04 Thread WATHELET Thomas
How can I add the Package org.apache.lucene.search.highlight into my projects because the standart Lucene api 2.1.0 do not content this package?

RE: Package org.apache.lucene.search.highlight

2007-03-04 Thread WATHELET Thomas
org.apache.lucene.search.highlight The contrib/highlighter directory contains the jar file that is needed. Ulf On 04.03.2007, at 10:58, WATHELET Thomas wrote: > How can I add the Package org.apache.lucene.search.highlight i

date range querys

2007-03-11 Thread Thomas Klein
tting and returning docs IDs. I'm indexing a some date field like that : 20060612223456 2006061222 20060612 200606 2006 In your opinion, will range query be fast enough or will I need to create, for exemple, one index per month and then open and parse them regarding querys ? R

Re: Common Words ignoring problem

2007-03-19 Thread thomas arni
process. http://www.getopt.org/luke/ Stop words affect the performance. The seize of the index without stop words is much small ( up to 40%), because they occur soo often. Thomas aslam bari wrote: Ok, Thats fine. Thanks Now what if i don't want to stop any word, means i want lucene not to ignore

Re: PorterStemFilter

2007-03-27 Thread thomas arni
ss (Analyzer), which is a simple copy of the StandardAnalyzer, and expand it with the appropriate Filter in the method "tokenStream". Hope this helps... Thomas [EMAIL PROTECTED] wrote: Hi, Lucene provides a PorterStemFilter which uses PorterStemmer. Is there any way I c

Re: TF-IDF API

2007-03-28 Thread thomas arni
rrent document. For the calculation of the idf, you can use the provided formula from the "DefaultSimilarity". To get the document frequency, which is necessary to calculate the idf, you can call: reader.docFreq(term) Hope this helps... Thomas Sengly Heng wrote: Hello Luceners,

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread thomas arni
//toDo: count the occurrence of the terms } } } Hope this helps. Thomas Sengly Heng wrote: Hello all, I would like to extract the term freq vector from the hit results as a total vector not by document. I have searched the

Help IndexWriter,Multi-threaded index access

2007-05-11 Thread legrand thomas
complete JUnit test which behaves exactly as my manager. The main methods to look at are testIndexManagement (the test),getIndexWriter,addDocument,performExactSearchIntoIndex,performApproximativeSearchIntoIndex. Thank you for your help, Best regards, T

Help IndexWriter,Multi-threaded index access

2007-05-11 Thread legrand thomas
complete JUnit test which behaves exactly as my manager. The main methods to look at are testIndexManagement (the test),getIndexWriter,addDocument,performExactSearchIntoIndex,performApproximativeSearchIntoIndex. Thank you for your help, Best regards, T

Update documents

2007-06-27 Thread WATHELET Thomas
High, Is-it possible to update a document's field without deleting the document and add it again into the index?

Query in lucene

2007-07-18 Thread WATHELET Thomas
Witch analyser I have to use to find text like this ''?

Indexing PDF documents with structure information

2007-08-13 Thread Thomas Arni
like the page or the chapter, where the relevant information is. As anyone have similar requirements? Which of these tools are the best to fit my requirements? Thanks for your help Thomas - To unsubscribe, e-mail: [EMAIL PROT

  1   2   3   >