Score Range....

2004-07-15 Thread Karthik N S
Hey Guys Apologies.. I hava Silly Question. On a avaliable Hit returns, How would one be able to get score between an upper and lower limit value Say' X > 0.4 and X < 1.0 ' Do u think this will work with regards Karthik WITH WARM REGARDS HAVE A NICE DA

Re: Searching against Database

2004-07-15 Thread Peter M Cipollone
- Original Message - From: "Hetan Shah" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Thursday, July 15, 2004 7:51 PM Subject: Re: Searching against Database > Is it possible to search against the column in the table ? If so are > there any limitations on the # o

Re: Searching against Database

2004-07-15 Thread Hetan Shah
Is it possible to search against the column in the table ? If so are there any limitations on the # of columns one should target to search against? any other suggestions? Thanks. -H - To unsubscribe, e-mail: [EMAIL PROTECTED] Fo

Re: release & migration plan

2004-07-15 Thread Doug Cutting
fp235-5 wrote: I am looking at the code to implement setIndexInterval() in IndexWriter. I'd like to have your opinion on the best way to do it. Currently the creation of an instance of TermInfosWriter requires the following steps: ... IndexWriter.addDocument(Document) IndexWriter.addDocument(Docume

Re: Token or not Token, PerFieldAnalyzer

2004-07-15 Thread Doug Cutting
Florian Sauvin wrote: Everywhere in the documentation (and it seems logical) you say to use the same analyzer for indexing and querying... how is this handled on not tokenized fields? Imperfectly. The QueryParser knows nothing about the index, so it does not know which fields were tokenized and wh

Token or not Token, PerFieldAnalyzer

2004-07-15 Thread Florian Sauvin
Hello, When indexing a field, we have the choice of tokenizing it or not. I have a custom analyzer that contains a tokenizer... does it mean that if the boolean token is set to false, the analyzer is not applied on the field content? Everywhere in the documentation (and it seems logical) you say to

Re: release & migration plan

2004-07-15 Thread fp235-5
I am looking at the code to implement setIndexInterval() in IndexWriter. I'd like to have your opinion on the best way to do it. Currently the creation of an instance of TermInfosWriter requires the following steps: ... IndexWriter.addDocument(Document) IndexWriter.addDocument(Document, Analyser)

Re: Re: Scoring without normalization!

2004-07-15 Thread Joshua Lewis
Hi, Note that as http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarity.html#lengthNorm(java.lang.String,%20int) points out: "[T]hese values are computed under IndexWriter.addDocument(Document) and stored then using {#encodeNorm(float)}. Thus they have limited precision,

Re: Re: Scoring without normalization!

2004-07-15 Thread Jones G
Thanks. I tried overriding Similarity, returning 1 in lengthNorm and queryNorm and setSimilarity on IndexSearcher with this. Query: 1 Found: 1540632 Rank: 1 ID: 8157438 Score: 0.9994 3.73650457E11 = weight(title:iron in 159395), product of: 7.0507255 = queryWeight(title:iron), product o

Re: Searching against Database

2004-07-15 Thread Hetan Shah
It seems to me that the solutions offered are based on the documents already existing in the database. My scenario is a little more complex then that. In my application the documents get generated dynamically and I search for various fields across various tables. How do I start this task? Has a

Re: Scoring without normalization!

2004-07-15 Thread Doug Cutting
Have you looked at: http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarity.html in particular, at: http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarity.html#lengthNorm(java.lang.String,%20int) http://jakarta.apache.org/lucene/docs/api/org/apache/lucen

Re: RE: Scoring without normalization!

2004-07-15 Thread Jones G
Sadly, I am still running into problems Explain shows the following after the modification. Rank: 1 ID: 11285358Score: 5.5740864E8 5.5740864E8 = product of: 8.3611296E8 = sum of: 8.3611296E8 = product of: 6.6889037E9 = weight(title:iron in 1235940), product of: 0.12621

Powered By Lucene image?

2004-07-15 Thread yahootintin . 1247688
Hi, Are there any "powered by Lucene" images? I thought there used to be some on the site but I can't find them now. Any help is appreciated! Thanks. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-m

RE: Problems indexing Japanese with CJKAnalyzer ... Or French with UTF-8 and MetaData

2004-07-15 Thread Bruno Tirel
I don't think I understand correctly your proposal. As a basis, I am using Demo3 with indexHTML, HTMLDocument and HTMLParser. Inside HTML parser, I am calling getMetaTags (calling addMetaData) wich return Properties object. My issue is coming fron this definition : Properties are stored into ISO-88

RE: Searching against Database

2004-07-15 Thread Daniel de Souza Teixeira
See this document! http://www.jguru.com/faq/view.jsp?EID=1074229 Regards! -- Daniel > Hi, > > You how to convert RTF file file txt file. > Any API available? > > If u have any sample code pls send it to me. > > Regards, > Natarajan. > > -Original Message- > From: Sergiu Gordea [mailto:[E

Re: Wildcard search with my own analyzer

2004-07-15 Thread Erik Hatcher
On Jul 15, 2004, at 10:02 AM, Morus Walter wrote: Joel Shellman writes: What do I need to do so that wildcard searching will work on this? I am using the same analyzer for indexing and searching (otherwise the first search wouldn't work either). Check what query is produced (query.toString(...))

RE: Searching against Database

2004-07-15 Thread Natarajan.T
Hi, You how to convert RTF file file txt file. Any API available? If u have any sample code pls send it to me. Regards, Natarajan. -Original Message- From: Sergiu Gordea [mailto:[EMAIL PROTECTED] Sent: Thursday, July 15, 2004 2:16 PM To: Lucene Users List Subject: Re: Searching against

Re: Wildcard search with my own analyzer

2004-07-15 Thread Morus Walter
Joel Shellman writes: > > What do I need to do so that wildcard searching will work on this? I am > using the same analyzer for indexing and searching (otherwise the first > search wouldn't work either). > Check what query is produced (query.toString(...)). I guess that query parser which seems

RE: Anyone use MultiSearcher class

2004-07-15 Thread Mark Florence
Don, I think I finally understand your problem -- and mine -- with MultiSearcher. I had tested an implementation of my system using ParallelMultiSearcher to split a huge index over many computers. I was very impressed by the results on my test data, but alarmed after a trial with live data :) Cons

Wildcard search with my own analyzer

2004-07-15 Thread Joel Shellman
I wanted to support categories, and so I created my own analyzer so that: Root Category||My Category||Some Other Things Would be split up into three terms split by ||, and I wanted it to stay case sensitive. If I do a search for: categories:"Root Category" it works fine. But if I do a search for:

Re: Searching against Database

2004-07-15 Thread Sergiu Gordea
This is not a solution in my case, becasue the permissions of the groups, and the user groups can be changed, and it will make managing index to be a nightmare. anyway, I appreciate the advice, maybe it will be useful for the other guys that asked this question. Sergiu [EMAIL PROTECTED] wr

Re: Problems indexing Japanese with CJKAnalyzer

2004-07-15 Thread Praveen Peddi
If its a web application, you have to cal request.setEncoding("UTF-8") before reading any parameters. Also make sure html page encoding is specified as "UTF-8" in the metatag. most web app servers decode the request paramaters in the system's default encoding algorithm. If u call above method, I th

RE: Searching against Database

2004-07-15 Thread wallen
If you know ahead of time which documents are viewable by a certain user group you could add a field, such as group, and then when you index the document you put the names of the user groups that are allowed to view that document. Then your query tool can append, for example "AND group:developers"

Re: Search +QueryParser+Score

2004-07-15 Thread Erik Hatcher
I don't really understand what QueryParser has to do with your question. If you want only Hits that have a score of 1.0 (keep in mind that Hits normalizes scores if they are over 1.0), why not just walk all the Hits in order until you get to one that is not 1.0? Or, use a HitCollector to colle

RE: Search +QueryParser+Score

2004-07-15 Thread Karthik N S
Hey Guys... Apologies Let me be more Specific regarding the last mail I would like to get all Hits returned with score = 1.0 ONLY using Query Parser . What are my Options. with regards Karthik -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Thur

Re: Search +QueryParser+Score

2004-07-15 Thread Erik Hatcher
Kathik, I have a really hard time following your questions, otherwise I'd chime in on them more often. Your meaning is not often clear. In the case of normalizing the score to 1.0 or less - this is precisely what Hits does for you. I'm not sure what you mean by "BEFORE" doing QueryParser - a

Re: Searching against Database

2004-07-15 Thread Erik Hatcher
In this situation, you may want to investigate implementing a custom Filter which is user-specific and constrains the search space to only the rows a specific user is allowed to search. Erik On Jul 15, 2004, at 3:04 AM, Sergiu Gordea wrote: Hi again, I'm thinking to get the list of IDs

Re: One Field!

2004-07-15 Thread Erik Hatcher
On Jul 14, 2004, at 10:19 PM, Jones G wrote: I have an index with multiple fields. Right now I am using MultiFieldQueryParser to search the fields. This means that if the same term occurs in multiple fields, it will be weighed accordingly. Is there any way to treat all the fields in question as

Search +QueryParser+Score

2004-07-15 Thread Karthik N S
Hey Guy's Apologies. I have a Question Is there any API avaliable in Lucene1.4 to set the "Score value to 1.0f or lesser " BEFORE doing the Query Parser for search , so that the returns Hits for the Score settings only. with regards Karthik ---

RE: Problems indexing Japanese with CJKAnalyzer

2004-07-15 Thread Bruno Tirel
Hi All, I am also trying to localize everything for French application, using UTF-8 encoding. I have already applied what Jon described. I fully confirm his recommandation for HTML Parser and HTML Document changes with UNICODE and "UTF-8" encoding specification. In my case, I have still one case

Re: Searching against Database

2004-07-15 Thread Sergiu Gordea
Hi again, I'm thinking to get the list of IDs from the database and the list of hits from Lucene Index and to create a comparator in order to eliminate the not permitted Hits from the list. Which solution do you think is better? Thanks, Sergiu Sergiu Gordea wrote: Hi, I have a simillar problem.