Re: lucene for Arabic and Urdu

2007-09-18 Thread Grant Ingersoll
http://wiki.apache.org/lucene-java/IndexingOtherLanguages is slightly out of date, but still has some tips. You may also want to consider starting with Solr, as it has many features a search engine needs. In the past, I have written Arabic Analyzers for Lucene (sorry, can't share them) but

Re: Is there a FilterQueryParser?

2007-09-18 Thread markharw00d
Scott Tiger wrote: I want get BooleanFilter contains two RangeFilters from query string. The XMLQueryParser may be of interest. See BooleanFilter.xml and CachedFilter.xml examples in the XMLQueryParser Junit tests. I typically use QueryTemplateManager to transform user input provided in a

Re: lucene for Arabic and Urdu

2007-09-18 Thread Karl Wettin
18 sep 2007 kl. 23.23 skrev Liaqat Ali: I m new to the field of Information Retrieval and now working to develop search engine for language like Arabic and Urdu. Kindly guide me in this regard that how can Lucene be utilized for this purpose. Lucene makes no distinction between languag

lucene for Arabic and Urdu

2007-09-18 Thread Liaqat Ali
Hello All I m new to the field of Information Retrieval and now working to develop search engine for language like Arabic and Urdu. Kindly guide me in this regard that how can Lucene be utilized for this purpose. Can anybody tell me exactly what I should do to design a search engine from the

Re: Lucene equivalent of grep -A -B command

2007-09-18 Thread Chris Hostetter
: Is there a way we can get contents of the file surrounding to the term (head : 10 lines and tail 10 lines) we are searching through a lucene indexed file? FAQ: Is there a way to get a text summary of an indexed document with Lucene (a.k.a. a "snippet" or "fragment") to display along with the

Is there a FilterQueryParser?

2007-09-18 Thread Scott Tiger
I want get BooleanFilter contains two RangeFilters from query string. Can I use FilterQuery to get RangeFilter? example: my query string: field1:[0 TO 100] AND field2:[1000 TO 2000] BooleanFilter I want: BooleanFilter bf = new BooleanFilter(); bf.add( new FilterClause( new Range

Re: NumberTools - Range Searches

2007-09-18 Thread Mohammad Norouzi
Hi you may put two fields in your document one contains decoded values and another is original values, but still you need to implement your query builder so while searching you must specify the decoded filed and you show the original field to the user On 9/18/07, [EMAIL PROTECTED] <[EMAIL PROTECTE

Re: Span Query on different fields

2007-09-18 Thread Grant Ingersoll
I think the semantics of positions across fields isn't well defined, which is why SpanQuery requires a single field. The workaround is to add a 4th field that is a combo of 1, 2 and 3. Alternatively, since you are doing an OrQuery, just process the Spans from each SpanTermQuery separately

NumberTools - Range Searches

2007-09-18 Thread marc.dumontier
Hi, I'm trying to do Range searches on a numeric field and have come across the lexicographical behavior, such that mylength: [0-2] brings back values such as 1172 I've seen prior posts about the NumberTools class, and have indexed a field using the longToString() method. Do

Span Query on different fields

2007-09-18 Thread gsanctus
Hello, I have an index with tree fields (field1, field2,field3). I have to do an or-query and get term positions within a document. So I'm using span query but I have a problem because with spans I can't write ... SpanTermQuery field1 = new SpanTermQuery(new Term("field1", "hello")); SpanTermQuery

Re: Sorting a search result

2007-09-18 Thread Zach Bailey
Sure. You'll have to do away with the List abstraction in favor of the SortedSet abstraction which in your case is probably ok for the following reasons: 1.) List does not ensure the elements are unique (you probably don't want to have duplicate results), while Set and by extension SortedSet d

Re: Sorting a search result

2007-09-18 Thread anorman
H. Right now my search results are contained in an ArrayList object like so: List searchResult = new ArrayList(); I am adding my "resultBean" objects (with it's data, i.e. title, author, score, etc) into the searchResult (ArrayList). Would this be possible with a SortedSet? Thanks, Albe

Lucene equivalent of grep -A -B command

2007-09-18 Thread Ashish Parikh
Is there a way we can get contents of the file surrounding to the term (head 10 lines and tail 10 lines) we are searching through a lucene indexed file? -- View this message in context: http://www.nabble.com/Lucene-equivalent-of-grep--A--B-command-tf4475418.html#a12760645 Sent from the Lucene -

Re: Sorting a search result

2007-09-18 Thread Zach Bailey
You could use a SortedSet, which automatically inserts an object into its sorted position when calling add()? Cheers, -Zach anorman wrote: I have set up a search result made up of a resultBean object containing the information that I am fetching. Currently it is sorted by score (relevance) wh

Sorting a search result

2007-09-18 Thread anorman
I have set up a search result made up of a resultBean object containing the information that I am fetching. Currently it is sorted by score (relevance) which is great and what I want. However, I have created a loop which continues to add results from several indexes to the search result (e.g. ar

Re: How to tokenize with comma in standard tokenizer

2007-09-18 Thread Bhavin Pandya
Thanks mark. Take the comma out of: | <#P: ("_"|"-"|"/"|"."|",") > in the .jj file Its working for me... - Bhavin pandya - Original Message - From: "Mark Miller" <[EMAIL PROTECTED]> To: Sent: Monday, September 17, 2007 8:34 PM Subject: Re: How to tokenize with comma in standard t

Re: Caching in lucene

2007-09-18 Thread Karl Wettin
18 sep 2007 kl. 08.53 skrev Shailendra Mudgal: Ok let me explain you. By warming up the reader i used to understand is that it memorizes the index terms. Therefore subsequent queries will be answered by using these terms. Is this correct ? Are you asking if all terms are flyweighted/inte