Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-03-03 Thread Mikhail Khludnev
or such need, where > Token and Payload class both are not there now? > > Regards > Rajib > > -Original Message- > From: Uwe Schindler > Sent: 10 February 2023 15:36 > To: java-user@lucene.apache.org > Subject: Re: Need help for conversion code from Lucene 2.4.0 t

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-03-03 Thread Saha, Rajib
Sent: 10 February 2023 15:36 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2 Hi, the reason for this is that files in Lucene are always write-once. We never ever change a file after it was written and committed in the 2-phase-commit. If you

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-02-13 Thread Saha, Rajib
file with the new logic. Regards Rajib -Original Message- From: Uwe Schindler Sent: 10 February 2023 15:36 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2 Hi, the reason for this is that files in Lucene are always write-once. We

Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-02-10 Thread Uwe Schindler
Exception e) { e.printStackTrace(); } == Regards Rajib -Original Message- From: Uwe Schindler Sent: 06 February 2023 16:46 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2 Hi, Since around Lucene 4 (maybe alre

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-02-09 Thread Saha, Rajib
output.close(); } catch(Exception e) { e.printStackTrace(); } == Regards Rajib -Original Message- From: Uwe Schindler Sent: 06 February 2023 16:46 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code f

Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-02-06 Thread Uwe Schindler
places. Currently, this API is not there. Could you please suggest, how we can handle the API in 8.11.2? Regards Rajib -Original Message- From: Mikhail Khludnev Sent: 01 February 2023 12:22 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-02-06 Thread Saha, Rajib
. Currently, this API is not there. Could you please suggest, how we can handle the API in 8.11.2? Regards Rajib -Original Message- From: Mikhail Khludnev Sent: 01 February 2023 12:22 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-31 Thread Mikhail Khludnev
ndexWriter.optimize() > > Is there any similar concept in 8.11? If so, can you please help with APIs > org.apache.lucene.index.IndexWriter#addIndexes(org.apache.lucene.store.Directory...) But it kicks merge underneath. Should be fine. === > > Regard

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-31 Thread Saha, Rajib
37 To: java-user@lucene.apache.org Subject: RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2 Hi Mikhail, Thanks for your suggestion. It solved lots of cases today in my end. 馃槉 I need some more suggestions from your end. I am putting together as below one b

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-30 Thread Saha, Rajib
? If so, can you please help with APIs === Regards Rajib -Original Message- From: Mikhail Khludnev Sent: 29 January 2023 18:05 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2 Hello, You can use

Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-29 Thread Mikhail Khludnev
---Original Message- > From: Mikhail Khludnev > Sent: 19 January 2023 04:26 > To: java-user@lucene.apache.org > Subject: Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2 > > [You don't often get email from m...@apache.org. Learn why this is > important at

RE: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-29 Thread Saha, Rajib
n 8.11.2. Could you please suggest someway to extract all the Terms with an IndexReader or some alternative ways? Regards Rajib -Original Message- From: Mikhail Khludnev Sent: 19 January 2023 04:26 To: java-user@lucene.apache.org Subject: Re: Need help for conversion code from Lucene 2

Re: Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-18 Thread Mikhail Khludnev
Hello, Rajib. API were evolved since 2.4, but it should be clear https://lucene.apache.org/core/8_11_2/core/org/apache/lucene/index/package-summary.html#fields On Wed, Jan 18, 2023 at 1:11 PM Saha, Rajib wrote: > Hi All, > > We are in a process for conversion of Lucene from 2.4.0 to 8.11.2 for

Need help for conversion code from Lucene 2.4.0 to 8.11.2

2023-01-18 Thread Saha, Rajib
Hi All, We are in a process for conversion of Lucene from 2.4.0 to 8.11.2 for our platform code. We have used extensively Lucene in our code. We have replaced several of our code to Lucene 8.11.2 APIs. But, few places, we are stuck of which New Lucene APIs to use, as not getting any suitable

Re: Need help of example of Lucene use.

2023-01-04 Thread Mikhail Khludnev
Hello Rajib. You can start from https://lucene.apache.org/core/8_11_1/core/org/apache/lucene/analysis/package-summary.html#package.description Also, it might make sense to go through for analysis/token mentions in https://lucene.apache.org/core/8_11_1/MIGRATE.html and also MIGRATE.txt in every (??

Need help of example of Lucene use.

2023-01-04 Thread Saha, Rajib
Hi, We are in a project of migration of Lucene from 2.4.1 to 8.11.2. In our project Lucene is been used extensively. We are looking for examples/sample code of uses of Lucene 8.11.2. Currently I am badly required of some examples of using TokenStream, tokenAttributes, *Filter. I need to replac

Re: Need help on defining custom scorer in Lucene 9

2022-04-03 Thread Michael Wechner
Hi Lokesh IIUC each document (like for example a shop description) has a longitude and a latitude associated with. The user search input are some keywords and the the user's geo location. The keywords you use to search for the documents and the users's geo location you would like to use for

Need help on defining custom scorer in Lucene 9

2022-04-03 Thread Lokesh Mavale
Hi Team, I am little bit familiar with Lucene, and I have a problem statement in hand to score each document based on the value of the field. Value will be of type GeoPoint { Lat Long } And in the qry I will be getting other pair of lat, long and distance from that lat long. I have figured out

Re: Need help on aggregation of nested documents

2021-11-16 Thread Gopal Sharma
I have created a custom Collector extending SimpleCollector. I can see the methods scoreMode() and collect(int doc). I am seeing that the collect method is invoked by lucene with the child docId. Am I moving in the right direction? But to collect the values I would need the Document by using read

Re: Need help on aggregation of nested documents

2021-11-16 Thread Adrien Grand
Indeed you shouldn't load all hits, you should register a org.apache.lucene.search.Collector that will aggregate data while matches are being collected. Since you are already using a ToChildBlockJoinQuery, you should be able to use it in conjunction with utility classes from lucene/facets. Have yo

Re: Need help on aggregation of nested documents

2021-11-15 Thread Gopal Sharma
Hi Adrien, Thanks for the reply. I am able to retrieve the child docId's using the .ToChildBlockJoinQuery. Now for me to do aggregates i need to find the document using reader.document(int docID) right?. If that is the case won't getting all the documents would be a costly operation and then fina

Re: Need help on aggregation of nested documents

2021-11-15 Thread Adrien Grand
It's not straightforward as we don't provide high-level tooling to do this. You need to use the BitSetProducer that you pass to the ToParentBlockJoinQuery in order to resolve the range of child doc IDs for a given parent doc ID (see e.g. how ToChildBlockJoinQuery does it), and then aggregate over t

Need help on aggregation of nested documents

2021-11-14 Thread Gopal Sharma
Hi Team, I have a document structure as a customer which itself has few attributes like gender, location etc. Each customer will have a list of facts like transaction, product views etc. I want to do an aggregation of the facts. For example find all customers who are from a specific location and

Re: need help in search

2015-10-05 Thread will
Hi Bhaskar: or everyone's benefit, I hope you will collate the emails into a wiki page and carry it forward. Meritocracy's might have rtfm'd the whole thing. With all respect: Will On 10/5/15 1:06 PM, Bhaskar wrote: Hi, Actually I am looking for auto complete only. Do we have auto sugg

Re: need help in search

2015-10-05 Thread Bhaskar
Hi, Actually I am looking for auto complete only. Do we have auto suggest module in lucene? can you suggest some examples? Thanks in advance. Regards, Bhaskar On Mon, Oct 5, 2015 at 10:30 PM, Alessandro Benedetti < benedetti.ale...@gmail.com> wrote: > +1 on Jack, > furthermore, are you taking ab

Re: need help in search

2015-10-05 Thread Alessandro Benedetti
+1 on Jack, furthermore, are you taking about search or autocomplete ? If you only need autocompletion on the term, maybe it's even better if you take a look to the Lucene suggest module ! Cheers 2015-10-05 14:34 GMT+01:00 Jack Krupansky : > Sounds like you need the edge n-gram filter at index t

Re: need help in search

2015-10-05 Thread Jack Krupansky
Sounds like you need the edge n-gram filter at index time to index all of the prefix strings for each term. Just be aware that using an n-gram filter will explode the size of the index (all the extra terms) The standard tokenizer and word delimiter filter will split terms on special characters, so

Re: need help in search

2015-10-05 Thread Doug Turnbull
Curious if you've tried escaping with \ ie 143\-00098 On Monday, October 5, 2015, Bhaskar wrote: > Hi, > > > when I type 143-00098 I should get all matched result i.e ( 143-00098, > 143-000981, 143-0009823). also If i type 143-000 then i should 143-00098, > 143-0009, 143-0001) > > Looks like the

need help in search

2015-10-05 Thread Bhaskar
Hi, when I type 143-00098 I should get all matched result i.e ( 143-00098, 143-000981, 143-0009823). also If i type 143-000 then i should 143-00098, 143-0009, 143-0001) Looks like the searching is treating "-"( hyphen) as subtract or exclude pages that contain a specific term.. In my case it sho

Re: Need help in alphanumeric search

2015-10-05 Thread Bhaskar
data and the queries you want to do. > Maybe > >> use WhitespaceAnalyzer or better StandardAnalyzer as a first step. Be > sure > >> to reindex your data before querying. The Analyzer used on the search > side > >> must be the same like on the query side. If you want to

Re: Need help in alphanumeric search

2015-10-01 Thread Jack Krupansky
> > > > > > > > > > https://lucene.apache.org/core/5_3_1/analyzers-common/org/apache/lucene/analysis/core/LetterTokenizer.html > > > > >> "A LetterTokenizer is a tokenizer that divides text at > non-letters. > > > > That's >

Re: Need help in alphanumeric search

2015-10-01 Thread Bhaskar
okenizer.html > > > >> "A LetterTokenizer is a tokenizer that divides text at non-letters. > > > That's > > > >> to say, it defines tokens as maximal strings of adjacent letters, as > > > >> defined by java.lang.Character.isLetter() predic

Re: Need help in alphanumeric search

2015-10-01 Thread Jack Krupansky
I'd suggest to first inform yourself about analysis and choose a > better > > >> one that suits your underlying data and the queries you want to do. > > Maybe > > >> use WhitespaceAnalyzer or better StandardAnalyzer as a first step. Be > > sure > > >> to r

Re: Need help in alphanumeric search

2015-10-01 Thread Bhaskar
ries you want to do. > Maybe > >> use WhitespaceAnalyzer or better StandardAnalyzer as a first step. Be > sure > >> to reindex your data before querying. The Analyzer used on the search > side > >> must be the same like on the query side. If you want to use wildc

Re: Need help in alphanumeric search

2015-10-01 Thread Ian Lea
be the same like on the query side. If you want to use wildcards, you >> have to take care more, because wildcards are not really natural for "full >> text search engine" and may cause inconsistent results. >> >> Uwe >> >> - >> Uwe Schindler >>

Re: Need help in alphanumeric search

2015-10-01 Thread Bhaskar
> must be the same like on the query side. If you want to use wildcards, you > have to take care more, because wildcards are not really natural for "full > text search engine" and may cause inconsistent results. > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Al

Re: Need help in alphanumeric search

2015-09-30 Thread Bhaskar
213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message----- > > From: Bhaskar [mailto:bhaskar1...@gmail.com] > > Sent: Wednesday, September 30, 2015 4:28 AM > > To: java-user@lucene.apache.org > > Subject: Re: Need help in alphanu

RE: Need help in alphanumeric search

2015-09-29 Thread Uwe Schindler
/www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Bhaskar [mailto:bhaskar1...@gmail.com] > Sent: Wednesday, September 30, 2015 4:28 AM > To: java-user@lucene.apache.org > Subject: Re: Need help in alphanumeric search > > Hi Uwe, > > Below is my indexing

Re: Need help in alphanumeric search

2015-09-29 Thread Bhaskar
gt; searching. > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: Erick Erickson [mailto:erickerick...@gmail.com] > > Sent: Monday, Sep

Re: Need help in alphanumeric search

2015-09-28 Thread Erick Erickson
e > > >> -Original Message- >> From: Erick Erickson [mailto:erickerick...@gmail.com] >> Sent: Monday, September 28, 2015 6:01 PM >> To: java-user >> Subject: Re: Need help in alphanumeric search >> >> You need to supply the definitions of this field

RE: Need help in alphanumeric search

2015-09-28 Thread Uwe Schindler
- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Monday, September 28, 2015 6:01 PM > To: java-user > Subject: Re: Need help in alphanumeric sea

Re: Need help in alphanumeric search

2015-09-28 Thread Erick Erickson
You need to supply the definitions of this field from your schema.xml file, both the and Additionally, please provide the results of the query you're trying with &debug=true appended. The adminUI/analysis page is very helpful in these situations as well. Select the appropriate core from the dro

Re: Need help in alphanumeric search

2015-09-28 Thread Bhaskar
Thanks Lan for reply. cpn values are like 123-0049, 342-043, ab23-090, hedwsdg my application is working when i gave search for below inputs 1) ab* 2)hedwsdg 3) hed* but it is not working for 1) 123* 2) 123-0049 3) ab23* Note: if the search input has number then it is not working. Thanks in

Re: Need help in alphanumeric search

2015-09-28 Thread Ian Lea
Hi Can you provide a few examples of values of cpn that a) are and b) are not being found, for indexing and searching. You may also find some of the tips at http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_no_hits_.2F_incorrect_hits.3F useful. You haven't shown the code that create

Need help in alphanumeric search

2015-09-28 Thread Bhaskar
Hi, I am beginner in Apache lucene, I am using 5.3.1. I have created the index on the database result. The index values are having alphanumeric and strings values. I am able to search the strings but I am not able to search alphanumeric values. Can someone help me here. Below is indexing code...

Need Help To understand feasibility

2015-06-16 Thread suraj kumar
Hi Team, I am working on a project where I have to create reports using data from ElasticSearch and Microsoft Sql Server. Like some data is getting dump in ElasticSearc document and some data is in Sql server. I am in big trouble how to bring ElasticSarch data to SSRS reports. Please let me kn

Re: Need help to do simple line by line indexing and search

2014-09-17 Thread atawfik
Hi, Can you share the implementation of your analyzer. It might be the problem. It will be helpful to share also a sample of your indexed documents. Regards Ameer -- View this message in context: http://lucene.472066.n3.nabble.com/Need-help-to-do-simple-line-by-line-indexing-and-search

Need help to do simple line by line indexing and search

2014-09-16 Thread Ulaganathan, Udhayakumar
Hi, I am a new user to Lucene Search I want to index the contents of a file. Contents of the file will be a single file name with extension (file name may contain special characters) While indexing, I created a new analyzer to tokenize only on new line character. Path of the file name is -> Str

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Me
Hi everybody UerDictionary is right. I am using yahoo Japanese tokenizer API (鏃ユ湰瑾炲舰鎱嬬礌瑙f瀽) to teach my own user dictionary. http://developer.yahoo.co.jp/webapi/jlp/ On 2014/03/11, at 8:10, Rahul Ratnakar wrote: > Worked perfectly for Japanese. > > I have the same issue with Chinese Analyzer, I am

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Rahul Ratnakar
Worked perfectly for Japanese. I have the same issue with Chinese Analyzer, I am using SmartChinese (lucene-analyzers-smartcn-4.6.0.jar) but I don't see a similar interface as the Japanese analyzer. Is there an easy way to implement the same for Chinese? On Mon, Mar 10, 2014 at 3:26 PM, Rahul R

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Rahul Ratnakar
Thanks Robert. This was exactly what I was looking for, will try this. On Mon, Mar 10, 2014 at 3:13 PM, Robert Muir wrote: > You can pass UserDictionary with your own entries to do this. > > On Mon, Mar 10, 2014 at 3:08 PM, Rahul Ratnakar > wrote: > > Thanks Furkan, This is the exact tool that

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Robert Muir
You can pass UserDictionary with your own entries to do this. On Mon, Mar 10, 2014 at 3:08 PM, Rahul Ratnakar wrote: > Thanks Furkan, This is the exact tool that I am using, albeit in my code, I > have tried all search modes e.g. > > new JapaneseAnalyzer(Version.LUCENE_46, null, JapaneseTokenizer

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Rahul Ratnakar
Thanks Furkan, This is the exact tool that I am using, albeit in my code, I have tried all search modes e.g. new JapaneseAnalyzer(Version.LUCENE_46, null, JapaneseTokenizer.Mode.NORMAL, JapaneseAnalyzer.getDefaultStopSet(), JapaneseAnalyzer.getDefaultStopTags()) new JapaneseAnalyzer(Version.LUCENE

Re: Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Furkan KAMACI
Hi; Here is the page of it that has a online Kuromoji tokenizer and information: http://www.atilika.org/ It may help you. Thanks; Furkan KAMACI 2014-03-10 19:57 GMT+02:00 Rahul Ratnakar : > I am trying to analyze some japanese web pages for presence of slang/adult > phrases in them using lucen

Need help "teaching" Japanese tokenizer to pick up slangs

2014-03-10 Thread Rahul Ratnakar
I am trying to analyze some japanese web pages for presence of slang/adult phrases in them using lucene-analyzers-kuromoji-4.6.0.jar. While the tokenizer breaks up the word into proper words, I am more interested in catching the slangs which seems to result from combining various "safe" words. Few

Re: Need help to start contributing in apache lucene.

2014-02-25 Thread Furkan KAMACI
There is also this page: http://wiki.apache.org/lucene-java/HowToContribute 2014-02-25 12:41 GMT+02:00 Furkan KAMACI : > Hi; > > You can check this page: http://wiki.apache.org/solr/HowToContribute > > Thanks; > Furkan KAMACI > > > 2014-02-25 12:32 GMT+02:00 chandresh pancholi < > chandreshpanch

Re: Need help to start contributing in apache lucene.

2014-02-25 Thread Furkan KAMACI
Hi; You can check this page: http://wiki.apache.org/solr/HowToContribute Thanks; Furkan KAMACI 2014-02-25 12:32 GMT+02:00 chandresh pancholi < chandreshpancholi...@gmail.com>: > Hi Fellow members, > > I am new to apache lucene community. i clone the svn repo to my local. I am > planning to con

Need help to start contributing in apache lucene.

2014-02-25 Thread chandresh pancholi
Hi Fellow members, I am new to apache lucene community. i clone the svn repo to my local. I am planning to contribute in the project by bug fixing. can someone help me to start working? How can i proceed? It would be very helpful if you share any lucene documents -- Chandresh Pancholi Flipkart.co

Re: New to Apache Lucene: Need help in querying data - text with wildCards

2014-02-10 Thread gudiseashok
ssage in context: http://lucene.472066.n3.nabble.com/New-to-Apache-Lucene-Need-help-in-querying-data-text-with-wildCards-tp4116515p4116519.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe

Re: New to Apache Lucene: Need help in querying data - text with wildCards

2014-02-10 Thread Michael McCandless
but only time I am getting results is when I mentioned a > small string like "pop" (in above logMessage), in all other cases which has > any special characters I am not getting the results. Can anyone suggest what > would be the pattern I have to use to satisfy above mentione

New to Apache Lucene: Need help in querying data - text with wildCards

2014-02-10 Thread gudiseashok
ch has any special characters I am not getting the results. Can anyone suggest what would be the pattern I have to use to satisfy above mentioned three cases user request? I appreciate your help in this regard. -- View this message in context: http://lucene.472066.n3.nabble.com/New-to-Ap

Re: Need Help In code

2014-01-30 Thread Priyanka Tufchi
Hello Mike We tried the following code, but it is giving null : TopGroups hits = c.getTopGroups( productitemQuery, Sort.RELEVANCE, 0, // offset 10, // maxDocsPerGroup 0, // withinGroupOffset true // fillSortFields ); On Thu, Jan 30, 2014 at 2:35 AM, Michael McCandless

Re: Need Help In code

2014-01-30 Thread Michael McCandless
After indexsearcher.search you should call c.getTopGroups? See the TestBlockJoin.java example... Can you boil this down to a runnable test case, i.e. include createProductItem/createProduct sources, etc. Mike McCandless http://blog.mikemccandless.com On Thu, Jan 30, 2014 at 2:20 AM, Priyanka

Need Help In code

2014-01-29 Thread Priyanka Tufchi
Hello This is the Sample Code Of BlockJoinQuery( we tried . Issues: 1)Dont know how to get hits and score from it 2) This code is not giving output. I have attached the code for easy view StandardAnalyzer analyzer = new StandardAnalyzer(Version.LUCENE_41); // 1. create the index Directory inde

Re: need Help with lucene attributes

2013-09-13 Thread Michael McCandless
Hi, I just responded on your previous thread about this ... maybe you didn't see it (you need to subscribe to java-user@lucene.apache.org to see responses). Mike McCandless http://blog.mikemccandless.com On Fri, Sep 13, 2013 at 1:12 AM, nischal reddy wrote: > Hi, > > I am confused a bit about

need Help with lucene attributes

2013-09-13 Thread nischal reddy
Hi, I am confused a bit about the lucene attributes, can someone please help me out with this, can we store all the attributes of a term in the index? i have set following attributes for a term, CharacterTermAttribute, TypeAttribute, PayloadAttribute and OffsetsAttribute, but when i examine my i

Re: Need help regarding understanding internals of Lucene Index.

2013-01-25 Thread Adrien Grand
Hi Vignesh, This is a very broad question! The following links might help you: - Lucene documentation: http://lucene.apache.org/core/4_1_0/index.html - File formats: http://lucene.apache.org/core/4_1_0/core/org/apache/lucene/codecs/lucene41/package-summary.html#package_description - The block t

Need help regarding understanding internals of Lucene Index.

2013-01-25 Thread VIGNESH S
Hi, I recently started using Lucene. There are lot of articles and news about undestanding Lucene at the API levels . Can anyone help me understanding in depth what happens internally inside a lucene index how it stores and details about the which Datastructure lucene uses for inverted index.

Re: Need help About Lucene Query

2012-04-15 Thread Adriano Crestani
plemented in our project.But I > need help about my lucene question. > > I have one Index which has two fields. A and B > Some values are like below > A B > ___ > > AC 10 > AC 20 > ACC30 > ACC40 > ACP70 > > I need l

RE: Need Help for Wild Card Query Highlighting

2011-10-18 Thread Vidya Kanigiluppai Sivasubramanian
highlight wild card query results from 2.2.0 onwards also, I am confident that it will work in 2.4.1. Thanks, Vidya -Original Message- From: Ian Lea [mailto:ian@gmail.com] Sent: Tuesday, October 18, 2011 1:52 PM To: java-user@lucene.apache.org Subject: Re: Need Help for Wild Card Query

Re: Need Help for Wild Card Query Highlighting

2011-10-18 Thread Ian Lea
Why 2.4.1? That is ancient and there have been many improvements since then. Google finds hits for "lucene highlight wild card" some of which contain some solutions some of which may or may not be relevant for your problem. -- Ian. On Tue, Oct 18, 2011 at 8:17 AM, Vidya Kanigiluppai Sivasubra

Need Help for Wild Card Query Highlighting

2011-10-18 Thread Vidya Kanigiluppai Sivasubramanian
Hi, I am new to lucene. I am using lucene 2.4.1 in my project to do a search in a text document. I need to perform a wild card query. I am using the code given in Hrycon - blog. It is working fine with complete words. When we do a wild card query, we can only see the search hits but the text fra

Re: Need Help: Business Scenario to lucene implementation

2011-09-01 Thread Saurabh Gokhale
Hi Grant, Thanks for the reply. I would definitely look into Solr Deduplication approch. But since I am using pure lucene and not Solr, I am not sure how feasible that would be to find something in lucene or try duplicating it. But thats looks to be the way forward. Also regarding the question a

Re: Need Help: Business Scenario to lucene implementation

2011-09-01 Thread Grant Ingersoll
I'd probably treat this as a deduplication problem and look to use a fuzzy matching approach, such as the TextProfileSignature in Solr/Nutch: http://wiki.apache.org/solr/Deduplication, which I believe is tunable as to it's threshold of acceptance. I'd also likely give pushback on the notion of

Re: Need Help: Business Scenario to lucene implementation

2011-08-31 Thread Saurabh Gokhale
Can some one pls help with the logic that can be applied to decide on the closeness requirement given below (like 50% matching). This matching is a pure text matching. Since the current lucene score does not translate into the percentage of closeness, is there anything else that can give this info

Need Help: Business Scenario to lucene implementation

2011-08-30 Thread Saurabh Gokhale
Hi All, I need your help to understand how I can have Lucene applied to the following business scenario. Question is in RED *Business Scenario:* Analyze newly created document "A" with existing documents in the system and if document A matches more than (similar to) 50% with any of the existing d

Re: Need Help: Index directory files getting deleted due to server restart.

2011-07-29 Thread Chris Hostetter
: It is strange that I was suggested not to call commit explicitly and leave : it to the lucene but it seems it has its own disadvantages. as long as you commit/close the writer cleanly on shutdown you should be fine ... i don't think you need to be so agressive as to call it on ever X docs (un

Re: Need Help: Index directory files getting deleted due to server restart.

2011-07-29 Thread Saurabh Gokhale
Hi Ian Thanks for looking into the issue. And you are right. Its not this code which was causing the issue. The issue was as follows: (I just successfully performed a test run) *ISSUE:* My code had following characteristics. 1. CREATE_OR_APPEND way of opening indexWriter. 2. No explicit call to

Re: Need Help: Index directory files getting deleted due to server restart.

2011-07-29 Thread Ian Lea
Code looks fine and will not zap the current contents of indexDir. Something else must be - another call with OpenMode.CREATE? Where is indexDir - could tomcat be zapping it on startup? Some other job? -- Ian. On Thu, Jul 28, 2011 at 8:12 PM, Saurabh Gokhale wrote: > Hi All, > > I am using f

Need Help: Index directory files getting deleted due to server restart.

2011-07-28 Thread Saurabh Gokhale
Hi All, I am using following code to create Lucene index using writer object if (indexWriter == null) { Directory idx = new NIOFSDirectory(new File(indexDir),null); IndexWriterConfig iConfig = new IndexWriterConfig(Version.LUCENE_31, getAnalyzer()); iConfig.setOpenMode(IndexWriterConfig.O

Re: I have an error and need help.. please

2011-07-12 Thread Simon Willnauer
maybe you should try the nutch mailing list, this one is for lucene users. -- > http://nutch.apache.org/mailing_lists.html simon On Tue, Jul 12, 2011 at 6:48 PM, Marlen wrote: > I forgot to say .. Im using Nutch, >> >> The index looks to be fine but when I tip "http://localhost:8080/nutch/"

Re: I have an error and need help.. please

2011-07-12 Thread Marlen
I forgot to say .. Im using Nutch, The index looks to be fine but when I tip "http://localhost:8080/nutch/"; this come out Estado HTTP 500 - type Informe de Excepci贸n mensaje descripci贸n El servidor encontr贸 un error interno () que hizo que no pudiera rellenar este requerimiento. excepci

I have an error and need help.. please

2011-07-12 Thread zmachado
The index looks to be fine but when I tip "http://localhost:8080/nutch/"; this come out Estado HTTP 500 - type Informe de Excepci贸n mensaje descripci贸n El servidor encontr贸 un error interno () que hizo que no pudiera rellenar este requerimiento. excepci贸n org.apache.jasper.JasperException: ja

RE: need help

2011-06-21 Thread karl.wright
You might want to look at ManifoldCF too. http://incubator.apache.org/connectors/ Karl -Original Message- From: ext Marlen [mailto:zmach...@facinf.uho.edu.cu] Sent: Tuesday, June 21, 2011 9:49 AM To: java-user@lucene.apache.org Subject: need help I need to create a search engine that

RE: need help

2011-06-21 Thread Vinaya Kumar Thimmappa
Hello Cheta, Check this site : http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ Vinaya -Original Message- From: Marlen [mailto:zmach...@facinf.uho.edu.cu] Sent: Tuesday, June 21, 2011 7:19 PM To: java-user@lucene.apache.org Subject: need help I need to create a search

need help

2011-06-21 Thread Marlen
I need to create a search engine that searches on a intranet and my FTP .. .. I want to use Lucene as search engine .. my question is: which best fits my needs .. Nutch or Solr? thank a lot cheta On 13/06/2011 9:17, Ian Lea wrote: Hello Lucene can be used for searching pretty much anything.

Re: need help

2011-06-13 Thread Marlen
thank you very much Ian. On 13/06/2011 9:17, Ian Lea wrote: Hello Lucene can be used for searching pretty much anything. But it is a library, not an application, and you'll have to write code to make it do what you want. You might be better off using Solr. It uses lucene but provides lots of

Re: need help

2011-06-13 Thread Ian Lea
Hello Lucene can be used for searching pretty much anything. But it is a library, not an application, and you'll have to write code to make it do what you want. You might be better off using Solr. It uses lucene but provides lots of stuff on top. http://lucene.apache.org/solr/features.html -

need help

2011-06-13 Thread Marlen
hello I麓m new with lucene.. I wold like to know if I can use it to make searchs on my web site and FTP.. and know if it v隆can make search over pdf,*.doc, and any other non plane text Thanks - To unsubscribe, e-mail: java-user-u

RE: Need Help: Lucene with PHP/Java Bridge

2010-10-24 Thread Uwe Schindler
- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: dian puma [mailto:dianp...@gmail.com] > Sent: Monday, October 25, 2010 6:03 AM > To: java-user@lucene.apache.org > Subject: Re: Need Help: Lucene with PHP

Re: Need Help: Lucene with PHP/Java Bridge

2010-10-24 Thread dian puma
Hi. I still have problem with it My code worked well when I run it by command line, ex."php srcLucene.php" But it didn't work on web browser, still got an error like this. indexing ... Exception occured: [[o:Exception]:"java.lang.Exc

Re: Need Help: Lucene with PHP/Java Bridge

2010-10-24 Thread dian puma
> Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: dian puma [mailto:dianp...@gmail.com] > > Sent: Saturday, October 23, 2010 6:01 PM > > To: java-use

RE: Need Help: Lucene with PHP/Java Bridge

2010-10-23 Thread Uwe Schindler
ctor? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: dian puma [mailto:dianp...@gmail.com] > Sent: Saturday, October 23, 2010 6:01 PM > To: java-user@lucene.apache.org > Subject: Need He

RE: Need Help: Lucene with PHP/Java Bridge

2010-10-23 Thread Zhang, Lisheng
...@gmail.com] Sent: Saturday, October 23, 2010 9:01 AM To: java-user@lucene.apache.org Subject: Need Help: Lucene with PHP/Java Bridge Dear All, Currently, I'm using PHP/Java Bridge to have Lucene in my PHP web application, and also using the java extension for PHP. FYI, I'd setup lu

Need Help: Lucene with PHP/Java Bridge

2010-10-23 Thread dian puma
Dear All, Currently, I'm using PHP/Java Bridge to have Lucene in my PHP web application, and also using the java extension for PHP. FYI, I'd setup lucene on my PC several months ago and my code below worked well. But, Today I try to setup lucene on another PC, and I get an error message: ==

Re: Need help regarding Lucene Document Iteration...

2010-10-05 Thread Ian Lea
What exactly is the problem? The standard idiom nowadays for iterating through a List is what you're using in "for (Field field : ...)". I haven't used an iterator for a long time. But perhaps your iteration code is working and the problem is in your search code. The javadoc for search(query, n

Need help regarding Lucene Document Iteration...

2010-10-04 Thread Altaf Vasi
Hi, i am doing the following in java. I am searching through a lucene index and getting a Collection of Documents. Below is the code that i have written. Collection docCollection= new ArrayList(); try { Query query = new TermQuery(new Term(nameType, queryStr));

Re: Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
thanks Jayendra...it was really helpful On Sat, Aug 7, 2010 at 6:07 PM, jayendra patil wrote: > Trying to put up an explanation :- > > 0.022172567 = (MATCH) product of: > 0.07760398 = (MATCH) sum of: > 0.02287053 = (MATCH) weight(payload:ces in 550), product of: > 0.32539415 = queryWeight(

Re: Need help in understanding output of searcher.explain() function

2010-08-07 Thread jayendra patil
Trying to put up an explanation :- 0.022172567 = (MATCH) product of: 0.07760398 = (MATCH) sum of: 0.02287053 = (MATCH) weight(payload:ces in 550), product of: 0.32539415 = queryWeight(payload:ces), product of: 2.2491398 = *idf*(docFreq=157, maxDocs=551) 0.14467494 = queryNor

Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
Hello Guys, I trying to understand how lucene score is calculated. So 'm using the searcher.explain() function. But the output it gives is really confusing for me. Below are the details of the query that I gave and o/p it gave me Query: *It is definitely a CES deal that will be over in Sep or Oct

  1   2   >