general question

2015-03-30 Thread Fielder, Todd Patrick
Hello, I'm new to Lucene and am looking for advice. I'm wanting to search the entire DB (or almost the entire DB) for a keyword. The users also want to know which field the string occurred in. I can think of two ways to do this, but neither are ideal and I'm looking for suggestions: 1)

lucene - general question

2006-12-03 Thread Eshwaramoorthy Babu
Hi , we have a requirement to compare 2 xml files and generate result(reconcilation report). The xml file size is 6MB each and the flrmat is as below 123 123 I have to implement the below logic Number of matching ID'S in both xml Number of non matching ID'S in both xml Number of non matching

Re: general question

2015-03-30 Thread Michael McCandless
You could do this with a custom Collector which, for every hit visits all child scorers asking each one whether it matched the current hit. Your collector would have to somehow store this information away so that once the search is done and you pull the top N hits, you know which fields those hits

Re: lucene - general question

2006-12-03 Thread Lukas Vlcek
Hi Babu, Sorry but I don't see any point in using Lucene if you don't need search functionality. Also for parsing XML files I would consider using some scripting language (as opposed to pure Java based solution). The reason is that scripting languages can be more effectire when simplicity of resu

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Lukas, Thanks for your response. I was planning to search for 1st xml ID's in 2nd XML. so I thought of using lucene for search. Can you please suggest me some scripting solution. Is perl right solution? Thanks, Babu On 12/4/06, Lukas Vlcek <[EMAIL PROTECTED]> wrote: Hi Babu, Sorry but

Re: lucene - general question

2006-12-04 Thread Lukas Vlcek
Hi, Try to look at Groovy (I haven't used it yet but some people say it is much easire to work with XML file in Groovy then in Java). It produces class files so it can be integrated with your exisitng Java code. 6MB file is not that much unless you are working in limited environment (like mobile

Re: lucene - general question

2006-12-04 Thread [EMAIL PROTECTED]
Hi Babu, your sample xml schema contains only few fields, why not consider to use db (mysql) todo: read your xml file, then use digester to convert to java object after that insert it your db. when your done with your insert stuff, you can simply query your db anytime you like. cheers, Buics

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Buics, Thanks for your response.. I will receive 2 xml files, I have to compare these 2 and generate a xml report with below 1. Matching id's from both xml 2. Duplicate id's from both xml The requirement is for reconcilation of 2 application data. For this I have to get the get all id's fr

Re: lucene - general question

2006-12-04 Thread Erick Erickson
I also think Lucene is overkill, assuming that you don't need to persist the XML you're talking about. Why not just use Java Sets? See http://java.sun.com/docs/books/tutorial/collections/interfaces/set.html for how to do set intersection, union, difference, etc. And what's the difference between

RE: [EXTERNAL] Re: general question

2015-03-30 Thread Fielder, Todd Patrick
child : scorer.getChildren()){ System.out.println("relationship: " + child.relationship); } } -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Monday, March 30, 2015 11:20 AM To: Lucene Users; tpfi...@sandia.go Subject: [EXTERNAL] R

Re: [EXTERNAL] Re: general question

2015-03-31 Thread Terry Smith
" + child.relationship); > } > > } > > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Monday, March 30, 2015 11:20 AM > To: Lucene Users; tpfi...@sandia.go > Subject: [EXTERNAL] Re: general question > > You could d

Re: [EXTERNAL] Re: general question

2015-03-31 Thread Michael McCandless
ct(int docID) throws IOException { >> >> for(ChildScorer child : scorer.getChildren()){ >> System.out.println("relationship: " + child.relationship); >> } >> >> } >> >> -Original Message- >> From: Michael McCandless [mailto:l

Re: [EXTERNAL] Re: general question

2015-04-01 Thread Sanne Grinovero
dren()){ >>> System.out.println("relationship: " + child.relationship); >>> } >>> >>> } >>> >>> -Original Message- >>> From: Michael McCandless [mailto:luc...@mikemccandless.com] >>> Sent: Monday, M

RE: [EXTERNAL] Re: general question

2015-04-02 Thread Fielder, Todd Patrick
rinov...@gmail.com] Sent: Wednesday, April 01, 2015 5:33 PM To: java-user@lucene.apache.org Subject: Re: [EXTERNAL] Re: general question Hello all, I don't need to do the same, but the suggestions got me curious. Why would you consider it more efficient to iterate on the child scorers, r

some general question about Nutch Search engine.

2005-09-16 Thread Legolas Woodland
Hi Thank you for reading my post I have some general question : 1-does Nutch support multilanguage indexing and searching ? 2-does it has capability to index and search more than 500,000 site in a timely manner? 3-does it have capabilities to add ADs System , sponsored result first and other

Re: some general question about Nutch Search engine.

2005-09-16 Thread Andrzej Bialecki
Legolas Woodland wrote: Hi Thank you for reading my post I have some general question : Please see http://nutch.org for information about Nutch. 1-does Nutch support multilanguage indexing and searching ? Yes, to large degree (there are always issues when making assumptions about the