lucene - general question

2006-12-03 Thread Eshwaramoorthy Babu
Hi , we have a requirement to compare 2 xml files and generate result(reconcilation report). The xml file size is 6MB each and the flrmat is as below 123 123 I have to implement the below logic Number of matching ID'S in both xml Number of non matching ID'S in both xml Number of non matching

Re: lucene - general question

2006-12-03 Thread Lukas Vlcek
Hi Babu, Sorry but I don't see any point in using Lucene if you don't need search functionality. Also for parsing XML files I would consider using some scripting language (as opposed to pure Java based solution). The reason is that scripting languages can be more effectire when simplicity of resu

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Lukas, Thanks for your response. I was planning to search for 1st xml ID's in 2nd XML. so I thought of using lucene for search. Can you please suggest me some scripting solution. Is perl right solution? Thanks, Babu On 12/4/06, Lukas Vlcek <[EMAIL PROTECTED]> wrote: Hi Babu, Sorry but

Re: lucene - general question

2006-12-04 Thread Lukas Vlcek
Hi, Try to look at Groovy (I haven't used it yet but some people say it is much easire to work with XML file in Groovy then in Java). It produces class files so it can be integrated with your exisitng Java code. 6MB file is not that much unless you are working in limited environment (like mobile

Re: lucene - general question

2006-12-04 Thread [EMAIL PROTECTED]
Hi Babu, your sample xml schema contains only few fields, why not consider to use db (mysql) todo: read your xml file, then use digester to convert to java object after that insert it your db. when your done with your insert stuff, you can simply query your db anytime you like. cheers, Buics

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Buics, Thanks for your response.. I will receive 2 xml files, I have to compare these 2 and generate a xml report with below 1. Matching id's from both xml 2. Duplicate id's from both xml The requirement is for reconcilation of 2 application data. For this I have to get the get all id's fr

Re: lucene - general question

2006-12-04 Thread Erick Erickson
I also think Lucene is overkill, assuming that you don't need to persist the XML you're talking about. Why not just use Java Sets? See http://java.sun.com/docs/books/tutorial/collections/interfaces/set.html for how to do set intersection, union, difference, etc. And what's the difference between