Seattle / NW Hadoop, HBase Lucene, etc. Meetup , Wed August 26th, 6:45pm

2009-08-25 Thread Bradford Stephens
Hey there, Apologies for this not going out sooner -- apparently it was sitting as a draft in my inbox. A few of you have pinged me, so thanks for your vigilance. It's time for another Hadoop/Lucene/Apache Stack meetup! We've had great attendance in the past few months, let's keep it up! I'm alwa

Re: How to give a score for all documents?

2009-08-25 Thread Fabrício Raphael
I am continuing a work about wavelets in IR. In the bellow article you will to find a example. http://www.ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=4740460&isnumber=4740405&punumber=4740404&k2dockey=4740...@ieeecnfs&query=%28%28using+wavelets+to+classify+documents%29%3Cin%3Eti+%29&pos=0

Re: How to give a score for all documents?

2009-08-25 Thread Simon Willnauer
Hi Fabricio, I will try to recap what you are trying to say... you IR model does score documents that would not be returned by a particular query. So you have some other indicator that make a document relevant?! If it is not a term could you give use an example? How would you decide if a doc is r

Re: Extending Sort/FieldCache

2009-08-25 Thread Chris Hostetter
: I'd like to extend Lucene's FieldCache such that it will read native values ... : payload value to long, and store it in the cache. The reason I want to : extend Lucene's FieldCache is because I'd like Lucene to take care of : updating this cache when necessary (such as after reopen for

Re: How to give a score for all documents?

2009-08-25 Thread Fabrício Raphael
First, that a document is relevant to a query does not necessarily mean that this document has to contain some query term. You can have other ways to assert that a document is relevant to a query. My IR model is different of the vector model, so it can to give score not null for documents irreleva

Lucene release 2.9

2009-08-25 Thread Mark Miller
Hello all Lucene users, I just wanted to let you in on the current release schedule for Lucene 2.9 (still subject to change): Currently, we plan to go into official feature freeze tomorrow (Wednesday, August 26 2009). That means we will try and keep the 2.9 code as stable as possible, only commit

Re: How to give a score for all documents?

2009-08-25 Thread Shai Erera
Can you please elaborate more on the use case? Why if a certain document is irrelevant to a certain query, you'd like to give it a score? Are you perhaps talking about certain documents which should always appear in search results, no matter what the query is? And instead of always showing them, yo

Re: How to give a score for all documents?

2009-08-25 Thread Fabrício Raphael
I already know about this, but I want to give a customized score for all documents in collection, independent if wache document is or isn't relevant to the vector model. The similarity function is called only when the document is relevant to the vector model. Do you understand me? Thanks! On Sa

Purpose of the file modification date methods in Directory?

2009-08-25 Thread cemerick
I've built a Lucene Directory implementation for jdbm, an embedded Java database. Part of the Directory API are two methods related to "file" modification dates: touchFile and fileModified. My question is, what is the purpose of these methods? I've searched the Lucene core source tree, and found n

Lucene Meetup - September 3, Mountain View, CA

2009-08-25 Thread Erik Hatcher
Announcing a new Meetup for SFBay Apache Lucene/Solr Meetup! What: SFBay Apache Lucene/Solr June Meetup When: September 3, 2009 6:30 PM Where: Computer History Museum, 1401 N Shoreline Blvd, Mountain View, CA 94043 Presentations and discussions on Lucene/Solr, the Apache Open Source Search

Re: Lucene query syntax using grouping, MUST, MUST_NOT and SHOULD

2009-08-25 Thread AHMET ARSLAN
--- On Tue, 8/25/09, Henric Müller wrote: > From: Henric Müller > Subject: Lucene query syntax using grouping, MUST, MUST_NOT and SHOULD > To: java-user@lucene.apache.org > Date: Tuesday, August 25, 2009, 12:20 PM > Hi, > > I have experience some strange things when dealing with > Lucene quer

Lucene query syntax using grouping, MUST, MUST_NOT and SHOULD

2009-08-25 Thread Henric Müller
Hi, I have experience some strange things when dealing with Lucene queries. I've read some threads on this list and realize that Lucene shouldn't be seen as a complete boolean query language. However I try to understand the Lucene-way and have one example that I wonder if someone can help me

Re: Faster way for faceting?

2009-08-25 Thread Ivan Vasilev
Hi Simon, 10x for your answer. Unfortunately the code that you suggest is compatible in speed with the code that we use in our app (it was even a bit slower). 10x, Ivan Simon Willnauer wrote: Hi there, I'm not sure if the performance is considerable for you but you could try: TermDocs term