Re: phrase search in a particular case

2010-06-17 Thread Michael McCandless
SpanFirstQuery? Mike On Thu, Jun 17, 2010 at 3:23 PM, rakesh rakesh wrote: > Hi, > > I have thousands of article titles in lucene index. So for a query "Oil > spill" I want to return all the article title starts with "Oil spill". I do > not want those titles which has this phrase but do not star

[ANN] Free Webinar: June 24: How Cisco uses Lucene/Solr w/ Social Networks

2010-06-17 Thread Chris Hostetter
(cross posted announcement, please keep any replies to gene...@lucene) On behalf of Lucid Imagination, I'd like to invite folks to a free Webinar we're hosting on June 24th... How Cisco’s Pulse uses Lucene/Solr to put Social Networks to Work Thursday, June 24, 2010 9am

phrase search in a particular case

2010-06-17 Thread rakesh rakesh
Hi, I have thousands of article titles in lucene index. So for a query "Oil spill" I want to return all the article title starts with "Oil spill". I do not want those titles which has this phrase but do not start with this. Can anyone help me. Thanks in advance Thanks rakesh

Exception when running search

2010-06-17 Thread Siraj Haider
I am trying to run a search using search(query, filter, n, sort) method which return TopFieldDocs. The sort is defined like: sort = new Sort(new SortField("DATEISSUED", SortField.LONG, true)); and I am passing filter as null. The query I am passing is : +SK:1J +TEAMID:1 which return results s

parsing document title

2010-06-17 Thread Mango
I'm supposed to index documents which do not have all the information I need stored in the Metadata fields. I would like to extract the document title from the document body when the Title Metadata field contains no information. In addition, many of the documents contain a table with information o

Re: Strange behaviour of StandardTokenizer

2010-06-17 Thread Ahmet Arslan
> I ran into a strange behaviour of the StandardTokenizer. > Terms containing a '-' are tokenized differently depending > on the context. > For example, the term 'nl-lt' is split into 'nl' and 'lt'. > The term 'nl-lt0' is tokenized into 'nl-lt0'. > Is this a bug or a feature? It is designed tha

Strange behaviour of StandardTokenizer

2010-06-17 Thread Anna Hunecke
Hi! I ran into a strange behaviour of the StandardTokenizer. Terms containing a '-' are tokenized differently depending on the context. For example, the term 'nl-lt' is split into 'nl' and 'lt'. The term 'nl-lt0' is tokenized into 'nl-lt0'. Is this a bug or a feature? Can I avoid it somehow? I'm