no result when searching sentences in solr

2011-09-21 Thread hadi
I index some pdf and docx with solrj and when i want to create query some sentences like We'd be glad to have you accompany or anything else, the result is empty. is it any configuration? i mention that i create query in /solr/browse -- View this message in context:

boost a document which has a field not empty

2011-09-21 Thread Zoltan Altfatter
Hi, I have one entity called organisation. I am indexing their name to be able to search afterwards on their name. I store also the website of the organisation. Some organisations have a website some don't. Can I achieve that when searching for organisations even if I have a match on their name I

Solr Indexing - Null Values in date field

2011-09-21 Thread mechravi25
Hi, I have a field in my source with data type as string and that field has NULL values. I am trying to index this field in solr as a date data type with multivalued = true. Following is the entry for that field in my schema.xml field name=startdate type=date indexed=true stored=true

Re: Solr Indexing - Null Values in date field

2011-09-21 Thread Gora Mohanty
On Wed, Sep 21, 2011 at 4:08 PM, mechravi25 mechrav...@yahoo.co.in wrote: Hi, I have a field in my source with data type as string and that field has NULL values. I am trying to index this field in solr as a date data type with multivalued = true. Following is the entry for that field in my

Fuzzy Suggester

2011-09-21 Thread O. Klein
From http://wiki.apache.org/solr/Suggester: JaspellLookup can provide fuzzy suggestions, though this functionality is not currently exposed (it's a one line change in JaspellLookup). Anybody know what change this would have to be? -- View this message in context:

Problem using EdgeNGram

2011-09-21 Thread Kissue Kissue
Hi, I am using solr 3.3 with SolrJ. I am trying to use EdgeNgram to power auto suggest feature in my application. My understanding is that using EdgeNgram would mean that results will only be returned for records starting with the search criteria but this is not happening for me. For example if

JSON response with SolrJ

2011-09-21 Thread Kissue Kissue
Hi, I am using solr 3.3 with SolrJ. Does anybody have any idea how i can retrieve JSON response with SolrJ? Is it possible? It seems to be more focused on XML and Beans. Thanks.

Re: JSON response with SolrJ

2011-09-21 Thread Parvin Gasimzade
Hi, Similar question asked before.Maybe it can help. http://lucene.472066.n3.nabble.com/SolrJ-Response-JSON-td1002024.html On Wed, Sep 21, 2011 at 3:01 PM, Kissue Kissue kissue...@gmail.com wrote: Hi, I am using solr 3.3 with SolrJ. Does anybody have any idea how i can retrieve JSON

Re: Problem using EdgeNGram

2011-09-21 Thread O. Klein
Try using KeywordTokenizerFactory instead of StandardTokenizerFactory to get the results you want. -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-using-EdgeNGram-tp3355132p3355211.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: boost a document which has a field not empty

2011-09-21 Thread Alexei Martchenko
Can u assign a doc boost at index time? 2011/9/21 Zoltan Altfatter altfatt...@gmail.com Hi, I have one entity called organisation. I am indexing their name to be able to search afterwards on their name. I store also the website of the organisation. Some organisations have a website some

Sort five random Top Offers to the top

2011-09-21 Thread MOuli
Hey Community. I got a Lucene/Solr Index with many offers. Some of them are marked by a flag field topoffer that they are top offers. Now I want so sort randomly 5 of this offers on the top. For Example HTC Sensation - topoffer = true HTC Desire - topoffer = false Samsung Galaxy S2 - topoffer

Re: boost a document which has a field not empty

2011-09-21 Thread Ahmet Arslan
I have one entity called organisation. I am indexing their name to be able to search afterwards on their name. I store also the website of the organisation. Some organisations have a website some don't. Can I achieve that when searching for organisations even if I have a match on their

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-21 Thread Robert Muir
On Tue, Sep 20, 2011 at 12:32 PM, Michael McCandless luc...@mikemccandless.com wrote: Or: is it possible you reopened the reader several times against the index (ie, after committing from Solr)?  If so, I think 2.9.x never unmaps the mapped areas, and so this would accumulate against the

Re: boost a document which has a field not empty

2011-09-21 Thread Zoltan Altfatter
Yes, I am using edismax and the bq parameter did the trick. Thanks a lot. On Wed, Sep 21, 2011 at 3:59 PM, Ahmet Arslan iori...@yahoo.com wrote: I have one entity called organisation. I am indexing their name to be able to search afterwards on their name. I store also the website of the

LocalParams, bq, and highlighting

2011-09-21 Thread Demian Katz
I've run into another strange behavior related to LocalParams syntax in Solr 1.4.1. If I apply Dismax boosts using bq in LocalParams syntax, the contents of the boost queries get used by the highlighter. Obviously, when I use bq as a separate parameter, this is not an issue. To clarify, here

Selective values for facets

2011-09-21 Thread ntsrikanth
Hi, The dataset I have got is for special offers. We got lot of offer codes. But I need to create few facets for specific conditions only. For example, I got the following codes: ABCD, AGTR, KUYH, NEWY, NEWA, NEWB, EAS1, EAS2 And I need to create a facet like 'New Year Offers' mapped with

Best Practices for indexing nested XML in Solr via DIH

2011-09-21 Thread Pulkit Singhal
Hello Everyone, I was wondering what are the various best practices that everyone follows for indexing nested XML into Solr. Please don't feel limited by examples, feel free to share your own experiences. Given an xml structure such as the following: categoryPath category

Re: How to write core's name in log

2011-09-21 Thread Pulkit Singhal
Not sure if this is a good lead for you but when I run out-of-the-box multi-core example-DIH instance of Solr, I often see core name thrown about in the logs. Perhaps you can look there? On Thu, Sep 15, 2011 at 6:50 AM, Joan joan.monp...@gmail.com wrote: Hi, I have multiple core in Solr and I

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/20/2011 4:09 PM, Robert Muir wrote: yes, mergeFactory=10 is interpreted as both segmentsPerTier and maxMergeAtOnce. yes, specifying explicit TieredMP parameters will override whatever you set in mergeFactor (which is basically only interpreted to be backwards compatible) this is why i

strange copied field problem

2011-09-21 Thread Tanner Postert
i have 3 fields that I am working with: genre, genre_search and text. genre is a string field which comes from the data source. genre_search is a text field that is copied from genre, and text is a text field that is copied from genre_search and a few other fields. Text field is the default search

Re: FW: MMapDirectory failed to map a 23G compound index segment

2011-09-21 Thread Yongtao Liu
I hit similar issue recently. Not sure if MMapDirectory is right way to go. When index file be map to ram, JVM will call OS file mapping function. The memory usage is in share memory, it may not be calculate to JVM process space. I saw one problem is if the index file bigger then physical ram,

SolrCloud state

2011-09-21 Thread Miguel Coxo
Hi there. I'm starting a new project using solr and i would like to know if solr is able to setup a cluster with fault tolerance. I'm setting up an environment with two shards. Each shard should have a replica. What i would like to know is if a shard master fails will the replica be promoted to

Re: strange copied field problem

2011-09-21 Thread Pulkit Singhal
I am NOT claiming that making a copy of a copy field is wrong or leads to a race condition. I don't know that. BUT did you try to copy into the text field directly from the genre field? Instead of the genre_search field? Did that yield working queries? On Wed, Sep 21, 2011 at 12:16 PM, Tanner

Re: strange copied field problem

2011-09-21 Thread Tanner Postert
i believe that was the original configuration, but I can switch it back and see if that yields any results. On Wed, Sep 21, 2011 at 10:54 AM, Pulkit Singhal pulkitsing...@gmail.comwrote: I am NOT claiming that making a copy of a copy field is wrong or leads to a race condition. I don't know

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Pulkit Singhal
Usually any good piece of java code refrains from capturing Throwable so that Errors will bubble up unlike exceptions. Having said that, perhaps someone in the list can help, if you share which particular Solr version you are using where you suspect that the Error is being eaten up. On Fri, Sep

Re: Solr Indexing - Null Values in date field

2011-09-21 Thread Pulkit Singhal
Also you may use the script transformer to explicitly remove the field from the document if the field is null. I do this for all my sdouble and sdate fields ... its a bit manual and I would like to see Solr enhanced to simply skip stuff like this by having a flag for its DIH code but until then it

Debugging DIH by placing breakpoints

2011-09-21 Thread Pulkit Singhal
Hello, I was wondering where can I find the source code for DIH? I want to checkout the source and step-trhought it breakpoint by breakpoint to understand it better :) Thanks! - Pulkit

Re: strange copied field problem

2011-09-21 Thread Tanner Postert
sure enough that worked. could have sworn we had it this way before, but either way, that fixed it. Thanks. On Wed, Sep 21, 2011 at 11:01 AM, Tanner Postert tanner.post...@gmail.comwrote: i believe that was the original configuration, but I can switch it back and see if that yields any

Re: Debugging DIH by placing breakpoints

2011-09-21 Thread Gora Mohanty
On Thu, Sep 22, 2011 at 12:08 AM, Pulkit Singhal pulkitsing...@gmail.com wrote: Hello, I was wondering where can I find the source code for DIH? I want to checkout the source and step-trhought it breakpoint by breakpoint to understand it better :) Should be under contrib/dataimporthandler in

Re: Debugging DIH by placing breakpoints

2011-09-21 Thread Pulkit Singhal
Correct! With that additional info, plus http://wiki.apache.org/solr/HowToContribute (ant eclipse), plus a refreshed (close/open) eclipse project ... I'm all set. Thanks Again. On Wed, Sep 21, 2011 at 1:43 PM, Gora Mohanty g...@mimirtech.com wrote: On Thu, Sep 22, 2011 at 12:08 AM, Pulkit

Production Issue: SolrJ client throwing this error even though field type is not defined in schema

2011-09-21 Thread roz dev
Hi All We are getting this error in our Production Solr Setup. Message: Element type t_sort must be followed by either attribute specifications, or /. Solr version is 1.4.1 Stack trace indicates that solr is returning malformed document. Caused by:

Re: strange copied field problem

2011-09-21 Thread Pulkit Singhal
No probs. I would still hope someone would comment on you thread with some expert opinions about making a copy of a copy :) On Wed, Sep 21, 2011 at 1:38 PM, Tanner Postert tanner.post...@gmail.com wrote: sure enough that worked. could have sworn we had it this way before, but either way, that

Re: Sort five random Top Offers to the top

2011-09-21 Thread Sujit Pal
Hi MOuli, AFAIK (and I don't know that much about Solr), this feature does not exist out of the box in Solr. One way to achieve this could be to construct a DocSet with topoffer:true and intersect it with your result DocSet, then select the first 5 off the intersection, randomly shuffle them,

Implementing a custom ResourceLoader

2011-09-21 Thread Jithin Emmanuel
Hi, As part of writing a solr plugin I need to override the ResourceLoader. My plugin is intended stop word analyzer filter factory and I need to change the way stop words are being fetched. My assumption is overriding ResourceLoader-getLines() will help me to meet my target of fetching stop word

Re: Two unrelated questions

2011-09-21 Thread Erick Erickson
for 1 I don't quite get what you're driving at. Your DIH query assigns the uniqueKey, it's not like it's something auto-generated. Perhaps a concrete example would help. 2 There's a limit you can adjust that defaults to 1024 (maxBooleanClauses in solrconfig.xml). You can bump this very high, but

Re: Slow autocomplete(terms)

2011-09-21 Thread Erick Erickson
Think about ngrams if you really need infix searches, you're right that the regex is very probably the root of your problem. The index has to examine *every* term in the field to determine if the regex will match. Best Erick On Tue, Sep 20, 2011 at 12:57 AM, roySolr royrutten1...@gmail.com

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Chris Hostetter
: With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be : ignored. I've got both set to 35, but Solr is merging every 10 segments. I ... : Here's the relevant config pieces. These two sections are in separate files : incorporated into solrconfig.xml using xinclude:

RE: Two unrelated questions

2011-09-21 Thread Olson, Ron
Thanks for the reply. As far as #1, my table that I'm indexing via DIH has a PK field, generated by a sequence, so there are records with ID of 1, 2, 3, etc. That same id is the one I use in my unique id field in the document (uniqueKeyID/uniqueID). I've noticed that the table has, say, 10

Re: Two unrelated questions

2011-09-21 Thread Rob Casson
for #1, i don't use DIH, but is there any possibility of that column having duplicate keys, with subsequent docs replacing existing ones? and for #2, for some cases you could use a negative filterquery: http://wiki.apache.org/solr/SimpleFacetParameters#Retrieve_docs_with_facets_missing so

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Chris Hostetter
: Usually any good piece of java code refrains from capturing Throwable : so that Errors will bubble up unlike exceptions. Having said that, Even if some piece of code catches an OutOfMemoryError, the JVM should have already called the -XX:OnOutOfMemoryError hook - Although from what i can

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/21/2011 3:10 PM, Chris Hostetter wrote: : With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be : ignored. I've got both set to 35, but Solr is merging every 10 segments. I ... : Here's the relevant config pieces. These two sections are in separate files :

SOLR error with custom FacetComponent

2011-09-21 Thread Ravi Bulusu
Hi All, I'm trying to write a custom SOLR facet component and I'm getting some errors when I deploy my code into the SOLR server. Can you please let me know what Im doing wrong? I appreciate your help on this issue. Thanks. *Issue* I'm getting an error saying Error instantiating

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/21/2011 11:18 AM, Shawn Heisey wrote: With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be ignored. I've got both set to 35, but Solr is merging every 10 segments. I haven't tried explicitly setting mergeFactor yet to see if that will make the other settings

Re: SOLR error with custom FacetComponent

2011-09-21 Thread Erik Hatcher
Why create a custom facet component for this? Simply add lines like this to your request handler(s): str name=facet.fieldmanu_exact/str either in defaults or appends sections. Erik On Sep 21, 2011, at 14:00 , Ravi Bulusu wrote: Hi All, I'm trying to write a custom

RE: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Michael Ryan
I think the problem is that the mergePolicy config needs to be inside of the indexDefaults config, rather than after it as your have. -Michael

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Jason Toy
I am running the sun version: java version 1.6.0_26 Java(TM) SE Runtime Environment (build 1.6.0_26-b03) Java HotSpot(TM) 64-Bit Server VM (build 20.1-b02, mixed mode) I get multiple Out of memory exceptions looking at my application and the solr logs, but my script doesn't get called the first

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Michael Sokolov
I wonder if config-file validation would be helpful here :) I posted a patch in SOLR-1758 once. -Mike On 9/21/2011 6:22 PM, Michael Ryan wrote: I think the problem is that themergePolicy config needs to be inside of the indexDefaults config, rather than after it as your have. -Michael

RE: NRT and commit behavior

2011-09-21 Thread Tirthankar Chatterjee
Okay, but is there any number that if we reach on the index size or total docs in the index or the size of physical memory that sharding should be considered. I am trying to find the winning combination. Tirthankar -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com]

Re: Two unrelated questions

2011-09-21 Thread tamanjit.bin...@yahoo.co.in
For *1* I have faced similar issues, and have realized that it has got more to do with the data I am trying to index. In some cases when I run even a full-import with DIH, unless its a flat table that I am tryin to index, there are often issues at data end when I try to get joins and then index