Re: synonyms.txt: different results on admin and on site..

2011-09-08 Thread deniz
you are right about wildcards and analysis stuff... so any way of putting wildcards in for analysis? - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/synonyms-txt-different-results-on-admin-and-on-site-tp3318338p3322026.html Sent

Re: Using multivalued field in map function

2011-09-08 Thread tkamphuis
Well, I'd like to do the following: I've got a website full of blogposts and every blogpost has an owner, this owner is refererred to through his/her id. For example: BloggerId = 123. It's also possible that the blog has multiple co-writers, which are also referred to by there BloggerId but these

Re: Indexing Lotus Notes database using API

2011-09-08 Thread Tobias Berg
Hi again, After reading a bit more, IBM no longer supports the JDBC driver for Lotus Notes. Instead the Notes.jar API is recommended. So I'll go with that, as Oleg suggested. 2011/9/6 Tobias Berg > Thanks Jan, > > I will look into using the JDBC driver. > > /Tobias > > > 2011/9/5 Jan Høydahl >

Re: pagination with grouping

2011-09-08 Thread Bill Bell
There are 2 use cases: 1. rows=10 means 10 groups. 2. rows=10 means to results (irregardless of groups). I thought there was a total number of groups (ngroups) or case #1. I don't believe case #2 has been coded. On 9/8/11 2:22 PM, "alx...@aim.com" wrote: > > > > Hello, > >When trying to impl

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-08 Thread Lance Norskog
Do you need to use the compound format? On Thu, Sep 8, 2011 at 3:57 PM, Rich Cariens wrote: > I should add some more context: > > 1. the problem index included several cfs segment files that were around > 4.7G, and > 2. I'm running four SOLR instances on the same box, all of which have >

Re: Solr variable substitution question

2011-09-08 Thread Chris Hostetter
: Is there an easy way without out writing custom response writers to accept a : parameter on the request URL and then substitute that value in the response? there's nothing built in to support this using the syntax you are asking about, but one approach to consider instead of writing a custom r

Solr variable substitution question

2011-09-08 Thread Garrick Plagge
Is there an easy way without out writing custom response writers to accept a parameter on the request URL and then substitute that value in the response? For example, if an index field is setup and a value like the following is added to a field: http://www.google.com/?partner={$partner} If

Re: FastVectorHighlighter with wildcard queries

2011-09-08 Thread Koji Sekiguchi
(11/09/09 6:16), Rahul Warawdekar wrote: Hi, I am currently evaluating the FastVectorHighlighter in a Solr search based project and have a couple of questions 1. Is there any specific reason why the FastVectorHighlighter does not provide support for multiterm(wildcard) queries ? 2. What are the

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-08 Thread Rich Cariens
I should add some more context: 1. the problem index included several cfs segment files that were around 4.7G, and 2. I'm running four SOLR instances on the same box, all of which have similiar problem indeces. A colleague thought perhaps I was bumping up against my 256,000 open files

Re: any docs on using the GeoHashField?

2011-09-08 Thread Chris Hostetter
: I would think I could index a lat,lon pair into a GeoHashField (that : works) and then retrieve the field value to see the computed geohash. ... : What am I missing - how can I retrieve the hash? I don't think it's designed to work that way. GeoHashField provides GeoHash based search

FunctionQueryNode pipeline?

2011-09-08 Thread mtraynham
I'm wondering if anyone has attempted this. The general idea is to have a FunctionQueryNode, Builder and Pipeline, where functions appear as foo(param) within the query string. Further down the pipeline, Solr has provided functionality for handling ValueSources. The only issue I have encountered

Re: solr 1.4 facet.limit behaviour in merging from several shards

2011-09-08 Thread Chris Hostetter
: When shooting a distributed query, we use facet.limit=1000. Then the merging : SOLR combines the results. We also use facet.zeros=false to ensure returning : only non-zero facet entries. : The issue that we found is that there was a gap in time in the final results : list (reverse sorted by date

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-08 Thread Rich Cariens
FWiW I optimized the index down to a single segment and now I have no trouble opening an MMapDirectory on that index, even though the 23G cfx segment file remains. On Thu, Sep 8, 2011 at 4:27 PM, Rich Cariens wrote: > Thanks for the response. "free -g" reports: > > totalused

FastVectorHighlighter with wildcard queries

2011-09-08 Thread Rahul Warawdekar
Hi, I am currently evaluating the FastVectorHighlighter in a Solr search based project and have a couple of questions 1. Is there any specific reason why the FastVectorHighlighter does not provide support for multiterm(wildcard) queries ? 2. What are the other constraints when using FastVectorHig

Re: no result with q

2011-09-08 Thread Chris Hostetter
: q=49 OR * I don't think that query means what you think it means... : > Is there a way to tell Solr to only use fq and neglect q if no answer? Or : > am I doomed to send first request, realize I've got no answer and then send : > a second request? what do you mean by "no answer" ? I *think*

Re: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Frederik Kraus
In our case it's clearly the wrong tradeoff :) I'm going to patch our Solr for now, but either - a config option - a facet.whatever param - or reversing the tradeoff should be done in my eyes. Am Donnerstag, 8. September 2011 um 22:34 schrieb Yonik Seeley: > So this is bad if you have a hi

Re: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Yonik Seeley
On Thu, Sep 8, 2011 at 4:18 PM, Frederik Kraus wrote: >  Now that is quite interesting indeed and sounds like a bug to me. Including > facets with a count of 0 we have a few 100k which then apparently get > transferred. hmhmhm > > Can anyone with more knowledge of the facet component maybe chime

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-08 Thread Rich Cariens
Thanks for the response. "free -g" reports: totalusedfreesharedbuffers cached Mem: 141 95 46 0 093 -/+ buffers/cache: 2 139 Swap: 3 0 3 2011/9/7 François Schiettecatte >

pagination with grouping

2011-09-08 Thread alxsss
Hello, When trying to implement pagination as in the case without grouping I see two issues. 1. with rows=10 solr feed displays 10 groups not 10 results 2. there is no total number of results with grouping to show the last page. In detail: 1. I need to display only 10 results in one page.

Re: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Frederik Kraus
Now that is quite interesting indeed and sounds like a bug to me. Including facets with a count of 0 we have a few 100k which then apparently get transferred. hmhmhm Can anyone with more knowledge of the facet component maybe chime in why the miscount is removed? Am Donnerstag, 8. September

RE: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Michael Ryan
> yep - facet.mincount=1 Yeah, I've ran into this same issue, though I never looked too closely into it. What is happening is that the facet.mincount parameter is removed when the query is made to the shards, so each shard is returning about 3 facet values, most of them with a count of 0. I

Re: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Frederik Kraus
yep - facet.mincount=1 Am Donnerstag, 8. September 2011 um 21:37 schrieb Michael Ryan: > Are you using facet.mincount in the query? > > -Michael

RE: High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Michael Ryan
Are you using facet.mincount in the query? -Michael

Re: Using multivalued field in map function

2011-09-08 Thread Emmanuel Espina
Function queries don't work with multivalued field. http://wiki.apache.org/solr/FunctionQuery#Vector_Functions You'll have to think in another way of doing that. What do you want to achieve with that map? Regards Emmanuel 2011/9/8 tkamphuis > Hi, > > I'm working on implementing Solr in a proje

High facet.limit (with only 2-3 actual facets) -> Massive bandwidth consumption in DistributedSearch

2011-09-08 Thread Frederik Kraus
Hi guys, I've just experienced an odd issue today with the following setup: Test 1: 20 Shards facet.limit=2 (returned facets 2-3) about 5-6MB network traffic Resp Time ca 3sec Test 2: 20 Shards facet.limit=100 (returned facets 2-3) only a few kb network traffic Resp Time ca 0.02sec Does

Re: how to do sorting on no. of highlighting in solr

2011-09-08 Thread lboutros
Hi, it is possible to create a new similarity class which returns the term occurrences. You have to disable Idf (just return1), normalization and co. then you have to declare it in your schema: http://wiki.apache.org/solr/SchemaXml#Similarity http://wiki.apache.org/solr/SolrPlugins#Similarity

Solr and internationalization

2011-09-08 Thread Stéphane Campinas
Hi, I am working on a plugin for Solr, and in this project, a QueryNodeException [1] is thrown in the case of an error while parsing the query. However, I receive the following exception in such a case: HTTP Status 500 - Can't find bundle for base name org.apache.lucene.queryParser.core

Re: can indexing information stored in db rather than filesystem?

2011-09-08 Thread Walter Underwood
Lucene isn't even close to a DBMS. The batch commits make it less of of database than Berkeley DB. Lucene is a search engine. wunder On Sep 8, 2011, at 8:54 AM, Jaeger, Jay - DOT wrote: > If you think about it, Lucene (upon which Solr is build) *is* a kind of DBMS > - just not an RDBMS. Afte

where do i find xsd schemas for MultiCore Solr responses

2011-09-08 Thread Anirudha Jadhav
Hi , I am using separate task to monitor solr instances, where do i find xsd schemas for MultiCore Solr responses inorder to correctly parse them. thanks, Ani eg. XML " 016 core0multicore\core0\multicore\core0\data\2011-09-07T21:36:53.864Z 67193861001313764498766 falsetruefalse

RE: can indexing information stored in db rather than filesystem?

2011-09-08 Thread Jaeger, Jay - DOT
If you think about it, Lucene (upon which Solr is build) *is* a kind of DBMS - just not an RDBMS. After all, in the end, a DBMS stores its stuff in files, too. If you then turned around and mapped the stuff that Solr does into database tables, you would lose all of the performance advantages t

Re: can indexing information stored in db rather than filesystem?

2011-09-08 Thread Gora Mohanty
On Thu, Sep 8, 2011 at 6:57 PM, kiran.bodigam wrote: > SOLR will keep the indexed information in directory called > index(${DATA_DIR}/index) after clicking the commit from admin console and > also we can override  the existing index files. similarly can we keep (or) > store these  index related in

Using multivalued field in map function

2011-09-08 Thread tkamphuis
Hi, I'm working on implementing Solr in a project and right now I'm stuck on a specific search including an arr field. The thing is: I'd like to search sub-id's on an object, these sub-id's are stored in a multivalue field, e.g.: 12272 12304 12306 The query (or part of the query) that

RE: Spellcheck

2011-09-08 Thread Jaeger, Jay - DOT
> " Following up from your message on the Nutch list. If q=*:* is showing you > empty elements, no fields are getting indexed." I don't think that is correct. I believe that the correct statement would be no fields are getting *** stored ***. If the fields were not getting indexed, they woul

Re: How to write this query?

2011-09-08 Thread crisfromnova
You can try this: q=key:value1^8 key=value2^4 key=value3^2. It should be working. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-write-this-query-tp3318577p3319491.html Sent from the Solr - User mailing list archive at Nabble.com.

How to order results by word position???

2011-09-08 Thread crisfromnova
Hi, I have a problem with solr search. If I search after "vitamin" I receive : 1 - arrca MULTIVITAMIN FRUCHTSAFTBÄRCHEN 2 - VITAMIN E-KAPSELN NAT. 400 How can I configure the fieldtype or the query to get the documents by position of the word in documents found?("VITAMIN E-KAPSELN NAT. 400" befo

Re: synonyms.txt: different results on admin and on site..

2011-09-08 Thread François Schiettecatte
Wildcard terms are not analyzed, so your synonyms.txt may come into play here, have you check the analysis for deniz* ? François On Sep 7, 2011, at 10:08 PM, deniz wrote: > well yea you are right... i realised that lack of detail issue here... so > here it comes... > > > This is from my sche

Re: no result with q

2011-09-08 Thread elisabeth benoit
ok, I guess I found how q=49 OR * 2011/9/8 elisabeth benoit > > Hello, > > I have a query > > /select?&q=49&q.alt=*:*&fq=NAME_ANALYZED:decorasol AND WAY_ANALYZED:rue > charonne AND (TOWN_ANALYZED:paris OR > DEPARTMENT_ANALYZED:paris)&rows=50&fl=*,score& > > returning no answer because of the q=

Re: grouping by alpha-numeric field

2011-09-08 Thread Erick Erickson
First, this is a very sparse problem statement, please review: http://wiki.apache.org/solr/UsingMailingLists But my guess is that you perhaps changed your schema and didn't re-index completely. Use admin/schema browser to look at the field and see whether the values in the field are what you expe

Re: Reading results from FieldCollapsing

2011-09-08 Thread Erick Erickson
Glad you resolved it. I've spent more time than I'd like to admit trying to get an IDE to play nice! Erick On Wed, Sep 7, 2011 at 9:53 AM, Sowmya V.B. wrote: > Hi Erick > > It was indeed an eclipse issue! Thanks for saying that. > I cleaned the project, restarted eclipse and built again...and I

no result with q

2011-09-08 Thread elisabeth benoit
Hello, I have a query /select?&q=49&q.alt=*:*&fq=NAME_ANALYZED:decorasol AND WAY_ANALYZED:rue charonne AND (TOWN_ANALYZED:paris OR DEPARTMENT_ANALYZED:paris)&rows=50&fl=*,score& returning no answer because of the q=49 parameter. The query /select?&q=&q.alt=*:*&fq=NAME_ANALYZED:decorasol AND WA

Re: Spellcheck

2011-09-08 Thread tamanjit.bin...@yahoo.co.in
In the RequestHandler spellCheckCompRH you have to make changes as follows: explicit textSpell false true 5 spellcheck This hopefully should do it. Also I would suggest that for the time being (i.e. till you ge

Re: Unable to generate trace

2011-09-08 Thread Bernd Fehling
How about using jmap or jvisualvm? Or even connecting with eclipse to the process for live analysis? Am 08.09.2011 11:07, schrieb Rohit: Nope not getting anything here also. Regards, Rohit -Original Message- From: Jerry Li [mailto:zongjie...@gmail.com] Sent: 08 September 2011 08:09

Re: Sorting groups by numFound group size

2011-09-08 Thread Martijn v Groningen
No, as far as I know sorting by group count isn't planned. You can create an issue in Jira where future development of this feature can be tracked. On 7 September 2011 23:54, bobsolr wrote: > Hi Martijn, > > Thanks for the reply. Unfortunately I can't reference the group size using > a > functio

RE: Unable to generate trace

2011-09-08 Thread Rohit
Nope not getting anything here also. Regards, Rohit -Original Message- From: Jerry Li [mailto:zongjie...@gmail.com] Sent: 08 September 2011 08:09 To: solr-user@lucene.apache.org Subject: Re: Unable to generate trace what about kill -3 PID command? On Thu, Sep 8, 2011 at 4:06 PM, Rohit

Re: Unable to generate trace

2011-09-08 Thread Jerry Li
what about kill -3 PID command? On Thu, Sep 8, 2011 at 4:06 PM, Rohit wrote: > Hi, > > > > I am running solr in tomcat on a linux machine, my solr hangs after about 40 > hrs, I wanted to generate the dump and analyse the logs. But the command > kill -QUIT PID doesn't seem to be doing anything. >

Unable to generate trace

2011-09-08 Thread Rohit
Hi, I am running solr in tomcat on a linux machine, my solr hangs after about 40 hrs, I wanted to generate the dump and analyse the logs. But the command kill -QUIT PID doesn't seem to be doing anything. How can I generate a dump otherwise to see, why solr hangs? Regards, Rohit

Re: Spellcheck

2011-09-08 Thread Danicela nutch
I already tried with field type = "textSpell" but it didn't work, I replaced it though. I found this code : default false false 1 spellcheck I put this in solrconfig.xml and used the query you gave me, there is the build line (like before) but I don't have any

Re: Spellcheck

2011-09-08 Thread Danicela nutch
I already tried with field type = "textSpell" but it didn't work, I replaced it though. I found this code : default false false 1 spellcheck I put this in solrconfig.xml and used the query you gave me, there is the build line (like before) but I don't have any

Re: Question Query Detection Strategies?

2011-09-08 Thread tamanjit.bin...@yahoo.co.in
If you have meta-data available on what all is actually searched on your site, you can use mahout to get keywords, and then define your solr accordingly. THis might help: https://cwiki.apache.org/MAHOUT/collocations.html Alternatively, you can parse your query (either at client end or server end