facet total score instead of total count

2010-07-28 Thread Bharat Jain
Hi, I have a requirement where I want to sum up the scores of the faceted fields. This will be decide the relevancy for us. Is there a way to do it on a facet field? Basically instead of giving the count of records for facet field I would like to have total sum of scores for those records. Any

logic required for newbie

2010-07-28 Thread Jonty Rhods
Hi All, I am very new and learning solr. I have 10 column like following in table 1. id 2. name 3. user_id 4. location 5. country 6. landmark1 7. landmark2 8. landmark3 9. landmark4 10. landmark5 when user search for landmark then I want to return only one landmark which match. Rest of the

Re: question about relevance

2010-07-28 Thread Bharat Jain
Well you are correct Erik that this is a database-ish thing try to achieve in solr and unfortunately the sin :) had been committed by somebody else :) and now we are running into relevancy issues. Let me try to state the problem more casually. 1. There are user records of type A, B, C etc.

Re: Any tips/guidelines to turning the Solr/luence performance in a master/slave/sharding environment

2010-07-28 Thread Tommaso Teofili
Hi, I think the starting point should be : http://wiki.apache.org/solr/SolrPerformanceFactors For example you could start playing with the mergeFactor parameter. My 2 cents, Tommaso 2010/7/27 Chengyang atreey...@163.com How to reduce the index files size, decreate the sync time between each

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Tommaso Teofili
I attached a patch for Solr 1.4.1 release on https://issues.apache.org/jira/browse/SOLR-1902 that made things work for me. This strange behaviour for me was due to the fact that I copied the patched jars and war inside the dist directory but forgot to update the war inside the example/webapps

Re: SpatialSearch: sorting by distance

2010-07-28 Thread Pavel Minchenkov
Does anybody know if this feature works correctly? Or I'm doing something wrong? 2010/7/27 Pavel Minchenkov char...@gmail.com Hi, I'm trying to sort by distance like this: sort=dist(2,lat,lon,55.755786,37.617633) asc In general results are sorted, but some documents are not in right

Re: Integration Problem

2010-07-28 Thread Jörg Wißmeier
Nobody out there who can help me with this problem? I need to edit the result of the javabin writer (adding the results from the webservice). I hope it is possible to do that. thanks in advance. Am Mo 26.07.2010 10:25 schrieb Jörg Wißmeier : Hi everybody, since a while i'm working with solr

solr log file rotation

2010-07-28 Thread Christos Constantinou
Hi all, I am running a Solr 1.4 instance on FreeBSD that generates large log files in very short periods. I used /etc/newsyslog to configure log file rotation, however once the log file is rotated then Solr doesn't write logs to the new file. I'm wondering if there is a way to let Solr know

Re: Spellchecking and frequency

2010-07-28 Thread dan sutton
Hi Mark, Thanks for that info looks very interesting, would be great to see your code. Out of interest did you use the dictionary and the phonetic file? Did you see better results with both? In regards to the secondary part to check the corpus for matching suggestions, would another way to do

Re: Indexing Problem: Where's my data?

2010-07-28 Thread Chantal Ackermann
make sure to set stored=true on every field you expect to be returned in your results for later display. Chantal

Re: DIH : SQL query (sub-entity) is executed although variable is not set (null or empty list)

2010-07-28 Thread Chantal Ackermann
Hi Lance! On Wed, 2010-07-28 at 02:31 +0200, Lance Norskog wrote: Should this go into the trunk, or does it only solve problems unique to your use case? The solution is generic but is an extension of XPathEntityProcessor because I didn't want to touch the solr.war. This way I can deploy the

Solr using 1500 threads - is that normal?

2010-07-28 Thread Christos Constantinou
Hi, Solr seems to be crashing after a JVM exception that new threads cannot be created. I am writing in hope of advice from someone that has experienced this before. The exception that is causing the problem is: Exception in thread btpool0-5 java.lang.OutOfMemoryError: unable to create new

Re: Strange search

2010-07-28 Thread stockii
try to delete solr.SnowballPorterFilterFactory from your analyzerchain. i had similar problems by using german SnowballPorterFilterFactory -- View this message in context: http://lucene.472066.n3.nabble.com/Strange-search-tp998961p1001990.html Sent from the Solr - User mailing list archive at

SolrJ Response + JSON

2010-07-28 Thread MitchK
Hello community, I need to transform SolrJ - responses into JSON, after some computing on those results by another application has finished. I can not do those computations on the Solr - side. So, I really have to translate SolrJ's output into JSON. Any experiences how to do so without

Get unique values

2010-07-28 Thread Rafal Bluszcz Zawadzki
Hi, In my schema I have (inter ali) fields CollectionID, and CollectionName. These two values always match together, which means that for every value of CollectionID there is matching value from CollectionName. I am interested in query which allow me to get unique values of CollectionID with

Highlighted match snippets highlight non-matched words (such as 0.1 and 0.2)

2010-07-28 Thread Jon Cram
Hi, I'm observing some strange highlighted words in field value snippets returned from Solr when matched term highlighting (http://wiki.apache.org/solr/HighlightingParameters) is enabled. In some cases, highlighted field value snippets contain highlighted words that are not matches: -

Re: clustering component

2010-07-28 Thread Stanislaw Osinski
The patch should also work with trunk, but I haven't verified it yet. I've just added a patch against solr trunk to https://issues.apache.org/jira/browse/SOLR-1804. S.

Show elevated Result Differently

2010-07-28 Thread Vishal.Arora
I want to show elevated Result Different from others is there any way to do this -- View this message in context: http://lucene.472066.n3.nabble.com/Show-elevated-Result-Differently-tp1002081p1002081.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: SolrJ Response + JSON

2010-07-28 Thread Mark Allan
I think you should just be able to add wt=json to the end of your query (or change whatever the existing wt parameter is in your URL). Mark On 28 Jul 2010, at 12:54 pm, MitchK wrote: Hello community, I need to transform SolrJ - responses into JSON, after some computing on those results

SolrJ Response + JSON

2010-07-28 Thread MitchK
Hello , Second try to send a mail to the mailing list... I need to translate SolrJ's response into JSON-response. I can not query Solr directly, because I need to do some math with the responsed data, before I show the results to the client. Any experiences how to translate SolrJ's response

Re: SolrJ Response + JSON

2010-07-28 Thread Mark Allan
On 28 Jul 2010, at 2:08 pm, MitchK wrote: Second try to send a mail to the mailing list... Your first attempt got through as well. Here's my original response. I think you should just be able to add wt=json to the end of your query (or change whatever the existing wt parameter is in your

Re: SolrJ Response + JSON

2010-07-28 Thread Markus Jelsma
Hi, I got a response to your e-mail in my box 30 minutes ago. Anyway, enable the JSONResponseWriter, if you haven't already, and query with wt=json. Can't get mucht easier. Cheers, On Wednesday 28 July 2010 15:08:26 MitchK wrote: Hello , Second try to send a mail to the mailing list...

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Thank you Markus, Mark. Seems to be a problem with Nabble, not with the mailing list. Sorry. I can create a JSON-response, when I query Solr directly. But I mean, that I query Solr through a SolrJ-client (CommonsHttpSolrServer). That means my queries look a litte bit like that:

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Yesterday I did get this working with version 4.0 from trunk. I haven't fully tested it yet, but the content doesn't come through blank anymore, so that's good. Would it be more stable to stick with 1.4.1 and your patch to get to Tika 0.8, or to stick with the 4.0 trunk version? Best, Dave

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Alessandro Benedetti
In my opinion, the 1.4.1 version with the Patch is more Stable. Until 4.0 will be released 2010/7/28 David Thibault dthiba...@esperion.com Yesterday I did get this working with version 4.0 from trunk. I haven't fully tested it yet, but the content doesn't come through blank anymore, so

Re: SolrJ Response + JSON

2010-07-28 Thread Chantal Ackermann
You could use org.apache.solr.handler.JsonLoader. That one uses org.apache.noggit.JSONParser internally. I've used the JacksonParser with Spring. http://json.org/ lists parsers for different programming languages. Cheers, Chantal On Wed, 2010-07-28 at 15:08 +0200, MitchK wrote: Hello ,

RE: Solr 3.1 and ExtractingRequestHandler resulting in blank content

2010-07-28 Thread David Thibault
If you don't store the content then you can't do highlighting, right? Also, don't you just have to switch the text field to say stored=true in your schema to store the text? I don't understand why you're differentiating the behavior of ExtractingRequestHandler from the behavior of Solr in

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Thanks, I'll try that then. I kind of figured that'd be the answer, but after fighting with Solr ExtractingRequestHandler for 2 days I also just wanted to be done with it once it started working with 4.0...=) However, stability would be better in the long run. Best, Dave -Original

Re: logic required for newbie

2010-07-28 Thread rajini maski
you can index each of these field separately... field1- Id field2- name field3-user_id field4-country. field7- landmark While quering you can specify q=Landmark9 This will return you results.. And if you want only particular fields in output.. use the fl parameter in query... like

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Tommaso Teofili
This was my same feeling :-) and so I went for the trunk to have things working quickly, but I also have to consider which one is the best version since I am going to deploy it in the near future in an enterprise environment and choosing the best version is an importat step. I am quite new to Solr

simple question from a newbie

2010-07-28 Thread Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR)
Hi, I'm new to Solr and have a rather dumb question. I want to do a query that returns all the Titles that start with a certain letter. For example I have these titles: Results of in-mine research in support Cancer Reports State injury indicators report Cancer Reports Indexed dermal

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Thank you, Chantal. I have looked at this one: http://www.json.org/java/index.html This seems to be an easy-to-understand-implementation. However, I am wondering how to determine whether a SolrDocument's field is multiValued or not. The JSONResponseWriter of Solr looks at the

display solr result in JSP

2010-07-28 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
I am new for solr. Just got example xml file index and search by following solr tutorial. I wonder how I can get the search result display in a JSP. I really appreciate any suggestions you can give. Thanks so much, Xiaohui

Re: SolrJ Response + JSON

2010-07-28 Thread Chantal Ackermann
Hi Mitch On Wed, 2010-07-28 at 16:38 +0200, MitchK wrote: Thank you, Chantal. I have looked at this one: http://www.json.org/java/index.html This seems to be an easy-to-understand-implementation. However, I am wondering how to determine whether a SolrDocument's field is multiValued or

Re: logic required for newbie

2010-07-28 Thread Jonty Rhods
Hi thanks for reply.. Actually requirement is diffrent (sorry if I am unable to clerify in first mail). basically follwoing are the fields name in schema as well: 1. id 2. name 3. user_id 4. location 5. country 6. landmark1 7. landmark2 8. landmark3 9. landmark4 10. landmark5 which

Re: display solr result in JSP

2010-07-28 Thread Ranveer
Hi, very simple to display value in jsp. if you are using solrj then simply store value in bean from java class and can display. same thing you can do in servlet too.. get the solr server response and return in bean or can display directly(in servlet). hope you will able to do. regards

Re: simple question from a newbie

2010-07-28 Thread Ranveer
I think you using wild-card search or should use wild-card search. but first of all please provide the schema and configuration file for more details. regards Ranveer On Wednesday 28 July 2010 07:51 PM, Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR) wrote: Hi, I'm new to Solr and have a rather

RE: Indexing Problem: Where's my data?

2010-07-28 Thread Michael Griffiths
Thanks - but my schema.xml is not recognizing field names specified in the data-config.xml. For example - and I just tested this now - if I have in my data-config.xml: field column=product_id name=pid / And then in my schema.xml: field name=pid type=int indexed=true stored=true required=true

Re: Spellchecking and frequency

2010-07-28 Thread Jonathan Rochkind
I therefore wrote an implementation of SolrSpellChecker that wraps jazzy, the java aspell library. I also extended the SpellCheckComponent to take the matrix of suggested words and query the corpus to find the first combination of suggestions which returned a match. This works well for my use

Re: Is there a cache for a query?

2010-07-28 Thread Moazzam Khan
As far as I know all searches get cache at least for some time. I am not sure about field collapse results being cached. - Moazzam http://moazzam-khan.com On Mon, Jul 26, 2010 at 9:48 PM, Li Li fancye...@gmail.com wrote: I want a cache to cache all result of a query(all steps including

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Hi Chantal, thank you for the feedback. I did not see the wood for the trees! The SolrDocument's javadoc says the following: http://lucene.apache.org/solr/api/org/apache/solr/common/SolrDocument.html |*getFieldValue

RE: simple question from a newbie

2010-07-28 Thread Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR)
I think I got it to work. If I do a wildcard search using the dc3.title field it seems to work fine (dc3.title:c*). The dc.title:c* returns every title that has a word in it that starts with 'c', which isn't exactly what I wanted. I'm guessing it's because of the type=caseInsensitiveSort.

Solr 1.4.1 field collapse

2010-07-28 Thread Moazzam Khan
Hi guys, I read somewhere that Solr 1.4.1 has field collapse support by default (without patching it) but I haven't been able to confirm it. Is this true? - Moazzam

Re: slave index is bigger than master index

2010-07-28 Thread Muneeb Ali
Well I do have disk limitations too, and thats why I think slave nodes died, when replicating data from master node. (as it was just adding on top of existing index files). :: What do you mean here? Optimizing is too CPU expensive? What I meant by avoid playing around with slave nodes is that

Re: slave index is bigger than master index

2010-07-28 Thread Muneeb Ali
In solrconfig.xml, these two lines control that. Maybe they need to be increased. str name=httpConnTimeout5000/str str name=httpReadTimeout1/str Where do I add those in solrconfig? These lines doesn't seem to be present in the example solrconfig file... -- View this message in

How do NOT queries work?

2010-07-28 Thread Kaan Meralan
I wonder how do NOT queries work. Is it a pass on the result set and filtering out the NOT property or something like that? Also is there anybody who does some performance checks on NOT queries? I want to know whether there is a significant performance degradation or not when you have NOT in a

RE: display solr result in JSP

2010-07-28 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your reply. I don't have much experience at JSP. I found tag library, and am trying to use xsltlib:apply xml=%= url.getContent().toString() % xsl=/xsl/result.xsl/ . Unfortunately I didn't get it work. Would you please give me more information? I really appreciate your

Re: Total number of terms in an index?

2010-07-28 Thread Jason Rutherglen
Tom, The total number of terms... Ah well, not a big deal, however yes the flex branch does expose this so we can show this in Solr at some point, hopefully outside of Solr's Luke impl. On Tue, Jul 27, 2010 at 9:27 AM, Burton-West, Tom tburt...@umich.edu wrote: Hi Jason, Are you looking for

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Tommasso, I used your patch and tried it with the 1.4.1 solr.war from a fresh 1.4.1 distribution, and it still gave me that NoSuchMethodError. However, when I tried it with the newly-patched-and-compiled apache-solr-1.4.2-dev.war file it works. I think I tried that before and it didn't work.

Re: Total number of terms in an index?

2010-07-28 Thread Jonathan Rochkind
At first I was thinking the TermsComponent might give you this, but oddly it seems not to. http://wiki.apache.org/solr/TermsComponent

RE: How to 'filter' facet results

2010-07-28 Thread Nagelberg, Kallin
ManBearPig is still a threat. -Kallin Nagelberg -Original Message- From: Jonathan Rochkind [mailto:rochk...@jhu.edu] Sent: Tuesday, July 27, 2010 7:44 PM To: solr-user@lucene.apache.org Subject: RE: How to 'filter' facet results Is there a way to tell Solr to only return a specific

Problem with field collapsing

2010-07-28 Thread Moazzam Khan
Hi All, Whenever I use field collapse, the numFound attribute contains exactly as many rows as I put in rows parameter instead of returning total number of documents that matched the query. Is there a way to rectify this? Thanks, Moazzam

Re: SolrCore has a large number of SolrIndexSearchers retained in infoRegistry

2010-07-28 Thread skommuri
Hi, It didn't seem like it improved the situation. The same exception stack traces are found. I have explicitly defined the index readers to be reopened by specifying in the solrconfig.xml The exception occurs when the remote cores are being searched. I am attaching the exceptions in a text

Re: Using Solr to perform range queries in Dspace

2010-07-28 Thread Chris Hostetter
: I'm trying to use dspace to search across a range of index created and stored : using Dsindexer.java class. I have seen where Solr can be use to perform I've never headr of Dsindexer.java but since this is hte first result google returns...

Know which terms are in a document

2010-07-28 Thread Max Lynch
I would like to be search against my index, and then *know* which of a set of given terms were found in each document. For example, let's say I want to show articles with the word pizza or cake in them, but would like to be able to say which of those two was found. I might use this to handle the

Re: Show elevated Result Differently

2010-07-28 Thread Erick Erickson
Please expand on what this means, it's quite vague. You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Wed, Jul 28, 2010 at 8:43 AM, Vishal.Arora vis...@value-one.com wrote: I want to show elevated Result Different from others is there any way to do this -- View

Re: simple question from a newbie

2010-07-28 Thread Erick Erickson
What is the query you submit (don't forget debugQuery=on? In particular, what field are you sorting on? But yes, if you're searching on a tokenized field, you'll get matches on all tokens in that field. Which are probably single words. And no matter how you sort, you're still getting documents

Re: Solr using 1500 threads - is that normal?

2010-07-28 Thread dc tech
1,500 threads seems extreme by any standards so there is something happening in your install. Even with appservers for web apps, typically 100 would be a fair # of threads. On 7/28/10, Christos Constantinou ch...@simpleweb.co.uk wrote: Hi, Solr seems to be crashing after a JVM exception that

Re: Solr using 1500 threads - is that normal?

2010-07-28 Thread Erick Erickson
Your commits are very suspect. How often are you making changes to your index? Do you have autocommit on? Do you commit when updating each document? Committing too often and consequently firing off warmup queries is the first place I'd look. But I agree with dc tech, 1,500 is wy more than I

Re: WordDelimiterFilter and phrase queries?

2010-07-28 Thread Chris Hostetter
: pos token offset : 1 3 0-1 : 2 diphenyl 2-10 : 3 propanoic 11-20 : 3 diphenylpropanoic 2-20 : Say someone enters the query string 3-diphenylpropanoic : : The query parser I'm using transforms this into a phrase query and the : indexed form is missed because based the positions of the terms '3'

Re: Scoring Search for autocomplete

2010-07-28 Thread Chris Hostetter
You weren't really clear on how you are generating your autocomplete results -- ie: via TermsComponent on your main index? or via a search on a custom index where each document is a word to suggested? Assuming the later, then the approach you describe below sounds good to me, but it doesn't

Help with schema design

2010-07-28 Thread Pramod Goyal
Hi, I have a use case where i get a document and a list of events that has happened on the document. For example First document: Some text content Events: Event TypeEvent By Event Time Update Pramod 06062010 2:30:00 Update Raj 06062010 2:30:00

Is solr able to merge index on different nodes

2010-07-28 Thread Chengyang
Once I want to create a large index, can I split the index on different nodes and the merge all the indexs to one node. Any further suggestion for this case?

Re: logic required for newbie

2010-07-28 Thread rajini maski
First of all I hope that in schema you have mentioned for fields indexed=true and stored=true... Next if you have done so... and now just search as q=landmark:piza... you will get one result set only.. Note : There is one constraint about applying analyzers and tokenizers... IF you apply white

Re: SolrJ Response + JSON

2010-07-28 Thread rajini maski
Yeah right... This query will do it http://localhost:8090/solr/select/?q=*:*version=2.2start=0rows=10indent=onwt=json This will do your work... This is more liike using xsl transformation supported by solr..:) Regards, Rajani Maski On Wed, Jul 28, 2010 at 6:24 PM, Mark Allan