Re: Read past EOF error due to broken connection

2011-06-22 Thread pravesh
First commit and then try again to search. You can also use lucene's CheckIndex tool to check fix your index (it may remove some corrupt segments in your index) Thanx Pravesh -- View this message in context:

Problem in accessing a variable's changed value outside of if block in javascript code

2011-06-22 Thread Romi
*$(#submit).click(function(){ var query=getquerystring() ; //get the query string entered by user // get the JSON response from solr server var newquery=query; $.getJSON(http://192.168.1.9:8983/solr/db/select/?wt=jsonstart=0rows=100q=+query+json.wrf=?;,

Re: Solr Clustering For Multiple Pages

2011-06-22 Thread nilay....@gmail.com
Thanks Alot . I was thinking i am not doing in correct way . - Regards Nilay Tiwari -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Clustering-For-Multiple-Pages-tp3085507p3094379.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Clustering For Multiple Pages

2011-06-22 Thread nilay....@gmail.com
Can you please tell me how can i apply filter in cluster data in Solr ? Currently i storing docid and topic name in Map and get the ids by topic from Map and then pass into solr separating by OR condition Is there any other way to do this - Regards Nilay Tiwari -- View this

Re: Parse solr json object

2011-06-22 Thread lee carroll
try this mail list http://docs.jquery.com/Discussion or this doc http://api.jquery.com/jQuery.each/ On 21 June 2011 07:32, Romi romijain3...@gmail.com wrote: Hi, for enabling highlighting i want to parse json object. for readilibility i included xml format of that json object. please tell me

Re: Solr Clustering For Multiple Pages

2011-06-22 Thread Stanislaw Osinski
I don't quite follow, I must admit. Maybe it's faceting you're after? http://wiki.apache.org/solr/SolrFacetingOverview Staszek On Wed, Jun 22, 2011 at 08:40, nilay@gmail.com nilay@gmail.comwrote: Can you please tell me how can i apply filter in cluster data in Solr ? Currently i

strange utf-8 problem

2011-06-22 Thread ramires
I use solr 4 trunk to index some sites with nutch 1-2-rc4. When i try to index 300k documents with solr4 i get error. But when i use solr 1.4.1 version there is no problem like that. I install solr4 to tomcat5,6 jetty7,8 there is no change. I use apache-solr-core-1.4.0.jar

Re: MultiValued facet behavior question

2011-06-22 Thread Dennis de Boer
Hi Bill, yes, you absolutely do make sense. I posted the exact same question to this mailing list (subject: faceting on multivalued fields), but got no response out of it. A friend of mine is now helping out. I hope someone on the list can give us some advice. I'll post our findings to this

Re: MultiValued facet behavior question

2011-06-22 Thread lee carroll
Can your front end app normalize the q parameter. Either with a drop down or a type a head derived from the values in the specialties field. that way q will match value(s) in your facet results. I'm not sure what you are trying to achieve though so maybe i'm off the mark. On 22 June 2011 04:37,

Re: MultiValued facet behavior question

2011-06-22 Thread lee carroll
Oh sorry forgot to also type: Often facet fields are not stemmed or heavily analysed. The facet values are from the index. On 22 June 2011 08:21, lee carroll lee.a.carr...@googlemail.com wrote: Can your front end app normalize the q parameter. Either with a drop down or a type a head derived

Re: MultiValued facet behavior question

2011-06-22 Thread Michael Kuhlmann
Am 22.06.2011 05:37, schrieb Bill Bell: It can get more complicated. Here is another example: q=cardiologydefType=dismaxqf=specialties (Cardiology and cardiologist are stems)... But I don't really know which value in Cardiologist match perfectly. Again, I only want it to return:

Re: Problem with field collapsing of patched Solr 1.4

2011-06-22 Thread Thalaiselvam
Hi, Iam using solr collapse, it is working perfectly with default sorting (score), when we give the more than fileld in sort with pagination, it through incorrect result. Could any one help to solve this?. Thanks in advance... Regards Thalaiselvam N -- View this message in context:

Re: MultiValued facet behavior question

2011-06-22 Thread Bill Bell
Here is an example using exampledocs and trunk 4.0: http://localhost:8983/solr/select/?q=cat:%22hard%20drive%22version=2.2sta rt=0rows=10indent=onfacet=truefacet.field=catfacet.query={!lucene}cat: %22hard%20drive%22facet.mincount=1 Results: result name=response numFound=2 start=0 Etc lst

Re: MultiValued facet behavior question

2011-06-22 Thread Bill Bell
You can type q=cardiology and match on cardiologist. If stemming did not work you can just add a synonym: cardiology,cardiologist But that is not the issue. The issue is around multiValue fields and facets. You would expect a user Who is searching on the multiValued field to match on some values

Re: MultiValued facet behavior question

2011-06-22 Thread lee carroll
Hi Bill, can you explain a little bit more around why you need this. Knowing the motivation might suggest a different solution not just involving faceting. On 22 June 2011 08:49, Bill Bell billnb...@gmail.com wrote: You can type q=cardiology and match on cardiologist. If stemming did not work

Re: MultiValued facet behavior question

2011-06-22 Thread Dennis de Boer
Hi Bill, as far as I understood now, with the help of my friend, you can't. Multivalued fields don't work that way. You can however always filter the facet results manually in the JSP. You knwo what the user chose as a facet. The issue I ran into is when you have additional facet fields. For

Re: MultiValued facet behavior question

2011-06-22 Thread Michael Kuhlmann
Am 22.06.2011 09:49, schrieb Bill Bell: You can type q=cardiology and match on cardiologist. If stemming did not work you can just add a synonym: cardiology,cardiologist Okay, synonyms are the only way I can think of a realistic match. Stemming won't work on a facet field; you wouldn't get

Understanding query explain information

2011-06-22 Thread Alexander Ramos Jardim
Hi guys, I am getting some doubts about how to correctly understand the debugQuery output. I have a field named itemName in my index. This is a text field, just that. When I quqery a simple ?q=itemName:iPad , I end up with the following query result. Simply trying to understand why these strings

Re: char sets accepted via xml

2011-06-22 Thread Tom Gross
Hi, I also have this issue with Solr 3.2.0. It is probably this: https://issues.apache.org/jira/browse/SOLR-2381 Tom On 06/15/2011 02:09 PM, Mark Cunningham wrote: Hi, If you submit information to solr using xml, does the server assume you're using unicode encoded in utf8? And does it accept

Conflict in wildcard query and spellchecker in solr search

2011-06-22 Thread Romi
Using solr search when i search for rin* it run wildcard query and i get the result for ring but when i search for Rin* it run spellchecker and then gives the result for ring. why so ?? please explain - Thanks Regards Romi -- View this message in context:

Re: Conflict in wildcard query and spellchecker in solr search

2011-06-22 Thread Markus Jelsma
Wildcard queries are not analyzed. Lowercase your query beforehand. On Wednesday 22 June 2011 14:08:48 Romi wrote: Using solr search when i search for rin* it run wildcard query and i get the result for ring but when i search for Rin* it run spellchecker and then gives the result for ring. why

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-22 Thread Surendra
Hi Chris ,Andreas I have upgraded to solr 3.2 ... everything seems fine now. I will have to integrate this to my application and observe if any further issues...again thanks for your patience and time... --Surendra

Re: MultiValued facet behavior question

2011-06-22 Thread lee carroll
Hi Bill, So that part works. Then when I output the facet, I need a different behavior than the default. I need The facet to only output the value that matches (scored) - NOT ALL VALUES in the multiValued field. I think it makes sense? Why do you need this ? If your use case is faceted

Re: Conflict in wildcard query and spellchecker in solr search

2011-06-22 Thread Romi
* fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.WordDelimiterFilterFactory

Search is taking long-long time.

2011-06-22 Thread Mohammad Shariq
I am running two solrShards. I have indexed 100 million docs in each shard ( each are 50 GB and only 'id' is stored). My search have became very slow. Its taking around 2-3 seconds. below is my query : http://solrHost1:8080/solr/select?shards=solrHost1:8080/solr,solrHost2:8080/solrq=

Re: Conflict in wildcard query and spellchecker in solr search

2011-06-22 Thread Markus Jelsma
No, wildcard queries are not analyzed. They are not _passed_ through your analyzers. If you lowercase at index-time, you must lowercase it outside of Solr before sending a query. On Wednesday 22 June 2011 14:35:12 Romi wrote: * fieldType name=text class=solr.TextField positionIncrementGap=100

Re: MultiValued facet behavior question

2011-06-22 Thread Dennis de Boer
Hi Lee, since I have the same problem, I might as well try to answer this question. You want this behaviour to make things clear for your users. If they select cardiologists, does it make sense to also show family doctors as a facetvalue to the user. The same thing goed for the facets that are

Re: Conflict in wildcard query and spellchecker in solr search

2011-06-22 Thread Romi
how can I lowercase query outside of Solr before sending a query? - Thanks Regards Romi -- View this message in context: http://lucene.472066.n3.nabble.com/Conflict-in-wildcard-query-and-spellchecker-in-solr-search-tp3095198p3095345.html Sent from the Solr - User mailing list archive at

Re: Understanding query explain information

2011-06-22 Thread lee carroll
Hi are you using synonyms ? On 22 June 2011 10:30, Alexander Ramos Jardim alexander.ramos.jar...@gmail.com wrote: Hi guys, I am getting some doubts about how to correctly understand the debugQuery output. I have a field named itemName in my index. This is a text field, just that. When I

Re: MultiValued facet behavior question

2011-06-22 Thread Mike Sokolov
On 06/22/2011 04:01 AM, Dennis de Boer wrote: Hi Bill, as far as I understood now, with the help of my friend, you can't. Multivalued fields don't work that way. You can however always filter the facet results manually in the JSP. You knwo what the user chose as a facet. Yes - that is the

Tika Jax-RS and DIH

2011-06-22 Thread Tod
Mattmann, Chris A (388J chris.a.mattmann at jpl.nasa.gov writes: Hi Jo, You may consider checking out Tika trunk, where we recently have a Tika JAX-RS web service [1] committed as part of the tika-server module. You could probably wire DIH into it and accomplish the same thing.

Re: Search is taking long-long time.

2011-06-22 Thread Ahmet Arslan
I am running two solrShards. I have indexed 100 million docs in each shard ( each are 50 GB and only 'id' is stored). My search have became very slow. Its taking around 2-3 seconds. below is my query : http://solrHost1:8080/solr/select?shards=solrHost1:8080/solr,solrHost2:8080/solrq=

Re: MultiValued facet behavior question

2011-06-22 Thread lee carroll
Hi Dennis, I think maybe I just disagree. Your not showing facet counts for cardiologists and Family Doctors independently. The Family Doctor count will be all Family Doctors who are also Cardiologists. This allows users to further filter Cardiologists who are also family Doctors. (this could be

Re: MultiValued facet behavior question

2011-06-22 Thread Dennis de Boer
Well, the use case is rather simple. It is not a use case but more auser experience. If I have a list of values I can facet on, for example : A B C D E And I click on B, does it make sense for the user to display B C E after the selection ? Just because items in B are C and E items as well? As

Re: Search is taking long-long time.

2011-06-22 Thread Mohammad Shariq
this is how my 'time' field looks in schema : field name=time type=tint indexed=true stored=false/ and also, I am doing frequent Update to Solr (every 5 minuts). On 22 June 2011 18:41, Ahmet Arslan iori...@yahoo.com wrote: I am running two solrShards. I have indexed 100 million docs in

RE: MultiValued facet behavior question

2011-06-22 Thread Bob Sandiford
Hi, Bill (and others). I post this for what it's worth - it's a very specialized resolution we wrote to a similar requirement that may help with your (and similar) requirements. Caveats abound [1] We're running 3.1. We wanted to be able to return facets which matched on our actual search,

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-22 Thread Mattmann, Chris A (388J)
Glad it worked out! Cheers, Chris On Jun 22, 2011, at 5:14 AM, Surendra wrote: Hi Chris ,Andreas I have upgraded to solr 3.2 ... everything seems fine now. I will have to integrate this to my application and observe if any further issues...again thanks for your patience and time...

Re: Weird optimize performance degradation

2011-06-22 Thread Santiago Bazerque
Thanks for your answers Erick Mohammad! I'll get back to the list if I have more specific info about this issue, so far the index is performing normally again. Best, Santiago On Mon, Jun 20, 2011 at 9:29 AM, Erick Erickson erickerick...@gmail.comwrote: Hmmm, that is odd, anyone else want to

SEVERE: java.lang.NoSuchFieldError: core Solr branch3.x

2011-06-22 Thread Markus Jelsma
Hi, Today's checkout (Solr Specification Version: 3.4.0.2011.06.22.16.10.08) produces the exception below on start up. The same exception with very similar strack trace comes when committing and add. Example schema and docs will reproduce the error. Jun 22, 2011 4:11:57 PM

size of synonyms.txt

2011-06-22 Thread Bernd Fehling
While trying some synonyms.txt files I noticed a huge increase of heap usage. synonyms_1.txt -- 6645 lines (2826104 bytes in size) results in 66364 entries in SynonymMap with 730MB heap usage. Startup time about 2 minutes. synonyms_2.txt -- 6645 lines (5384884 bytes in size) results in 115168

Re: rename a core to same name of existing core

2011-06-22 Thread Stefan Matheis
Koji, the Description on http://wiki.apache.org/solr/CoreAdmin#CREATE is: *quote* If a core with the same name exists, while the new created core is initalizing, the old one will continue to accept requests. Once it has finished, all new request will go to the new core, and the old core will be

Re: commit time and lock

2011-06-22 Thread Ranveer
Dear all, Kindly help me.. thanks On Tuesday 21 June 2011 11:46 AM, Jonty Rhods wrote: I am using solrj to index the data. I have around 5 docs indexed. As at the time of commit due to lock server stop giving response so I was calculating commit time: double starttemp =

Re: Read past EOF error due to broken connection

2011-06-22 Thread Anuj Kumar
Hi Pravesh, Thanks for your reply. I tried both the approaches- Commit fails with this exception- Exception in thread main org.apache.solr.common.SolrException: Severe errors in solr configuration. Check your log files for more detailed information on what may be wrong. If you want solr to

Re: MultiValued facet behavior question

2011-06-22 Thread Mike Sokolov
We always remove the facet filter when faceting: in other words, for a good user experience, you generally want to show facets based on the query excluding any restriction based on the facets. So in your example (facet B selected), we would continue to show *all* facets. Only if you performed

Re: rename a core to same name of existing core

2011-06-22 Thread Koji Sekiguchi
Stefan, I guess, same handling for other actions, like rename. I agree. Thank you for the pointer! koji (11/06/22 23:16), Stefan Matheis wrote: Koji, the Description on http://wiki.apache.org/solr/CoreAdmin#CREATE is: *quote* If a core with the same name exists, while the new created core

response time for pdf indexing

2011-06-22 Thread libnova
Hi ! We are using Zend Search based on Lucene. Our indexing pdf consultations take longer than 2 seconds. We want to change to solr to try to solve this problem. i. Can anyone tell me the response time for querys on pdf documents on solr? ii. Can anyone tell me some strategies to

Re: Exception using Analyze from the Solr Admin app

2011-06-22 Thread karthik
any help on this would be really appreciated. i just setup a totally brand new setup of solr still got this exception .. I can see that this would be something to do with classpath, but not able to figure out exactly what is causing this issue. -- karthik On Mon, Jun 13, 2011 at 4:23 PM,

[Announce] Solr 3.2 with RankingAlgorithm

2011-06-22 Thread Nagendra Nagarajayya
Hi! I would like to announce the availability of Solr 3.2 with RankingAlgorithm. Please download and give the new version a try. This version of RankingAlgorithm exposes a lucene compatible api so almost all of the Solr features should work as it is. Note: NRT support will be available by

Re: ampersand, dismax, combining two fields, one of which is keywordTokenizer

2011-06-22 Thread Jonathan Rochkind
Yeah, I see your points. It's complicated. I'm not sure either. But the thing is: in order to use a feature like that you'd have to really think hard about the query analysis of your fields, and which ones will produce which tokens in which situations You need to think really hard about

Re: Exception using Analyze from the Solr Admin app

2011-06-22 Thread Stefan Matheis
Karthik, could you attach/pastebin your schema and also the text you're trying to analyze? Regards Stefan On Wed, Jun 22, 2011 at 5:29 PM, karthik kmoha...@gmail.com wrote: any help on this would be really appreciated. i just setup a totally brand new setup of solr still got this exception

Re: MultiValued facet behavior question

2011-06-22 Thread Darren Govoni
How is that different from doing a field search and just counting the results? If you only want the facet of the searched term (input), then why not just combine that with the result count and use that? Facets are more useful when you _don't_ know the distribution of values across a result set

Re: size of synonyms.txt

2011-06-22 Thread Robert Muir
On Wed, Jun 22, 2011 at 10:14 AM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: While trying some synonyms.txt files I noticed a huge increase of heap usage. synonyms_1.txt -- 6645 lines (2826104 bytes in size) results in 66364 entries in SynonymMap with 730MB heap usage. Startup time

Re: size of synonyms.txt

2011-06-22 Thread Darren Govoni
I once tried to load wordnet synsets as a synonym file and it was prohibitively slow and unusable. fyi. On 06/22/2011 12:23 PM, Robert Muir wrote: On Wed, Jun 22, 2011 at 10:14 AM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: While trying some synonyms.txt files I noticed a huge

Re: MultiValued facet behavior question

2011-06-22 Thread Jonathan Rochkind
Okay, so since you put cardiologist in the 'q', you only want facet values that have 'cardiologist' (or 'Cardiologist') to show in up the facet list. In general, there's no good way to do that. But. If you want to do some client-side processing before you submit the query to Solr, and on

Re: size of synonyms.txt

2011-06-22 Thread Bernd Fehling
On Wed, Jun 22, 2011 at 10:14 AM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: While trying some synonyms.txt files I noticed a huge increase of heap usage. synonyms_1.txt -- 6645 lines (2826104 bytes in size) results in 66364 entries in SynonymMap with 730MB heap usage.

RE: response time for pdf indexing

2011-06-22 Thread Steven A Rowe
Hi Rode, Have you seen http://wiki.apache.org/solr/SolrPerformanceFactors ? Steve -Original Message- From: Rode González (libnova) [mailto:r...@libnova.es] Sent: Wednesday, June 22, 2011 11:30 AM To: solr-user@lucene.apache.org Cc: dan...@silvereme.com; Gonzalo Iglesias; Leo;

Re: Exception using Analyze from the Solr Admin app

2011-06-22 Thread karthik
Thanks for offering to help Stefan. I just resolved the issue. It was some crazy thing within Tomcat (I still need to find out what it was). I just backed up my old tomcat installation just created a new instance of tomcat deployed my solr installation in there everything started working fine

Re: MultiValued facet behavior question

2011-06-22 Thread Darren Govoni
Yeah, I agree with that last statement. It seems to me that the use case where it _might_ matter is where you have a query for MORE than one. q=cardiologist OR family and in that case, it MIGHT be useful to separate the facets in a XOR sense where you don't get cross-pollution. But the

Re: MultiValued facet behavior question

2011-06-22 Thread Gino Rodrigues
An interesting live scenario for this matter: http://www.bondfaro.com.br/  (brazilian site) The query ipad returns results spread across many categories (links on the left, teasers in the center). The Tablet category (facet) is one of them. The query tablet does exactly the same as clicking

Re: SEVERE: java.lang.NoSuchFieldError: core Solr branch3.x

2011-06-22 Thread Yonik Seeley
I just tried branch_3x and couldn't reproduce this. Looks like maybe there is something wrong with your build, or some old class files left over somewhere being picked up. -Yonik http://www.lucidimagination.com On Wed, Jun 22, 2011 at 10:15 AM, Markus Jelsma markus.jel...@openindex.io wrote:

sorting by termfreq on trunk doesn't work?

2011-06-22 Thread Jason Toy
I am trying to use sorting by the termfreq function using the trunk code since termfreq was added in the 4.0 code base. I run this query: http://127.0.0.1:8983/solr/select/?q=librariansort=termfreq(all_lists_text,librarian)%20desc but I get: HTTP ERROR 500 Problem accessing /solr/select/.

Re: sorting by termfreq on trunk doesn't work?

2011-06-22 Thread Yonik Seeley
Thanks for the problem report. It turns out we didn't check for a null pointer when there were no terms in a field for a segment. I've just committed a fix to trunk. -Yonik http://www.lucidimagination.com On Wed, Jun 22, 2011 at 10:28 PM, Jason Toy jason...@gmail.com wrote: I am trying to

Re: Search is taking long-long time.

2011-06-22 Thread pravesh
Was your searches always slow, OR, since you did some changes at index/config/schema level? Is it due to 5-mins index updation? Are you warming ur searches? Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Search-is-taking-long-long-time-tp3095306p3098552.html

Query time noun, verb boosting

2011-06-22 Thread Pooja Verlani
Hi, At the query time, I want to make the lucene query such that it should boost only the noun from the query or some concept existing in the index. Are there any possibilities or any possible ideas that can be worked around? Regards, Pooja

Re: Query time noun, verb boosting

2011-06-22 Thread Anshum
What would you mean by 'noun or some concept'. Would be better if you could give a rather concrete example. About detecting parts of speech, you could use a lot of libraries but I didn't get about boosting terms from the Index. -- Anshum Gupta http://ai-cafe.blogspot.com On Thu, Jun 23, 2011

Re: Query time noun, verb boosting

2011-06-22 Thread Pooja Verlani
Hi, Say for example, a query like mammohan singh dancing, I am preferring to make a compulsory condition on nouns to be searched but any verb isnt important for me, I am preferring to extract results for manmohan singh and not for dancing. If I can extract noun verb or can get to know that in my