Re: Solr 4.0 UI issue
Thank you for reply, I didn't get you. What do you mean by saying: so you may check the (raw) log output ... Please help! Thank you in advance Tom Greece -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993427.html Sent from the Solr - User mailing list archive at Nabble.com.
Problem while indexing XML file with special characters represented uuml
Dear community, I am experiencing strange problem while trying to index / to import XML document to SOLR via DataImportHandler. The XML document contains some special characters (e.g. german ü) that are represented as XML entities uuml; or auml;. There is also DTD file that defines these entities (!ENTITY uuml#252; ) (I tried to use dtd file as well as to include the DTD definition to the xml itself). After I start the import command full-import, the import process throws an exception as soon as it tries to parse uuml;: Un declared general entity uuml. Did anyone already face such a problem? best regards, Michael My data-config for importing is: dataConfig dataSource type=FileDataSource encoding=ISO-8859-1 / document !-- stream should be true since huge xml document is being parsed -- entity name=article processor=XPathEntityProcessor stream=true forEach=/dblp/article url=documents/dblp.xml field column=keyxpath=/dblp/article/@key / field column=title xpath=/dblp/article/title / /entity /document /dataConfig The XML file looks e.g. like this: ?xml version=1.0 encoding=ISO-8859-1? !DOCTYPE dblp [ !ENTITY uuml#252; !-- small u, dieresis or umlaut mark -- ] dblp article key=journals/fm/Riccardi09 mdate=2011-10-27 authorMarco Riccardi/author titleSolution of Cubic and Quartic Equations.uuml;/title pages117-122/pages year2009/year volume17/volume journalFormalized Mathematics/journal number1-4/number eehttp://dx.doi.org/10.2478/v10037-009-0012-z/eeurldb/journals/fm/fm17.html#Riccardi09/url /article/dblp The stack-trace is: 05.07.2012 17:37:19 org.apache.solr.update.processor.LogUpdateProcessor finish INFO: {deleteByQuery=*:*,add=[persons/Codd71a, persons/Hall74]} 0 1 05.07.2012 17:37:19 org.apache.solr.common.SolrException log SCHWERWIEGEND: Full Import failed:java.lang.RuntimeException: java.lang.RuntimeE xception: org.apache.solr.handler.dataimport.DataImportHandlerException: Parsing failed for xml, url:documents/dblp.xml rows processed in this xml:2 last row in this xml:{title=Common Subexpression Identification in General Algebraic System s., $forEach=/dblp/article, key=persons/Hall74} Processing Document # 3 at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java :264) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImpo rter.java:375) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.j ava:445) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.ja va:426) Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataIm portHandlerException: Parsing failed for xml, url:documents/dblp.xml rows proces sed in this xml:2 last row in this xml:{title=Common Subexpression Identificatio n in General Algebraic Systems., $forEach=/dblp/article, key=persons/Hall74} Pro cessing Document # 3 at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde r.java:621) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.j ava:327) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java :225) ... 3 more Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Parsin g failed for xml, url:documents/dblp.xml rows processed in this xml:2 last row i n this xml:{title=Common Subexpression Identification in General Algebraic Syste ms., $forEach=/dblp/article, key=persons/Hall74} Processing Document # 3 at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAnd Throw(DataImportHandlerException.java:72) at org.apache.solr.handler.dataimport.XPathEntityProcessor$3.next(XPathE ntityProcessor.java:504) at org.apache.solr.handler.dataimport.XPathEntityProcessor$3.next(XPathE ntityProcessor.java:517) at org.apache.solr.handler.dataimport.EntityProcessorBase.getNext(Entity ProcessorBase.java:120) at org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow( XPathEntityProcessor.java:225) at org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPath EntityProcessor.java:204) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.pullRow(Ent ityProcessorWrapper.java:330) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(Ent ityProcessorWrapper.java:296) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde r.java:683) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde r.java:619) ... 5 more Caused by: java.lang.RuntimeException: com.ctc.wstx.exc.WstxParsingException: Un declared general entity uuml at [row,col {unknown-source}]: [26,42] at
Re: Solr 4.0 UI issue
restart ur browser and solr if u r using example and do check u haven't removed the line requestHandler name=/admin/ class=solr.admin. AdminHandlers /” from solr config same worked for me. On Fri, Jul 6, 2012 at 1:21 PM, anarchos78 rigasathanasio...@hotmail.comwrote: Thank you for reply, I didn't get you. What do you mean by saying: so you may check the (raw) log output ... Please help! Thank you in advance Tom Greece -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993427.html Sent from the Solr - User mailing list archive at Nabble.com. -- Thanks Regards Sachin Aggarwal 7760502772
Better (and valid) Spellcheck in combination with other parameters with at least one occurance
Hi, I am trying to implement solr search with spellcheck. - My current seach works like this - I have some specific criteria for every search query. i.e. If I am hitting search with q=restaurants , I also pass another param say c=mumbai. So solr returns me restaurants in mumbai. Now I want to implement spellcheck, but at the same time I also want to make sure that whatever results my spellcheck is providing, are valid (means have at least one occurance in combination with my other param as in c=city). I am not able to decide how to achieve that. i.e. If I pass say hangry to solr with spell check then spell check retuerns few suggestions like hungry, angry However I want to suggest user only those suggestions which have hitcounts in my data with common field as in c=mumbai. Otherwise what happens is - solr returns me some suggestion words hungry , angry and if they dont have ny records with combination of city, it returns no result, which is bad user experience. so ideally Solr should return me suggestion for only those words which have at least 1 count for that suggestion with mumbai. I am currently firing 2 queries to luscene in order to achieve this Query one - spellcheck - this gives me word suggestns Query two - with given words in spell check i run facet query to get occurance counts to check which ones are valid. But this seems like unnecessary over head (Query two - with facet takes too long to respond as well). And I am trying to find more optimized way to do this. Can anyone suggest me how to do that ? thanks, ndesai -- View this message in context: http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: solr-4.0 and high cpu usage [SOLVED]
Thanks for bringing closure on this, it'll probably help others too! Erick On Thu, Jul 5, 2012 at 12:28 PM, Anatoli Matuskova anatoli.matusk...@gmail.com wrote: Found why! On Solr 1.4 dismax param mm defaults to 1 if not specified, which is equivalent to AND. On Solr 4.0 if mm is not specified, the default operator is used, which defaults to OR. That made return much more results for each query I was running, increasing the response time and the CPU usage. -- View this message in context: http://lucene.472066.n3.nabble.com/solr-4-0-and-high-cpu-usage-tp3993187p3993275.html Sent from the Solr - User mailing list archive at Nabble.com.
How to Request several docs ?
Dear Solr users, I would like to request/get several docs indexed by solr with only one request. I have a schema.xml where my field PN is the key field (unique key), I have more than 80M docs in my index. I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . Is it possible to do a request with PN1, PN2, PN3, etc... PNn Where n can be 20 or 30 by example. Thanks a lot, Bruno
Re: Solr 4.0 UI issue
Didn't helped -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993507.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: How to Request several docs ?
Le 6 juil. 2012 à 15:43, Bruno Mannina a écrit : I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . I've always done this so. paul
RE: Better (and valid) Spellcheck in combination with other parameters with at least one occurance
If you're using Solr3.1 or higher, you can do this. See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate . Here's a summary: - specify spellcheck.collate=true to get a re-written query made from the individual word suggestions. - specify spellcheck.maxCollationTries to something 0 (10, perhaps) to have it try the collation possibilities against the index before returning them to the user. This means that all collations returned will be guaranteed to return hits. - specify spellcheck.collateExtendedResults=true if you want hit counts for the collation queries and also details on which original word was replaced by which new word. - specify spellcheck.maxCollations to something 1 if you want to get more than 1 collation returned. If on 4.0-alpha: - maybe specify spellcheck.collateParam.mm=100%, if your original query had a very low mm value. This will make the collations returned more meaningful to the user. - maybe specify spellcheck.alternativeTermCount to something 0 if you want the spellchecker to consider that the user might have misspelled words even though the misspelling occurs somewhere in the index. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: ninaddesai82 [mailto:desai.ni...@gmail.com] Sent: Friday, July 06, 2012 6:04 AM To: solr-user@lucene.apache.org Subject: Better (and valid) Spellcheck in combination with other parameters with at least one occurance Hi, I am trying to implement solr search with spellcheck. - My current seach works like this - I have some specific criteria for every search query. i.e. If I am hitting search with q=restaurants , I also pass another param say c=mumbai. So solr returns me restaurants in mumbai. Now I want to implement spellcheck, but at the same time I also want to make sure that whatever results my spellcheck is providing, are valid (means have at least one occurance in combination with my other param as in c=city). I am not able to decide how to achieve that. i.e. If I pass say hangry to solr with spell check then spell check retuerns few suggestions like hungry, angry However I want to suggest user only those suggestions which have hitcounts in my data with common field as in c=mumbai. Otherwise what happens is - solr returns me some suggestion words hungry , angry and if they dont have ny records with combination of city, it returns no result, which is bad user experience. so ideally Solr should return me suggestion for only those words which have at least 1 count for that suggestion with mumbai. I am currently firing 2 queries to luscene in order to achieve this Query one - spellcheck - this gives me word suggestns Query two - with given words in spell check i run facet query to get occurance counts to check which ones are valid. But this seems like unnecessary over head (Query two - with facet takes too long to respond as well). And I am trying to find more optimized way to do this. Can anyone suggest me how to do that ? thanks, ndesai -- View this message in context: http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr facet multiple constraint
What does doesn't work mean? returning no results? Not returning facets? returning incorrect facet counts? You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Fri, Jul 6, 2012 at 1:37 AM, davidbougearel david.bougea...@smile-benelux.com wrote: Well thanks for your answer, in fact i've written what the QueryResponse return as the solr query here is my real solr query before use the executeQuery : q=service%3A1+AND+publicationstatus%3ALIVEsort=publishingdate+descfq=%7B%21ex%3Ddt%7D%28%28%28user%3A10%29%29%29facet.field=%7B%21tag%3Ddt%7Duserfacet=truefacet.mincount=1 which is the same as my first post without the 'wt=javabin' and instead of commas. Could you please see if there is something wrong for you ? Best regards, David. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-facet-multiple-constraint-tp3992974p3993408.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance
I happen to remember this JIRA: https://issues.apache.org/jira/browse/SOLR-2462 be a bit careful if you use collate with 3.1 or 3.2... Best Erick On Fri, Jul 6, 2012 at 10:36 AM, Dyer, James james.d...@ingrambook.com wrote: If you're using Solr3.1 or higher, you can do this. See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate . Here's a summary: - specify spellcheck.collate=true to get a re-written query made from the individual word suggestions. - specify spellcheck.maxCollationTries to something 0 (10, perhaps) to have it try the collation possibilities against the index before returning them to the user. This means that all collations returned will be guaranteed to return hits. - specify spellcheck.collateExtendedResults=true if you want hit counts for the collation queries and also details on which original word was replaced by which new word. - specify spellcheck.maxCollations to something 1 if you want to get more than 1 collation returned. If on 4.0-alpha: - maybe specify spellcheck.collateParam.mm=100%, if your original query had a very low mm value. This will make the collations returned more meaningful to the user. - maybe specify spellcheck.alternativeTermCount to something 0 if you want the spellchecker to consider that the user might have misspelled words even though the misspelling occurs somewhere in the index. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: ninaddesai82 [mailto:desai.ni...@gmail.com] Sent: Friday, July 06, 2012 6:04 AM To: solr-user@lucene.apache.org Subject: Better (and valid) Spellcheck in combination with other parameters with at least one occurance Hi, I am trying to implement solr search with spellcheck. - My current seach works like this - I have some specific criteria for every search query. i.e. If I am hitting search with q=restaurants , I also pass another param say c=mumbai. So solr returns me restaurants in mumbai. Now I want to implement spellcheck, but at the same time I also want to make sure that whatever results my spellcheck is providing, are valid (means have at least one occurance in combination with my other param as in c=city). I am not able to decide how to achieve that. i.e. If I pass say hangry to solr with spell check then spell check retuerns few suggestions like hungry, angry However I want to suggest user only those suggestions which have hitcounts in my data with common field as in c=mumbai. Otherwise what happens is - solr returns me some suggestion words hungry , angry and if they dont have ny records with combination of city, it returns no result, which is bad user experience. so ideally Solr should return me suggestion for only those words which have at least 1 count for that suggestion with mumbai. I am currently firing 2 queries to luscene in order to achieve this Query one - spellcheck - this gives me word suggestns Query two - with given words in spell check i run facet query to get occurance counts to check which ones are valid. But this seems like unnecessary over head (Query two - with facet takes too long to respond as well). And I am trying to find more optimized way to do this. Can anyone suggest me how to do that ? thanks, ndesai -- View this message in context: http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance
anyone ?? -- View this message in context: http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484p3993498.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: How to Request several docs ?
Hi Bruno, Check this http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2) On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote: Dear Solr users, I would like to request/get several docs indexed by solr with only one request. I have a schema.xml where my field PN is the key field (unique key), I have more than 80M docs in my index. I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . Is it possible to do a request with PN1, PN2, PN3, etc... PNn Where n can be 20 or 30 by example. Thanks a lot, Bruno
Re: Multi-thread UpdateProcessor
Okay, why do you think this idea is not worth to look at? On Fri, Jul 6, 2012 at 12:53 AM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Hello, Most times when single thread streaming http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update is used I saw lack of cpu utilization at Solr server. Resonable motivation is utilize more threads to index faster, but it requires more complicated client side. I propose to employ special update processor which can fork the stream processing onto many threads. If you like it pls vote for https://issues.apache.org/jira/browse/SOLR-3585 . Regards -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics http://www.griddynamics.com mkhlud...@griddynamics.com -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics http://www.griddynamics.com mkhlud...@griddynamics.com
Re: How to Request several docs ?
Hi Gheeta, Sorry but I don't understand, I suppose uniqueKey for me is pn field and (pin1 pin2) are values. but if it's that, it not works. I do: http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on no error but no result ! Le 06/07/2012 18:35, geetha anjali a écrit : Hi Bruno, Check this http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2) On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote: Dear Solr users, I would like to request/get several docs indexed by solr with only one request. I have a schema.xml where my field PN is the key field (unique key), I have more than 80M docs in my index. I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . Is it possible to do a request with PN1, PN2, PN3, etc... PNn Where n can be 20 or 30 by example. Thanks a lot, Bruno
Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance
Be a little patient, we're all volunteers here. On Fri, Jul 6, 2012 at 8:55 AM, ninaddesai82 desai.ni...@gmail.com wrote: anyone ?? -- View this message in context: http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484p3993498.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: How to Request several docs ?
It should. So, add debugQuery=on and examine the results. Also, we should see the field definition and results of the above. You might have your default operator set to AND rather than OR (adding the debug param should make this plain).. If your pn is a string type, then it will be case sensitive. I'm assuming that you've tried it with a single value rather than two, right? The admin/schema browser link will allow you to examine the actual contents of the field in your index, you might look at your pn field to see what the values look like there, and if they're what you expect. Consider reviewing: http://wiki.apache.org/solr/UsingMailingLists, you haven't provided much information to go on Best Erick On Fri, Jul 6, 2012 at 12:43 PM, Bruno Mannina bmann...@free.fr wrote: Hi Gheeta, Sorry but I don't understand, I suppose uniqueKey for me is pn field and (pin1 pin2) are values. but if it's that, it not works. I do: http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on no error but no result ! Le 06/07/2012 18:35, geetha anjali a écrit : Hi Bruno, Check this http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2) On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote: Dear Solr users, I would like to request/get several docs indexed by solr with only one request. I have a schema.xml where my field PN is the key field (unique key), I have more than 80M docs in my index. I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . Is it possible to do a request with PN1, PN2, PN3, etc... PNn Where n can be 20 or 30 by example. Thanks a lot, Bruno
Re: How to Request several docs ?
Le 06/07/2012 19:26, Erick Erickson a écrit : It should. So, add debugQuery=on and examine the results. Also, we should see the field definition and results of the above. You might have your default operator set to AND rather than OR (adding the debug param should make this plain).. ok I not yet tested but solrQueryParser defaultOperator=AND/ ;o) If your pn is a string type, then it will be case sensitive. I'm assuming that you've tried it with a single value rather than two, right? yes with one I have result, I take care about Upper/Lower case The admin/schema browser link will allow you to examine the actual contents of the field in your index, you might look at your pn field to see what the values look like there, and if they're what you expect. Consider reviewing: http://wiki.apache.org/solr/UsingMailingLists, you haven't provided much information to go on sorry, I will be more explicit if I have a new message. Best Erick On Fri, Jul 6, 2012 at 12:43 PM, Bruno Mannina bmann...@free.fr wrote: Hi Gheeta, Sorry but I don't understand, I suppose uniqueKey for me is pn field and (pin1 pin2) are values. but if it's that, it not works. I do: http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on no error but no result ! Le 06/07/2012 18:35, geetha anjali a écrit : Hi Bruno, Check this http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2) On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote: Dear Solr users, I would like to request/get several docs indexed by solr with only one request. I have a schema.xml where my field PN is the key field (unique key), I have more than 80M docs in my index. I have a list of PN that I want to get and I don't want to do one request by PN and I think it's not clean to do PN1 or PN2 or PN3 or . Is it possible to do a request with PN1, PN2, PN3, etc... PNn Where n can be 20 or 30 by example. Thanks a lot, Bruno
Re: How to Request several docs ?
Le 06/07/2012 19:37, Bruno Mannina a écrit : Le 06/07/2012 19:26, Erick Erickson a écrit : It should. So, add debugQuery=on and examine the results. Also, we should see the field definition and results of the above. You might have your default operator set to AND rather than OR (adding the debug param should make this plain).. ok I not yet tested but solrQueryParser defaultOperator=AND/ ;o) Changed, Tested and results are here ! Thanks a lot !
Re: Regression of JIRA 1826?
A little more information on this. I tinkered a bit with the schema and it appears to be related to WordDelimiterFilterFactory and splitOnCaseChange being true, or at least this setting being set exhibits the issue. Also I am using the edismax query parser. Again any ideas/help would be greatly appreciated. On Fri, Jul 6, 2012 at 1:40 AM, Jamie Johnson jej2...@gmail.com wrote: I just upgraded to trunk to try to fix an issue I was having with the highlighter described in JIRA 1826, but it appears that this issue still exists on trunk. I'm running the following query subject:ztest* subject is a text field (not multivalued) and the return in highlighting is emZTest/emForemZTestForJamie/em the actual stored value is ZTestForJamie. Is anyone else experiencing this?
Re: Boosting the score of the whole documents
: I would like to give a boost to the whole documents as I index them. I am : sending to solr the xml in the form: : : adddoc boost=2.0/doc/add : : But it does't seem to alter the search scores in any way. I would expect http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_increase_the_score_for_specific_documents http://wiki.apache.org/solr/UpdateXmlMessages#Optional_attributes_on_.22doc.22 http://lucene.apache.org/core/3_6_0/api/core/org/apache/lucene/search/Similarity.html#formula_norm -Hoss
Re: Multi-thread UpdateProcessor
Mikhail, you have my +1 and a jira comment :) // Dmitry On Fri, Jul 6, 2012 at 7:41 PM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Okay, why do you think this idea is not worth to look at? On Fri, Jul 6, 2012 at 12:53 AM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Hello, Most times when single thread streaming http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update is used I saw lack of cpu utilization at Solr server. Resonable motivation is utilize more threads to index faster, but it requires more complicated client side. I propose to employ special update processor which can fork the stream processing onto many threads. If you like it pls vote for https://issues.apache.org/jira/browse/SOLR-3585 . Regards -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics http://www.griddynamics.com mkhlud...@griddynamics.com -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics http://www.griddynamics.com mkhlud...@griddynamics.com -- Regards, Dmitry Kan
Grouping and Stats
Hello - I’m not sure If this is an appropriate use for Solr, but I want to stay away from a typical DB store for high availability reasons. I am storing documents that may have a common value for a field we’ll call “category”. In another field there will be an integer field we’ll call “rating”. I would like to group the documents on the “category” field and display the average “rating” per group. The stats component lets me get the avg rating, but when I collapse the results into groups it gives me the average for the entire collection, rather than for the specific group. Am I going about this wrong? Is it possible to get the desired outcome with a single query? I’d appreciate any insight! Thank you, Jeremy Branham Software Engineer http://LinkedIn.com/in/JeremyBranham http://jeremybranham.wordpress.com/ http://Zeroth.biz
Nrt and caching
Sorry I'm a bit new to the nrt stuff in solr but I'm trying to understand the implications of frequent commits and cache rebuilding and auto warming. What are the best practices surrounding nrt searching and caches and query performance. Thanks! Amit