Re: Solr 4.0 UI issue

2012-07-06 Thread anarchos78
Thank you for reply,

I didn't get you. What do you mean by saying: so you may check the (raw)
log output ... Please help!
Thank you in advance
Tom
Greece

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993427.html
Sent from the Solr - User mailing list archive at Nabble.com.


Problem while indexing XML file with special characters represented uuml

2012-07-06 Thread Michael Belenki
Dear community,

I am experiencing strange problem while trying to index / to import XML
document to SOLR via DataImportHandler. The XML document contains some
special characters (e.g. german ü) that are represented as XML entities
uuml; or auml;. There is also DTD file that defines these entities
(!ENTITY uuml#252; ) (I tried to use dtd file as well as to
include the DTD definition to the xml itself). After I start the import
command full-import, the import process throws an exception as soon as it
tries to parse uuml;: Un
declared general entity uuml. Did anyone already face such a problem? 

best regards,

Michael


My data-config for importing is:


dataConfig
dataSource type=FileDataSource encoding=ISO-8859-1 /
document
!--  stream should be true since huge xml document is being 
parsed --
entity name=article
processor=XPathEntityProcessor
stream=true
forEach=/dblp/article
url=documents/dblp.xml


field column=keyxpath=/dblp/article/@key /
field column=title xpath=/dblp/article/title /


   /entity
/document
/dataConfig

The XML file looks e.g. like this:

?xml version=1.0 encoding=ISO-8859-1?

!DOCTYPE dblp [

!ENTITY uuml#252; !-- small u, dieresis or umlaut mark --
]
dblp

article key=journals/fm/Riccardi09 mdate=2011-10-27
authorMarco Riccardi/author
titleSolution of Cubic and Quartic Equations.uuml;/title
pages117-122/pages
year2009/year
volume17/volume

journalFormalized Mathematics/journal

number1-4/number
eehttp://dx.doi.org/10.2478/v10037-009-0012-z/eeurldb/journals/fm/fm17.html#Riccardi09/url
/article/dblp

The stack-trace is:

05.07.2012 17:37:19 org.apache.solr.update.processor.LogUpdateProcessor
finish
INFO: {deleteByQuery=*:*,add=[persons/Codd71a, persons/Hall74]} 0 1
05.07.2012 17:37:19 org.apache.solr.common.SolrException log
SCHWERWIEGEND: Full Import failed:java.lang.RuntimeException:
java.lang.RuntimeE
xception: org.apache.solr.handler.dataimport.DataImportHandlerException:
Parsing
 failed for xml, url:documents/dblp.xml rows processed in this xml:2 last
row in
 this xml:{title=Common Subexpression Identification in General Algebraic
System
s., $forEach=/dblp/article, key=persons/Hall74} Processing Document # 3
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java
:264)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImpo
rter.java:375)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.j
ava:445)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.ja
va:426)
Caused by: java.lang.RuntimeException:
org.apache.solr.handler.dataimport.DataIm
portHandlerException: Parsing failed for xml, url:documents/dblp.xml rows
proces
sed in this xml:2 last row in this xml:{title=Common Subexpression
Identificatio
n in General Algebraic Systems., $forEach=/dblp/article,
key=persons/Hall74} Pro
cessing Document # 3
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
r.java:621)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.j
ava:327)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java
:225)
... 3 more
Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException:
Parsin
g failed for xml, url:documents/dblp.xml rows processed in this xml:2 last
row i
n this xml:{title=Common Subexpression Identification in General Algebraic
Syste
ms., $forEach=/dblp/article, key=persons/Hall74} Processing Document # 3
at
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAnd
Throw(DataImportHandlerException.java:72)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor$3.next(XPathE
ntityProcessor.java:504)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor$3.next(XPathE
ntityProcessor.java:517)
at
org.apache.solr.handler.dataimport.EntityProcessorBase.getNext(Entity
ProcessorBase.java:120)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow(
XPathEntityProcessor.java:225)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPath
EntityProcessor.java:204)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.pullRow(Ent
ityProcessorWrapper.java:330)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(Ent
ityProcessorWrapper.java:296)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
r.java:683)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
r.java:619)
... 5 more
Caused by: java.lang.RuntimeException:
com.ctc.wstx.exc.WstxParsingException: Un
declared general entity uuml
 at [row,col {unknown-source}]: [26,42]
at

Re: Solr 4.0 UI issue

2012-07-06 Thread Sachin Aggarwal
restart ur browser and solr if u r using example and do check u haven't
removed the line
requestHandler name=/admin/ class=solr.admin.
AdminHandlers /”

from solr config same worked for me.

On Fri, Jul 6, 2012 at 1:21 PM, anarchos78 rigasathanasio...@hotmail.comwrote:

 Thank you for reply,

 I didn't get you. What do you mean by saying: so you may check the (raw)
 log output ... Please help!
 Thank you in advance
 Tom
 Greece

 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993427.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 

Thanks  Regards

Sachin Aggarwal
7760502772


Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread ninaddesai82
Hi,
I am trying to implement solr search with spellcheck. - 

My current seach works like this - I have some specific criteria for every
search query. i.e. If I am hitting search with q=restaurants , I also pass
another param say c=mumbai. So solr returns me restaurants in mumbai.

Now I want to implement spellcheck, but at the same time I also want to make
sure that whatever results my spellcheck is providing, are valid (means have
at least one occurance in combination with my other param as in c=city). I
am not able to decide how to achieve that.

i.e. 
If I pass say hangry to solr with spell check then spell check retuerns
few suggestions like hungry, angry
However I want to suggest user only those suggestions which have hitcounts
in my data with common field as in c=mumbai.
Otherwise what happens is - solr returns me some suggestion words hungry ,
angry and if they dont have ny records with combination of city, it returns
no result, which is bad user experience.

so ideally Solr should return me suggestion for only those words which have
at least 1 count for that suggestion with mumbai. 

I am currently firing 2 queries to luscene in order to achieve this 
Query one - spellcheck - this gives me word suggestns
Query two - with given words in spell check i run facet query to get
occurance counts to check which ones are valid.
But this seems like unnecessary over head (Query two - with facet takes too
long to respond as well). And I am trying to find more optimized way to do
this.

Can anyone suggest me how to do that ?

thanks,
ndesai

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr-4.0 and high cpu usage [SOLVED]

2012-07-06 Thread Erick Erickson
Thanks for bringing closure on this, it'll probably help others too!

Erick

On Thu, Jul 5, 2012 at 12:28 PM, Anatoli Matuskova
anatoli.matusk...@gmail.com wrote:
 Found why!
 On Solr 1.4 dismax param mm defaults to 1 if not specified, which is
 equivalent to AND. On Solr 4.0 if mm is not specified, the default operator
 is used, which defaults to OR. That made return much more results for each
 query I was running, increasing the response time and the CPU usage.


 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/solr-4-0-and-high-cpu-usage-tp3993187p3993275.html
 Sent from the Solr - User mailing list archive at Nabble.com.


How to Request several docs ?

2012-07-06 Thread Bruno Mannina

Dear Solr users,

I would like to request/get several docs indexed by solr with only one 
request.


I have a schema.xml where my field PN is the key field (unique key), I 
have more than 80M docs in my index.


I have a list of PN that I want to get and I don't want to do one 
request by PN and I think it's not clean to do

PN1 or PN2 or PN3 or .

Is it possible to do a request with PN1, PN2, PN3, etc... PNn
Where n can be 20 or 30 by example.

Thanks a lot,
Bruno


Re: Solr 4.0 UI issue

2012-07-06 Thread anarchos78
Didn't helped

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-4-0-UI-issue-tp3993286p3993507.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: How to Request several docs ?

2012-07-06 Thread Paul Libbrecht
Le 6 juil. 2012 à 15:43, Bruno Mannina a écrit :

 I have a list of PN that I want to get and I don't want to do one request by 
 PN and I think it's not clean to do
 PN1 or PN2 or PN3 or .

I've always done this so.

paul

RE: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread Dyer, James
If you're using Solr3.1 or higher, you can do this.  See 
http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate .  Here's a 
summary:

- specify spellcheck.collate=true to get a re-written query made from the 
individual word suggestions.

- specify spellcheck.maxCollationTries to something 0 (10, perhaps) to have 
it try the collation possibilities against the index before returning them to 
the user.  This means that all collations returned will be guaranteed to return 
hits.  

- specify spellcheck.collateExtendedResults=true if you want hit counts for 
the collation queries and also details on which original word was replaced by 
which new word.

- specify spellcheck.maxCollations to something 1 if you want to get more 
than 1 collation returned.  

If on 4.0-alpha:

- maybe specify spellcheck.collateParam.mm=100%, if your original query had a 
very low mm value.  This will make the collations returned more meaningful to 
the user.

- maybe specify spellcheck.alternativeTermCount to something 0 if you want 
the spellchecker to consider that the user might have misspelled words even 
though the misspelling occurs somewhere in the index.

James Dyer
E-Commerce Systems
Ingram Content Group
(615) 213-4311


-Original Message-
From: ninaddesai82 [mailto:desai.ni...@gmail.com] 
Sent: Friday, July 06, 2012 6:04 AM
To: solr-user@lucene.apache.org
Subject: Better (and valid) Spellcheck in combination with other parameters 
with at least one occurance

Hi,
I am trying to implement solr search with spellcheck. - 

My current seach works like this - I have some specific criteria for every
search query. i.e. If I am hitting search with q=restaurants , I also pass
another param say c=mumbai. So solr returns me restaurants in mumbai.

Now I want to implement spellcheck, but at the same time I also want to make
sure that whatever results my spellcheck is providing, are valid (means have
at least one occurance in combination with my other param as in c=city). I
am not able to decide how to achieve that.

i.e. 
If I pass say hangry to solr with spell check then spell check retuerns
few suggestions like hungry, angry
However I want to suggest user only those suggestions which have hitcounts
in my data with common field as in c=mumbai.
Otherwise what happens is - solr returns me some suggestion words hungry ,
angry and if they dont have ny records with combination of city, it returns
no result, which is bad user experience.

so ideally Solr should return me suggestion for only those words which have
at least 1 count for that suggestion with mumbai. 

I am currently firing 2 queries to luscene in order to achieve this 
Query one - spellcheck - this gives me word suggestns
Query two - with given words in spell check i run facet query to get
occurance counts to check which ones are valid.
But this seems like unnecessary over head (Query two - with facet takes too
long to respond as well). And I am trying to find more optimized way to do
this.

Can anyone suggest me how to do that ?

thanks,
ndesai

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr facet multiple constraint

2012-07-06 Thread Erick Erickson
What does doesn't work mean? returning no results? Not returning facets?
returning incorrect facet counts?

You might review:
http://wiki.apache.org/solr/UsingMailingLists

Best
Erick

On Fri, Jul 6, 2012 at 1:37 AM, davidbougearel
david.bougea...@smile-benelux.com wrote:
 Well thanks for your answer, in fact i've written what the QueryResponse
 return as the solr query here is my real solr query before use the
 executeQuery :

 q=service%3A1+AND+publicationstatus%3ALIVEsort=publishingdate+descfq=%7B%21ex%3Ddt%7D%28%28%28user%3A10%29%29%29facet.field=%7B%21tag%3Ddt%7Duserfacet=truefacet.mincount=1

 which is the same as my first post without the 'wt=javabin' and  instead of
 commas.

 Could you please see if there is something wrong for you ?

 Best regards,

 David.

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Solr-facet-multiple-constraint-tp3992974p3993408.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread Erick Erickson
I happen to remember this JIRA: https://issues.apache.org/jira/browse/SOLR-2462

be a bit careful if you use collate with 3.1 or 3.2...

Best
Erick

On Fri, Jul 6, 2012 at 10:36 AM, Dyer, James james.d...@ingrambook.com wrote:
 If you're using Solr3.1 or higher, you can do this.  See 
 http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate .  Here's 
 a summary:

 - specify spellcheck.collate=true to get a re-written query made from the 
 individual word suggestions.

 - specify spellcheck.maxCollationTries to something 0 (10, perhaps) to 
 have it try the collation possibilities against the index before returning 
 them to the user.  This means that all collations returned will be guaranteed 
 to return hits.

 - specify spellcheck.collateExtendedResults=true if you want hit counts for 
 the collation queries and also details on which original word was replaced by 
 which new word.

 - specify spellcheck.maxCollations to something 1 if you want to get more 
 than 1 collation returned.

 If on 4.0-alpha:

 - maybe specify spellcheck.collateParam.mm=100%, if your original query had 
 a very low mm value.  This will make the collations returned more 
 meaningful to the user.

 - maybe specify spellcheck.alternativeTermCount to something 0 if you want 
 the spellchecker to consider that the user might have misspelled words even 
 though the misspelling occurs somewhere in the index.

 James Dyer
 E-Commerce Systems
 Ingram Content Group
 (615) 213-4311


 -Original Message-
 From: ninaddesai82 [mailto:desai.ni...@gmail.com]
 Sent: Friday, July 06, 2012 6:04 AM
 To: solr-user@lucene.apache.org
 Subject: Better (and valid) Spellcheck in combination with other parameters 
 with at least one occurance

 Hi,
 I am trying to implement solr search with spellcheck. -

 My current seach works like this - I have some specific criteria for every
 search query. i.e. If I am hitting search with q=restaurants , I also pass
 another param say c=mumbai. So solr returns me restaurants in mumbai.

 Now I want to implement spellcheck, but at the same time I also want to make
 sure that whatever results my spellcheck is providing, are valid (means have
 at least one occurance in combination with my other param as in c=city). I
 am not able to decide how to achieve that.

 i.e.
 If I pass say hangry to solr with spell check then spell check retuerns
 few suggestions like hungry, angry
 However I want to suggest user only those suggestions which have hitcounts
 in my data with common field as in c=mumbai.
 Otherwise what happens is - solr returns me some suggestion words hungry ,
 angry and if they dont have ny records with combination of city, it returns
 no result, which is bad user experience.

 so ideally Solr should return me suggestion for only those words which have
 at least 1 count for that suggestion with mumbai.

 I am currently firing 2 queries to luscene in order to achieve this
 Query one - spellcheck - this gives me word suggestns
 Query two - with given words in spell check i run facet query to get
 occurance counts to check which ones are valid.
 But this seems like unnecessary over head (Query two - with facet takes too
 long to respond as well). And I am trying to find more optimized way to do
 this.

 Can anyone suggest me how to do that ?

 thanks,
 ndesai

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread ninaddesai82
anyone ??

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484p3993498.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: How to Request several docs ?

2012-07-06 Thread geetha anjali
Hi Bruno,
Check this
http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2)



On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote:

 Dear Solr users,

 I would like to request/get several docs indexed by solr with only one
 request.

 I have a schema.xml where my field PN is the key field (unique key), I
 have more than 80M docs in my index.

 I have a list of PN that I want to get and I don't want to do one request
 by PN and I think it's not clean to do
 PN1 or PN2 or PN3 or .

 Is it possible to do a request with PN1, PN2, PN3, etc... PNn
 Where n can be 20 or 30 by example.

 Thanks a lot,
 Bruno



Re: Multi-thread UpdateProcessor

2012-07-06 Thread Mikhail Khludnev
Okay, why do you think this idea is not worth to look at?

On Fri, Jul 6, 2012 at 12:53 AM, Mikhail Khludnev 
mkhlud...@griddynamics.com wrote:

 Hello,

 Most times when single thread streaming
 http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update is
 used I saw lack of cpu utilization at Solr server. Resonable motivation is
 utilize more threads to index faster, but it requires more complicated
  client side.
 I propose to employ special update processor which can fork the stream
 processing onto many threads. If you like it pls vote for
 https://issues.apache.org/jira/browse/SOLR-3585 .

 Regards

 --
 Sincerely yours
 Mikhail Khludnev
 Tech Lead
 Grid Dynamics

 http://www.griddynamics.com
  mkhlud...@griddynamics.com




-- 
Sincerely yours
Mikhail Khludnev
Tech Lead
Grid Dynamics

http://www.griddynamics.com
 mkhlud...@griddynamics.com


Re: How to Request several docs ?

2012-07-06 Thread Bruno Mannina

Hi Gheeta,

Sorry but I don't understand, I suppose uniqueKey for me is pn field
and (pin1 pin2) are values. but if it's that, it not works.

I do:
http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on

no error but no result !


Le 06/07/2012 18:35, geetha anjali a écrit :

Hi Bruno,
Check this
http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2)



On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote:


Dear Solr users,

I would like to request/get several docs indexed by solr with only one
request.

I have a schema.xml where my field PN is the key field (unique key), I
have more than 80M docs in my index.

I have a list of PN that I want to get and I don't want to do one request
by PN and I think it's not clean to do
PN1 or PN2 or PN3 or .

Is it possible to do a request with PN1, PN2, PN3, etc... PNn
Where n can be 20 or 30 by example.

Thanks a lot,
Bruno






Re: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread Erick Erickson
Be a little patient, we're all volunteers here.

On Fri, Jul 6, 2012 at 8:55 AM, ninaddesai82 desai.ni...@gmail.com wrote:
 anyone ??

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Better-and-valid-Spellcheck-in-combination-with-other-parameters-with-at-least-one-occurance-tp3993484p3993498.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: How to Request several docs ?

2012-07-06 Thread Erick Erickson
It should. So, add debugQuery=on and examine the results. Also, we should
see the field definition and results of the above. You might have
your default operator set to AND rather than OR (adding the debug param
should make this plain)..

If your pn is a string type, then it will be case sensitive. I'm
assuming that you've
tried it with a single value rather than two, right?

The admin/schema browser link will allow you to examine the actual contents of
the field in your index, you might look at your pn field to see what
the values look
like there, and if they're what you expect.

Consider reviewing: http://wiki.apache.org/solr/UsingMailingLists, you
haven't provided much information to go on

Best
Erick

On Fri, Jul 6, 2012 at 12:43 PM, Bruno Mannina bmann...@free.fr wrote:
 Hi Gheeta,

 Sorry but I don't understand, I suppose uniqueKey for me is pn field
 and (pin1 pin2) are values. but if it's that, it not works.

 I do:
 http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on

 no error but no result !


 Le 06/07/2012 18:35, geetha anjali a écrit :

 Hi Bruno,
 Check this
 http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2)



 On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote:

 Dear Solr users,

 I would like to request/get several docs indexed by solr with only one
 request.

 I have a schema.xml where my field PN is the key field (unique key), I
 have more than 80M docs in my index.

 I have a list of PN that I want to get and I don't want to do one request
 by PN and I think it's not clean to do
 PN1 or PN2 or PN3 or .

 Is it possible to do a request with PN1, PN2, PN3, etc... PNn
 Where n can be 20 or 30 by example.

 Thanks a lot,
 Bruno





Re: How to Request several docs ?

2012-07-06 Thread Bruno Mannina

Le 06/07/2012 19:26, Erick Erickson a écrit :

It should. So, add debugQuery=on and examine the results. Also, we should
see the field definition and results of the above. You might have
your default operator set to AND rather than OR (adding the debug param
should make this plain)..

ok I not yet tested but

solrQueryParser defaultOperator=AND/

;o)


If your pn is a string type, then it will be case sensitive. I'm
assuming that you've
tried it with a single value rather than two, right?

yes with one I have result, I take care about Upper/Lower case


The admin/schema browser link will allow you to examine the actual contents of
the field in your index, you might look at your pn field to see what
the values look
like there, and if they're what you expect.

Consider reviewing: http://wiki.apache.org/solr/UsingMailingLists, you
haven't provided much information to go on

sorry, I will be more explicit if I have a new message.


Best
Erick

On Fri, Jul 6, 2012 at 12:43 PM, Bruno Mannina bmann...@free.fr wrote:

Hi Gheeta,

Sorry but I don't understand, I suppose uniqueKey for me is pn field
and (pin1 pin2) are values. but if it's that, it not works.

I do:
http://localhost:8983/solr/select/?q=pn:%28EP100A1%20FR2963608A1%29version=2.2start=0rows=10indent=on

no error but no result !


Le 06/07/2012 18:35, geetha anjali a écrit :


Hi Bruno,
Check this
http://localhost:8983/solr/select?q=uniqueKey:(pin1 pin2)



On Fri, Jul 6, 2012 at 7:13 PM, Bruno Mannina bmann...@free.fr wrote:


Dear Solr users,

I would like to request/get several docs indexed by solr with only one
request.

I have a schema.xml where my field PN is the key field (unique key), I
have more than 80M docs in my index.

I have a list of PN that I want to get and I don't want to do one request
by PN and I think it's not clean to do
PN1 or PN2 or PN3 or .

Is it possible to do a request with PN1, PN2, PN3, etc... PNn
Where n can be 20 or 30 by example.

Thanks a lot,
Bruno










Re: How to Request several docs ?

2012-07-06 Thread Bruno Mannina

Le 06/07/2012 19:37, Bruno Mannina a écrit :

Le 06/07/2012 19:26, Erick Erickson a écrit :
It should. So, add debugQuery=on and examine the results. Also, we 
should

see the field definition and results of the above. You might have
your default operator set to AND rather than OR (adding the debug param
should make this plain)..

ok I not yet tested but

solrQueryParser defaultOperator=AND/

;o) 

Changed, Tested and results are here !

Thanks a lot !



Re: Regression of JIRA 1826?

2012-07-06 Thread Jamie Johnson
A little more information on this.

I tinkered a bit with the schema and it appears to be related to
WordDelimiterFilterFactory and splitOnCaseChange being true, or at
least this setting being set exhibits the issue.

Also I am using the edismax query parser.  Again any ideas/help would
be greatly appreciated.

On Fri, Jul 6, 2012 at 1:40 AM, Jamie Johnson jej2...@gmail.com wrote:
 I just upgraded to trunk to try to fix an issue I was having with the
 highlighter described in JIRA 1826, but it appears that this issue
 still exists on trunk.  I'm running the following query

 subject:ztest*

 subject is a text field (not multivalued) and the return in highlighting is

 emZTest/emForemZTestForJamie/em

 the actual stored value is ZTestForJamie.  Is anyone else experiencing this?


Re: Boosting the score of the whole documents

2012-07-06 Thread Chris Hostetter

: I would like to give a boost to the whole documents as I index them. I am
: sending to solr the xml in the form:
: 
: adddoc boost=2.0/doc/add
: 
: But it does't seem to alter the search scores in any way. I would expect

http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_increase_the_score_for_specific_documents
http://wiki.apache.org/solr/UpdateXmlMessages#Optional_attributes_on_.22doc.22

http://lucene.apache.org/core/3_6_0/api/core/org/apache/lucene/search/Similarity.html#formula_norm


-Hoss


Re: Multi-thread UpdateProcessor

2012-07-06 Thread Dmitry Kan
Mikhail,

you have my +1 and a jira comment :)

// Dmitry

On Fri, Jul 6, 2012 at 7:41 PM, Mikhail Khludnev mkhlud...@griddynamics.com
 wrote:

 Okay, why do you think this idea is not worth to look at?

 On Fri, Jul 6, 2012 at 12:53 AM, Mikhail Khludnev 
 mkhlud...@griddynamics.com wrote:

  Hello,
 
  Most times when single thread streaming
  http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update is
  used I saw lack of cpu utilization at Solr server. Resonable motivation
 is
  utilize more threads to index faster, but it requires more complicated
   client side.
  I propose to employ special update processor which can fork the stream
  processing onto many threads. If you like it pls vote for
  https://issues.apache.org/jira/browse/SOLR-3585 .
 
  Regards
 
  --
  Sincerely yours
  Mikhail Khludnev
  Tech Lead
  Grid Dynamics
 
  http://www.griddynamics.com
   mkhlud...@griddynamics.com
 
 


 --
 Sincerely yours
 Mikhail Khludnev
 Tech Lead
 Grid Dynamics

 http://www.griddynamics.com
  mkhlud...@griddynamics.com




-- 
Regards,

Dmitry Kan


Grouping and Stats

2012-07-06 Thread Jeremy Branham
Hello -
I’m not sure If this is an appropriate use for Solr, but I want to stay away 
from a typical DB store for high availability reasons.

I am storing documents that may have a common value for a field we’ll call 
“category”.
In another field there will be an integer field we’ll call “rating”.

I would like to group the documents on the “category” field and display the 
average “rating” per group.

The stats component lets me get the avg rating, but when I collapse the results 
into groups it gives me the average for the entire collection, rather than for 
the specific group.

Am I going about this wrong?
Is it possible to get the desired outcome with a  single query?

I’d appreciate any insight!
Thank you,

Jeremy Branham
Software Engineer
http://LinkedIn.com/in/JeremyBranham 
http://jeremybranham.wordpress.com/ 
http://Zeroth.biz

Nrt and caching

2012-07-06 Thread Amit Nithian
Sorry I'm a bit new to the nrt stuff in solr but I'm trying to understand
the implications of frequent commits and cache rebuilding and auto warming.
What are the best practices surrounding nrt searching and caches and query
performance.

Thanks!
Amit