Re: Accent Characters

2012-05-30 Thread Sami Siren
Vicente,

Are you using CommonsHttpSolrServer or HttpSolrServer? If the latter
then you are probably hitting this:
https://issues.apache.org/jira/browse/SOLR-3375

The remedy is to use CommonshHttpSolrServer.

--
 Sami Siren

On Thu, May 31, 2012 at 7:52 AM, Vicente Couto  wrote:
> Hello, Jack.
>
> Yeah, I'm screwed up.
>
> Well, the documents are indexed with the accents.
> I started a new clean solr 3.6 configuration, with as few changes as
> possible; I'm running two cores, one for English and another one for French.
> Here is where I am now: If I try to run queries by using solrJ, it does some
> sort of encoding. For example, I can see into the logs that if I run one
> query looking for "pré", I got
>
> INFO: [coreFR] webapp=/solr path=/select
> params={fl=*,score&q=content:pré&hl.fl=content&hl.maxAnalyzedChars=10&hl=true}
> hits=0 status=0 QTime=0
>
> And I can't see any results. If I try by using encoding to UTF-8 it's not
> works.
> But if I simply put http calls into the browser address bar, for example, it
> works perfectly!
> So, how can I "tell" solrJ to not encode the queries?
>
> Thank you
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Accent-Characters-tp3985931p3986970.html
> Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Sujatha Arun
Thanks Jack .

In which case the template cores would be ones that would be initialized
first and we need to take care of this  on template configs .

Also I notices that when  we remove the core1 and core0 and try to create a
new webapp without any core and empty solr.xml and try to create a new core
,we get an error and core is not created.

Regards
Sujatha

On Thu, May 31, 2012 at 12:40 AM, Jack Krupansky wrote:

> As per the source code, Solr only sets the BooleanQuery clause limit on
> the very first core load. It ignores any the setting on subsequent core
> loads, including a reload of the initial core.
>
> SolrCore.java: "// only change the BooleanQuery maxClauseCount once for
> ALL cores..."
>
> The cores should get loaded in the order they appear in solr.xml, although
> I don't know if that is a written, contractual guarantee.
>
> As the CoreAdmin wiki page says, "Workaround, set maxBooleanClauses to the
> greatest value desired in *all* cores".
>
> See:
> http://wiki.apache.org/solr/**CoreAdmin#Known_Issues
>
> The wiki is wrong when it says "Whichever Solr core initializes last will
> win the setting of the solrconfig.xml's maxBooleanClauses value." The first
> core to be loaded wins. Or, maybe the source code is wrong. Either way, a
> correction is needed.
>
> -- Jack Krupansky
>
> -Original Message- From: Sujatha Arun
> Sent: Wednesday, May 30, 2012 1:30 PM
> To: solr-user@lucene.apache.org
> Subject: solr 1.3 Multicores and maxboolean clause
>
>
> Hello,
>
> The solrcore Wiki says that "Lucene's
> BooleanQuery
> >**maxClauseCount
>
> is a static variable, making it a single value across the
> entire JVM. Whichever Solr core initializes last will win the setting of
> the solrconfig.xml's maxBooleanClauses value. Workaround, set
> maxBooleanClauses to the greatest value desired in *all* cores."
>
> Now what I see is that even if any one core* has a smaller value for
> maxboolean clause* ,the smaller one is taken into effect and not the last
> core which is created.
>
> *Some questions*
>
>
>  1. What is the order for initialization of the cores on a server
>
>  restart,I don't see this info in the logs?
>  2. When i change the maxboolean clause on one cores and reload the core
>
>  ,it is not effected ?Does this require Tomcat restart?why?
>  3. The default cores core0 and core1 that comes in the example multicore
>
>  setup does not have this value set in them as it has minimum configuration
>  ,does this affect the value in other cores if I use that as default?
>
> Regards,
> Sujatha
>


Re: A few random questions about solr queries.

2012-05-30 Thread santamaria2
A wee bit of clarification on the 2nd question. I meant relative performance,
ie. would it be much slower to facet over 20 facet.queries & 10 facet.fields
compared to say, 4 facet.queries & facet.fields. I wonder if this makes
sense...

So... is a bump improper etiquette here? >_>

--
View this message in context: 
http://lucene.472066.n3.nabble.com/A-few-random-questions-about-solr-queries-tp3986562p3986977.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Accent Characters

2012-05-30 Thread Jack Krupansky

This might be related:

https://issues.apache.org/jira/browse/SOLR-443

It suggests setting an HTTP header: Content-Type: 
application/x-www-form-urlencoded; charset=UTF-8


-- Jack Krupansky

-Original Message- 
From: Vicente Couto

Sent: Thursday, May 31, 2012 12:52 AM
To: solr-user@lucene.apache.org
Subject: Re: Accent Characters

Hello, Jack.

Yeah, I'm screwed up.

Well, the documents are indexed with the accents.
I started a new clean solr 3.6 configuration, with as few changes as
possible; I'm running two cores, one for English and another one for French.
Here is where I am now: If I try to run queries by using solrJ, it does some
sort of encoding. For example, I can see into the logs that if I run one
query looking for "pré", I got

INFO: [coreFR] webapp=/solr path=/select
params={fl=*,score&q=content:pré&hl.fl=content&hl.maxAnalyzedChars=10&hl=true}
hits=0 status=0 QTime=0

And I can't see any results. If I try by using encoding to UTF-8 it's not
works.
But if I simply put http calls into the browser address bar, for example, it
works perfectly!
So, how can I "tell" solrJ to not encode the queries?

Thank you

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Accent-Characters-tp3985931p3986970.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Accent Characters

2012-05-30 Thread Vicente Couto
Hello, Jack.

Yeah, I'm screwed up.

Well, the documents are indexed with the accents.
I started a new clean solr 3.6 configuration, with as few changes as
possible; I'm running two cores, one for English and another one for French.
Here is where I am now: If I try to run queries by using solrJ, it does some
sort of encoding. For example, I can see into the logs that if I run one
query looking for "pré", I got

INFO: [coreFR] webapp=/solr path=/select
params={fl=*,score&q=content:pré&hl.fl=content&hl.maxAnalyzedChars=10&hl=true}
hits=0 status=0 QTime=0 

And I can't see any results. If I try by using encoding to UTF-8 it's not
works.
But if I simply put http calls into the browser address bar, for example, it
works perfectly!
So, how can I "tell" solrJ to not encode the queries?

Thank you

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Accent-Characters-tp3985931p3986970.html
Sent from the Solr - User mailing list archive at Nabble.com.


Poll: What do you use for Solr performance monitoring?

2012-05-30 Thread Otis Gospodnetic
Hi,

Super quick poll:  What do you use for Solr performance monitoring?
Vote here: 
http://blog.sematext.com/2012/05/30/poll-what-do-you-use-for-solr-performance-monitoring/


I'm collecting data for my Berlin Buzzwords talk that will touch on Solr, so 
your votes will be greatly appreciated!

Thanks,
Otis


Re: Logging in Embedded SolrServer - What a nightmare.

2012-05-30 Thread solruser489
For anyone else who is still having this issue, the following may help. The
embedded SOLR server uses sl4j for logging, which is a facade over other
logging frameworks. It achieves this by looking for a 'binding' jar for the
implementation framework in the classpath. In my case I had the 'simple'
binding (slf4j-simple.jar) in my classpath, which is a non-configurable
simple console logging implementation binding that logs everything from INFO
up. Simply remove this jar from your class path to default to the NOP
logger, or replace it with another logging framework binding jar (such as
slf4j-log4j.jar) and configure it as per that framework's requirements.

Hope this helps!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Logging-in-Embedded-SolrServer-What-a-nightmare-tp485539p3986953.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr Caches

2012-05-30 Thread Chris Hostetter

: FilterCache:
...
: So if a query contains two fq params, it will create two separate entries
: for each of these fq params. The value of each entry is the list of ids of
: all documents across the index that match the corresponding fq param. Each
: entry is independent of any other entry.
...
strictly speaking think of the cache values as a "set of (doc) ids" not a 
"list of (doc) ids" ... list implies order, and there is none in the 
filterCache.

: A minimum size for filterCache could be (total number of fields * avg
: number of unique values per field) ? Is this correct ? I have not enabled

you could do that ... but it would probably be overkill.  you really only 
need to worry about hte # of fields users will be filtering on, and even 
then only the values people will be filtering on.  if you are using 
facet.method=enum for a field, then you might wnat to ensure it's big 
enough for all the unique values on every (facet) field so you don't get 
evicitions in a single request ... but the facet.method=fc is a lot more 
efficent in most cases.

it's "ok" for unpopular queries to get evicted from the cache(s), so don't 
worry about it too much -- the best way to pick a size for your caches is 
to pick a size and then test, if you get lots of evictions, and you have 
ram to spare: go bigger.  if you get no evicitions, and have a low hit 
rate and want the ram for other things: go smaller.

: QueryResultsCache:
...
: q=Status:Active&fq=Org:Apache&fq=Version:13, it will create one entry that
: contains list of ids of documents that match this full query. Is this

correct.

: documentCache:
...
: correct ? For sizing, SolrWiki states that "*The size for the documentCache
: should always be greater than  * *".
: Why do we need the max_concurrent_queries parameter here ? Is it when
: max_results is much lesser than numDocs ? In my case, a q=*:*search is done

max_results in that context is max_results per request ... ie: the "rows" 
param.  The point is that you don't want a single request have to fetch 
the same document from the index twice because it got a cache miss due to 
a concurrent request evicted that doc from the documentCache.


-Hoss


Re: how to read fieldValueCacheStatistics

2012-05-30 Thread Chris Hostetter

: When I read fieldValueCache statistics I have something that looks like
: 
: item_ABC_FACET :
: 
{field=ABC_FACET,memSize=4224,tindexSize=32,time=92,phase1=92,nTerms=0,bigTerms=0,termInstances=0,uses=11}
: 
: 
: is there a doc somewhere that explains what are

...technically that's one stat, showing you and "UnInvertedField" 
instance in the cache (that's the string-ification of that 
UnInvertedField)

the specifics of what those numbers mean are definitely what i would 
consider "expert level" ... off the top of my head the only ones i am 
fairly sure of are:

memSize - how many bytes of ram it's using
time - how long it took to build 
nTerms - number of unique terms in that field
bigTerms - number of "big" terms, ie: terms that have such a high docFreq, 
they weren't un-inverted because it would be too ineffectient.

In general, this level of detail is the kind of thing where you should 
probably review the code.


-Hoss


Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Chris Hostetter

: that describes concurrency in Solr. The short question is, for such a
: cache, do I need to worry about concurrent access (I'm guessing that the
: firstSearcher QuerySenderListener process would be
: single-threaded/non-concurrent, and thus writes would never be an issue -
: is this correct?) - e.g. for my case, would I back the "NoReplacementCache"
: with a HashMap or ? The bigger question is: what are the parallel task
: execution paths in Solr and under what conditions are they possible?

During the warming phase(s), no one has access to the cache but the 
stuff doing the warming (the cache regenerator, and later the searcher 
listeners) which are single threaded, but that's really an implementation 
detail. after that the SolrCache is used by any concurrent threads that 
deal with that SolrIndexSearcher instance.

In general you should assume that gets/puts on a SolrCache will be 
concurrent -- in your case the puts are going to be No-Ops, so you relaly 
just need to worry about hte gets, so a HashMap should work fine.



-Hoss


Re: How to show Synopsis for textual description field

2012-05-30 Thread Lance Norskog
This is called "Document Summarization" and there is a lot of
literature on the topic. None of it is implemented for Solr. All of
them require some preprocessing so the summaries need to be stored
during indexing. There is some natural-language processing and some
math processing in most summarizers.

On Wed, May 30, 2012 at 7:20 AM, srini  wrote:
> HI All,
>
> I have text filed which contains description for product. When user search
> for any word, I would like to show the synopsis for this field( which could
> be 2-3 lines). Can anyone give some hint?
>
>
> Thanks In Advance!!!
> Srini
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/How-to-show-Synopsis-for-textual-description-field-tp3986843.html
> Sent from the Solr - User mailing list archive at Nabble.com.



-- 
Lance Norskog
goks...@gmail.com


Re: Highlight the search word in results

2012-05-30 Thread Jack Krupansky
1. Please be specific as to what is not working. Is a highlighting section 
returned in the results? Is the field missing? Or what?

2. hl.fragsize is in characters. Something like 300 is more realistic.
3. Your "f.DESCRIPTION.hl.snippets=5" is missing its XML tags. I don't think 
you need this anyway.

4. hl.alternateField takes a field name, not a field type.
5. Use /browse in the example schema after indexing books.json and set 
&wt=xml to see the raw results which has the highlighting. That is what you 
will be looking for in your results.


To index books.json, cd to the exampledocs directory and do this:

curl http://localhost:8983/solr/update/json?commit=true --data-binary 
@books.json -H 'Content-type:application/json'


For example, here is a test query:

http://localhost:8983/solr/browse/?debugQuery=true&wt=xml&q=the

After the results, you should see a section that starts like this:


 
   
 The Lightning Thief
   
 
 
   
 The Sea of Monsters
   


The "em" is the highlighting of the search terms.

If you don't see that section, you are probably missing the highlight search 
component - see solrconfig in example.


Get comfortable using the example and then replicate the features in your 
own handler.


-- Jack Krupansky

-Original Message- 
From: srini

Sent: Wednesday, May 30, 2012 6:03 PM
To: solr-user@lucene.apache.org
Subject: Highlight the search word in results

I am using solr/browse and I am able to get results for search word. But I
want to highlight the search word in results. I tried below settings in
solr-config.xml and it did not work out. Any ideas?

Any thing I am missing here!!!

solr-config.xml



  on
   DESCRIPTION
f.DESCRIPTION.hl.snippets=5
10
   text_ws2



schema.xml


 
   
   
 
   






--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlight-the-search-word-in-results-tp3986935.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky
I should have said that you need to have both a text and string copy of the 
field. Make sure one of them is "stored" and do a copyField to the other 
field. Then your main query can be against the text field and the sort 
function query against the string field.


-- Jack Krupansky

-Original Message- 
From: Gau

Sent: Wednesday, May 30, 2012 6:23 PM
To: solr-user@lucene.apache.org
Subject: Re: Difference between textfield and strfield

I cannot move from textfield to strfield, since I am using synonym 
expansion.

Is there anything we can do on textfield itself

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986938.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Difference between textfield and strfield

2012-05-30 Thread Gau
I cannot move from textfield to strfield, since I am using synonym expansion.
Is there anything we can do on textfield itself

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986938.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky

Take a look at "strdist" and see whether that does enough of what you want:

See:
http://wiki.apache.org/solr/FunctionQuery#strdist

Some discussion here:
http://lucene.472066.n3.nabble.com/can-t-use-strdist-as-functionquery-td1023390.html

And these would need to be "string" fields, not "text" since it is using the 
full, literal character string value.


-- Jack Krupansky

-Original Message- 
From: Gau

Sent: Wednesday, May 30, 2012 5:23 PM
To: solr-user@lucene.apache.org
Subject: Re: Difference between textfield and strfield

Well the I do not have phrases for synonym expansion. So it does work well.
The synonym expansion is done at query time. And since i am just searching
against the first name field, tf, idf and other ranking parameters do not
make sense, hence their weight has been initialized to 1. So after applying
synonym expansion I am getting results in random word format.

the Results are perfect just that they are not ordered by Levenstein
distance of the original query.

So the use case is
if use enters query ab
it gets expanded at query time to abc,abxy,aberfg
And I get results for ab, abc, abxy, aberfg.
But I want them to be sorted by Levenstein distance from the original query
(ab)
So order shoud be
ab
abc
abxy
aberfg

.. !
TextField makes this even more difficult? Any other suggestions?
Spellcheckers? Ngrams?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986928.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Difference between textfield and strfield

2012-05-30 Thread Gau
Well the I do not have phrases for synonym expansion. So it does work well.
The synonym expansion is done at query time. And since i am just searching
against the first name field, tf, idf and other ranking parameters do not
make sense, hence their weight has been initialized to 1. So after applying
synonym expansion I am getting results in random word format. 

the Results are perfect just that they are not ordered by Levenstein
distance of the original query.

So the use case is 
if use enters query ab
it gets expanded at query time to abc,abxy,aberfg
And I get results for ab, abc, abxy, aberfg.
But I want them to be sorted by Levenstein distance from the original query
(ab)
So order shoud be 
ab
abc
abxy
aberfg

.. ! 
TextField makes this even more difficult? Any other suggestions?
Spellcheckers? Ngrams?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986928.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky
A text field is a sequence of terms that has been tokenized while a string 
field is a single term (although it can also be multivalued.)


Punctuation and spacing is ignored for text fields. Text tends to be 
lowercased, stemmed, and even stop words removed. You tend to search text 
using a handful of keywords whose exact order is not required, although 
quoted phrases can be used as well. Fuzzy queries can be done on individual 
terms (words). Wildcards as well.


String fields are literal character strings with all punctuation, spacing, 
and case preserved. Anything other than exact match is done using wildcards, 
although I suppose fuzzy query should work as well.


String fields are useful for facets and filter queries or display.

Text fields are useful for keyword search.

Synonyms are a token filtering, which applies to text fields, not string 
fields.


A fuzzy query would not work properly for a synonym expansion in which some 
of the terms are phrases, but should otherwise work for a text field term.


If you have a specific case that fails, please provide the details.

-- Jack Krupansky

-Original Message- 
From: Gau

Sent: Wednesday, May 30, 2012 4:39 PM
To: solr-user@lucene.apache.org
Subject: Difference between textfield and strfield

Hi,

Can anyone explain me the basic pros and cons between textfield and
strfield. I am trying to use Levenstein distance on textfield, but it seems
that it can only be applied on the strfield. So my question is whats the
difference between the 2 and what are the radical advantages of one over the
other

Currently I have the text field defined for first_name and i apply synonym
expansion at query time to this field.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Difference between textfield and strfield

2012-05-30 Thread Gau
Hi,

 Can anyone explain me the basic pros and cons between textfield and
strfield. I am trying to use Levenstein distance on textfield, but it seems
that it can only be applied on the strfield. So my question is whats the
difference between the 2 and what are the radical advantages of one over the
other

Currently I have the text field defined for first_name and i apply synonym
expansion at query time to this field.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Jack Krupansky
As per the source code, Solr only sets the BooleanQuery clause limit on the 
very first core load. It ignores any the setting on subsequent core loads, 
including a reload of the initial core.


SolrCore.java: "// only change the BooleanQuery maxClauseCount once for ALL 
cores..."


The cores should get loaded in the order they appear in solr.xml, although I 
don't know if that is a written, contractual guarantee.


As the CoreAdmin wiki page says, "Workaround, set maxBooleanClauses to the 
greatest value desired in *all* cores".


See:
http://wiki.apache.org/solr/CoreAdmin#Known_Issues

The wiki is wrong when it says "Whichever Solr core initializes last will 
win the setting of the solrconfig.xml's maxBooleanClauses value." The first 
core to be loaded wins. Or, maybe the source code is wrong. Either way, a 
correction is needed.


-- Jack Krupansky

-Original Message- 
From: Sujatha Arun

Sent: Wednesday, May 30, 2012 1:30 PM
To: solr-user@lucene.apache.org
Subject: solr 1.3 Multicores and maxboolean clause

Hello,

The solrcore Wiki says that "Lucene's
BooleanQuerymaxClauseCount
is a static variable, making it a single value across the
entire JVM. Whichever Solr core initializes last will win the setting of
the solrconfig.xml's maxBooleanClauses value. Workaround, set
maxBooleanClauses to the greatest value desired in *all* cores."

Now what I see is that even if any one core* has a smaller value for
maxboolean clause* ,the smaller one is taken into effect and not the last
core which is created.

*Some questions*


  1. What is the order for initialization of the cores on a server
  restart,I don't see this info in the logs?
  2. When i change the maxboolean clause on one cores and reload the core
  ,it is not effected ?Does this require Tomcat restart?why?
  3. The default cores core0 and core1 that comes in the example multicore
  setup does not have this value set in them as it has minimum 
configuration

  ,does this affect the value in other cores if I use that as default?

Regards,
Sujatha 



Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Walter Underwood
On May 30, 2012, at 11:44 AM, Aaron Daubman wrote:

>  The bigger question is: what are the parallel task
> execution paths in Solr and under what conditions are they possible?

I'd go with the general servlet rules, where everything is assumed to have 
concurrent access.

wunder
--
Walter Underwood
wun...@wunderwood.org





Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Aaron Daubman
Hoss,


: 1) Any recommendations on which best to sub-class? I'm guessing, for this
> : scenario with "rare" batch puts and no evictions, I'd be looking for get
> : performance. This will also be on a box with many CPUs - so I wonder if
> the
> : older LRUCache would be preferable?
>
> i suspect you are correct ... the entire point of the other caches is
> dealingwith faster replacement, so you really don't care.
>
> You might even find it worth while to write your own
> "NoReplacementCache" from scratch backed by a HashMap (instead of the
> LinkedHashMap used in LRUCache)


I really like this idea (roll-your-own cache using a simple HashMap).
However, as much searching as I've done, I've come up short on anything
that describes concurrency in Solr. The short question is, for such a
cache, do I need to worry about concurrent access (I'm guessing that the
firstSearcher QuerySenderListener process would be
single-threaded/non-concurrent, and thus writes would never be an issue -
is this correct?) - e.g. for my case, would I back the "NoReplacementCache"
with a HashMap or ? The bigger question is: what are the parallel task
execution paths in Solr and under what conditions are they possible?

Thanks again,
 Aaron


solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Sujatha Arun
Hello,

The solrcore Wiki says that "Lucene's
BooleanQuerymaxClauseCount
is a static variable, making it a single value across the
entire JVM. Whichever Solr core initializes last will win the setting of
the solrconfig.xml's maxBooleanClauses value. Workaround, set
maxBooleanClauses to the greatest value desired in *all* cores."

Now what I see is that even if any one core* has a smaller value for
maxboolean clause* ,the smaller one is taken into effect and not the last
core which is created.

*Some questions*


   1. What is the order for initialization of the cores on a server
   restart,I don't see this info in the logs?
   2. When i change the maxboolean clause on one cores and reload the core
   ,it is not effected ?Does this require Tomcat restart?why?
   3. The default cores core0 and core1 that comes in the example multicore
   setup does not have this value set in them as it has minimum configuration
   ,does this affect the value in other cores if I use that as default?

Regards,
Sujatha


Re: Multicore Issue - Server Restart

2012-05-30 Thread Sujatha Arun
solr 1.3

Regards
Sujatha

On Wed, May 30, 2012 at 8:26 PM, Siva Kommuri  wrote:

> Hi Sujatha,
>
> Which version of Solr are you using?
>
> Best Wishes,
> Siva
>
> On Wed, May 30, 2012 at 12:22 AM, Sujatha Arun 
> wrote:
>
> > Yes ,that is correct.
> >
> > Regards
> > Sujatha
> >
> > On Tue, May 29, 2012 at 7:23 PM, lboutros  wrote:
> >
> > > Hi Suajtha,
> > >
> > > each webapps has its own solr home ?
> > >
> > > Ludovic.
> > >
> > > -
> > > Jouve
> > > France.
> > > --
> > > View this message in context:
> > >
> >
> http://lucene.472066.n3.nabble.com/Multicore-Issue-Server-Restart-tp3986516p3986602.html
> > > Sent from the Solr - User mailing list archive at Nabble.com.
> > >
> >
>


Re: Multicore Issue - Server Restart

2012-05-30 Thread Siva Kommuri
Hi Sujatha,

Which version of Solr are you using?

Best Wishes,
Siva

On Wed, May 30, 2012 at 12:22 AM, Sujatha Arun  wrote:

> Yes ,that is correct.
>
> Regards
> Sujatha
>
> On Tue, May 29, 2012 at 7:23 PM, lboutros  wrote:
>
> > Hi Suajtha,
> >
> > each webapps has its own solr home ?
> >
> > Ludovic.
> >
> > -
> > Jouve
> > France.
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Multicore-Issue-Server-Restart-tp3986516p3986602.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>


Re: how to reduce the result size to 2-3 lines and expand based on user interest

2012-05-30 Thread Jack Krupansky

The commonly used term is "snippet" or "snippet highlighting."

Read:
http://wiki.apache.org/solr/HighlightingParameters

Try that approach.

To expand, execute a fresh query for the specific document but with 
highlighting disabled or with highlighted parameters that provide a larger 
snippet size.


-- Jack Krupansky

-Original Message- 
From: srini

Sent: Wednesday, May 30, 2012 10:13 AM
To: solr-user@lucene.apache.org
Subject: Re: how to reduce the result size to 2-3 lines and expand based on 
user interest


Here is the correct technical term to put. "synopsis". Yes all I wanted to
do is show synopsis for the text description field I have.

Thanks
Srini

--
View this message in context: 
http://lucene.472066.n3.nabble.com/how-to-reduce-the-result-size-to-2-3-lines-and-expand-based-on-user-interest-tp3985692p3986840.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: solr limits

2012-05-30 Thread Jack Krupansky
In theory, yes. A single core/index is limited by the Lucene limit of 2.14 
billion documents (document number is non-negative and represented as a 
signed Java "int"), but you can use shards to get a multiple of that. You 
can put 50 to 250 million (maybe more, depending on the data) documents in a 
single shard. You also will want replication of each shard for query 
capacity.


Take a look at Mark Miller's writeup:
http://www.lucidimagination.com/content/scaling-lucene-and-solr

-- Jack Krupansky

-Original Message- 
From: Sachin Aggarwal

Sent: Wednesday, May 30, 2012 1:13 AM
To: solr-user@lucene.apache.org
Subject: solr limits

Hello,


How much solr scales, can it go beyond 10 billion

--

Thanks & Regards

Sachin Aggarwal
7204609353 



How to show Synopsis for textual description field

2012-05-30 Thread srini
HI All,

I have text filed which contains description for product. When user search
for any word, I would like to show the synopsis for this field( which could
be 2-3 lines). Can anyone give some hint?


Thanks In Advance!!!
Srini

--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-show-Synopsis-for-textual-description-field-tp3986843.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: how to reduce the result size to 2-3 lines and expand based on user interest

2012-05-30 Thread srini
 Here is the correct technical term to put. "synopsis". Yes all I wanted to
do is show synopsis for the text description field I have.

Thanks
Srini

--
View this message in context: 
http://lucene.472066.n3.nabble.com/how-to-reduce-the-result-size-to-2-3-lines-and-expand-based-on-user-interest-tp3985692p3986840.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Never mind, I figured it out after sending the email:

http://localhost:8983/solr/select?q=level2categories:%22Artists\|Direct%20from%20the%20Artist%22&group=true&group.field=groupby&group.main=true&facet=true&group.facet=true&facet.field=level1categories&facet.field=level2categories&rows=0

I needed to enclose the value in double quotes and also have escape the |
sign with \.

Thanks.


Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Hi,

We have a schema that contains categories (multi valued) for different
levels. Each level of categories are indexed in a different field. Each
category entry contains its parent in its value. The level is separated by
a "|" symbol. We are seeing following results for a faceted query:

http://localhost:8983/solr/select?q=level1categories:Artists&group=true&group.field=groupby&group.main=true&facet=true&group.facet=true&facet.field=level1categories&facet.field=level2categories&rows=0

In above, level1category is queried for "Artists" and we get following
counts:


   

 
 
  4761
   0
   0
   
  
   265

   198
   14
   10
   
 
  
   

The count for level2categories --> Artists|Direct from the Artist  is 265.

When I filter on leve2categories:Artists|Direct from the Artist, I receive
776 results. I am using following query:

http://localhost:8983/solr/select?q=level2categories:Artists|Direct%20from%20the%20Artist&group=true&group.field=groupby&group.main=true&facet=true&group.facet=true&facet.field=level1categories&facet.field=level2categories&rows=0

or this

http://localhost:8983/solr/select?q=level2categories:Artists\|Direct%20from%20the%20Artist&group=true&group.field=groupby&group.main=true&facet=true&group.facet=true&facet.field=level1categories&facet.field=level2categories&rows=0

I was expecting that it would return 265 as row count but I get following
counts as response:




 
 
   776
   0
   0
  
  
   198
   180
   0
   0
  


What am I doing incorrectly? What is it filtering on to come to 776 total
rows?

Regards,
Sohail


difference between Katta and SolrCloud (replicator factor)

2012-05-30 Thread Jamel ESSOUSSI
Hi,

I would like to know the difference between Katta and SolrCloud in the
distributed context ? And can we specify in SolrCloud the minimum number of
copies of a document maintained by the cluster (replicator factor).

Best Regards

-- Jamel ESSOUSSI

--
View this message in context: 
http://lucene.472066.n3.nabble.com/difference-between-Katta-and-SolrCloud-replicator-factor-tp3986791.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multicore Issue - Server Restart

2012-05-30 Thread Sujatha Arun
Yes ,that is correct.

Regards
Sujatha

On Tue, May 29, 2012 at 7:23 PM, lboutros  wrote:

> Hi Suajtha,
>
> each webapps has its own solr home ?
>
> Ludovic.
>
> -
> Jouve
> France.
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Multicore-Issue-Server-Restart-tp3986516p3986602.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>