Solr-Lucene compatibility

2012-02-19 Thread youngestachiever
I have an app the writes lucene indexes and is based on lucene 2.3.0.

Can I read those indexes using solr 3.5.0 and perform a distributed search?
Or should I use a lower version of solr, so that the index reader is
compatible with the index writer.

Would be great if somebody with prior exposure can advice!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-Lucene-compatibility-tp3760009p3760009.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr-Lucene compatibility

2012-02-19 Thread youngestachiever
I have an app the writes lucene indexes and is based on lucene 2.3.0. 

Can I read those indexes using solr 3.5.0 and perform a distributed search?
Or should I use a lower version of solr, so that the index reader is
compatible with the index writer. 

Would be great if somebody with prior exposure can advice!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-Lucene-compatibility-tp3760022p3760022.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Basic SOLR help needed

2012-02-19 Thread pravesh
>>When I do a query using the Admin tool:
>>INST_NAME:KENTUCKY TECH PADUCAH  (There is a docment in the db that meets
this INST_NAME exactly) 

Try using this way:
INST_NAME:(KENTUCKY TECH PADUCAH)
This way all the 3 terms would be searched in the field INST_NAME, otherwise
only the first term KENTUCKY is searched in the INST_NAME and rest terms
like TECH and PADUCAH are searched in your default search field

Regds
Pravesh

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-SOLR-help-needed-tp3759855p375.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Basic SOLR help needed

2012-02-19 Thread jwgildea
I want to be able to search against INST_NAME using keywords, stemming,
synonyms, etc...  I think I need to split on white space.  Don't I?
Thanks for the quick response!

 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-SOLR-help-needed-tp3759855p3759998.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr edismax clarification

2012-02-19 Thread Indika Tantrigoda
Hi,
Thanks for the response.

Below is the field type and schema definition.

 
  
  
  
  
  
  
  
  
  
  
  
  
  
  

   

This is the request handler.



 edismax
 explicit
 *:*
 0.01
 
title^2 description^1.0
 
 pow(100,boost_factor)



Here is the output from debugQuery=on

BoostedQuery(boost(+(((title:boston^2.0 | description:boston)~0.01
(title:public^2.0 | description:public)~0.01 (title:library^2.0 |
description:library)~0.01)~3),pow(const(100.0),int(boost_factor

stopwords_en.txt is inlcuded for stopworods. i.e. the list of stopwords
that come with Solr.

The full query string is
http://localhost:/solr/select/?fl=title&rows=13&qt=general_search&q=boston+public+library&start=0&debugQuery=on

I use SolrJ in my application to connect to Solr.

As per the query string shown above if I search for Boston public library I
do not get a document that has the title Boston.

Thanks.

On 17 February 2012 15:55, O. Klein  wrote:

>
> Indika Tantrigoda wrote
> >
> > Hi All,
> >
> > I am using edismax SearchHandler in my search and I have some issues in
> > the
> > search results. As I understand if the "defaultOperator" is set to OR the
> > search query will be passed as  -> The OR quick OR brown OR fox
> > implicitly.
> >
> >
>
> Did you also remove "mm"? If not  "defaultOperator" is ignored and it
> follows "mm" settings.
>
> http://wiki.apache.org/solr/DisMaxQParserPlugin#mm_.28Minimum_.27Should.27_Match.29
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Solr-edismax-clarification-tp3751013p3753260.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Basic SOLR help needed

2012-02-19 Thread jwgildea
I have created a SOLR DB that appears to be working in general.   Here are
the fields from my schema:


















I loaded 2 documents

When I do a query using the Admin tool:
INST_NAME:KENTUCKY TECH PADUCAH  (There is a docment in the that meets this
query exactly)

It appears to only look at the first keyword "NATIONAL"

If I break the query up:
INST_NAME:NATIONAL 
INST_NAME:PARK 
INST_NAME:COMMUNITY 
INST_NAME:COLLEGE

The query works as expected.  The matching doc has a score of 7.996526, and
there are a bunch of similar matches with lower scores

Help.  I'm sure it's something basic.

Thanks





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-SOLR-help-needed-tp3759855p3759855.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Development inside or outside of Solr?

2012-02-19 Thread Oleg Tikhonov
Hi,

I cannot say about two mentioned approaches however take a look at Tika CLI
with --language option.

Hope it helps,

Oleg


On Mon, Feb 20, 2012 at 4:44 AM, bing  wrote:

> Hi, all,
>
> I am deploying a multicore solr server runing on Tomcat, where I want to
> achieve language detection during index/query.
>
> Solr3.5.0 has a wrapped Tika API that can do language detection. Currently,
> the default behavior of Solr3.5.0 is, every time I index a document, and at
> mean time Solr call Tika API to give the result of language detection, i.e.
> index and detection happens at the same time. However, I hope I can have
> the
> language detection result first, and then I decide which core to put the
> document, i.e. detection happens before index.
>
> There seems that I need to do development in either of the following ways:
>
> 1. I might need to do revision of Solr itself, change the default behavior
> of Solr;
> 2. Or I might write a Java client outside Solr, call the client through
> server (JSP maybe) in index/query.
>
> Can anyone meeting with similar conditions give some suggestions about the
> advantages and disad of the two approaches? Any other alternatives? Thank
> you.
>
>
> Best
> Bing
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Development-inside-or-outside-of-Solr-tp3759680p3759680.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Basic SOLR help needed

2012-02-19 Thread tamanjit.bin...@yahoo.co.in
Hi,
I guess the behavior is because of the field type of the field INST_NAME.
Try changing its type to string for getting the desired results.
text_en_splitting splits on whitespace.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-SOLR-help-needed-tp3759855p3759866.html
Sent from the Solr - User mailing list archive at Nabble.com.


Development inside or outside of Solr?

2012-02-19 Thread bing
Hi, all, 

I am deploying a multicore solr server runing on Tomcat, where I want to
achieve language detection during index/query. 

Solr3.5.0 has a wrapped Tika API that can do language detection. Currently,
the default behavior of Solr3.5.0 is, every time I index a document, and at
mean time Solr call Tika API to give the result of language detection, i.e.
index and detection happens at the same time. However, I hope I can have the
language detection result first, and then I decide which core to put the
document, i.e. detection happens before index. 

There seems that I need to do development in either of the following ways:

1. I might need to do revision of Solr itself, change the default behavior
of Solr; 
2. Or I might write a Java client outside Solr, call the client through
server (JSP maybe) in index/query. 

Can anyone meeting with similar conditions give some suggestions about the
advantages and disad of the two approaches? Any other alternatives? Thank
you. 


Best 
Bing  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Development-inside-or-outside-of-Solr-tp3759680p3759680.html
Sent from the Solr - User mailing list archive at Nabble.com.


Git repo

2012-02-19 Thread Mark Diggory
Is there a git repo location that mirrors apache svn repos for solr?

Cheers,
Mark


-- 
[image: @mire Inc.]
*Mark Diggory *(Schedule a
Meeting
)
*2888 Loker Avenue East, Suite 305, Carlsbad, CA. 92010*
*Esperantolaan 4, Heverlee 3001, Belgium*
http://www.atmire.com


Re: Removing empty dynamic fields from a Solr 1.4 index

2012-02-19 Thread Erick Erickson
When you say that they're still around, what
do you mean? They're listed in the schema browser?
You can search on them? ???

But in general, there's no penalty for having these
stay around, they're not consuming resources
in any documents, so I don't think you need to
worry from performance perspective.

Although it is distracting to see them still there..

Best
Erick

On Fri, Feb 17, 2012 at 6:50 AM, Andrew Ingram
 wrote:
> Hi all
>
> (Note: this question is cross-posted on stackoverflow: 
> http://stackoverflow.com/questions/9327542/removing-empty-dynamic-fields-from-a-solr-1-4-index)
>
> I have a Solr index that uses quite a few dynamic fields. I've recently 
> changed my code to reduce the amount of data we index with Solr, 
> significantly reducing the number of dynamic fields that are in use.
>
> I've reindexed my data, and the doc count (as displayed in the admin schema 
> browser) for the old fields has dropped to zero. But I'm confused as to why 
> the fields still exist. I've done an optimize, and restarted the server, but 
> I can't find any information on whether there's a way to get these fields to 
> disappear.
>
> Am I now stuck with these fields unless I recreate the index from scratch? 
> We're talking about a significant reduction in fields (about 200 -> 30), and 
> I'm worried about the performance impact of keeping them floating around.
>
> Thanks,
> Andrew Ingram


Re: Payload and exact search - 2

2012-02-19 Thread Erick Erickson
As far as I know, you're on the right track. Note that it isn't important
that the payload filter be the first, just that nothing that splits the tokens
up on your delimit character (pipe symbol) come before it.

Like I said, payloads are a bit of a mystery to me, so don't take my
word for gospel here!

Best
Erick

On Sun, Feb 19, 2012 at 9:54 AM, leonardo2  wrote:
> Thank's for your reply,
> so, if I apply the  encoder="identity"/> as first filter in the chain, it shoud works
> In this new configuration, the first filter in the chain intercept the
> payload. It manages and removes the payload info and then the subsequent
> filters are applied to the clear text: is it right?
>
> Leonardo
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3758152.html
> Sent from the Solr - User mailing list archive at Nabble.com.


Re: Payload and exact search - 2

2012-02-19 Thread leonardo2
Thank's for your reply,
so, if I apply the  as first filter in the chain, it shoud works 
In this new configuration, the first filter in the chain intercept the
payload. It manages and removes the payload info and then the subsequent
filters are applied to the clear text: is it right?

Leonardo

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3758152.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solritas: Modify $content in layout.vm

2012-02-19 Thread remi tassing
Yeah, that works for now. I'll check that §content thing later on.

Thanks man!

Remi

On Sunday, February 19, 2012, Erik Hatcher  wrote:
> Unfortunately things a bit messy in there because others have tried to
make a kitchen sink of things in there, but as I said, it all starts with
browse.vm and then follow any #parse's from there.  You'll see browse.vm
#parse's "facets.vm", and in there you'll see how it then #parse's to those
various pieces you mention, so you can just remove, say, the
#parse('cluster.vm') in there (and subsequently clean it up and actually
remove the now unused cluster.vm file if you like).  And so on.
>
>Erik
>
>
> On Feb 18, 2012, at 10:23 , remi tassing wrote:
>
>> Yes, I'm using the example configuration (Solr-3.4).
>>
>> What I'm trying to do is to remove the menus on the left side ("Query
>> Facets", "Range Facets", "Clusters"), and the "boost by price" button.
I'm
>> not using them for now and they're kind of distracting.
>>
>> Thanks, again, in advance!
>>
>> Remi
>>
>> On Fri, Feb 17, 2012 at 11:56 PM, Erik Hatcher wrote:
>>
>>> $content is output of the main template rendered.
>>>
>>> To modify what is generated into $content, modify the main template or
the
>>> sub-#parsed templates (which is what you've discovered, looks like)
that is
>>> rendered (browse.vm, perhaps, if you're using the default example
setup).
>>> The main template that is rendered is specified as v.template (in the
>>> /browse handler definition in solrconfig.xml, again if you're using the
>>> example configuration).
>>>
>>> Does that help?  If not, let us know what you're trying to do exactly.
>>>
>>>   Erik
>>>
>>>
>>>
>>>
>>> On Feb 16, 2012, at 23:06 , remi tassing wrote:
>>>
 Hi all,

 How do we modify the "$content" variable in the layout.vm file? I could
 managed to change other stuff in doc.vm or header.vm but not this one.

 Is there any tutorial on this?

 Remi
>>>
>>>
>
>