Re: Is it possible to use sub-fields or multivalued fields for boosting?

2011-05-04 Thread findbestopensource
Hello deniz,

You could create a new field say FullName which is a  of
firstname and surname. Search on both the new field and location but boost
up the new field query.

Regards
Aditya
www.findbestopensource.com



On Thu, May 5, 2011 at 9:21 AM, deniz  wrote:

> okay... let me make the situation more clear... I am trying to create an
> universal field which includes information about users like firstname,
> surname, gender, location etc. When I enter something e.g London, I would
> like to match any users having 'London' in any field firstname, surname or
> location. But if it matches name or surname, I would like to give a higher
> weight.
>
> so my question is... is it possible to have sub-fields? like
> 
>   blabla
>   blabla
>   blabla
>   blabla
> 
>
> or any other ideas for implementing such feature?
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Is-it-possible-to-use-sub-fields-or-multivalued-fields-for-boosting-tp2901992p2901992.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


copyField

2011-05-04 Thread deniz
another question
if i define different fields with different boosts and then copy them into
another field and make a search by using this universal field, the boosting
will be done? 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/copyField-tp2902242p2902242.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Field names with a period (.)

2011-05-04 Thread Gora Mohanty
On Thu, May 5, 2011 at 5:08 AM, Leonardo Souza  wrote:
> Hi guys,
>
> Can i have a field name with a period(.) ?
> Like in *file.size*

Cannot find now where this is documented, but from what I remember it is
recommended to use only characters A-Z, a-z, 0-9, and underscore (_) in
field names, and some special characters are known to cause problems.

Regards,
Gora


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
Thank you so much for this gem, David!

I still don't manage to build though:
$ svn update
At revision 1099684.

$ mvn clean

$ mvn -N -Pbootstrap install

[INFO]

[INFO] BUILD FAILURE
[INFO]

[INFO] Total time: 8.234s
[INFO] Finished at: Thu May 05 07:21:34 CEST 2011
[INFO] Final Memory: 12M/81M
[INFO]

[ERROR] Failed to execute goal
org.apache.maven.plugins:maven-install-plugin:2.3.1:install-file
(install-solr-noggit) on project lucene-solr-grandparent: Error installing
artifact 'org.apache.solr:solr-noggit:jar': Failed to install artifact
org.apache.solr:solr-noggit:jar:4.0-SNAPSHOT:
/Users/simpatico/debug/solr4/solr/lib/apache-solr-noggit-r944541.jar (No
such file or directory) -> [Help 1]


On Thu, May 5, 2011 at 12:02 AM, Smiley, David W.  wrote:

> Hi folks. What you're supposed to do is run:
>
> mvn -N -Pbootstrap install
>
> as the very first one-time only step.  It copies several custom jar files
> into your local repository. From then on you can build like normally with
> maven.
>
> ~ David Smiley
> Author: http://www.packtpub.com/solr-1-4-enterprise-search-server/
>
>
> On May 4, 2011, at 2:36 PM, Gabriele Kahlout wrote:
>
> > but it doesn't build.
> >
> > Now, I've checked out solr4 from the trunk and tried to build the maven
> > project there, but it fails downloading berkleydb:
> >
> > BUILD FAILURE
> > 
> > Total time: 1:07.367s
> > Finished at: Wed May 04 20:33:29 CEST 2011
> > Final Memory: 24M/81M
> > 
> > Failed to execute goal on project lucene-bdb: Could not resolve
> dependencies
> > for project org.apache.lucene:lucene-bdb:jar:4.0-SNAPSHOT: Failure to
> find
> > com.sleepycat:berkeleydb:jar:4.7.25 in
> > http://download.carrot2.org/maven2/was cached in the local repository,
> > resolution will not be reattempted until
> > the update interval of carrot2.org has elapsed or updates are forced ->
> > [Help 1]
> >
> >
> > I looked up to get the jar on my own but I didn't find a 4.7.25 version,
> the
> > latest on oracle website (java edition) is 4.1. Where can i download this
> > maven dependency from?
> >
> > On Wed, May 4, 2011 at 1:26 PM, Gabriele Kahlout
> > wrote:
> >
> >> It worked after checking out the dev-tools folder. Thank you!
> >>
> >>
> >> On Wed, May 4, 2011 at 1:20 PM, lboutros  wrote:
> >>
> >>> 
> >>>  >>> description="Copy Maven POMs from dev-tools/maven/ to their
> >>> target
> >>> locations">
> >>>   
> >>> 
> >>> 
> >>>   
> >>> 
> >>> 
> >>>   
> >>> 
> >>>
> >>
> >>
> >>
> >> --
> >> Regards,
> >> K. Gabriele
> >>
> >> --- unchanged since 20/9/10 ---
> >> P.S. If the subject contains "[LON]" or the addressee acknowledges the
> >> receipt within 48 hours then I don't resend the email.
> >> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
> >> time(x) < Now + 48h) ⇒ ¬resend(I, this).
> >>
> >> If an email is sent by a sender that is not a trusted contact or the
> email
> >> does not contain a valid code then the email is not received. A valid
> code
> >> starts with a hyphen and ends with "X".
> >> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
> >> L(-[a-z]+[0-9]X)).
> >>
> >>
> >
> >
> > --
> > Regards,
> > K. Gabriele
> >
> > --- unchanged since 20/9/10 ---
> > P.S. If the subject contains "[LON]" or the addressee acknowledges the
> > receipt within 48 hours then I don't resend the email.
> > subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
> time(x)
> > < Now + 48h) ⇒ ¬resend(I, this).
> >
> > If an email is sent by a sender that is not a trusted contact or the
> email
> > does not contain a valid code then the email is not received. A valid
> code
> > starts with a hyphen and ends with "X".
> > ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
> > L(-[a-z]+[0-9]X)).
>
>
>
>
>
>


-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).


OverlappingFileLockException when concurrent commits in solr

2011-05-04 Thread nitesh nandy
Hello,

I'm using solr version 1.4.0 with tomcat 6. I've 2 solr instances running as
2 different web apps with separate data folders. My application requires
frequent commits from multiple clients. I've noticed that when more than one
client try to commit at the same time, these OverlappingFileLockException
start to appear. Can anything be done to rectify this problem? Please find
the error log below. Thanks

---
HTTP Status 500 - null

java.nio.channels.OverlappingFileLockException
at
sun.nio.ch.FileChannelImpl$SharedFileLockTable.checkList(FileChannelImpl.java:1215)
 at
sun.nio.ch.FileChannelImpl$SharedFileLockTable.add(FileChannelImpl.java:1117)
at sun.nio.ch.FileChannelImpl.tryLock(FileChannelImpl.java:923)
 at java.nio.channels.FileChannel.tryLock(FileChannel.java:978)
at org.apache.lucene.store.NativeFSLock.obtain(NativeFSLockFactory.java:233)
 at org.apache.lucene.store.Lock.obtain(Lock.java:73)
at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1550)
 at org.apache.lucene.index.IndexWriter.(IndexWriter.java:1407)
at
org.apache.solr.update.SolrIndexWriter.(SolrIndexWriter.java:190)
 at
org.apache.solr.update.UpdateHandler.createMainIndexWriter(UpdateHandler.java:98)
at
org.apache.solr.update.DirectUpdateHandler2.openWriter(DirectUpdateHandler2.java:173)
 at
org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:220)
at
org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:61)
 at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:139)
at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
 at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
 at org.apache.solr.core.SolrCore.execute(SolrCore.java:1317)
at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338)
 at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241)
at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
 at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
at
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
 at
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
at
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
 at
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
at
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
 at
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
at
org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:859)
 at
org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:588)
at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
 at java.lang.Thread.run(Thread.java:636)
type Status
reportmessage null

java.nio.channels.OverlappingFileLockException
at
sun.nio.ch.FileChannelImpl$SharedFileLockTable.checkList(FileChannelImpl.java:1215)
at
sun.nio.ch.FileChannelImpl$SharedFileLockTable.add(FileChannelImpl.java:1117)
 at sun.nio.ch.FileChannelImpl.tryLock(FileChannelImpl.java:923)
at java.nio.channels.FileChannel.tryLock(FileChannel.java:978)
 at
org.apache.lucene.store.NativeFSLock.obtain(NativeFSLockFactory.java:233)
at org.apache.lucene.store.Lock.obtain(Lock.java:73)
 at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1550)
at org.apache.lucene.index.IndexWriter.(IndexWriter.java:1407)
 at
org.apache.solr.update.SolrIndexWriter.(SolrIndexWriter.java:190)
at
org.apache.solr.update.UpdateHandler.createMainIndexWriter(UpdateHandler.java:98)
 at
org.apache.solr.update.DirectUpdateHandler2.openWriter(DirectUpdateHandler2.java:173)
at
org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:220)
 at
org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:61)
at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:139)
 at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
 at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1317)
 at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241)
 at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
 at
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java

Is it possible to use sub-fields or multivalued fields for boosting?

2011-05-04 Thread deniz
okay... let me make the situation more clear... I am trying to create an
universal field which includes information about users like firstname,
surname, gender, location etc. When I enter something e.g London, I would
like to match any users having 'London' in any field firstname, surname or
location. But if it matches name or surname, I would like to give a higher
weight.

so my question is... is it possible to have sub-fields? like

   blabla
   blabla
   blabla
   blabla


or any other ideas for implementing such feature?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-use-sub-fields-or-multivalued-fields-for-boosting-tp2901992p2901992.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Do boosts on values in multivalued fields still get consolidated?

2011-05-04 Thread Neil Hooey
If I have a document with:
{ id: 1, sentences: "hello world|5.0_goodbye|2.3_this is a sentence|2.8" }

How would I get those payloads to take affect, on the tokens separated by
"_"?

How do you write a query to use those payloads?

On Wed, May 4, 2011 at 22:26, Otis Gospodnetic
wrote:

> Hi Neil,
>
> I think payloads is the way to go.  Index-time boosting is not per term.
>
> Otis
> 
> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> Lucene ecosystem search :: http://search-lucene.com/
>
>
>
> - Original Message 
> > From: Neil Hooey 
> > To: solr-user@lucene.apache.org
> > Sent: Wed, May 4, 2011 9:36:24 PM
> > Subject: Do boosts on values in multivalued fields still get
> consolidated?
> >
> > Kapil Chhabra indicates on his blog that if you boost a value in  a
> > multivalued field during index time, the boosts are consolidated  for
> > every field, and the individual values are lost.
> >
> > Here's the  link:
> > http://blog.kapilchhabra.com/2008/01/solr-index-time-boost-facts-2
> >
> > This  post is from 2008-01-20, but it still seems to be true in Solr 3.1.
> >
> > Has  this behaviour been fixed in future versions of Solr, or are there
> > plans to  fix it?
> >
> > In general, when a user searches for a document, I'd like  to
> > arbitrarily weight each keyword for that document during index  time.
> >
> > For example if they searched for "q=keywords:monkey", and got these
> documents:
> > keywords: [ monkey, ape, chimp, garage ]
> > keywords: [ monkey,  cloud, food, door ]
> >
> > I'd like to have boosts recorded like this, at index  time, based on
> > keyword co-relevance:
> > keywords: [ monkey:50, ape:50,  chimp:50, garage:0.1 ]
> > keywords: [ monkey:1, cloud:1, food:1, door:1  ]
> >
> > Since, in the first document, the word "monkey" is clearly related  to
> > "ape" and "chimp", but "garage" is not. Similarly in the  second
> > document, none of the keywords are really related to each other  at
> > all.
> >
> > I see a couple of potential solutions to this problem, in the  absence
> > of boosts for multivalued fields:
> > 1. Turn keyword lists into  strings, and use payloads: "monkey|50,
> > ape|50, chimp|50, garage|0.1"
> > 2.  Use dynamic fields of the form: keyword_*: keyword_monkey,
> > keyword_ape, ...  and boost those fields.
> >
> > Are those solutions feasible, or are there better  solutions to this
> problem?
> >
> > - Neil
> >
>


Re: Do boosts on values in multivalued fields still get consolidated?

2011-05-04 Thread Otis Gospodnetic
Hi Neil,

I think payloads is the way to go.  Index-time boosting is not per term.

Otis

Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



- Original Message 
> From: Neil Hooey 
> To: solr-user@lucene.apache.org
> Sent: Wed, May 4, 2011 9:36:24 PM
> Subject: Do boosts on values in multivalued fields still get consolidated?
> 
> Kapil Chhabra indicates on his blog that if you boost a value in  a
> multivalued field during index time, the boosts are consolidated  for
> every field, and the individual values are lost.
> 
> Here's the  link:
> http://blog.kapilchhabra.com/2008/01/solr-index-time-boost-facts-2
> 
> This  post is from 2008-01-20, but it still seems to be true in Solr 3.1.
> 
> Has  this behaviour been fixed in future versions of Solr, or are there
> plans to  fix it?
> 
> In general, when a user searches for a document, I'd like  to
> arbitrarily weight each keyword for that document during index  time.
> 
> For example if they searched for "q=keywords:monkey", and got these  
documents:
> keywords: [ monkey, ape, chimp, garage ]
> keywords: [ monkey,  cloud, food, door ]
> 
> I'd like to have boosts recorded like this, at index  time, based on
> keyword co-relevance:
> keywords: [ monkey:50, ape:50,  chimp:50, garage:0.1 ]
> keywords: [ monkey:1, cloud:1, food:1, door:1  ]
> 
> Since, in the first document, the word "monkey" is clearly related  to
> "ape" and "chimp", but "garage" is not. Similarly in the  second
> document, none of the keywords are really related to each other  at
> all.
> 
> I see a couple of potential solutions to this problem, in the  absence
> of boosts for multivalued fields:
> 1. Turn keyword lists into  strings, and use payloads: "monkey|50,
> ape|50, chimp|50, garage|0.1"
> 2.  Use dynamic fields of the form: keyword_*: keyword_monkey,
> keyword_ape, ...  and boost those fields.
> 
> Are those solutions feasible, or are there better  solutions to this problem?
> 
> - Neil
> 


Re: Formatted date/time in long field and javabinRW exception

2011-05-04 Thread Chris Hostetter

: That sounds quite reasonable indeed. But i don't understand why Solr doesn't 
: throw an exception when i actually index a string in a long fieldType while i 
: do remember getting some number formatting exception when pushing strings to 
: an integer fieldType.
: 
: With the current set up i can send a properly formatted date to a long 
: fieldType, which should, in my opionion, punish me with an exception.

I must have missunderstood your initial email -- i thought you said you 
built the index externally (using nutch) and were only *reading* it with 
solr.

when indexing, the same rules apply: field types only attempt to parse the 
data coming in if they need to, and they skip it for perfornace when they 
done.

Field types like solr.TrieIntField and solr.TrieLongField will complain if 
you feed them an arbitrary string, because thye parse their input int an 
acutal Numeric object for special binary encoding.

If you use the legacy solr.IntField or solr.LongField (which don't support 
ranges and don't do any special encoding) these don't parse the input at 
all -- they just index the raw string










-Hoss


Do boosts on values in multivalued fields still get consolidated?

2011-05-04 Thread Neil Hooey
Kapil Chhabra indicates on his blog that if you boost a value in a
multivalued field during index time, the boosts are consolidated for
every field, and the individual values are lost.

Here's the link:
http://blog.kapilchhabra.com/2008/01/solr-index-time-boost-facts-2

This post is from 2008-01-20, but it still seems to be true in Solr 3.1.

Has this behaviour been fixed in future versions of Solr, or are there
plans to fix it?

In general, when a user searches for a document, I'd like to
arbitrarily weight each keyword for that document during index time.

For example if they searched for "q=keywords:monkey", and got these documents:
keywords: [ monkey, ape, chimp, garage ]
keywords: [ monkey, cloud, food, door ]

I'd like to have boosts recorded like this, at index time, based on
keyword co-relevance:
keywords: [ monkey:50, ape:50, chimp:50, garage:0.1 ]
keywords: [ monkey:1, cloud:1, food:1, door:1 ]

Since, in the first document, the word "monkey" is clearly related to
"ape" and "chimp", but "garage" is not. Similarly in the second
document, none of the keywords are really related to each other at
all.

I see a couple of potential solutions to this problem, in the absence
of boosts for multivalued fields:
1. Turn keyword lists into strings, and use payloads: "monkey|50,
ape|50, chimp|50, garage|0.1"
2. Use dynamic fields of the form: keyword_*: keyword_monkey,
keyword_ape, ... and boost those fields.

Are those solutions feasible, or are there better solutions to this problem?

- Neil


Re: why query chinese character with bracket become phrase query by default?

2011-05-04 Thread Ahmet Arslan

Please see Robert's two solutions (autoGeneratePhraseQueries or PositionFilter) 
http://search-lucene.com/m/imED32mqqyp1/

--- On Thu, 5/5/11, cyang2010  wrote:

> From: cyang2010 
> Subject: why query chinese character with bracket become phrase query by 
> default?
> To: solr-user@lucene.apache.org
> Date: Thursday, May 5, 2011, 3:21 AM
> Hi,
> 
> In solr admin query full interface page, the following
> query with english
> become term query according to debug :
> 
> title_en_US: (blood red)
> 
> 
> title_en_US: (blood
> red)
> title_en_US: (blood
> red)
> title_en_US:blood
> title_en_US:red
> title_en_US:blood
> title_en_US:red
> 
> 
> However, using the same syntax with two chinese terms, the
> query result into
> a phrase query:
> 
> title_zh_CN: (我活)
> 
> 
> title_zh_CN:
> (我活)
> title_zh_CN:
> (我活)
> PhraseQuery(title_zh_CN:"我
> 活")
> title_zh_CN:"我
> 活"
> 
> 
> I do have different tokenizer/filter for those two
> different fields.   
> title_en_US is using all those common english specific
> tokenizer, while
> title_zh_CN uses
> solr.ChineseTokenizerFactory.   
> 
> I don't think those tokenizer determin whether things
> within bracket become
> term queries or phrase queries.
> 
> I really need to blindly pass user-input text to a solr
> field without doing
> any parsing, and hope it is all doing term query for each
> term contained in
> the search text.
> 
> How do i achieve that?
> 
> Thanks,
> 
> 
> cy
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/why-query-chinese-character-with-bracket-become-phrase-query-by-default-tp2901542p2901542.html
> Sent from the Solr - User mailing list archive at
> Nabble.com.
>


Solr SpellcheckComponent issue

2011-05-04 Thread Siddharth Powar
Hi,

I have been using using SolrSpellCheckcomponent. One of my requirements is
that if a user types something like "add", solr would return "adidas". To
get something like this, I used EdgeNGramsFilterFactory and applied it to
the fields that I am indexing. So for adidas I will have something like "a",
"ad", "adi", "adid"... Correct me if I'm wrong, shouldnt the distance
algorithm used internally, match adidas with this approach?


Thanks,
Sid


Re: How to preserve filelist / commit-points after master restart

2011-05-04 Thread Maduranga Kannangara
If it helps, these are filelist output before and after restarting master on
a sample setup:

Before restarting master:
---

{indexSize=113.82 KB,
indexPath=C:\JavaStuff\Solr\replication\solrhome\master\data\index,
commits=[{indexVersion=1252480003511,generation=107,filelist=[_35.fdx,
_35.frq, _35.tii, _35.fdt, _35.tis, segments_2z, _35.fnm,

_35.prx]}]
,isMaster=true,isSlave=false,indexVersion=1252480003511,generation=107}

http://localhost:8081/master/replication
Poll Interval null
Local Index Index Version: 1252480003511, Generation: 107
 Location: C:\JavaStuff\Solr\replication\solrhome\repeater\data\index
Size: 57.29 KB
 Config Files To Replicate: schema.xml,stopwords.txt,synonyms.txt
Trigger Replication On: [optimize, startup]
 Times Replicated Since Startup: 25
Previous Replication Done At: Wed May 04 10:51:01 EST 2011
 Config Files Replicated At: null
Config Files Replicated: null
Times Config Files Replicated Since Startup: null

After restarting master:
---

{indexSize=113.82 KB,
indexPath=C:\JavaStuff\Solr\replication\solrhome\master\data\index,
commits=[],
isMaster=true,isSlave=false,indexVersion=1252480003512,generation=108}

Master http://localhost:8081/master/replication
Poll Interval null
Local Index Index Version: 1252480003511, Generation: 107
 Location: C:\JavaStuff\Solr\replication\solrhome\repeater\data\index
Size: 57.29 KB
 Config Files To Replicate: schema.xml,stopwords.txt,synonyms.txt
Trigger Replication On: [optimize, startup]
 Times Replicated Since Startup: 25
Previous Replication Done At: Wed May 04 10:51:01 EST 2011
 Config Files Replicated At: null
Config Files Replicated: null
Times Config Files Replicated Since Startup: null

Hope someone would be able to help.

Thanks




On Wed, May 4, 2011 at 3:46 PM, Maduranga Kannangara wrote:

> Hi All,
>
> We use Solr 1.4.1. Single core setup with a repeater (for QA) and a few
> slaves (for Production).
>
> Master will index many sources and make data ready. Once all data is
> "ready-for-production", optimization will take place. On master
> "replicateAfter" is set to "optimize". (Subsequently on repeater
> replicateAfter=commit,startup). We do not want to use
> replicateAfter=startup,optimize on master as that would release bad data. As
> you can see, a bunch of sources should fit together to be able to release a
> sensible product. So we use "replicateAfter=optimize" to denote data is now
> okay to move to the next level.
>
> The problem is when master is restarted the filelist command on
> ReplicationHandler returns nothing and replication will not take place until
> another optimise command is done to master.
>
> How can I preserve the "optimized" state (or filelist or commit-points, not
> sure what keyword to use..) even after a master restart so that slaves can
> carry on from there. (I saw the mail thread Yonik has answered:
> "Replication filelist command failure on container restart", but I am trying
> to figure out if its possible to persist this file-list or
> indexDeletionPolicy or whatever that state -- please correct me on that
> and sorry for my layman language)
>
> We have too many master indexes setup in this way, therefore its not a good
> idea for us to run optimize or have replicateAfter=startup on each index as
> that will reduce the data quality or possible level-of-automation.
>
> Any solution to work around or fix this issue is highly appreciated.
>
> Thanks in advance
> Madu
>
>
>


why query chinese character with bracket become phrase query by default?

2011-05-04 Thread cyang2010
Hi,

In solr admin query full interface page, the following query with english
become term query according to debug :

title_en_US: (blood red)


title_en_US: (blood red)
title_en_US: (blood red)
title_en_US:blood title_en_US:red
title_en_US:blood title_en_US:red


However, using the same syntax with two chinese terms, the query result into
a phrase query:

title_zh_CN: (我活)


title_zh_CN: (我活)
title_zh_CN: (我活)
PhraseQuery(title_zh_CN:"我 活")
title_zh_CN:"我 活"


I do have different tokenizer/filter for those two different fields.   
title_en_US is using all those common english specific tokenizer, while
title_zh_CN uses solr.ChineseTokenizerFactory.   

I don't think those tokenizer determin whether things within bracket become
term queries or phrase queries.

I really need to blindly pass user-input text to a solr field without doing
any parsing, and hope it is all doing term query for each term contained in
the search text.

How do i achieve that?

Thanks,


cy

--
View this message in context: 
http://lucene.472066.n3.nabble.com/why-query-chinese-character-with-bracket-become-phrase-query-by-default-tp2901542p2901542.html
Sent from the Solr - User mailing list archive at Nabble.com.


Field names with a period (.)

2011-05-04 Thread Leonardo Souza
Hi guys,

Can i have a field name with a period(.) ?
Like in *file.size*

thanks!

[ ]'s
Leonardo da S. Souza
 °v°   Linux user #375225
 /(_)\   http://counter.li.org/
 ^ ^


RE: Solr Terms and Date field issues

2011-05-04 Thread Viswa S

Erik,

I suspected the same, and setup a test instance to reproduce this. The date 
field I used is setup to capture indexing time, in other words the schema has a 
default value of "NOW". However, I have reproduced this issue with fields which 
do no have defaults too.

On the second one, I did a delete->commit (with expungeDeletes=true) and then a 
optimize. All other fields show updated terms except the date fields. I have 
also double checked to see if the Luke handler has any different terms, and it 
did not.


Thanks
Viswa


> Date: Wed, 4 May 2011 08:17:39 -0400
> Subject: Re: Solr Terms and Date field issues
> From: erickerick...@gmail.com
> To: solr-user@lucene.apache.org
> 
> Hmmm, this *looks* like you've changed your schema without
> re-indexing all your data so you're getting old (string?) values in
> that field, but that's just a guess. If this is really happening on a
> clean index it's a problem.
> 
> I'm also going to guess that you're not really deleting the documents
> you think. Are you committing after the deletes?
> 
> Best
> Erick
> 
> On Wed, May 4, 2011 at 2:18 AM, Viswa S  wrote:
> >
> > Hello,
> >
> > The terms query for a date field seems to get populated with some weird 
> > dates, many of these dates (1970,2009,2011-04-23) are not present in the 
> > indexed data.  Please see sample data below
> >
> > I also notice that a delete and optimize does not remove the relevant terms 
> > for date fields, the string fields seems work fine.
> >
> > Thanks
> > Viswa
> >
> > Results from Terms component:
> >
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 3479
> >
> > 265
> >
> >
> > Result from facet component, rounded by seconds.:
> >
> > 
> > 1
> >
> > 1148
> >
> > 2333
> >
> > +1SECOND
> >
> > 2011-05-03T06:14:14Z
> >
> > 2011-05-04T06:14:14Z
> >
  

Re: How do you index multiple documents in JSON?

2011-05-04 Thread Neil Hooey
I found out how to do it, but you have to have duplicate "add" keys in
a JSON object, which isn't easily serializable from a hash in a
language.

I reported an issue here:
https://issues.apache.org/jira/browse/SOLR-2496

Please vote for it if you agree.


On Wed, May 4, 2011 at 3:00 PM, Neil Hooey  wrote:
> How do you add multiple documents to Solr in JSON in a single request?
>
> In XML, I can just send this:
> 
>    1
>    2
> 
>
> There is an example on this page:
> http://wiki.apache.org/solr/UpdateJSON
>
> But it doesn't demonstrate how to send more than one document.
>
> Thanks,
>
> - Neil
>


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Smiley, David W.
Hi folks. What you're supposed to do is run:

mvn -N -Pbootstrap install 

as the very first one-time only step.  It copies several custom jar files into 
your local repository. From then on you can build like normally with maven.

~ David Smiley
Author: http://www.packtpub.com/solr-1-4-enterprise-search-server/


On May 4, 2011, at 2:36 PM, Gabriele Kahlout wrote:

> but it doesn't build.
> 
> Now, I've checked out solr4 from the trunk and tried to build the maven
> project there, but it fails downloading berkleydb:
> 
> BUILD FAILURE
> 
> Total time: 1:07.367s
> Finished at: Wed May 04 20:33:29 CEST 2011
> Final Memory: 24M/81M
> 
> Failed to execute goal on project lucene-bdb: Could not resolve dependencies
> for project org.apache.lucene:lucene-bdb:jar:4.0-SNAPSHOT: Failure to find
> com.sleepycat:berkeleydb:jar:4.7.25 in
> http://download.carrot2.org/maven2/was cached in the local repository,
> resolution will not be reattempted until
> the update interval of carrot2.org has elapsed or updates are forced ->
> [Help 1]
> 
> 
> I looked up to get the jar on my own but I didn't find a 4.7.25 version, the
> latest on oracle website (java edition) is 4.1. Where can i download this
> maven dependency from?
> 
> On Wed, May 4, 2011 at 1:26 PM, Gabriele Kahlout
> wrote:
> 
>> It worked after checking out the dev-tools folder. Thank you!
>> 
>> 
>> On Wed, May 4, 2011 at 1:20 PM, lboutros  wrote:
>> 
>>> 
>>> >> description="Copy Maven POMs from dev-tools/maven/ to their
>>> target
>>> locations">
>>>   
>>> 
>>> 
>>>   
>>> 
>>> 
>>>   
>>> 
>>> 
>> 
>> 
>> 
>> --
>> Regards,
>> K. Gabriele
>> 
>> --- unchanged since 20/9/10 ---
>> P.S. If the subject contains "[LON]" or the addressee acknowledges the
>> receipt within 48 hours then I don't resend the email.
>> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
>> time(x) < Now + 48h) ⇒ ¬resend(I, this).
>> 
>> If an email is sent by a sender that is not a trusted contact or the email
>> does not contain a valid code then the email is not received. A valid code
>> starts with a hyphen and ends with "X".
>> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
>> L(-[a-z]+[0-9]X)).
>> 
>> 
> 
> 
> -- 
> Regards,
> K. Gabriele
> 
> --- unchanged since 20/9/10 ---
> P.S. If the subject contains "[LON]" or the addressee acknowledges the
> receipt within 48 hours then I don't resend the email.
> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
> < Now + 48h) ⇒ ¬resend(I, this).
> 
> If an email is sent by a sender that is not a trusted contact or the email
> does not contain a valid code then the email is not received. A valid code
> starts with a hyphen and ends with "X".
> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
> L(-[a-z]+[0-9]X)).







Re: Bulk update via filter query

2011-05-04 Thread Ahmet Arslan
> That won't work. External file fields
> are currently only usable within
> function queries, according to the Javadocs

Yes you are right, only function queries.
However he can dump ids of white-colored items to a text file in the following 
format:

12278=20.0
9984=20.0

issue a commit and filter out these sold-out items at query time by
&fq={!frange l=1 u=30}externalFileFieldName

Though not sure this is what he is after.



Re: Bulk update via filter query

2011-05-04 Thread Ahmet Arslan
> This could work. Are there
> search/index performance drawbacks when using it?

I am not using this feature in production, But it is the only way that i know 
to update a field without re-indexing whole document.

If you can give us more details about use case, others can suggest different 
things.



mlt.qf with multiple fields

2011-05-04 Thread Luke Crouch
I'm trying to use MoreLikeThis handler and mlt.qf to boost certain fields:

/solr/mlt?q=id:1&mlt.fl=body_title,text&mlt.qf=body_title^20.0+text^1.0&mlt.mintf=1

Looks like this has been an outstanding issue:

http://lucene.472066.n3.nabble.com/Querying-multiple-fields-with-the-MoreLikeThis-handler-and-mlt-fl-td501767.html

Is this fixed or documented somewhere?

-L


JsonUpdateRequestHandler

2011-05-04 Thread Justine Mathews
Hi,

When I have add the Json request handler as below for update in solrconfig.xml


I am getting following error. Version : apache-solr-1.4.1.  Could you please 
help...

Error is shown below,


Check your log files for more detailed information on what may be wrong.

If you want solr to continue after configuration errors, change:

false

in solrconfig.xml

-
org.apache.solr.common.SolrException: Error loading class 
'solr.JsonUpdateRequestHandler'
at 
org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:375)
at org.apache.solr.core.SolrCore.createInstance(SolrCore.java:413)
at org.apache.solr.core.SolrCore.createRequestHandler(SolrCore.java:449)
at 
org.apache.solr.core.RequestHandlers.initHandlersFromConfig(RequestHandlers.java:152)
at org.apache.solr.core.SolrCore.(SolrCore.java:556)
at 
org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:137)
at 
org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:83)
at org.mortbay.jetty.servlet.FilterHolder.doStart(FilterHolder.java:99)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at 
org.mortbay.jetty.servlet.ServletHandler.initialize(ServletHandler.java:594)
at org.mortbay.jetty.servlet.Context.startContext(Context.java:139)
at 
org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1218)
at 
org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:500)
at 
org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:448)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at 
org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:147)
at 
org.mortbay.jetty.handler.ContextHandlerCollection.doStart(ContextHandlerCollection.java:161)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at 
org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:147)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at 
org.mortbay.jetty.handler.HandlerWrapper.doStart(HandlerWrapper.java:117)
at org.mortbay.jetty.Server.doStart(Server.java:210)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at org.mortbay.xml.XmlConfiguration.main(XmlConfiguration.java:929)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at org.mortbay.start.Main.invokeMain(Main.java:183)
at org.mortbay.start.Main.start(Main.java:497)
at org.mortbay.start.Main.main(Main.java:115)
Caused by: java.lang.ClassNotFoundException: solr.JsonUpdateRequestHandler
at java.net.URLClassLoader$1.run(Unknown Source)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
at java.net.FactoryURLClassLoader.loadClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Unknown Source)
at 
org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:359)
... 30 more
RequestURI=/solr/


--
Regards,
Justine K Mathews, MCSD.NET
Mob: +44-(0) 7795268546
http://www.justinemathews.com
http://uk.linkedin.com/in/justinemathews



Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread lboutros
I opened and built my needed projects in Netbeans, i.e.: Solr Core, Solr
Search Server, Solrj, Lucene Core etc

But with the given library you should go to the next step.

Ludovic.

-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2900363.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
Neither do i..but i was doing mvn install.. what do you do?

On Wed, May 4, 2011 at 9:11 PM, lboutros  wrote:

> I do not build this part, I don't need it.
>
> The lib was present in the branch_3x branch, but is not there anymore.
>
> You can download it here :
>
>
> http://search.lucidimagination.com/search/out?u=http%3A%2F%2Fdownloads.osafoundation.org%2Fdb%2Fdb-4.7.25.jar
>
> You have to install it locally.
>
> Ludovic.
>
> 2011/5/4 Gabriele Kahlout [via Lucene] <
> ml-node+2900157-349704242-383...@n3.nabble.com>
>
> > but it doesn't build.
> >
> > Now, I've checked out solr4 from the trunk and tried to build the maven
> > project there, but it fails downloading berkleydb:
> >
> > BUILD FAILURE
> > 
> > Total time: 1:07.367s
> > Finished at: Wed May 04 20:33:29 CEST 2011
> > Final Memory: 24M/81M
> > 
> > Failed to execute goal on project lucene-bdb: Could not resolve
> > dependencies
> > for project org.apache.lucene:lucene-bdb:jar:4.0-SNAPSHOT: Failure to
> find
> > com.sleepycat:berkeleydb:jar:4.7.25 in
> > http://download.carrot2.org/maven2/was cached in the local repository,
> > resolution will not be reattempted until
> > the update interval of carrot2.org has elapsed or updates are forced ->
> > [Help 1]
> >
> >
> > I looked up to get the jar on my own but I didn't find a 4.7.25 version,
> > the
> > latest on oracle website (java edition) is 4.1. Where can i download this
> > maven dependency from?
> >
> > On Wed, May 4, 2011 at 1:26 PM, Gabriele Kahlout
> > <[hidden email]<
> http://user/SendEmail.jtp?type=node&node=2900157&i=0&by-user=t>>wrote:
> >
> >
> >
>
>
> -
> Jouve
> France.
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2900323.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread lboutros
I do not build this part, I don't need it.

The lib was present in the branch_3x branch, but is not there anymore.

You can download it here :

http://search.lucidimagination.com/search/out?u=http%3A%2F%2Fdownloads.osafoundation.org%2Fdb%2Fdb-4.7.25.jar

You have to install it locally.

Ludovic.

2011/5/4 Gabriele Kahlout [via Lucene] <
ml-node+2900157-349704242-383...@n3.nabble.com>

> but it doesn't build.
>
> Now, I've checked out solr4 from the trunk and tried to build the maven
> project there, but it fails downloading berkleydb:
>
> BUILD FAILURE
> 
> Total time: 1:07.367s
> Finished at: Wed May 04 20:33:29 CEST 2011
> Final Memory: 24M/81M
> 
> Failed to execute goal on project lucene-bdb: Could not resolve
> dependencies
> for project org.apache.lucene:lucene-bdb:jar:4.0-SNAPSHOT: Failure to find
> com.sleepycat:berkeleydb:jar:4.7.25 in
> http://download.carrot2.org/maven2/was cached in the local repository,
> resolution will not be reattempted until
> the update interval of carrot2.org has elapsed or updates are forced ->
> [Help 1]
>
>
> I looked up to get the jar on my own but I didn't find a 4.7.25 version,
> the
> latest on oracle website (java edition) is 4.1. Where can i download this
> maven dependency from?
>
> On Wed, May 4, 2011 at 1:26 PM, Gabriele Kahlout
> <[hidden 
> email]>wrote:
>
>
>


-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2900323.html
Sent from the Solr - User mailing list archive at Nabble.com.

How do you index multiple documents in JSON?

2011-05-04 Thread Neil Hooey
How do you add multiple documents to Solr in JSON in a single request?

In XML, I can just send this:

1
2


There is an example on this page:
http://wiki.apache.org/solr/UpdateJSON

But it doesn't demonstrate how to send more than one document.

Thanks,

- Neil


Re: howto add custom QueryParser stage or filter

2011-05-04 Thread Erick Erickson
Hmmm, I'll have to defer this to people who understand it better.. Siiiggghh...

Erick

On Wed, May 4, 2011 at 9:56 AM, Bernd Fehling
 wrote:
> Hi Erik,
>
> Am 04.05.2011 14:30, schrieb Erick Erickson:
>>
>> Sure, all you have to do is derive from "the right class". See:
>> http://wiki.apache.org/solr/SolrPlugins#QParserPlugin
>>
>> But this'll be tricky since you have to get at the proper colons in your
>> example, and not remove the ones that delimit fields
>>
>> Might it be easier to clean the search terms in the app before
>> assembling the Solr query?
>
> There are many applications using my index and more to come.
> I thought about some kind of making the QueryParser more "foolproof",
> may be via regex or filetring. But you are right, very tricky.
>
> Actually the number of "illegal" queries is very low.
> May be it is possible to catch an exception somewhere and give
> a clean 0 hits result page back instead of an error page or dump.
> Do you have any idea how to realize this?
>
> Would be pleased to have some kind of QueryValidator which first
> parses/validates the query and if the query is valid starts the search,
> otherwise refuses the search and gives 0 hits with status "illegal query".
> Just a rough idea.
>
> Regards
> Bernd
>
>>
>> Best
>> Erick
>>
>> On Wed, May 4, 2011 at 6:32 AM, Bernd Fehling
>>   wrote:
>>>
>>> Dear list,
>>>
>>> is it possible to add a custom QueryParser stage to solr
>>> or add a custom query filter?
>>>
>>> My aim is to filter out reserved characters from query terms,
>>> like ":" within a query term.
>>>
>>> query=text:(:foo AND bar)
>>> query=text:(foo AND b:ar)
>>>
>>> Regards
>>> Bernd
>>>
>


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
but it doesn't build.

Now, I've checked out solr4 from the trunk and tried to build the maven
project there, but it fails downloading berkleydb:

BUILD FAILURE

Total time: 1:07.367s
Finished at: Wed May 04 20:33:29 CEST 2011
Final Memory: 24M/81M

Failed to execute goal on project lucene-bdb: Could not resolve dependencies
for project org.apache.lucene:lucene-bdb:jar:4.0-SNAPSHOT: Failure to find
com.sleepycat:berkeleydb:jar:4.7.25 in
http://download.carrot2.org/maven2/was cached in the local repository,
resolution will not be reattempted until
the update interval of carrot2.org has elapsed or updates are forced ->
[Help 1]


I looked up to get the jar on my own but I didn't find a 4.7.25 version, the
latest on oracle website (java edition) is 4.1. Where can i download this
maven dependency from?

On Wed, May 4, 2011 at 1:26 PM, Gabriele Kahlout
wrote:

> It worked after checking out the dev-tools folder. Thank you!
>
>
> On Wed, May 4, 2011 at 1:20 PM, lboutros  wrote:
>
>>  
>>  >  description="Copy Maven POMs from dev-tools/maven/ to their
>> target
>> locations">
>>
>>  
>>  
>>
>>  
>>  
>>
>>  
>>
>
>
>
> --
> Regards,
> K. Gabriele
>
> --- unchanged since 20/9/10 ---
> P.S. If the subject contains "[LON]" or the addressee acknowledges the
> receipt within 48 hours then I don't resend the email.
> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
> time(x) < Now + 48h) ⇒ ¬resend(I, this).
>
> If an email is sent by a sender that is not a trusted contact or the email
> does not contain a valid code then the email is not received. A valid code
> starts with a hyphen and ends with "X".
> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
> L(-[a-z]+[0-9]X)).
>
>


-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).


Re: Bulk update via filter query

2011-05-04 Thread simon
That won't work. External file fields are currently only usable within
function queries, according to the Javadocs

On Wed, May 4, 2011 at 12:16 PM, Rih  wrote:

> This could work. Are there search/index performance drawbacks when using
> it?
>
>
> On Mon, May 2, 2011 at 6:22 PM, Ahmet Arslan  wrote:
>
> >
> >
> > Is there an efficient way to update multiple documents with common values
> > (e.g. color = white)? An example would be to mark all white-colored items
> > as
> > sold-out.
> >
> >
> >
> http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.htmlcanbe
>  an option.
> >
> >
>


Re: Patch problems solr 1.4 - solr-2010

2011-05-04 Thread Shawn Heisey

On 5/4/2011 8:50 AM, Dyer, James wrote:

I also should mention that solr-2010 is incorporated in Solr 3.1, so if you can 
upgrade you won't need a patch.  Note, however, that you will still want to 
apply the fix in solr-2462 regardless of the version as this fix hasn't been 
committed anywhere.


Found and fixed a problem with the patch against 3.1.  I have a fresh 
checkout of 3.1, with the 3x patch for SOLR-2469 applied.  The patch 
currently in SOLR-2462 will not apply cleanly.  I increased fuzz and 
ignored whitespace, which allowed it to apply, but then the build failed.


[javac] 
/opt/ncindex/src/lucene_solr_3_1/solr/src/java/org/apache/solr/spelling/SpellCheckCollator.java:80: 
cannot find symbol

[javac] symbol  : variable params
[javac] location: class org.apache.solr.spelling.SpellCheckCollator
[javac] 
params.set(CommonParams.TIME_ALLOWED, 50);


The fuzz and whitespace options added the new line to the wrong place -- 
before params was even defined/declared (whatever Java calls that).  I 
moved it to the right location according to the patch, and now it works.


I'll try to come up with a patch that works against 3.1 to put in the 
issue.  If that doesn't work out, I'll at least put a comment in it.


Thanks,
Shawn



Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-04 Thread Gavin Engel
I have a couple  directives in my solrconfig.xml:




Both of those should work, as far as I know.  Those are pointing to 2
different folders, and both have a copy of my jar file in them.  Yet, for
some reason Solr doesn't see them.  I'd love to know why.

On Wed, May 4, 2011 at 5:13 AM, Erick Erickson wrote:

> This is pretty fragile, the Jetty work directories come and go.
>
> I predict it will keep disappearing and/or you'll go through this same
> hassle
> next time you re-install or move to a new machine or...
>
> You *should* be able to just remove that directory entirely and still start
> w/o
> copying the jar.
>
> Are you absolutely sure you have a  directive in your solrconfig.xml
> file that paths to a directory that has your spatial jar in it?
>
> Best
> Erick
>
> On Tue, May 3, 2011 at 8:42 PM, Gavin Engel  wrote:
> > Oh, I apparently figured out how to get the jar file to load, so problem
> is
> > solved I suppose.
> >
> > The fix seems very odd to me, but I got it from a comment on the SSP 2
> blog
> > page (
> >
> http://blog.jteam.nl/2009/08/03/geo-location-search-with-solr-and-lucene/comment-page-1/#comment-4774
> > ):
> >
> > The solution, for those of you getting the NoClassDefFoundError exception
> > thrown, is to put the jar file in your example directory, under:
> > solr/work/Jetty_0_0_0_0_8983_solr.war__solr__k1kf17/webapp/WEB-INF/lib/
> >
> >
> >
> >
> >
> > I created that odd directory structure first, copied in the jar, and
> started
> > Jetty.  It looked like the jar was deleted, so I re-copied the jar into
> it.
> >  The second time around, everything seems to have worked.
> >
> > I am lost as to why its looking in that strange folder structure for the
> jar
> > file, instead of ./lib or ./solr/lib.
> >
> >
> >
> > On Tue, May 3, 2011 at 4:35 PM, Markus Jelsma <
> markus.jel...@openindex.io>wrote:
> >
> >> Where did you store the jar? Is it in a directory Solr looks for libs?
> >> Depending on your distro or set up there can be different places to
> store
> >> the
> >> jar. The easiest solution is to put it in a dir where other Solr libs
> are
> >> found or in a dir that you configured in a  directive in
> solrconfig.
> >>
> >> > Hello all,
> >> >
> >> > I've been trying to add the Spatial Search Plugin to my Solr 1.4.1
> setup,
> >> >
> >> > and I get this error:
> >> > > java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin
> >> > > at java.lang.ClassLoader.defineClass1(Native Method)
> >> > >
> >> > >  at java.lang.ClassLoader.defineClass(ClassLoader.java:634)
> >> > >
> >> > > at
> >> > >
> java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
> >> > >
> >> > >  at java.net.URLClassLoader.defineClass(URLClassLoader.java:277)
> >> > >
> >> > > ...
> >> > > Caused by: java.lang.ClassNotFoundException:
> >> > > org.apache.solr.search.QParserPlugin
> >> > > at java.net.URLClassLoader$1.run(URLClassLoader.java:217)
> >> > >
> >> > >  at java.security.AccessController.doPrivileged(Native Method)
> >> > >
> >> > > at java.net.URLClassLoader.findClass(URLClassLoader.java:205)
> >> > >
> >> > >  at java.lang.ClassLoader.loadClass(ClassLoader.java:321)
> >> > >
> >> > > at java.lang.ClassLoader.loadClass(ClassLoader.java:266)
> >> > >
> >> > >  ... 50 more
> >> >
> >> > I've been trying my best with the devlopers' documentation, but I am
> >> still
> >> > stuck on the install phase of SSP 2.0.  I wonder if there are users of
> >> SSP
> >> > 2 that can help me troubleshoot this, please?
> >> >
> >> > -Gavin
> >>
> >
>


Re: Bulk update via filter query

2011-05-04 Thread Rih
This could work. Are there search/index performance drawbacks when using it?


On Mon, May 2, 2011 at 6:22 PM, Ahmet Arslan  wrote:

>
>
> Is there an efficient way to update multiple documents with common values
> (e.g. color = white)? An example would be to mark all white-colored items
> as
> sold-out.
>
>
> http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.htmlcan
>  be an option.
>
>


RE: Patch problems solr 1.4 - solr-2010

2011-05-04 Thread Dyer, James
I also should mention that solr-2010 is incorporated in Solr 3.1, so if you can 
upgrade you won't need a patch.  Note, however, that you will still want to 
apply the fix in solr-2462 regardless of the version as this fix hasn't been 
committed anywhere.

James Dyer
E-Commerce Systems
Ingram Content Group
(615) 213-4311


-Original Message-
From: roySolr [mailto:royrutten1...@gmail.com] 
Sent: Wednesday, May 04, 2011 6:53 AM
To: solr-user@lucene.apache.org
Subject: Patch problems solr 1.4 - solr-2010

Hello,

I want to patch my solr installation(1.4.1) with
solr-2010.(https://issues.apache.org/jira/browse/SOLR-2010)

I need this feature:
Only return collations that are guaranteed to result in hits if re-queried 

Now i try the following code:

wget
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
-O - | patch -p0  --dry-run

I get the following error message:

--13:51:35-- 
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
Resolving issues.apache.org... 140.211.11.121
Connecting to issues.apache.org|140.211.11.121|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 55846 (55K) [text/x-patch]
Saving to: `STDOUT'

100%[>]
55,846  59.3K/s   in 0.9s

13:51:37 (59.3 KB/s) - `-' saved [55846/55846]

(Stripping trailing CRs from patch.)
patching file src/common/org/apache/solr/common/params/SpellingParams.java
Hunk #1 FAILED at 81.
1 out of 1 hunk FAILED -- saving rejects to file
src/common/org/apache/solr/common/params/SpellingParams.java.rej
(Stripping trailing CRs from patch.)
patching file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java
Hunk #1 FAILED at 24.
Hunk #2 FAILED at 141.
Hunk #3 FAILED at 155.
Hunk #4 FAILED at 214.
Hunk #5 FAILED at 252.
Hunk #6 FAILED at 262.
6 out of 6 hunks FAILED -- saving rejects to file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/PossibilityIterator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/RankedSpellPossibility.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollation.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCorrection.java
(Stripping trailing CRs from patch.)
patching file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java
Hunk #1 FAILED at 31.
Hunk #2 FAILED at 45.
Hunk #3 FAILED at 108.
Hunk #4 FAILED at 210.
4 out of 4 hunks FAILED -- saving rejects to file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java
Hunk #1 FAILED at 23.
Hunk #2 FAILED at 109.
2 out of 2 hunks FAILED -- saving rejects to file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file src/test/org/apache/solr/spelling/SpellCheckCollatorTest.java
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/spelling/SpellPossibilityIteratorTest.java
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/schema.xml
Hunk #1 FAILED at 19.
Hunk #2 FAILED at 50.
Hunk #3 FAILED at 100.
Hunk #4 FAILED at 408.
Hunk #5 FAILED at 427.
Hunk #6 FAILED at 453.
Hunk #7 FAILED at 535.
7 out of 7 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/schema.xml.rej
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/solrconfig.xml
Hunk #1 FAILED at 29.
Hunk #2 FAILED at 116.
Hunk #3 FAILED at 340.
Hunk #4 FAILED at 396.
4 out of 4 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/solrconfig.xml.rej


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Patch-problems-solr-1-4-solr-2010-tp2898443p2898443.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Patch problems solr 1.4 - solr-2010

2011-05-04 Thread Dyer, James
I just did a clean check out on the 1.4.1 branch and then applied the latest 
(10/22/2010) version of SOLR-2010_141.patch and it applied cleanly.  

I noticed from the listing you sent that any "new" files it removes trailing 
"cr"s from the text.  Maybe its not doing this for you on files that need to be 
modified?  So maybe try running "dos2unix" on the patch file before trying to 
apply it?  With that said, I use eclipse/subclipse to interact with svn so I've 
not tried applying this patch from a command line.

I also want to bring to your attention that we found a nasty bug when we went 
to production that can cause OOMs if a user enters a query with several 
misspelled words.  See SOLR-2462 for a description of the problem and a patch.  
This patch will not apply to 1.4.1 cleanly, but if you tell your svn client to 
ignore 1 leading path name segment, you can get it to apply the change to 
"PossibilityIterator.java", which is all that is necessary to fix the bug.

James Dyer
E-Commerce Systems
Ingram Content Group
(615) 213-4311


-Original Message-
From: roySolr [mailto:royrutten1...@gmail.com] 
Sent: Wednesday, May 04, 2011 6:53 AM
To: solr-user@lucene.apache.org
Subject: Patch problems solr 1.4 - solr-2010

Hello,

I want to patch my solr installation(1.4.1) with
solr-2010.(https://issues.apache.org/jira/browse/SOLR-2010)

I need this feature:
Only return collations that are guaranteed to result in hits if re-queried 

Now i try the following code:

wget
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
-O - | patch -p0  --dry-run

I get the following error message:

--13:51:35-- 
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
Resolving issues.apache.org... 140.211.11.121
Connecting to issues.apache.org|140.211.11.121|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 55846 (55K) [text/x-patch]
Saving to: `STDOUT'

100%[>]
55,846  59.3K/s   in 0.9s

13:51:37 (59.3 KB/s) - `-' saved [55846/55846]

(Stripping trailing CRs from patch.)
patching file src/common/org/apache/solr/common/params/SpellingParams.java
Hunk #1 FAILED at 81.
1 out of 1 hunk FAILED -- saving rejects to file
src/common/org/apache/solr/common/params/SpellingParams.java.rej
(Stripping trailing CRs from patch.)
patching file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java
Hunk #1 FAILED at 24.
Hunk #2 FAILED at 141.
Hunk #3 FAILED at 155.
Hunk #4 FAILED at 214.
Hunk #5 FAILED at 252.
Hunk #6 FAILED at 262.
6 out of 6 hunks FAILED -- saving rejects to file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/PossibilityIterator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/RankedSpellPossibility.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollation.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCorrection.java
(Stripping trailing CRs from patch.)
patching file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java
Hunk #1 FAILED at 31.
Hunk #2 FAILED at 45.
Hunk #3 FAILED at 108.
Hunk #4 FAILED at 210.
4 out of 4 hunks FAILED -- saving rejects to file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java
Hunk #1 FAILED at 23.
Hunk #2 FAILED at 109.
2 out of 2 hunks FAILED -- saving rejects to file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file src/test/org/apache/solr/spelling/SpellCheckCollatorTest.java
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/spelling/SpellPossibilityIteratorTest.java
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/schema.xml
Hunk #1 FAILED at 19.
Hunk #2 FAILED at 50.
Hunk #3 FAILED at 100.
Hunk #4 FAILED at 408.
Hunk #5 FAILED at 427.
Hunk #6 FAILED at 453.
Hunk #7 FAILED at 535.
7 out of 7 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/schema.xml.rej
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/solrconfig.xml
Hunk #1 FAILED at 29.
Hunk #2 FAILED at 116.
Hunk #3 FAILED at 340.
Hunk #4 FAILED at 396.
4 out of 4 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/solrconfig.xml.rej


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Patch-problems-solr-1-4-solr-2010-tp2898443p2898443.html
Sent from the Solr - User

Re: howto add custom QueryParser stage or filter

2011-05-04 Thread Bernd Fehling

Hi Erik,

Am 04.05.2011 14:30, schrieb Erick Erickson:

Sure, all you have to do is derive from "the right class". See:
http://wiki.apache.org/solr/SolrPlugins#QParserPlugin

But this'll be tricky since you have to get at the proper colons in your
example, and not remove the ones that delimit fields

Might it be easier to clean the search terms in the app before
assembling the Solr query?


There are many applications using my index and more to come.
I thought about some kind of making the QueryParser more "foolproof",
may be via regex or filetring. But you are right, very tricky.

Actually the number of "illegal" queries is very low.
May be it is possible to catch an exception somewhere and give
a clean 0 hits result page back instead of an error page or dump.
Do you have any idea how to realize this?

Would be pleased to have some kind of QueryValidator which first
parses/validates the query and if the query is valid starts the search,
otherwise refuses the search and gives 0 hits with status "illegal query".
Just a rough idea.

Regards
Bernd



Best
Erick

On Wed, May 4, 2011 at 6:32 AM, Bernd Fehling
  wrote:

Dear list,

is it possible to add a custom QueryParser stage to solr
or add a custom query filter?

My aim is to filter out reserved characters from query terms,
like ":" within a query term.

query=text:(:foo AND bar)
query=text:(foo AND b:ar)

Regards
Bernd



Re: Deprication warnings in Solr log

2011-05-04 Thread Ward Bekker
That attribute is not defined. Is it required?

Regards,

Ward

On May 4, 2011, at 3:11 PM, lboutros wrote:

> did you update this part in your solrconfig.xml ?
> 
> LUCENE_31
> 
> Ludovic.
> 
> 
> -
> Jouve
> France.
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Deprication-warnings-in-Solr-log-tp2898163p2898749.html
> Sent from the Solr - User mailing list archive at Nabble.com.



Re: Deprication warnings in Solr log

2011-05-04 Thread lboutros
did you update this part in your solrconfig.xml ?

LUCENE_31

Ludovic.


-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Deprication-warnings-in-Solr-log-tp2898163p2898749.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Deprication warnings in Solr log

2011-05-04 Thread Ward Bekker
Hi Erick,

I've removed the old indexes. Rebuilded and I'm still getting the depreciation 
warnings. 

Regards,

Ward

On May 4, 2011, at 2:23 PM, Erick Erickson wrote:

> What this is saying is that your index was created with a 2.x format.
> That format
> is supported in 3.x, but will NOT be supported in 4.x.
> 
> So re-index your data with a 3.x Solr and this should go away...
> 
> Best
> Erick
> 
> On Wed, May 4, 2011 at 6:20 AM, Ward Bekker  wrote:
>> Hello,
>> 
>> I just upgraded to 3.1.  After this the solr.log is showing deprecation 
>> warnings (see below).
>> 
>> What can I do about this?
>> 
>> Regards,
>> 
>> Ward
>> 
>> -
>> 
>> WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
>> You should at some point declare and reindex to at least 3.0, because 2.x 
>> emulation is deprecated and will be removed in 4.0
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
>> assureMatchVersion
>> WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
>> should at some point declare and reindex to at least 3.0, because 2.x 
>> emulation is deprecated and will be removed in 4.0
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.HTMLStripCharFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.WhitespaceTokenizerFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
>> assureMatchVersion
>> WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
>> You should at some point declare and reindex to at least 3.0, because 2.x 
>> emulation is deprecated and will be removed in 4.0
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
>> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
>> assureMatchVersion
>> WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
>> should at some point declare and reindex to at least 3.0, because 2.x 
>> emulation is deprecated and will be removed in 4.0



Re: SOLR 3.1 and JSON capabilities

2011-05-04 Thread Erick Erickson
Have you looked here?
http://wiki.apache.org/solr/SolJSON

If so, what parts are you having trouble with?

Best
Erick

On Wed, May 4, 2011 at 8:11 AM, pgaur  wrote:
> I would like to get more details about JSON capabilites provided by the SOLR
> 3.1 version.
>
> Does this really means referring to the schema.xml I could just hit a JSON
> passing the field values corresponding to the fields of schema.xml and the
> document get indexed to SOLR?
>
> Any help would be highly appreciated.
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/SOLR-3-1-and-JSON-capabilities-tp2898507p2898507.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: howto add custom QueryParser stage or filter

2011-05-04 Thread Erick Erickson
Sure, all you have to do is derive from "the right class". See:
http://wiki.apache.org/solr/SolrPlugins#QParserPlugin

But this'll be tricky since you have to get at the proper colons in your
example, and not remove the ones that delimit fields

Might it be easier to clean the search terms in the app before
assembling the Solr query?

Best
Erick

On Wed, May 4, 2011 at 6:32 AM, Bernd Fehling
 wrote:
> Dear list,
>
> is it possible to add a custom QueryParser stage to solr
> or add a custom query filter?
>
> My aim is to filter out reserved characters from query terms,
> like ":" within a query term.
>
> query=text:(:foo AND bar)
> query=text:(foo AND b:ar)
>
> Regards
> Bernd
>


Re: Deprication warnings in Solr log

2011-05-04 Thread Erick Erickson
What this is saying is that your index was created with a 2.x format.
That format
is supported in 3.x, but will NOT be supported in 4.x.

So re-index your data with a 3.x Solr and this should go away...

Best
Erick

On Wed, May 4, 2011 at 6:20 AM, Ward Bekker  wrote:
> Hello,
>
> I just upgraded to 3.1.  After this the solr.log is showing deprecation 
> warnings (see below).
>
> What can I do about this?
>
> Regards,
>
> Ward
>
> -
>
> WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
> You should at some point declare and reindex to at least 3.0, because 2.x 
> emulation is deprecated and will be removed in 4.0
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
> assureMatchVersion
> WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
> should at some point declare and reindex to at least 3.0, because 2.x 
> emulation is deprecated and will be removed in 4.0
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.HTMLStripCharFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.WhitespaceTokenizerFactory
> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
> assureMatchVersion
> WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
> You should at some point declare and reindex to at least 3.0, because 2.x 
> emulation is deprecated and will be removed in 4.0
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
> May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
> assureMatchVersion
> WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
> should at some point declare and reindex to at least 3.0, because 2.x 
> emulation is deprecated and will be removed in 4.0


Re: full-import called simultaneously for multiple core.

2011-05-04 Thread Erick Erickson
Hmmm, Can you provide more details? I know of no reason this
isn't working...

Best
Erick

On Wed, May 4, 2011 at 3:27 AM, Kannan  wrote:
> I could see required field missing exception for the few docs except that i
> could not see any other exception.
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/full-import-called-simultaneously-for-multiple-core-tp2894606p2897746.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Using lowercase as field type

2011-05-04 Thread Erick Erickson
OK, what is your proof that they're not overwriting? Because the
deleted documents are still in the index, and looking at, say,
terms will show them until an optimize is done.

The deleted copies won't be shown in search results etc, but
the underlying data is still in the index.

If that's not the problem, let's see your schema definitions for the
field in question, and what you're seeing that you believe shows
the documents not being overwritten.

You might review this page:
http://wiki.apache.org/solr/UsingMailingLists

Best
Erick

On Wed, May 4, 2011 at 2:22 AM, Isan Fulia  wrote:
> I want multiple documents with same unique key  to overwrite each other but
> they are not overwriting because of lowercase field type as unique key
>
> On 4 May 2011 11:45, Markus Jelsma  wrote:
>
>> So those multiple documents overwrite eachother? In that case, your data is
>> not suited for a lowercased docID. I'd recommend not doing any analysis on
>> the
>> docID to prevent such headaches.
>>
>> > Hi ,
>> >
>> > My schema consists of a field of type lowercase(for applying the
>> lowercase
>> > filter factory)  and is the unique key .  But its no longer behaving as
>> > unique key. Multiple documents with same value for the unique key are
>> > getting indexed.
>> > Does anyone know why this is happening or is it that the field of type
>> > lowercase cannot be unique key.
>>
>
>
>
> --
> Thanks & Regards,
> Isan Fulia.
>


Re: Solr Terms and Date field issues

2011-05-04 Thread Erick Erickson
Hmmm, this *looks* like you've changed your schema without
re-indexing all your data so you're getting old (string?) values in
that field, but that's just a guess. If this is really happening on a
clean index it's a problem.

I'm also going to guess that you're not really deleting the documents
you think. Are you committing after the deletes?

Best
Erick

On Wed, May 4, 2011 at 2:18 AM, Viswa S  wrote:
>
> Hello,
>
> The terms query for a date field seems to get populated with some weird 
> dates, many of these dates (1970,2009,2011-04-23) are not present in the 
> indexed data.  Please see sample data below
>
> I also notice that a delete and optimize does not remove the relevant terms 
> for date fields, the string fields seems work fine.
>
> Thanks
> Viswa
>
> Results from Terms component:
>
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 3479
>
> 265
>
>
> Result from facet component, rounded by seconds.:
>
> 
> 1
>
> 1148
>
> 2333
>
> +1SECOND
>
> 2011-05-03T06:14:14Z
>
> 2011-05-04T06:14:14Z
>


Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-04 Thread Erick Erickson
This is pretty fragile, the Jetty work directories come and go.

I predict it will keep disappearing and/or you'll go through this same hassle
next time you re-install or move to a new machine or...

You *should* be able to just remove that directory entirely and still start w/o
copying the jar.

Are you absolutely sure you have a  directive in your solrconfig.xml
file that paths to a directory that has your spatial jar in it?

Best
Erick

On Tue, May 3, 2011 at 8:42 PM, Gavin Engel  wrote:
> Oh, I apparently figured out how to get the jar file to load, so problem is
> solved I suppose.
>
> The fix seems very odd to me, but I got it from a comment on the SSP 2 blog
> page (
> http://blog.jteam.nl/2009/08/03/geo-location-search-with-solr-and-lucene/comment-page-1/#comment-4774
> ):
>
> The solution, for those of you getting the NoClassDefFoundError exception
> thrown, is to put the jar file in your example directory, under:
> solr/work/Jetty_0_0_0_0_8983_solr.war__solr__k1kf17/webapp/WEB-INF/lib/
>
>
>
>
>
> I created that odd directory structure first, copied in the jar, and started
> Jetty.  It looked like the jar was deleted, so I re-copied the jar into it.
>  The second time around, everything seems to have worked.
>
> I am lost as to why its looking in that strange folder structure for the jar
> file, instead of ./lib or ./solr/lib.
>
>
>
> On Tue, May 3, 2011 at 4:35 PM, Markus Jelsma 
> wrote:
>
>> Where did you store the jar? Is it in a directory Solr looks for libs?
>> Depending on your distro or set up there can be different places to store
>> the
>> jar. The easiest solution is to put it in a dir where other Solr libs are
>> found or in a dir that you configured in a  directive in solrconfig.
>>
>> > Hello all,
>> >
>> > I've been trying to add the Spatial Search Plugin to my Solr 1.4.1 setup,
>> >
>> > and I get this error:
>> > > java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin
>> > > at java.lang.ClassLoader.defineClass1(Native Method)
>> > >
>> > >  at java.lang.ClassLoader.defineClass(ClassLoader.java:634)
>> > >
>> > > at
>> > > java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
>> > >
>> > >  at java.net.URLClassLoader.defineClass(URLClassLoader.java:277)
>> > >
>> > > ...
>> > > Caused by: java.lang.ClassNotFoundException:
>> > > org.apache.solr.search.QParserPlugin
>> > > at java.net.URLClassLoader$1.run(URLClassLoader.java:217)
>> > >
>> > >  at java.security.AccessController.doPrivileged(Native Method)
>> > >
>> > > at java.net.URLClassLoader.findClass(URLClassLoader.java:205)
>> > >
>> > >  at java.lang.ClassLoader.loadClass(ClassLoader.java:321)
>> > >
>> > > at java.lang.ClassLoader.loadClass(ClassLoader.java:266)
>> > >
>> > >  ... 50 more
>> >
>> > I've been trying my best with the devlopers' documentation, but I am
>> still
>> > stuck on the install phase of SSP 2.0.  I wonder if there are users of
>> SSP
>> > 2 that can help me troubleshoot this, please?
>> >
>> > -Gavin
>>
>


SOLR 3.1 and JSON capabilities

2011-05-04 Thread pgaur
I would like to get more details about JSON capabilites provided by the SOLR
3.1 version.

Does this really means referring to the schema.xml I could just hit a JSON
passing the field values corresponding to the fields of schema.xml and the
document get indexed to SOLR?

Any help would be highly appreciated.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/SOLR-3-1-and-JSON-capabilities-tp2898507p2898507.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Replicaiton Fails with Unreachable error when master host is responding.

2011-05-04 Thread Erick Erickson
thanks for bringing closure here. Problems like this drive me crazy,
especially when the solution is really simple, but hard to figure out!

Erick

On Wed, May 4, 2011 at 1:14 AM, Jed Glazner  wrote:
> So it turns out that it's the host names.  According the DNS RFC underscores
> are not valid in host names. Most DNS servers now support them, but it's not
> in the rfc strictly speaking.  So there must be something in the underlying
> java classes that bork when using underscores in host names, though  I
> didn't see anything in the stack trace that indicated an invalid host name
> exception. That was most the issue though.  Once I changed the host name to
> the master's IP address  replication worked great.  So I'm working with our
> IT to remove underscores from our host names.
>
> Just thought I would post my answer here in case anyone else had that issue.
>
> Thanks.
>
> Jed.
>
> On 04/28/2011 02:03 PM, Mike Sokolov wrote:
>>
>> No clue. Try wireshark to gather more data?
>>
>> On 04/28/2011 02:53 PM, Jed Glazner wrote:
>>>
>>> Anybody?
>>>
>>> On 04/27/2011 01:51 PM, Jed Glazner wrote:

 Hello All,

 I'm having a very strange problem that I just can't figure out. The
 slave is not able to replicate from the master, even though the master
 is reachable from the slave machine.  I can telnet to the port it's
 running on, I can use text based browsers to navigate the master from
 the slave. I just don't understand why it won't replicate.  The admin
 screen gives me an Unreachable in the status, and in the log there is an
 exception thrown.  Details below:

 BACKGROUND:

 OS: Arch Linux
 Solr Version: svn revision 1096983 from
 https://svn.apache.org/repos/asf/lucene/dev/branches/branch_3x/
 No custom plugins, just whatever came with the version above.
 Java Setup:

 java version "1.6.0_22"
 OpenJDK Runtime Environment (IcedTea6 1.10)
 (ArchLinux-6.b22_1.10-1-x86_64)
 OpenJDK 64-Bit Server VM (build 19.0-b09, mixed mode)

 We have 3 cores running, all 3 cores are not able to replicate.

 The admin on the slave shows  the Master as
 http://solr-master-01_dev.la.bo:8983/solr/music/replication  -
 *Unreachable*
 Replicaiton def on the slave

    529>>> class="solr.ReplicationHandler">
    530
    531>>>
 name="masterUrl">http://solr-master-01_dev.la.bo:8983/solr/music/replication
    53200:15:00
    533
    534

 Replication def on the master:

    529>>> class="solr.ReplicationHandler">
    530
    531commit
    532startup
    533schema.xml,stopwords.txt
    534
    535

 Below is the log start to finish for replication attempts, note that it
 says connection refused, however, I can telnet to 8983 from the slave to
 the master, so I know it's up and reachable from the slave:

 telnet solr-master-01_dev.la.bo 8983
 Trying 172.12.65.58...
 Connected to solr-master-01_dev.la.bo.
 Escape character is '^]'.

 I double checked the master to make sure that it didn't have replication
 turned off, and it's not.  So I should be able to replicate but it
 can't.  I just dont' know what else to check.  The log from the slave is
 below.

 Apr 27, 2011 7:39:45 PM org.apache.solr.request.SolrQueryResponse
 WARNING: org.apache.solr.request.SolrQueryResponse is deprecated. Please
 use the corresponding class in org.apache.solr.response
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: I/O exception (java.net.ConnectException) caught when processing
 request: Connection refused
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: Retrying request
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: I/O exception (java.net.ConnectException) caught when processing
 request: Connection refused
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: Retrying request
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: I/O exception (java.net.ConnectException) caught when processing
 request: Connection refused
 Apr 27, 2011 7:39:45 PM org.apache.commons.httpclient.HttpMethodDirector
 executeWithRetry
 INFO: Retrying request
 Apr 27, 2011 7:39:45 PM org.apache.solr.handler.ReplicationHandler
 getReplicationDetails
 WARNING: Exception while invoking 'details' method for replication on
 master
 java.net.ConnectException: Connection refused
       at java.net.PlainSocketImpl.socketConnect(Native Method)
       at

 java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:327)
       at

 java.net.AbstractPl

Re: How to take differential backup of Solr Index

2011-05-04 Thread Erick Erickson
I'm not sure what you're asking here, can you clarify? A search
machine that replicates from an indexer is just a Solr server,
search requests are handled like any other server.

If you're asking about how to configure replication, see:
http://wiki.apache.org/solr/SolrReplication#Slave

Best
Erick

On Wed, May 4, 2011 at 1:01 AM, Gaurav Shingala
 wrote:
>
> how we can configure query server in solr using replication feature?
>
> Thanks,
> Gaurav
>
>> Date: Mon, 2 May 2011 22:05:33 -0700
>> Subject: Re: How to take differential backup of Solr Index
>> From: goks...@gmail.com
>> To: solr-user@lucene.apache.org
>>
>> The Replication feature does this. If you configure a query server as
>> a 'backup' server, it downloads changes but does not read them.
>>
>> On Mon, May 2, 2011 at 9:56 PM, Gaurav Shingala
>>  wrote:
>> >
>> > Hi,
>> >
>> > Is there any way to take differential backup of Solr Index?
>> >
>> > Thanks,
>> > Gaurav
>> >
>> >
>>
>>
>>
>> --
>> Lance Norskog
>> goks...@gmail.com
>


Patch problems solr 1.4 - solr-2010

2011-05-04 Thread roySolr
Hello,

I want to patch my solr installation(1.4.1) with
solr-2010.(https://issues.apache.org/jira/browse/SOLR-2010)

I need this feature:
Only return collations that are guaranteed to result in hits if re-queried 

Now i try the following code:

wget
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
-O - | patch -p0  --dry-run

I get the following error message:

--13:51:35-- 
https://issues.apache.org/jira/secure/attachment/12457683/SOLR-2010_141.patch
Resolving issues.apache.org... 140.211.11.121
Connecting to issues.apache.org|140.211.11.121|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 55846 (55K) [text/x-patch]
Saving to: `STDOUT'

100%[>]
55,846  59.3K/s   in 0.9s

13:51:37 (59.3 KB/s) - `-' saved [55846/55846]

(Stripping trailing CRs from patch.)
patching file src/common/org/apache/solr/common/params/SpellingParams.java
Hunk #1 FAILED at 81.
1 out of 1 hunk FAILED -- saving rejects to file
src/common/org/apache/solr/common/params/SpellingParams.java.rej
(Stripping trailing CRs from patch.)
patching file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java
Hunk #1 FAILED at 24.
Hunk #2 FAILED at 141.
Hunk #3 FAILED at 155.
Hunk #4 FAILED at 214.
Hunk #5 FAILED at 252.
Hunk #6 FAILED at 262.
6 out of 6 hunks FAILED -- saving rejects to file
src/java/org/apache/solr/handler/component/SpellCheckComponent.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/PossibilityIterator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/RankedSpellPossibility.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollation.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCollator.java
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/solr/spelling/SpellCheckCorrection.java
(Stripping trailing CRs from patch.)
patching file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java
Hunk #1 FAILED at 31.
Hunk #2 FAILED at 45.
Hunk #3 FAILED at 108.
Hunk #4 FAILED at 210.
4 out of 4 hunks FAILED -- saving rejects to file
src/solrj/org/apache/solr/client/solrj/response/SpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java
Hunk #1 FAILED at 23.
Hunk #2 FAILED at 109.
2 out of 2 hunks FAILED -- saving rejects to file
src/test/org/apache/solr/client/solrj/response/TestSpellCheckResponse.java.rej
(Stripping trailing CRs from patch.)
patching file src/test/org/apache/solr/spelling/SpellCheckCollatorTest.java
(Stripping trailing CRs from patch.)
patching file
src/test/org/apache/solr/spelling/SpellPossibilityIteratorTest.java
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/schema.xml
Hunk #1 FAILED at 19.
Hunk #2 FAILED at 50.
Hunk #3 FAILED at 100.
Hunk #4 FAILED at 408.
Hunk #5 FAILED at 427.
Hunk #6 FAILED at 453.
Hunk #7 FAILED at 535.
7 out of 7 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/schema.xml.rej
(Stripping trailing CRs from patch.)
patching file src/test/test-files/solr/conf/solrconfig.xml
Hunk #1 FAILED at 29.
Hunk #2 FAILED at 116.
Hunk #3 FAILED at 340.
Hunk #4 FAILED at 396.
4 out of 4 hunks FAILED -- saving rejects to file
src/test/test-files/solr/conf/solrconfig.xml.rej


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Patch-problems-solr-1-4-solr-2010-tp2898443p2898443.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
It worked after checking out the dev-tools folder. Thank you!

On Wed, May 4, 2011 at 1:20 PM, lboutros  wrote:

>  
>description="Copy Maven POMs from dev-tools/maven/ to their target
> locations">
>
>  
>  
>
>  
>  
>
>  
>



-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread lboutros
ok, this is part of my build.xml (from the svn repository) :

  
  

  
  

  
  

  

Ludovic.

-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898339.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
On Wed, May 4, 2011 at 1:11 PM, lboutros  wrote:

> oups,
>
> sorry, this was not the target I used (this one should work too, but...),
> the one I used is get-maven-poms. That will just create pom files and copy
> them to their right target locations.
>

I don't have "get-maven-poms" target in my script.


> I'm using netbeans and I'm using the plugin "Automatic Projects" to do
> everything inside the IDE.
>
> Which version of Solr are you using ?
>

the official latest: 3.1

Maybe I can copy-paste from the build script you are using?


>
> Ludovic.
>
> 2011/5/4 Gabriele Kahlout [via Lucene] <
> ml-node+2898211-2124746009-383...@n3.nabble.com>
>
> > generate-maven-artifacts:
> >[mkdir] Created dir: /Users/simpatico/SOLR_HOME/build/maven
> >[mkdir] Created dir: /Users/simpatico/SOLR_HOME/dist/maven
> > [copy] Copying 1 file to
> > /Users/simpatico/SOLR_HOME/build/maven/src/maven
> > [artifact:install-provider] Installing provider:
> > org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> >
> > *BUILD FAILED*
> > /Users/simpatico/SOLR_HOME/*build.xml:800*: The following error occurred
> > while executing this line:
> > /Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
> > support the "uniqueVersion" attribute
> >
> >
> > *build.xml:800: * > pom.xml="src/maven/solr-parent-pom.xml.template"/>
> >
> > removed "uniquVersion" attirubte:
> >
> > generate-maven-artifacts:
> > [artifact:install-provider] Installing provider:
> > org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> > [artifact:deploy] Deploying to
> file:///Users/simpatico/SOLR_HOME/dist/maven
> >
> > [artifact:deploy] [INFO] Retrieving previous build number from remote
> > [artifact:deploy] [INFO] Retrieving previous metadata from remote
> > [artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
> > org.apache.solr:solr-parent'
> > [artifact:deploy] [INFO] Retrieving previous metadata from remote
> > [artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
> > org.apache.solr:solr-parent:1.4.2-SNAPSHOT'
> >  [copy] Copying 1 file to /Users/simpatico/SOLR_HOME/build/maven/lib
> > [artifact:install-provider] Installing provider:
> > org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> > [artifact:deploy] Deploying to
> file:///Users/simpatico/SOLR_HOME/dist/maven
> >
> > [artifact:deploy] [INFO] Retrieving previous build number from remote
> > [artifact:deploy] [INFO] Retrieving previous metadata from remote
> > [artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
> > org.apache.solr:solr-commons-csv'
> > [artifact:deploy] [INFO] Retrieving previous metadata from remote
> > [artifact:deploy] [INFO] Uploading project information for
> solr-commons-csv
> >
> > 1.4.2-SNAPSHOT
> > [artifact:deploy] [INFO] Retrieving previous metadata from remote
> > [artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
> > org.apache.solr:solr-commons-csv:1.4.2-SNAPSHOT'
> >  [copy] Copying 1 file to
> > /Users/simpatico/SOLR_HOME/build/maven/contrib/dataimporthandler
> > [artifact:install-provider] Installing provider:
> > org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> >
> > BUILD FAILED
> > /Users/simpatico/SOLR_HOME/build.xml:809: The following error occurred
> > while
> > executing this line:
> > */Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
> > support the nested "attach" element*
> >
> >
>
>
> -
> Jouve
> France.
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898315.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread lboutros
oups,

sorry, this was not the target I used (this one should work too, but...),
the one I used is get-maven-poms. That will just create pom files and copy
them to their right target locations.

I'm using netbeans and I'm using the plugin "Automatic Projects" to do
everything inside the IDE.

Which version of Solr are you using ?

Ludovic.

2011/5/4 Gabriele Kahlout [via Lucene] <
ml-node+2898211-2124746009-383...@n3.nabble.com>

> generate-maven-artifacts:
>[mkdir] Created dir: /Users/simpatico/SOLR_HOME/build/maven
>[mkdir] Created dir: /Users/simpatico/SOLR_HOME/dist/maven
> [copy] Copying 1 file to
> /Users/simpatico/SOLR_HOME/build/maven/src/maven
> [artifact:install-provider] Installing provider:
> org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
>
> *BUILD FAILED*
> /Users/simpatico/SOLR_HOME/*build.xml:800*: The following error occurred
> while executing this line:
> /Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
> support the "uniqueVersion" attribute
>
>
> *build.xml:800: * pom.xml="src/maven/solr-parent-pom.xml.template"/>
>
> removed "uniquVersion" attirubte:
>
> generate-maven-artifacts:
> [artifact:install-provider] Installing provider:
> org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> [artifact:deploy] Deploying to file:///Users/simpatico/SOLR_HOME/dist/maven
>
> [artifact:deploy] [INFO] Retrieving previous build number from remote
> [artifact:deploy] [INFO] Retrieving previous metadata from remote
> [artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
> org.apache.solr:solr-parent'
> [artifact:deploy] [INFO] Retrieving previous metadata from remote
> [artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
> org.apache.solr:solr-parent:1.4.2-SNAPSHOT'
>  [copy] Copying 1 file to /Users/simpatico/SOLR_HOME/build/maven/lib
> [artifact:install-provider] Installing provider:
> org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
> [artifact:deploy] Deploying to file:///Users/simpatico/SOLR_HOME/dist/maven
>
> [artifact:deploy] [INFO] Retrieving previous build number from remote
> [artifact:deploy] [INFO] Retrieving previous metadata from remote
> [artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
> org.apache.solr:solr-commons-csv'
> [artifact:deploy] [INFO] Retrieving previous metadata from remote
> [artifact:deploy] [INFO] Uploading project information for solr-commons-csv
>
> 1.4.2-SNAPSHOT
> [artifact:deploy] [INFO] Retrieving previous metadata from remote
> [artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
> org.apache.solr:solr-commons-csv:1.4.2-SNAPSHOT'
>  [copy] Copying 1 file to
> /Users/simpatico/SOLR_HOME/build/maven/contrib/dataimporthandler
> [artifact:install-provider] Installing provider:
> org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
>
> BUILD FAILED
> /Users/simpatico/SOLR_HOME/build.xml:809: The following error occurred
> while
> executing this line:
> */Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
> support the nested "attach" element*
>
>


-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898315.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
generate-maven-artifacts:
   [mkdir] Created dir: /Users/simpatico/SOLR_HOME/build/maven
   [mkdir] Created dir: /Users/simpatico/SOLR_HOME/dist/maven
[copy] Copying 1 file to
/Users/simpatico/SOLR_HOME/build/maven/src/maven
[artifact:install-provider] Installing provider:
org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2

*BUILD FAILED*
/Users/simpatico/SOLR_HOME/*build.xml:800*: The following error occurred
while executing this line:
/Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
support the "uniqueVersion" attribute


*build.xml:800: *

removed "uniquVersion" attirubte:

generate-maven-artifacts:
[artifact:install-provider] Installing provider:
org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
[artifact:deploy] Deploying to file:///Users/simpatico/SOLR_HOME/dist/maven
[artifact:deploy] [INFO] Retrieving previous build number from remote
[artifact:deploy] [INFO] Retrieving previous metadata from remote
[artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
org.apache.solr:solr-parent'
[artifact:deploy] [INFO] Retrieving previous metadata from remote
[artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
org.apache.solr:solr-parent:1.4.2-SNAPSHOT'
 [copy] Copying 1 file to /Users/simpatico/SOLR_HOME/build/maven/lib
[artifact:install-provider] Installing provider:
org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2
[artifact:deploy] Deploying to file:///Users/simpatico/SOLR_HOME/dist/maven
[artifact:deploy] [INFO] Retrieving previous build number from remote
[artifact:deploy] [INFO] Retrieving previous metadata from remote
[artifact:deploy] [INFO] Uploading repository metadata for: 'artifact
org.apache.solr:solr-commons-csv'
[artifact:deploy] [INFO] Retrieving previous metadata from remote
[artifact:deploy] [INFO] Uploading project information for solr-commons-csv
1.4.2-SNAPSHOT
[artifact:deploy] [INFO] Retrieving previous metadata from remote
[artifact:deploy] [INFO] Uploading repository metadata for: 'snapshot
org.apache.solr:solr-commons-csv:1.4.2-SNAPSHOT'
 [copy] Copying 1 file to
/Users/simpatico/SOLR_HOME/build/maven/contrib/dataimporthandler
[artifact:install-provider] Installing provider:
org.apache.maven.wagon:wagon-ssh:jar:1.0-beta-2

BUILD FAILED
/Users/simpatico/SOLR_HOME/build.xml:809: The following error occurred while
executing this line:
*/Users/simpatico/SOLR_HOME/common-build.xml:274: artifact:deploy doesn't
support the nested "attach" element*

On Wed, May 4, 2011 at 11:50 AM, lboutros  wrote:
> In the ant script there is a target to generate maven's artifacts.
>
> After that, you will be able to open the project as a standard maven
> project.
>
> Ludovic.
>
> 2011/5/4 Gabriele Kahlout [via Lucene] <
> ml-node+2898068-621882422-383...@n3.nabble.com>
>
>> Hello,
>>
>> I'm trying to modify Solr and I think debugging will be very useful to
>> understand what's going on. Hence I'd like to use an IDE (NetBeans)
>> which automatically supports Maven projects. I see under src/maven
>> that there are templates but I'm not sure how to use them to mavenize
>> the build/project. Nothing on the Wiki. I've seen issue solr-19 and
>> some messages on older msgs on the mailing list too.
>>
>> Any instructions?
>>
>>
>> --
>> Regards,
>> K. Gabriele
>>
>> --- unchanged since 20/9/10 ---
>> P.S. If the subject contains "[LON]" or the addressee acknowledges the
>> receipt within 48 hours then I don't resend the email.
>> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
>> time(x) < Now + 48h) ⇒ ¬resend(I, this).
>>
>> If an email is sent by a sender that is not a trusted contact or the
>> email does not contain a valid code then the email is not received. A
>> valid code starts with a hyphen and ends with "X".
>> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y
>> ∈ L(-[a-z]+[0-9]X)).
>>
>>
>> --
>>  If you reply to this email, your message will be added to the discussion
>> below:
>>
>>
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898068.html
>>  To start a new topic under Solr - User, email
>> ml-node+472068-1765922688-383...@n3.nabble.com
>> To unsubscribe from Solr - User, click here<
http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=472068&code=Ym91dHJvc2xAZ21haWwuY29tfDQ3MjA2OHw0Mzk2MDUxNjE=
>.
>>
>>
>
>
> -
> Jouve
> France.
> --
> View this message in context:
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898084.html
> Sent from the Solr - User mailing list archive at Nabble.com.



-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted conta

howto add custom QueryParser stage or filter

2011-05-04 Thread Bernd Fehling

Dear list,

is it possible to add a custom QueryParser stage to solr
or add a custom query filter?

My aim is to filter out reserved characters from query terms,
like ":" within a query term.

query=text:(:foo AND bar)
query=text:(foo AND b:ar)

Regards
Bernd


Re: stemming for English

2011-05-04 Thread Dmitry Kan
Hi Robert,

Have you seen *any* growth?

We have once added a copy field for supporting leading wildcard and got our
index doubled (or something close).

On Tue, May 3, 2011 at 9:24 PM, Robert Petersen  wrote:

> From what I have seen, adding a second field with the same terms as the
> first does *not* double your index size at all.
>
> -Original Message-
> From: Dmitry Kan [mailto:dmitry@gmail.com]
> Sent: Tuesday, May 03, 2011 4:06 AM
> To: solr-user@lucene.apache.org
> Subject: Re: stemming for English
>
> Yes, Ludovic. Thus effectively we get index doubled. Given the volume of
> data we store, we very carefully consider such cases, where the doubling of
> index is must.
>
> Dmitry
>
> On Tue, May 3, 2011 at 1:08 PM, lboutros  wrote:
>
> > Dmitry,
> >
> > I don't know any way to keep both stemming and consistent wildcard
> support
> > in the same field.
> > To me, you have to create 2 different fields.
> >
> > Ludovic.
> >
> > 2011/5/3 Dmitry Kan [via Lucene] <
> > ml-node+2893628-993677979-383...@n3.nabble.com>
> >
> > > Hi Ludovic,
> > >
> > > That's an option we had before we decided to go for a full-blown
> support
> > of
> > >
> > > wildcards.
> > >
> > > Do you know of a way to keep both stemming and consistent wildcard
> > support
> > > in the same field?`
> > >
> > > Dmitry
> > >
> > >
> >
> >
> > -
> > Jouve
> > France.
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/stemming-for-English-tp2893599p2893652.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>
>
>
> --
> Regards,
>
> Dmitry Kan
>



-- 
Regards,

Dmitry Kan


Deprication warnings in Solr log

2011-05-04 Thread Ward Bekker
Hello,

I just upgraded to 3.1.  After this the solr.log is showing deprecation 
warnings (see below).

What can I do about this? 

Regards,

Ward

-

WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
You should at some point declare and reindex to at least 3.0, because 2.x 
emulation is deprecated and will be removed in 4.0
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
assureMatchVersion
WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
should at some point declare and reindex to at least 3.0, because 2.x emulation 
is deprecated and will be removed in 4.0
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.HTMLStripCharFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.WhitespaceTokenizerFactory
May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
assureMatchVersion
WARNING: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. 
You should at some point declare and reindex to at least 3.0, because 2.x 
emulation is deprecated and will be removed in 4.0
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.WordDelimiterFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created : org.apache.solr.analysis.LowerCaseFilterFactory
May 3, 2011 12:26:48 PM org.apache.solr.analysis.BaseTokenStreamFactory 
assureMatchVersion
WARNING: LowerCaseFilterFactory is using deprecated LUCENE_24 emulation. You 
should at some point declare and reindex to at least 3.0, because 2.x emulation 
is deprecated and will be removed in 4.0

Re: UIMA analysisEngine path

2011-05-04 Thread Tommaso Teofili
Hello Barry,
the main AnalysisEngine descriptor defined inside the 
element should be inside one of the jars imported with the  elements.
At the moment it cannot be taken from expanded directories but it should be
easy to do it (and indeed useful) modifying the
OverridingParamsAEProvider class
[1] at line 57.
Hope this helps,
Tommaso

[1] :
http://svn.apache.org/viewvc/lucene/dev/tags/lucene_solr_3_1/solr/contrib/uima/src/main/java/org/apache/solr/uima/processor/ae/OverridingParamsAEProvider.java?view=markup

2011/5/3 Barry Hathaway 

> I'm new to Solr and trying to get it call a UIMA aggregate analysis engine
> and not having much luck.
> The null pointer exception indicates that it can't find the xml file
> associated with the engine.
> I have tried a number of combinations of a path in the 
>  element, but nothing
> seems to work. In addition, I've put the directory containing the
> descriptor in both the classpath
> when starting the server and in a  element in solrconfig.xml. So:
>
> What "classpath" does the  tag effectively search for to
> locate the descriptor?
>
> Do the  entries in solrconfig.xml affect this classpath?
>
> Do the engine descriptors have to be in a jar or can they be in an expanded
> directory?
>
> Thanks in advance.
>
> Barry
>
>
>
>


Re: Is it possible to build Solr as a maven project?

2011-05-04 Thread lboutros
In the ant script there is a target to generate maven's artifacts.

After that, you will be able to open the project as a standard maven
project.

Ludovic.

2011/5/4 Gabriele Kahlout [via Lucene] <
ml-node+2898068-621882422-383...@n3.nabble.com>

> Hello,
>
> I'm trying to modify Solr and I think debugging will be very useful to
> understand what's going on. Hence I'd like to use an IDE (NetBeans)
> which automatically supports Maven projects. I see under src/maven
> that there are templates but I'm not sure how to use them to mavenize
> the build/project. Nothing on the Wiki. I've seen issue solr-19 and
> some messages on older msgs on the mailing list too.
>
> Any instructions?
>
>
> --
> Regards,
> K. Gabriele
>
> --- unchanged since 20/9/10 ---
> P.S. If the subject contains "[LON]" or the addressee acknowledges the
> receipt within 48 hours then I don't resend the email.
> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
> time(x) < Now + 48h) ⇒ ¬resend(I, this).
>
> If an email is sent by a sender that is not a trusted contact or the
> email does not contain a valid code then the email is not received. A
> valid code starts with a hyphen and ends with "X".
> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y
> ∈ L(-[a-z]+[0-9]X)).
>
>
> --
>  If you reply to this email, your message will be added to the discussion
> below:
>
> http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898068.html
>  To start a new topic under Solr - User, email
> ml-node+472068-1765922688-383...@n3.nabble.com
> To unsubscribe from Solr - User, click 
> here.
>
>


-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2898084.html
Sent from the Solr - User mailing list archive at Nabble.com.

Is it possible to build Solr as a maven project?

2011-05-04 Thread Gabriele Kahlout
Hello,

I'm trying to modify Solr and I think debugging will be very useful to
understand what's going on. Hence I'd like to use an IDE (NetBeans)
which automatically supports Maven projects. I see under src/maven
that there are templates but I'm not sure how to use them to mavenize
the build/project. Nothing on the Wiki. I've seen issue solr-19 and
some messages on older msgs on the mailing list too.

Any instructions?


-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧
time(x) < Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the
email does not contain a valid code then the email is not received. A
valid code starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y
∈ L(-[a-z]+[0-9]X)).


Re: full-import called simultaneously for multiple core.

2011-05-04 Thread Kannan
I could see required field missing exception for the few docs except that i
could not see any other exception. 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/full-import-called-simultaneously-for-multiple-core-tp2894606p2897746.html
Sent from the Solr - User mailing list archive at Nabble.com.