[jira] Commented: (SOLR-218) Support for Lucene QueryParser properties via solrconfig.xml file

2007-05-02 Thread Michael Kimsal (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12493103
 ] 

Michael Kimsal commented on SOLR-218:
-


setAllowLeadingWildcard(SolrConfig.config.getBool("query/setAllowLeadingWildcard"));

setLowercaseExpandedTerms(SolrConfig.config.getBool("query/lowerCaseExpandedTerms"));
 
=

>From what I understand, these sorts of things could likely be handled by 
>custom query parsers.  However, 
I'm voting for this because I'd like to see the ability to configure these 
items globally first, as well as 
already having the option to write custom query parsers if needed.  This 
provides an easier way to 
configure the behaviour without needing to write code or recompile anything.

> Support for Lucene QueryParser properties via solrconfig.xml file
> -
>
> Key: SOLR-218
> URL: https://issues.apache.org/jira/browse/SOLR-218
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Affects Versions: 1.1.0
>Reporter: Michael Pelz-Sherman
>
> The SolrQueryParser class, which extends Lucene's QueryParser class, does not 
> provide any way of setting the various QueryParser properties via the solr 
> config file (solrconfig.xml). These properties include:
> allowLeadingWildcard (Set to true to allow * and ? as the first character of 
> a PrefixQuery and WildcardQuery)
> dateResolution: Sets the default date resolution used by RangeQueries for 
> fields for which no specific date resolutions has been set.
> defaultOperator: Sets the boolean operator of the QueryParser.
> fuzzyMinSim: Set the minimum similarity for fuzzy queries.
> locale: Set locale used by date range parsing.
> lowercaseExpandedTerms: Whether terms of wildcard, prefix, fuzzy and range 
> queries are to be automatically lower-cased or not.
> phraseSlop: Sets the default slop for phrases.
> useOldRangeQuery: By default QueryParser uses new ConstantScoreRangeQuery in 
> preference to RangeQuery for range queries.
> This can be achieved by calling the setter methods for these properties in 
> the SolrQueryParser constructor,
>   public SolrQueryParser(IndexSchema schema, String defaultField) {
> super(defaultField == null ? schema.getDefaultSearchFieldName() : 
> defaultField, schema.getQueryAnalyzer());
> this.schema = schema;
> 
> setAllowLeadingWildcard(SolrConfig.config.getBool("query/setAllowLeadingWildcard"));
> 
> setLowercaseExpandedTerms(SolrConfig.config.getBool("query/lowerCaseExpandedTerms"));
>   }
> In addition, solr should not modify these values from the defaults provided 
> by Lucene, as it currently does by calling setLowercaseExpandedTerms(false) 
> in this method.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: javadoc location?

2007-05-02 Thread Ryan McKinley


Thanks Hoss! Its nicely back on line.
http://lucene.apache.org/solr/api/index.html

I'll run "ant javadoc" before commiting anything else with javadoc links


Chris Hostetter wrote:

found the problem in DirectSolrConnection.java, we were tickling this
bug...

http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=5082928

...my gut says not to try and manually fix the site right now, and just
wait for the next nightly to take care of it -- mainly because i'm
paranoid about messing it up.  I certainly won't object if any other
committers want to do it.


: Date: Tue, 1 May 2007 10:52:16 -0700 (PDT)
: From: Chris Hostetter <[EMAIL PROTECTED]>
: Reply-To: solr-dev@lucene.apache.org
: To: Solr Dev 
: Subject: Re: javadoc location?
:
:
: : goes to:
: :   http://lucene.apache.org/solr/api/index.html  (404)
: :
: : we should probably fix that!
:
: The link is correct, that's where they are suppose to be -- the
: problem seems to be that buildingthe javadocs is currently broken
:
: When i run "ant javadoc" locally i'm getting a bunch of warnings and
: errors, the most significant seems to be...
:
:   [javadoc] Building index for all the packages and classes...
:   [javadoc] 
/home/chrish/svn/solr/src/java/org/apache/solr/util/ContentStreamBase.java:52: 
warning - @Link is an unknown tag -- same as a known tag except for case.
:   [javadoc] java.util.MissingResourceException: Can't find resource for 
bundle com.sun.tools.doclets.formats.html.resources.standard, key 
doclet.malformed_html_link_tag
:   [javadoc] at java.util.ResourceBundle.getObject(ResourceBundle.java:326)
:   [javadoc] at java.util.ResourceBundle.getString(ResourceBundle.java:286)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.util.MessageRetriever.getText(MessageRetriever.java:114)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.util.MessageRetriever.getText(MessageRetriever.java:92)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.util.MessageRetriever.getText(MessageRetriever.java:81)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.util.MessageRetriever.warning(MessageRetriever.java:290)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDocletWriter.redirectRelativeLinks(HtmlDocletWriter.java:1526)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDocletWriter.commentTagsToString(HtmlDocletWriter.java:1438)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDocletWriter.printCommentTags(HtmlDocletWriter.java:1397)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDocletWriter.printSummaryComment(HtmlDocletWriter.java:1370)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDocletWriter.printSummaryComment(HtmlDocletWriter.java:1366)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.AbstractIndexWriter.printComment(AbstractIndexWriter.java:192)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.AbstractIndexWriter.printDescription(AbstractIndexWriter.java:164)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.AbstractIndexWriter.generateContents(AbstractIndexWriter.java:89)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.SingleIndexWriter.generateIndexFile(SingleIndexWriter.java:76)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.SingleIndexWriter.generate(SingleIndexWriter.java:52)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDoclet.generateOtherFiles(HtmlDoclet.java:103)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.AbstractDoclet.startGeneration(AbstractDoclet.java:122)
:   [javadoc] at 
com.sun.tools.doclets.internal.toolkit.AbstractDoclet.start(AbstractDoclet.java:64)
:   [javadoc] at 
com.sun.tools.doclets.formats.html.HtmlDoclet.start(HtmlDoclet.java:42)
:   [javadoc] at 
com.sun.tools.doclets.standard.Standard.start(Standard.java:23)
:   [javadoc] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
:   [javadoc] at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
:   [javadoc] at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
:   [javadoc] at java.lang.reflect.Method.invoke(Method.java:585)
:   [javadoc] at 
com.sun.tools.javadoc.DocletInvoker.invoke(DocletInvoker.java:269)
:   [javadoc] at 
com.sun.tools.javadoc.DocletInvoker.start(DocletInvoker.java:143)
:   [javadoc] at com.sun.tools.javadoc.Start.parseAndExecute(Start.java:340)
:   [javadoc] at com.sun.tools.javadoc.Start.begin(Start.java:128)
:   [javadoc] at com.sun.tools.javadoc.Main.execute(Main.java:41)
:   [javadoc] at com.sun.tools.javadoc.Main.main(Main.java:31)
:
:
:
: -Hoss
:



-Hoss






[Fwd: [Jetty-support] Stable Release 6.1.2]

2007-05-02 Thread Ryan McKinley


should we consider including this?

I have been using jetty-6.1.2rc3 without problems


 Original Message 
Subject: [Jetty-support] Stable Release 6.1.2
Date: Wed, 02 May 2007 14:01:39 +1000
From: Greg Wilkins <[EMAIL PROTECTED]>
Reply-To: Jetty usage,  help & informal support 
<[EMAIL PROTECTED]>
To: [EMAIL PROTECTED],	"Discussion for Jetty 
development." <[EMAIL PROTECTED]>, 
[EMAIL PROTECTED]



Jetty stable release 6.1.2 is now available via http://jetty.mortbay.org

This release has had many improvements, fixes, optimizations and
stabilization over 6.1.1.

Thanks to all those who have contributed to the project and who
have tested the pre-releases and release candidates.


jetty-6.1.2 - 1 May 2007
 + Improved unavailabile handling
 + sendError resets output state
 + Fixed session invalidation error in WadiSessionManager
 + Updated Wadi to version 2.0-M3
 + Added static member definition in WadiSessionManager
 + JETTY-322 fix ajp cpong response and close handling
 + JETTY-324 fix ant plugin
 + JETTY-328 updated jboss

jetty-6.1.2rc5 - 24 April 2007
 + set default keystore for SslSocketConnector
 + removed some compile warnings
 + Allow jsp-file to be / or /*
 + JETTY-305 delayed connection destroy
 + JETTY-309 handle close in multivalue connection fields.
 + JETTY-309 force writable status of endpoints.
 + JETTY-314 fix for possible NPE in Request.isRequestedSessionIdValid

jetty-6.1.2rc4 - 19 April 2007
 + JETTY-294 Fixed authentication reset
 + JETTY-299 handle win32 paths for object naming
 + JETTY-300 removed synchronized on dispatch
 + JETTY-302 correctly parse quoted content encodings
 + JETTY-303 fixed dual reset of generator
 + JETTY-304 Fixed authentication reset

jetty-6.1.2rc3 - 16 April 2007
 + Improved performance and exclusions for TLD scanning
 + MBean properties assume writeable unless marked RO
 + refactor of SessionManager and SessionIdManager for clustering
 + Improvements to allow simple setting of Cache-Control headers
 + AJP redirects https requests correctly
 + Fixed writes of unencoded char arrays.
 + JETTY-283 Parse 206 and 304 responses in client
 + JETTY-285 enable jndi for mvn jetty:run-war and jetty:run-exploded
 + JETTY-289 fixed javax.net.ssl.SSLException on binary file upload
 + JETTY-292 Fixed error page handler error pages
 + JETTY-293 fixed NPE on fast init
 + JETTY-294 Response.reset() resets headers as well as content
 + JETTY-295 Optional support of authenticated welcome files
 + JETTY-296 Close direct content inputstreams
 + JETTY-297 Recreate tmp dir on stop/start
 + JETTY-298 Names in JMX ObjectNames for context, servlets and filters

jetty-6.1.2rc2 - 27 March 2007
 + Enable the SharedStoreContextualiser for the 
WadiSessionManager(Database stor

e for clustering)
 + AJP13 CPING request and CPONG response implemented
 + AJP13 Shutdown Request from peer implemented
 + AJP13 remoteUser, contextPath, servletPath requests implemented
 + Change some JNDI logging to debug level instead of info
 + Update jasper to glassfish tag SJSAS-9_1-B39-RC-14_Mar_2007
 + Optimized multi threaded init on startup servlets
 + Removed unneeded specialized TagLibConfiguration class from maven plugin
 + Refactor Scanner to increase code reuse with maven/ant plugins
 + Added RestFilter for PUT and DELETE from Aleksi Kallio
 + Make annotations work for maven plugin
 + JETTY-125 maven plugin: ensure test dependencies on classpath for 

sspath>
 + JETTY-246 path encode cookies rather than quote
 + JETTY-254 prevent close of jar entry by bad JVMs
 + JETTY-256 fixed isResumed and work around JVM bug
 + JETTY-258 duplicate log message in ServletHandler
 + JETTY-260 Close connector before stop
 + JETTY-262 Allow acceptor thread priority to be adjusted
 + JETTY-263 Added implementation for authorizationType Packets
 + JETTY-265 Only quote cookie values if needed
 + JETTY-266 Fix deadlock with shutdown
 + JETTY-271 ResourceHandler uses resource for MimeType mapping
 + JETTY-272 Activate and Passivate events for sessions
 + JETTY-274 Improve flushing at end of request for blocking
 + JETTY-276 Partial fix for reset/close race
 + JETTY-277 Improved ContextHandlerCollection
 + JETTY-278 Session invalidation delay until no requests
 + JETTY-278 Only unwrap one layer of ServletExceptions
 + JETTY-280 Fixed deadlock with two flushing threads
 + JETTY-284 Fixed stop connector race
 + JETTY-286 isIntegral and isConfidential methods overridden in 
SslSelectChanne

lConnector

jetty-6.1.2rc1 - 8 March 2007
 + TagLibConfiguration uses resource input stream
 + Improved handling of early close in AJP
 + add ajp connector jar to jetty-jboss sar
 + Improved Context setters for wadi support
 + fix Dump servlet to handle primitive array types
 + handle comma separated values for the Connection: header
 + Added option to allow null pathInfo within context
 + BoundedThreadPool queues rather than blocks excess jobs.
 + Support null pathInfo option for webservices deployed to jetty/jboss
 + Workarou

Custom fragmenter

2007-05-02 Thread Brian Whitman
Would love to set a custom fragmenter in Solr for highlighting. But I  
don't see a way to change the fragmenter "on the fly." Should this be  
a solrconfig/schema setting?





Re: [Fwd: [Jetty-support] Stable Release 6.1.2]

2007-05-02 Thread Otis Gospodnetic
I'm about to put Solr in Jetty 6.1.2 and can report the results next week.  
Generally speaking, I'd be for trying 6.1.2, as I've been watching their bug 
reports and I see they've slowed down a lot.

Otis
 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Simpy -- http://www.simpy.com/  -  Tag  -  Search  -  Share

- Original Message 
From: Ryan McKinley <[EMAIL PROTECTED]>
To: solr-dev@lucene.apache.org
Sent: Wednesday, May 2, 2007 11:21:14 AM
Subject: [Fwd: [Jetty-support] Stable Release 6.1.2]


should we consider including this?

I have been using jetty-6.1.2rc3 without problems


 Original Message 
Subject: [Jetty-support] Stable Release 6.1.2
Date: Wed, 02 May 2007 14:01:39 +1000
From: Greg Wilkins <[EMAIL PROTECTED]>
Reply-To: Jetty usage,  help & informal support 
<[EMAIL PROTECTED]>
To: [EMAIL PROTECTED],"Discussion for Jetty 
development." <[EMAIL PROTECTED]>, 
[EMAIL PROTECTED]


Jetty stable release 6.1.2 is now available via http://jetty.mortbay.org

This release has had many improvements, fixes, optimizations and
stabilization over 6.1.1.

Thanks to all those who have contributed to the project and who
have tested the pre-releases and release candidates.


jetty-6.1.2 - 1 May 2007
  + Improved unavailabile handling
  + sendError resets output state
  + Fixed session invalidation error in WadiSessionManager
  + Updated Wadi to version 2.0-M3
  + Added static member definition in WadiSessionManager
  + JETTY-322 fix ajp cpong response and close handling
  + JETTY-324 fix ant plugin
  + JETTY-328 updated jboss

jetty-6.1.2rc5 - 24 April 2007
  + set default keystore for SslSocketConnector
  + removed some compile warnings
  + Allow jsp-file to be / or /*
  + JETTY-305 delayed connection destroy
  + JETTY-309 handle close in multivalue connection fields.
  + JETTY-309 force writable status of endpoints.
  + JETTY-314 fix for possible NPE in Request.isRequestedSessionIdValid

jetty-6.1.2rc4 - 19 April 2007
  + JETTY-294 Fixed authentication reset
  + JETTY-299 handle win32 paths for object naming
  + JETTY-300 removed synchronized on dispatch
  + JETTY-302 correctly parse quoted content encodings
  + JETTY-303 fixed dual reset of generator
  + JETTY-304 Fixed authentication reset

jetty-6.1.2rc3 - 16 April 2007
  + Improved performance and exclusions for TLD scanning
  + MBean properties assume writeable unless marked RO
  + refactor of SessionManager and SessionIdManager for clustering
  + Improvements to allow simple setting of Cache-Control headers
  + AJP redirects https requests correctly
  + Fixed writes of unencoded char arrays.
  + JETTY-283 Parse 206 and 304 responses in client
  + JETTY-285 enable jndi for mvn jetty:run-war and jetty:run-exploded
  + JETTY-289 fixed javax.net.ssl.SSLException on binary file upload
  + JETTY-292 Fixed error page handler error pages
  + JETTY-293 fixed NPE on fast init
  + JETTY-294 Response.reset() resets headers as well as content
  + JETTY-295 Optional support of authenticated welcome files
  + JETTY-296 Close direct content inputstreams
  + JETTY-297 Recreate tmp dir on stop/start
  + JETTY-298 Names in JMX ObjectNames for context, servlets and filters

jetty-6.1.2rc2 - 27 March 2007
  + Enable the SharedStoreContextualiser for the 
WadiSessionManager(Database stor
e for clustering)
  + AJP13 CPING request and CPONG response implemented
  + AJP13 Shutdown Request from peer implemented
  + AJP13 remoteUser, contextPath, servletPath requests implemented
  + Change some JNDI logging to debug level instead of info
  + Update jasper to glassfish tag SJSAS-9_1-B39-RC-14_Mar_2007
  + Optimized multi threaded init on startup servlets
  + Removed unneeded specialized TagLibConfiguration class from maven plugin
  + Refactor Scanner to increase code reuse with maven/ant plugins
  + Added RestFilter for PUT and DELETE from Aleksi Kallio
  + Make annotations work for maven plugin
  + JETTY-125 maven plugin: ensure test dependencies on classpath for 

  + JETTY-246 path encode cookies rather than quote
  + JETTY-254 prevent close of jar entry by bad JVMs
  + JETTY-256 fixed isResumed and work around JVM bug
  + JETTY-258 duplicate log message in ServletHandler
  + JETTY-260 Close connector before stop
  + JETTY-262 Allow acceptor thread priority to be adjusted
  + JETTY-263 Added implementation for authorizationType Packets
  + JETTY-265 Only quote cookie values if needed
  + JETTY-266 Fix deadlock with shutdown
  + JETTY-271 ResourceHandler uses resource for MimeType mapping
  + JETTY-272 Activate and Passivate events for sessions
  + JETTY-274 Improve flushing at end of request for blocking
  + JETTY-276 Partial fix for reset/close race
  + JETTY-277 Improved ContextHandlerCollection
  + JETTY-278 Session invalidation delay until no requests
  + JETTY-278 Only unwrap one layer of ServletExceptions
  + JETTY-280 Fixed deadlock with two flushing threads
  + JETTY-284 Fixed stop connector race
  + JETTY-286 isIntegral and isConf

Re: Custom fragmenter

2007-05-02 Thread Chris Hostetter

: don't see a way to change the fragmenter "on the fly." Should this be
: a solrconfig/schema setting?

yeah, it would probably make sense if there was a way to specify that in
the config ... i'm not really sure exactly how it shoudl work though since
i don't relaly use highlighting ... on one hand we could have a set of
named Fragmenters similar to the set of named request handlers and the
HighlightingUtils could access it directly based on the query params.  on
the other hand each request handler could have it's own specified list of
Fragmenter classes.

Mike's really the go-to-guy for highlighting, i'm sure he's thoughta
little bit about what a patch like this should look like.



-Hoss



[jira] Updated: (SOLR-69) PATCH:MoreLikeThis support

2007-05-02 Thread Ryan McKinley (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-69:
--

Attachment: SOLR-69-MoreLikeThisRequestHandler.patch
lucene-queries-2.1.1-dev.jar

Changed the MoreLikeThis implementation to be a standalone request handler 
rather then tacked on to standard/dismax request handlers

How are other people using this patch?  I found that i am always looking for 
things that are similar to a single document.

This is still in progress, but posting for feedback.

An example command would be:
http://localhost:8983/solr/mlt?q=apache&qt=standard&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score




> PATCH:MoreLikeThis support
> --
>
> Key: SOLR-69
> URL: https://issues.apache.org/jira/browse/SOLR-69
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Reporter: Bertrand Delacretaz
>Priority: Minor
> Attachments: lucene-queries-2.0.0.jar, lucene-queries-2.1.1-dev.jar, 
> SOLR-69-MoreLikeThisRequestHandler.patch, SOLR-69.patch, SOLR-69.patch, 
> SOLR-69.patch, SOLR-69.patch
>
>
> Here's a patch that implements simple support of Lucene's MoreLikeThis class.
> The MoreLikeThisHelper code is heavily based on (hmm..."lifted from" might be 
> more appropriate ;-) Erik Hatcher's example mentioned in 
> http://www.mail-archive.com/[EMAIL PROTECTED]/msg00878.html
> To use it, add at least the following parameters to a standard or dismax 
> query:
>   mlt=true
>   mlt.fl=list,of,fields,which,define,similarity
> See the MoreLikeThisHelper source code for more parameters.
> Here are two URLs that work with the example config, after loading all 
> documents found in exampledocs in the index (just to show that it seems to 
> work - of course you need a larger corpus to make it interesting):
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=standard&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=dismax&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> Results are added to the output like this:
> 
>   ...
>   
> 
>   
> 1.5293242
> SOLR1000
>   
> 
> 
>   
> 1.5293242
> UTF8TEST
>   
> 
>   
> I haven't tested this extensively yet, will do in the next few days. But 
> comments are welcome of course.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: Custom fragmenter

2007-05-02 Thread Mike Klaas

On 5/2/07, Brian Whitman <[EMAIL PROTECTED]> wrote:

Would love to set a custom fragmenter in Solr for highlighting. But I
don't see a way to change the fragmenter "on the fly." Should this be
a solrconfig/schema setting?


It would be nice to able to register custom formatters and
subsequently use them in queries:


   com.business.MyFormatter


...&hl.formatter=mycustom

The main reason I've hesitated in adding this kind of configuration is
that it exposes Solr's implementation details (in particular, the use
of lucene's contrib Highlighter package).  There are other options
floating out there (including a span-based one which would correctly
highlight phrase matches).

Then again, chances are that any new highlighting package would
endeavour to support old-style formatters and scorers anyway...

-Mike


[jira] Commented: (SOLR-69) PATCH:MoreLikeThis support

2007-05-02 Thread Brian Whitman (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12493185
 ] 

Brian Whitman commented on SOLR-69:
---

I've personally never understood the "more documents that don't match this 
query but are like the documents in this query" usage of SOLR-69. MLT results 
(to me) should be like any other result, except by querying by text you are 
querying by document ID.  I'm confused as to how querying by query would work 
-- if a query for 'apache' returned 10 docs, would MLT work on each one and 
generate n more docs per doc? And would the original query results get 
returned? What's the ordering?

But I do know that paging and faceting should definitely work on MLT results. 
(Ryan's patch seems to implement this but I haven't tested it.) MLT results 
should look and operate like any other results. 






> PATCH:MoreLikeThis support
> --
>
> Key: SOLR-69
> URL: https://issues.apache.org/jira/browse/SOLR-69
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Reporter: Bertrand Delacretaz
>Priority: Minor
> Attachments: lucene-queries-2.0.0.jar, lucene-queries-2.1.1-dev.jar, 
> SOLR-69-MoreLikeThisRequestHandler.patch, SOLR-69.patch, SOLR-69.patch, 
> SOLR-69.patch, SOLR-69.patch
>
>
> Here's a patch that implements simple support of Lucene's MoreLikeThis class.
> The MoreLikeThisHelper code is heavily based on (hmm..."lifted from" might be 
> more appropriate ;-) Erik Hatcher's example mentioned in 
> http://www.mail-archive.com/[EMAIL PROTECTED]/msg00878.html
> To use it, add at least the following parameters to a standard or dismax 
> query:
>   mlt=true
>   mlt.fl=list,of,fields,which,define,similarity
> See the MoreLikeThisHelper source code for more parameters.
> Here are two URLs that work with the example config, after loading all 
> documents found in exampledocs in the index (just to show that it seems to 
> work - of course you need a larger corpus to make it interesting):
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=standard&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=dismax&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> Results are added to the output like this:
> 
>   ...
>   
> 
>   
> 1.5293242
> SOLR1000
>   
> 
> 
>   
> 1.5293242
> UTF8TEST
>   
> 
>   
> I haven't tested this extensively yet, will do in the next few days. But 
> comments are welcome of course.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-69) PATCH:MoreLikeThis support

2007-05-02 Thread Ken Krugler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12493198
 ] 

Ken Krugler commented on SOLR-69:
-

Ryan & Brian's comments above are (I think) indicative of how most people want 
to use MLT - you've got a single document, and you want to show other documents 
that are similar.

The way we deal with this is to do a query on the  field (as defined 
in the schema).

If this was the only use case, then the syntax could be something like:

[http://localhost:8983/solr/mlt?uid=xxx&mlt.fl=manu,cat&mindf=1&rows=10]

The uid parameter would implicitly be applied against the  field as 
specified in the schema.

But that's just for my use case - others may want the ability to have mlt 
results returned for the first hit result of an arbitrary query.

> PATCH:MoreLikeThis support
> --
>
> Key: SOLR-69
> URL: https://issues.apache.org/jira/browse/SOLR-69
> Project: Solr
>  Issue Type: Improvement
>  Components: search
>Reporter: Bertrand Delacretaz
>Priority: Minor
> Attachments: lucene-queries-2.0.0.jar, lucene-queries-2.1.1-dev.jar, 
> SOLR-69-MoreLikeThisRequestHandler.patch, SOLR-69.patch, SOLR-69.patch, 
> SOLR-69.patch, SOLR-69.patch
>
>
> Here's a patch that implements simple support of Lucene's MoreLikeThis class.
> The MoreLikeThisHelper code is heavily based on (hmm..."lifted from" might be 
> more appropriate ;-) Erik Hatcher's example mentioned in 
> http://www.mail-archive.com/[EMAIL PROTECTED]/msg00878.html
> To use it, add at least the following parameters to a standard or dismax 
> query:
>   mlt=true
>   mlt.fl=list,of,fields,which,define,similarity
> See the MoreLikeThisHelper source code for more parameters.
> Here are two URLs that work with the example config, after loading all 
> documents found in exampledocs in the index (just to show that it seems to 
> work - of course you need a larger corpus to make it interesting):
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=standard&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=dismax&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> Results are added to the output like this:
> 
>   ...
>   
> 
>   
> 1.5293242
> SOLR1000
>   
> 
> 
>   
> 1.5293242
> UTF8TEST
>   
> 
>   
> I haven't tested this extensively yet, will do in the next few days. But 
> comments are welcome of course.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-103) SQL Upload Plugin

2007-05-02 Thread Ryan McKinley (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-103:
---

Attachment: SOLR-103-SQLUpdateRequestHandler.patch

compiles with trunk + SOLR-193 + SOLR-139

Far from complete, but works for my immediate needs

> SQL Upload Plugin
> -
>
> Key: SOLR-103
> URL: https://issues.apache.org/jira/browse/SOLR-103
> Project: Solr
>  Issue Type: Improvement
>  Components: update
>Affects Versions: 1.2
>Reporter: Ryan McKinley
> Fix For: 1.2
>
> Attachments: SOLR-103-SQLUpdateRequestHandler.patch, 
> SOLR-103-SQLUpdateRequestHandler.patch
>
>
> Solr needs an easy way to upload lots of files directly from SQL.
> See also: SOLR-66 (CSV uploader)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



dynamic copyFields

2007-05-02 Thread Ryan McKinley


I'm looking for a way to copy from a dynamic field to another dynamic field.

I found this post from september:
http://www.nabble.com/copyField-to-a-dynamic-field-tf2300115.html#a6419101

Essentially, I have:
 
 

and want:
 

Any thoughts about how to implement this?

Matching the pattern looks straight forward and would not adversely 
affect the speed for anything that does not use patterns, but generating 
a dynamic field would require changing the final targetField variable in 
IndexSchema.DynamicCopy to a function.


There is a comment that says:  (line 655)
// Instead of storing a type, this could be implemented as a hierarchy
// with a virtual matches().
// Given how often a search will be done, however, speed is the overriding
// concern and I'm not sure which is faster.

Any lasting concerns?


thanks
ryan







Re: dynamic copyFields

2007-05-02 Thread Chris Hostetter

: Essentially, I have:
:   
:   
:
: and want:
:   

i haven't thought about the underlying impl at all, but from an
API/configuration standpoint one tough issue is that fact that dynamic
fields and the "source" of copyField have always been based on glob style
expressions, switching to regexes to support matching semantics would be
tricky to do while remaining backwards compatible.


-Hoss



Re: dynamic copyFields

2007-05-02 Thread Ryan McKinley

Chris Hostetter wrote:

: Essentially, I have:
:   
:   
:
: and want:
:   

i haven't thought about the underlying impl at all, but from an
API/configuration standpoint one tough issue is that fact that dynamic
fields and the "source" of copyField have always been based on glob style
expressions, switching to regexes to support matching semantics would be
tricky to do while remaining backwards compatible.



How about Mike's other suggestion:
 

this would keep the glob style for "source" and "dest", but use "regex" 
to transform a sorce -> dest




Re: dynamic copyFields

2007-05-02 Thread Mike Klaas

On 5/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:


How about Mike's other suggestion:
  

this would keep the glob style for "source" and "dest", but use "regex"
to transform a sorce -> dest


Wow, I didn't even remember suggesting that.  I agree (with Hoss) that
backward compatibility is important, but I disagree (with myself) that
the above syntax is nice.  Outside of perl, I'm not sure how common
the s/ / / syntax is (is it used in java?)

perhaps



?

-Mike


Re: dynamic copyFields

2007-05-02 Thread Walter Underwood
That syntax is from the "ed" editor. I learned it in 1975
on Unix v6/PWB, running on a PDP-11/70. --wunder

On 5/2/07 5:04 PM, "Mike Klaas" <[EMAIL PROTECTED]> wrote:

> On 5/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> 
>> How about Mike's other suggestion:
>>   
>> 
>> this would keep the glob style for "source" and "dest", but use "regex"
>> to transform a sorce -> dest
> 
> Wow, I didn't even remember suggesting that.  I agree (with Hoss) that
> backward compatibility is important, but I disagree (with myself) that
> the above syntax is nice.  Outside of perl, I'm not sure how common
> the s/ / / syntax is (is it used in java?)
> 
> perhaps
> 
> 
> 
> ?
> 
> -Mike



Re: dynamic copyFields

2007-05-02 Thread Ryan McKinley



perhaps





how about:


useRegex="true" ?

Syntax aside, the major implication is that DynamicCopy would need a 
virtual function:

 SchemaField getTargetField()

rather then direct access to a final SchemaField.  I don't have any real 
sense if that is a big deal or not, but it seems ok to me ;)



ryan











[jira] Updated: (SOLR-212) Embeddable class to call solr directly

2007-05-02 Thread Brian Whitman (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Whitman updated SOLR-212:
---

Attachment: embeddedSolr.zip

I didn't have time to extract it from Cocoa/ObjC. Here is the xcode project 
with everything anyway -- so right now you'll need OSX to try this out. This is 
a very simple test of Ryan's SOLR-212 patch, it queries, adds a document and 
commits, all without a web server! Let me know if you have any questions. 
--brian


> Embeddable class to call solr directly
> --
>
> Key: SOLR-212
> URL: https://issues.apache.org/jira/browse/SOLR-212
> Project: Solr
>  Issue Type: Improvement
>Reporter: Ryan McKinley
> Assigned To: Ryan McKinley
>Priority: Minor
> Fix For: 1.2
>
> Attachments: embeddedSolr.zip, SOLR-212-DirectSolrConnection.patch, 
> SOLR-212-DirectSolrConnection.patch, SOLR-212-DirectSolrConnection.patch, 
> SOLR-212-DirectSolrConnection.patch, SOLR-212-DirectSolrConnection.patch
>
>
> For some embedded applications, it is useful to call solr without running an 
> HTTP server.  This class mimics the behavior you would get if you sent the 
> request through an HTTP connection.  It is designed to work nicely (ie 
> simple) with JNI
> the main function is:
> public class DirectSolrConnection 
> {
>   String request( String pathAndParams, String body ) throws Exception
>   {
> ...
>   }
> }

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: dynamic copyFields

2007-05-02 Thread Erik Hatcher


On May 2, 2007, at 7:10 PM, Ryan McKinley wrote:

and want:
 


Why even bother with regexs at all?



simply replace the * match in the source in the * position in the  
dest.  Granted it doesn't have the power of regex to morph things  
across, but maybe a simple glob/replace is all that is needed?


Erik



[jira] Created: (SOLR-224) PhoneticFilterFactory -- support Metaphone/Soundex filters

2007-05-02 Thread Ryan McKinley (JIRA)
PhoneticFilterFactory -- support Metaphone/Soundex filters
--

 Key: SOLR-224
 URL: https://issues.apache.org/jira/browse/SOLR-224
 Project: Solr
  Issue Type: New Feature
Reporter: Ryan McKinley
Priority: Minor


A simple FilterFactory to replace or inject terms encoded with commons codec 
functions:

http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/package-summary.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (SOLR-224) PhoneticFilterFactory -- support Metaphone/Soundex filters

2007-05-02 Thread Ryan McKinley (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-224:
---

Attachment: commons-codec-1.3.jar
SOLR-224-PhoneticFilterFactory.patch

A PhoneticFilterFactory that lets you choose one of:
 "DoubleMetaphone", "Metaphone", "Soundex", "RefinedSoundex"

The javadocs need some work and I'll take another cleanup pass.  There should 
also be somethign in the example schema.xml

(note: this started as LIA example code)



> PhoneticFilterFactory -- support Metaphone/Soundex filters
> --
>
> Key: SOLR-224
> URL: https://issues.apache.org/jira/browse/SOLR-224
> Project: Solr
>  Issue Type: New Feature
>Reporter: Ryan McKinley
>Priority: Minor
> Attachments: commons-codec-1.3.jar, 
> SOLR-224-PhoneticFilterFactory.patch
>
>
> A simple FilterFactory to replace or inject terms encoded with commons codec 
> functions:
> http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/package-summary.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.