Re: Solr logo poll

2007-04-10 Thread Waheed Mohammed
A

On Friday 06 April 2007 19:51, Yonik Seeley wrote:
> Quick poll...  Solr 2.1 release planning is underway, and a new logo
> may be a part of that.
> What "form" of logo do you prefer, A or B?  There may be further
> tweaks to these pictures, but I'd like to get a sense of what the user
> community likes.
>
> A) http://issues.apache.org/jira/secure/attachment/12349897/logo-solr-d.jpg
>
> B)
> http://issues.apache.org/jira/secure/attachment/12353535/12353535_solr-nick
>.gif
>
> Just respond to this thread with your preference.
>
> -Yonik

-- 
Abdul Waheed, Mohammed
Fachinformationszentrum Technik e.V.
IT - Software Entwicklung
Tel.: (+49) 69 43 08-255
Fax. (+49) 69 43 08-200
e-mail: [EMAIL PROTECTED]

Hanauer Landstr. 151-153
60314 Frankfurt am Main
Vereinsregister beim Amtsgericht Frankfurt am Main, Nr. 7283
Vorsitzender: Dr. Berthold Grützmacher
Geschäftsführer: Peter Genth



Re: Solr logo poll

2007-04-10 Thread Paul Wootton
B
--
The information contained in this e-mail and any subsequent
correspondence is private and confidential and intended solely 
for the named recipient(s).  If you are not a named recipient, 
you must not copy, distribute, or disseminate the information, 
open any attachment, or take any action in reliance on it.  If you 
have received the e-mail in error, please notify the sender and delete
the e-mail.  
 
Any views or opinions expressed in this e-mail are those of the 
individual sender, unless otherwise stated.  Although this e-mail has 
been scanned for viruses you should rely on your own virus check, as 
the sender accepts no liability for any damage arising out of any bug 
or virus infection.

John Wiley & Sons Limited is a private limited company registered in
England with registered number 641132.

Registered office address: The Atrium, Southern Gate, Chichester,
West Sussex, PO19 8SQ.
--


[Standings] Solr logo poll

2007-04-10 Thread Thorsten Scherler
Hi all,

I did a small count till now we have:
a) 21
b) 13

salu2
-- 
Thorsten Scherler thorsten.at.apache.org
Open Source Java  consulting, training and solutions



Re: embedding solr

2007-04-10 Thread Daniel Einspanjer

Ryan,

Do you have any of this code you could share?  I am currently using
Solr to perform thousands of queries in a batch, and eliminating the
HTTP overhead is something I'd love to do if it isn't complicated.  We
need several of the extra features Solr provides, which is why we are
trying to use it instead of Lucene directly.

On 4/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:

I have embedded solr skipping HTTP transport altogether.  It was
remarkably easy to link directly to request handlers skipping the
dispatch filter and using the DocList and associated data in the
SolrQueryResponse directly.


Ideas for a relevance score that could be considered stable across multiple searches with the same query structure?

2007-04-10 Thread Daniel Einspanjer

I did a bit of research on the list for prior discussions of
normalized scores and such.  Please forgive me if I overlooked
something relevant, but I didn't see anything exactly what I'm looking
for.

I am building a replacement for our current text matching engine that
takes a list of documents from feed A and finds the best match for
each of those in the list of documents from feed B.  For purposes of
this example, feed A and B might have the fields:
title; director; year

The people reviewing this matching process need some way of
determining why a particular match was made other than the overall
score.  Was it because the title was a perfect match or was it because
the title wasn't that close, but the director and year were dead on?

The current idea I have for a strategy to provide this information
would be to run my query four times (n + 1 where n is each scoring
section), once to find the overall best match (a regular query) then
each additional query grouping, requiring, and boosting a different
section of the query. I would then store the rank of the "best" item
returned by the overall query.  That rank can be used to indicate the
relevance of that item based on the defined criteria.

So, following the indexes mentioned above, my queries would be:

The natural "overall" query:
(title:"feed A item one title"^10 (+title:feed~ +title:A~ +title:item~
+title:one~ +title:title~)) director:"Director, Feed A." (year:1974^10
year:[1972 TO 1976])

The query for title relevance:
+((title:"feed A item one title"^10 (+title:feed~ +title:A~
+title:item~ +title:one~ +title:title~)))^100 director:"Director, Feed
A." (year:1974^10 year:[1972 TO 1976])

The query for director relevance:
+(director:"Director, Feed A.")^100 (title:"feed A item one title"^10
(+title:feed~ +title:A~ +title:item~ +title:one~ +title:title~))
(year:1974^10 year:[1972 TO 1976])

The query for year relevance:
+((year:1974^10 year:[1972 TO 1976]))^100 (title:"feed A item one
title"^10 (+title:feed~ +title:A~ +title:item~ +title:one~
+title:title~)) director:"Director, Feed A."

If the #1 item returned by the overall query was 1/10 for title, 3/10
for director, and 5/10 for year and those three scoring sections had
equal weights of 1.0 to .10 then I would be able to display the
following scores:
title: 1.0
director: .8
year: .6
overall: 2.4


I looked at the javadocs related to the FunctionQuery class because it
looked interesting, but the actual docs were a bit light and I wasn't
able to determine if it might help me out with this need.

Does this sound unreasonable to anyone? Is there a clearly better way
I might have overlooked?

Thank you very much for your ideas and comments,

Daniel


Sort by index order desc

2007-04-10 Thread jb

Hello,

With Lucene, instead of returning results by date, I return results in the
reverse index order (last doc added is returned first)

For example :

hits = searcher.search(luceneQuery,new Sort(new SortField(null,
SortField.DOC, true))

How can I do that with Solr ?
Thanks.

JBC


Schema validator/debugger

2007-04-10 Thread Andrew Nagy

Does anyone have a good method of debugging a schema?
I have been struggling to get my new schema to run for the past couple 
of days and just do not see anything wrong with the XML document.


Thanks
Andrew


Re: embedding solr

2007-04-10 Thread Ryan McKinley

There is nothing particularly magic to it.  It is just fills up
SolrParams directly (see any of the tests) calling the requestHandler,
then walking through the Documents.  Something like:


 SolrRequestHandler handler = core.getRequestHandler( "" ); // gets
the standard one

 SolrQueryResponse rsp = new SolrQueryResponse();
 core.execute( handler, sreq, rsp );

 IndexReader reader = sreq.getSearcher().getReader();
 DocListAndSet response = (DocListAndSet)rsp.getValues().get( "response" );
 DocIterator iter = response.docList.iterator();
 while( iter.hasNext() ) {
   Document doc = reader.document( iter.next() );
   // ...
 }

ryan


On 4/10/07, Daniel Einspanjer <[EMAIL PROTECTED]> wrote:

Ryan,

Do you have any of this code you could share?  I am currently using
Solr to perform thousands of queries in a batch, and eliminating the
HTTP overhead is something I'd love to do if it isn't complicated.  We
need several of the extra features Solr provides, which is why we are
trying to use it instead of Lucene directly.

On 4/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> I have embedded solr skipping HTTP transport altogether.  It was
> remarkably easy to link directly to request handlers skipping the
> dispatch filter and using the DocList and associated data in the
> SolrQueryResponse directly.



Re: Schema validator/debugger

2007-04-10 Thread Ryan McKinley

What errors are you getting?  Are there exceptions in the log when it starts up?

On 4/10/07, Andrew Nagy <[EMAIL PROTECTED]> wrote:

Does anyone have a good method of debugging a schema?
I have been struggling to get my new schema to run for the past couple
of days and just do not see anything wrong with the XML document.

Thanks
Andrew



Re: embedding solr

2007-04-10 Thread Daniel Einspanjer

That is good to hear. I guess I was overly worried when I saw your
mention of having trouble getting the field values with the correct
types.  I will be taking a look at this later this week.

Thank you very much for your prompt response.

On 4/10/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:

There is nothing particularly magic to it.


Re: SolrSharp - a C# client API for Solr

2007-04-10 Thread JimS

Thanx for the great contribution Jeff!  A hand clap to the Solr team too.


I am looking forward to using Solr and Solr# in the coming months.  Your
client is going to be a great help.

regards,
-jim


On 4/9/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:


All -

I'm proud to announce a release to a new client API for Solr -- SolrSharp.
SolrSharp is a C# library that abstracts the interoperation of a solr
search
server.  This is an initial release that covers the basics of working with
Solr.  The library is very fleshed out, but the example has only
implemented
simple keyword search.  I really like the library (I'm a dogfood user, for
sure) because I can strongly type different types of objects to search
results.

There's more forthcoming, i.e. more examples, but the basics are in place.
Feedback always appreciated, suggestions for improvement are nice, and
helping hands are the best.

Until there's a better home for it, you can download the bits from JIRA
at:
https://issues.apache.org/jira/browse/SOLR-205

cheers,
jeff r.



Re: Sort by index order desc

2007-04-10 Thread Yonik Seeley

On 4/10/07, jb <[EMAIL PROTECTED]> wrote:

With Lucene, instead of returning results by date, I return results in the
reverse index order (last doc added is returned first)

For example :

hits = searcher.search(luceneQuery,new Sort(new SortField(null,
SortField.DOC, true))

How can I do that with Solr ?


It's not currently supported, but it would be easy to implement.
We would just need to come up with a pseudo field-name to represent
the lucene document id, and then add support for that in the Sort
parsing code.

You could also add a timestamp field to every document (you could even
have it default to "NOW" so you wouldn't need to set it), and then
sort by that.  That does take more memory than sorting by index order
of course.

-Yonik


SOLR with Jetty 6.1.1

2007-04-10 Thread Nathan Woodhull

Solr-ites,

I posted this question to the Jetty list, but I figured I would post
it here as well since it is an issue with the Solr documentation as
well.

Here at ActBlue.com I have been working on integrating Solr with our
rails application in order to provide a faceted search interface to
our directory of  candidates for elected office. Currently the
directory is implemented with SQL - we explored postgres tsearch but
decided to go with solr for a variety of reasons (I had used it
before, faceted browsing would be a pain to implement from scratch).

Since our platform is rebranded and used for other community
fundraising sites we need to run multiple instances of Solr within our
environment. I installed Jetty 6.1.1 and created an xml file for each
instance in the jetty contexts directory.

Unfortunately, the syntax for specifying JNDI resources seems to have
changed since the Solr docs were written. I tried the following XML,
but it does not seem to be working. The app starts up fine, but the
JNDI solr/home resource does not seem to get loaded.


http://jetty.mortbay.org/configure.dtd";>

Loading ActBlue
solr configuration.

/test
/solr/shared-solr.war


  solr/home
  /solr/actblue/entities
  false




Do any of you know the new Jetty syntax required to declare the JNDI
environment entry properly?

Thanks,

Nathan


--
Nathan Woodhull
blog: http://techfordemocracy.com/
aim: nathanwoodhull
cell: 518-207-6768


Any Parm Substituion Ideas...

2007-04-10 Thread Jim Dow
I really like the flexibility of naming request handlers to append general 
constraints / filters.

Has anyone spun thoughts around something like a "solr.ParmSubstHandler" or any 
way to pass maybe a special
ps=0:discussions; ps=1:images; ps=2:false




...
.


  category:[0]
  category:[1]
  isadmin:[2]

...


This may be inappropriate for building into SOLR; I'm not sure, but I'm looking 
at techniques to round out the appends to be even more flexible.

If there is interest and it makes sense to a wider audience, maybe I should try 
my hand at it.

Thanks...Jim Dow.




Re: Sort by index order desc

2007-04-10 Thread jb

Thanks for your answer. I will use a timestamp  while waiting.
Do you think this feature will be added in a future release ?


2007/4/10, Yonik Seeley <[EMAIL PROTECTED]>:

On 4/10/07, jb <[EMAIL PROTECTED]> wrote:
> With Lucene, instead of returning results by date, I return results in the
> reverse index order (last doc added is returned first)
>
> For example :
>
> hits = searcher.search(luceneQuery,new Sort(new SortField(null,
> SortField.DOC, true))
>
> How can I do that with Solr ?

It's not currently supported, but it would be easy to implement.
We would just need to come up with a pseudo field-name to represent
the lucene document id, and then add support for that in the Sort
parsing code.

You could also add a timestamp field to every document (you could even
have it default to "NOW" so you wouldn't need to set it), and then
sort by that.  That does take more memory than sorting by index order
of course.

-Yonik



Re: Sort by index order desc

2007-04-10 Thread Yonik Seeley

On 4/10/07, jb <[EMAIL PROTECTED]> wrote:

Thanks for your answer. I will use a timestamp  while waiting.
Do you think this feature will be added in a future release ?


Most likely... but if someone supplies a patch it might be even quicker :-)

-Yonik


Re: SolrSharp - a C# client API for Solr

2007-04-10 Thread Jeff Rodenburg

It will be extremely helpful to get this in the hands of others.  Like most
packages, this was built out of need.  As we get more eyes on it, I hope to
see it improve at the same rate as change in Solr.

I promised a few other additions to this set.  Here's what I'm working on:

- More content within the documentation about how to use the api.  It's
strongly object-oriented and usage requires you to put together your own set
of classes that inherit from abstract classes in the library.  The example
code does it, but it's not clear how or why you do it, so some guidance is
needed.  I should probably add a wiki entry on the Solr site as well.
- Nunit tests need to be added.  These always get complex when involving
distributed systems, but such is life.

-- jeff



On 4/10/07, JimS <[EMAIL PROTECTED]> wrote:


Thanx for the great contribution Jeff!  A hand clap to the Solr team too.


I am looking forward to using Solr and Solr# in the coming months.  Your
client is going to be a great help.

regards,
-jim


On 4/9/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
>
> All -
>
> I'm proud to announce a release to a new client API for Solr --
SolrSharp.
> SolrSharp is a C# library that abstracts the interoperation of a solr
> search
> server.  This is an initial release that covers the basics of working
with
> Solr.  The library is very fleshed out, but the example has only
> implemented
> simple keyword search.  I really like the library (I'm a dogfood user,
for
> sure) because I can strongly type different types of objects to search
> results.
>
> There's more forthcoming, i.e. more examples, but the basics are in
place.
> Feedback always appreciated, suggestions for improvement are nice, and
> helping hands are the best.
>
> Until there's a better home for it, you can download the bits from JIRA
> at:
> https://issues.apache.org/jira/browse/SOLR-205
>
> cheers,
> jeff r.
>



Re: Any Parm Substituion Ideas...

2007-04-10 Thread Chris Hostetter

I'm not certain that i understand exactly what you are describing, but
there was some discussion a while back that may be similar...

http://issues.apache.org/jira/browse/SOLR-109

...there's not a lot in the issue itself, but the linked discussion may be
fruitful for you.

if what you are describing is the same thing then i certianly think it
would be a handy addition to SolrQueryParser and the core request
handlers.

: Has anyone spun thoughts around something like a "solr.ParmSubstHandler" or 
any way to pass maybe a special
: ps=0:discussions; ps=1:images; ps=2:false




-Hoss



Re: Schema validator/debugger

2007-04-10 Thread Andrew Nagy

Ryan McKinley wrote:
What errors are you getting?  Are there exceptions in the log when it 
starts up?

Just a null pointer exception.

I added a few fields to my schema, and then replaced my solr war file 
with the latest build (see my message from a week ago).  It wouldn't 
work, so I assumed something wrong with the latest build.  I then 
reverted back to my 1.1.0 copy and am getting these errors.  The XML is 
valid, but that is all I can test.


Here is a snapshot of my logs, everything starts up fine - but then I 
click on "ping" and get a server error:


INFO: Deploying web application archive solr.war
Apr 10, 2007 11:35:23 AM org.apache.solr.servlet.SolrServlet init
INFO: SolrServlet.init()
Apr 10, 2007 11:35:23 AM org.apache.solr.servlet.SolrServlet init
INFO: No /solr/home in JNDI
Apr 10, 2007 11:35:23 AM org.apache.solr.servlet.SolrServlet init
INFO: user.dir=/office/local
Apr 10, 2007 11:35:23 AM org.apache.solr.core.Config getInstanceDir
INFO: Solr home defaulted to 'solr/'
Apr 10, 2007 11:35:23 AM org.apache.solr.core.SolrConfig initConfig
INFO: Loaded SolrConfig: solrconfig.xml
Apr 10, 2007 11:35:23 AM org.apache.solr.core.RequestHandlers 
INFO: adding requestHandler standard=solr.StandardRequestHandler
Apr 10, 2007 11:35:23 AM org.apache.solr.core.RequestHandlers 
INFO: adding requestHandler dismax=solr.DisMaxRequestHandler
Apr 10, 2007 11:35:23 AM org.apache.solr.core.RequestHandlers 
INFO: adding requestHandler partitioned=solr.DisMaxRequestHandler
Apr 10, 2007 11:35:23 AM org.apache.solr.core.RequestHandlers 
INFO: adding requestHandler instock=solr.DisMaxRequestHandler
Apr 10, 2007 11:35:23 AM org.apache.solr.core.SolrCore 
INFO: Opening new SolrCore at solr/, dataDir=solr/data
Apr 10, 2007 11:35:23 AM org.apache.solr.schema.IndexSchema readConfig
INFO: Reading Solr Schema
Apr 10, 2007 11:35:23 AM org.apache.solr.schema.IndexSchema readConfig
INFO: Schema name=example
Apr 10, 2007 11:35:23 AM org.apache.solr.schema.IndexSchema readConfig
INFO: default search field is title
Apr 10, 2007 11:35:23 AM org.apache.solr.schema.IndexSchema readConfig
INFO: query parser default operator is OR
Apr 10, 2007 11:35:23 AM org.apache.solr.schema.IndexSchema readConfig
INFO: unique key field: id
Apr 10, 2007 11:35:23 AM org.apache.solr.servlet.SolrUpdateServlet init
INFO: SolrUpdateServlet.init() done
Apr 10, 2007 11:35:24 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8081
Apr 10, 2007 11:35:24 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Apr 10, 2007 11:35:24 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/25  config=null
Apr 10, 2007 11:35:24 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Apr 10, 2007 11:35:24 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 2659 ms
Apr 10, 2007 11:35:40 AM org.apache.solr.core.SolrException log
SEVERE: java.lang.RuntimeException: java.lang.NullPointerException
   at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:363)
   at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:259)
   at 
org.apache.solr.request.SolrQueryRequestBase.getSearcher(SolrQueryRequestBase.java:152)
   at 
org.apache.solr.request.DisMaxRequestHandler.handleRequest(DisMaxRequestHandler.java:231)

   at org.apache.solr.core.SolrCore.execute(SolrCore.java:595)
   at org.apache.jsp.admin.ping_jsp._jspService(ping_jsp.java:75)
   at 
org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:97)

   at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
   at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:332)
   at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:314)

   at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:264)
   at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
   at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252)
   at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
   at 
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
   at 
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
   at 
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
   at 
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
   at 
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
   at 
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148)
   at 
org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:869)
   at 
org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:664)
   at 
org.apache.tomcat.ut

Re: embedding solr

2007-04-10 Thread Chris Hostetter
:   core.execute( handler, sreq, rsp );
:
:   IndexReader reader = sreq.getSearcher().getReader();
:   DocListAndSet response = (DocListAndSet)rsp.getValues().get( "response" );
:   DocIterator iter = response.docList.iterator();
:   while( iter.hasNext() ) {
: Document doc = reader.document( iter.next() );
: // ...
:   }

I can imagine this might be a little easier to deal with if there was a
"no-op" QueryResponseWriter that just did this for you ... adding the
documents to a new Map in the SolrQueryRequest.getContext() so you can get
to them perhaps?




-Hoss



Re: Sort by index order desc

2007-04-10 Thread jb

It would be great, yes :-)

2007/4/10, Yonik Seeley <[EMAIL PROTECTED]>:

On 4/10/07, jb <[EMAIL PROTECTED]> wrote:
> Thanks for your answer. I will use a timestamp  while waiting.
> Do you think this feature will be added in a future release ?

Most likely... but if someone supplies a patch it might be even quicker :-)

-Yonik



Re: Field Term vector

2007-04-10 Thread Chris Hostetter

: What is Field Term Vector? how we can use this?

I don't think anything in Solr uses Term Vector's out of hte box ... they
are a low level Lucene concept that may be useful to you if you are
writting a custo mRequestHandler and want access to them.

-Hoss



Solr web service available?

2007-04-10 Thread alartin

Hi all,

 I wonder is there a solr web service available? or I have to use tools like
Apache httpClient to send requests and get responses? Many thanks.
-- 
View this message in context: 
http://www.nabble.com/Solr-web-service-available--tf3557536.html#a9933854
Sent from the Solr - User mailing list archive at Nabble.com.