Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-29 Thread Andrzej Bialecki
On 2010-03-30 05:34, Andy wrote: Reading through this thread and SOLR-1316, there seems to be a lot of different ways to implement auto-complete in Solr. I've seen the mentions of: EdgeNGrams TermsComponent Faceting TST Patricia Tries RadixTree DAWG Which algorthm does SOLR-1316 implement? TST

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
Excuse my language-correctness yesterday - I think one can see the long trying-to-fix-a-bug-work? :) Erick, I have found out how to delete the whole index - with the help of some http-requests. Afterwards I have started to reindex the data again - and Solr used the newest schema-information after

Optimize after delta-import (DIH)

2010-03-29 Thread Blargy
According to the wiki: http://wiki.apache.org/solr/DataImportHandler#Commands the delta-import command will accept the same clean, commit and optimize parameters that the full-import command takes but I am my index keeps saying its not optimized. [java] INFO: [items] webapp=/solr path=/dataimport

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-29 Thread Andy
Reading through this thread and SOLR-1316, there seems to be a lot of different ways to implement auto-complete in Solr. I've seen the mentions of: EdgeNGrams TermsComponent Faceting TST Patricia Tries RadixTree DAWG Which algorthm does SOLR-1316 implement? TST is one. There are others mentioned

Re: Apache Lucene EuroCon Call For Participation: Prague, Czech Republic May 20 & 21, 2010

2010-03-29 Thread mbohlig
Grant, Were you going to send out the "open for registration" email as well? -Mike - Original Message From: Grant Ingersoll Cc: Lucene mailing list ; solr-user@lucene.apache.org; java-u...@lucene.apache.org; mahout-u...@lucene.apache.org; nutch-u...@lucene.apache.org; openrelevan

Re: Experiences with SOLR-1797 ?

2010-03-29 Thread Lance Norskog
There was only one report of the problem. I just read the patch and original source and it looks right; in concurrent programming these are "famous last words" :) 2010/3/29 Daniel Nowak : > Hello, > > has anyone some experiences with this patch of SOLR-1797 > (http://issues.apache.org/jira/brows

Re: Solr not returning all documents?

2010-03-29 Thread Lance Norskog
Yes, this should work. It will be very slow. There is a special hack by which you can say sort=_docid_+asc (or +desc). _docid_ is a magic field name that avoids sorting the results. Pulling documents at row # 1 million should be only a little slower than pulling documents at row #0. On Mon, Mar 2

Re: Including Tika-extracted docs in a document?

2010-03-29 Thread Lance Norskog
Look at the 'rootEntity' attribute in the DataImportHandler, both the description and the examples: http://wiki.apache.org/solr/DataImportHandler#Schema_for_the_data_config It is active for all entities. It means that you can run several operations in the outer entities, then have all of their fi

Re: multicore embedded swap / reload etc.

2010-03-29 Thread Lance Norskog
The code snippet you give tells how to access existing cores that are registered in the top-level solr.xml file. The Wiki pages tells how these cores are configured The Wiki pages also discusses dynamic operations on multiple cores. SolrJ should be able to do these as well (but I am not a SolrJ ex

Apache Lucene EuroCon Call For Participation: Prague, Czech Republic May 20 & 21, 2010

2010-03-29 Thread Grant Ingersoll
Apache Lucene EuroCon Call For Participation - Prague, Czech Republic May 20 & 21, 2010 All submissions must be received by Tuesday, April 13, 2010, 12 Midnight CET/6 PM US EDT The first European conference dedicated to Lucene and Solr is coming to Prague from May 18-21, 2010. Apache Lucene Eu

Re: Complex relational values

2010-03-29 Thread Lance Norskog
If 'item' is the unique document level, then this can be done with: unique id: your own design searchable text fields: foo_x: foo_y: bar_x: bar_y: The query becomes: foo_x:[100 TO *] AND foo_y:[500 TO *] Note that to search the other fields with dismax, and foo* with the standard query parser

Re: solr highlighting

2010-03-29 Thread Lance Norskog
No problem: wrapping and unwrapping escaped text can be very confusing. On Fri, Mar 26, 2010 at 6:31 AM, Niraj Aswani wrote: > Hi Lance, > > apologies.. please ignore my previous mail.  I'll have a look at the > PatternReplaceFilter. > > Thanks, > Niraj > > Niraj Aswani wrote: >> >> Hi Lance, >>

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-29 Thread Lance Norskog
SOLR-1316 uses a much faster data structure (Ternary Search Tree), not a Lucene index. Using Ngram-based tools like the spellchecker, or your implementation is inherently slower. Netflix, for example, uses a dedicated TST server farm (their own implementation of TST) to do auto-complete. On Fri,

Re: Solrj doesn't tell if PDF was actually parsed by Tika

2010-03-29 Thread Lance Norskog
Thanks! You can search for the document after you index it. On Fri, Mar 26, 2010 at 1:55 AM, Abdelhamid ABID wrote: > Well done : https://issues.apache.org/jira/browse/SOLR-1847 > > meanwhile, is there any workaround ? > > On 3/26/10, Lance Norskog wrote: >> >> Please file a bug for this on th

RE: negative boost

2010-03-29 Thread Jason Chaffee
I think the key was change the tie to 0. I had it at 0.1. Getting exactly what I want now. Big thanks for the help. -Original Message- From: Jason Chaffee [mailto:jchaf...@ebates.com] Sent: Monday, March 29, 2010 5:20 PM To: solr-user@lucene.apache.org Subject: RE: negative boost I u

RE: negative boost

2010-03-29 Thread Jason Chaffee
I understand that it defeats the reason for dismax, at least the original reason for dismax. However, if I can do it this way without having to write my own handler because I need to search multiple fields and combine the results, then it is still preferable and thus another way to leverage disma

RE: negative boost

2010-03-29 Thread Chris Hostetter
: Unfortunately, my results aren't quite what I want unless I use 0 on the : second field. Instead, if something matches in all the fields it is : elevated to the top. I only want the first field match elevated to the : top and I want all first field matches to have the same weight. Next, I

how to create this highlighter behaviour

2010-03-29 Thread Joe Calderon
hello *, ive been using the highlighter and been pretty happy with its results, however theres an edge case im not sure how to fix for query: amazing grace the record matched and highlighted is amazing rendition of amazing grace is there any way to only highlight amazing grace without using phr

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
I was using TermsComponent now to make sure, what is really indexed. Well, one title-field has got only a few terms indexed (as I have mentioned earlier: it is only saving up to 55 rows of the RDBMS), while the other fields (which are based on the same filter, but with another special-word.txt) i

RE: One item, multiple fields, and range queries

2010-03-29 Thread Steven A Rowe
Hi David, On 03/29/2010 at 4:54 PM, David Smiley (@MITRE.org) wrote: > Did you read my original message where I suggested perhaps a solution > might lie in intersecting different queries based on common multi-value > field offsets derived from matching term positions? I have no idea how > far off

RE: negative boost

2010-03-29 Thread Jason Chaffee
Unfortunately, my results aren't quite what I want unless I use 0 on the second field. Instead, if something matches in all the fields it is elevated to the top. I only want the first field match elevated to the top and I want all first field matches to have the same weight. Next, I want all

dataimporthandler multivalued dynamic fields

2010-03-29 Thread brad anderson
Greetings, I'm trying to use dataimporthandler to load values from a db and trying to put them into multivalued dynamic fields. It appears to work for the first value, but does not add all the values to the field. Here is the schema definition of the *_custom fields:

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
Luke is responsing (now): My topTerms of synonyms got a frequency of up to 800.000 and my processed title gots a maximum frequency of 7... What the hell??? However, I can't search any of the top synonyms. I am able to search within the first 55 documents of my index. What might be wrong, when

RE: One item, multiple fields, and range queries

2010-03-29 Thread David Smiley (@MITRE.org)
Steven, The composite doc idea is an interesting avenue to a solution here that I didn't think of. What's missing is code to do the group by and then do an intersection in order to get boolean AND behavior between the addresses and primary documents, and then filter out the non-primary docum

Re: Getting /handlers from response and dynamically removing them

2010-03-29 Thread Chris Hostetter
: Also, its not possible currently to reload a handler w/o a restart correct? There are methods that can be used to dynamicly add/remove handlers from SolrCore -- but there are no built in adminstrtive commands to do so. -Hoss

Re: Getting /handlers from response and dynamically removing them

2010-03-29 Thread Jon Baer
Thanks for the qt tip, I will try that. Im building a Solr installation as a small standalone and Id like to disable everything but the /select after an import has been completed. In normal situations just the master would be setup to index and the slaves are read but in this case I need to al

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
I was using this page: solr/admin/dataimport.jsp?handler=/dataimport To import my data from my database. I have made a few restarts of my Solr-server and I have re-imported the data a lot of times. Furthermore, I have tried to delete everything with the help of the post.jar from the tutorial. I h

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread Erick Erickson
Perhaps a silly question, but did you recreate your index after you made your schema changes? Or did you delete a bunch of documents in the meantime? Or do you have a unique key defined in your schema that is replacing documents? The fact that Luke is giving you unexpected results is a red flag tha

RE: One item, multiple fields, and range queries

2010-03-29 Thread Steven A Rowe
Hi David, On 03/29/2010 at 3:36 PM, David Smiley (@MITRE.org) wrote: > I'm not sure what to make of "or index using a heterogeneous field > schema, grouping the different doc type instances with a unique key > (the one) to form a composite doc" Lucene is schema-free - you can mix and match differ

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
EDIT: The shown query was not the ment one,... please, excuse me, I have tested a lot and I am a little bit confused :-). The right query is, of course: select/?q=titleProcessed:life&start=0&rows=10&indent=on -- View this message in context: http://n3.nabble.com/Absolutely-empty-resultset-reg

Re: negative boost

2010-03-29 Thread Paul Libbrecht
Jason, don't you want field1^3 • field2^1 • field3^0.9 ? As written in Lucene in action, it's all multiplied. So "negative boost" means "boost under 1" (and probably elsewhere) paul PS: take the log and you get this negative. Le 29-mars-10 à 21:08, Jason Chaffee a écrit : Is it possible

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
Hoss, thank you for your response. /select?q=*:* This returns results as expected. I have found the mistake, why introduction didn't match - a wrong copyfield. *rolleyes* However, this seems to bring more problems to the light: Now, the first few rows from my database seem to be searchable, bu

Re: field QParserPlugin - Help needed

2010-03-29 Thread Ahmet Arslan
> Could anyone please help me by directing me to some link > where I can get more details on Solr's field QParserPlugin. Additionally Chris Hostetter's explanation: http://search-lucene.com/m/ZKrXi2VX1st

RE: One item, multiple fields, and range queries

2010-03-29 Thread David Smiley (@MITRE.org)
I'm not going to index each address as its own document because the "one-side" that I have currently has loads of text and there are many addresses. Furthermore, it doesn't really address the general case of my problem statement. I'm not sure what to make of "or index using a heterogeneous field

Re: field QParserPlugin - Help needed

2010-03-29 Thread Erik Hatcher
Manas, The best you'll find is Solr's javadocs and source code itself. There's a bit on the wiki with the pointers: http://wiki.apache.org/solr/SolrPlugins#QParserPlugin Erik On Mar 29, 2010, at 3:25 PM, Nair, Manas wrote: Hello Experts, Could anyone please help me by directin

field QParserPlugin - Help needed

2010-03-29 Thread Nair, Manas
Hello Experts, Could anyone please help me by directing me to some link where I can get more details on Solr's field QParserPlugin. I would be really grateful. Thankyou all, Manas

Re: Getting /handlers from response and dynamically removing them

2010-03-29 Thread Erik Hatcher
You can get the qt parameter, at least, in your search component. What's the use case for controlling handlers enabled flag on the fly? Erik On Mar 29, 2010, at 3:02 PM, Jon Baer wrote: This is just something that seems to come up now and then ... * - Id like to write a last-compone

negative boost

2010-03-29 Thread Jason Chaffee
Is it possible to give a negative in boost in dismax? For instance, field1^3 field2^0 field3^-0.1 Thanks, Jason

Getting /handlers from response and dynamically removing them

2010-03-29 Thread Jon Baer
This is just something that seems to come up now and then ... * - Id like to write a last-component which does something specific for a particular declared handler /handler1 for example and there is no way to determine which handler it came from @ the moment (or can it?) * - It would be nice if

RE: keyword query tokenizer

2010-03-29 Thread Jason Chaffee
I didn't know the quotes would work. I thought it had to be escaped and I wasn't too fond of that because you have to unescape in the analysis phase. Using quotes doesn't seem so bad to me. -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Monday, March 29

Re: Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread Chris Hostetter
: my analysis.jsp shows me the right results. That means, everything seems to : be parsed the right way and there are some matches. analysis.jsp can tell you that *if* a document is indexed with the current config, then what will the tokens look like -- but it doesn't know if there are any docu

RE: keyword query tokenizer

2010-03-29 Thread Chris Hostetter
: Ahh, but that is exactly what I don't want the DisjunctionMaxQuery to : do. I do not max scoring field per "word". Instead, I want it per : "phrase" which may be a single word or multiple words. then you need to quote your enitre "q" param. (or escape all the white space and meta characters)

RE: keyword query tokenizer

2010-03-29 Thread Jason Chaffee
Ahh, but that is exactly what I don't want the DisjunctionMaxQuery to do. I do not max scoring field per "word". Instead, I want it per "phrase" which may be a single word or multiple words. -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Friday, March

Re: ReplicationHandler reports incorrect replication failures

2010-03-29 Thread Jason Rutherglen
Shawn, I was working on something very similar... Lets perhaps also create a Jira issue for this monitoring? Thanks, Jason On Fri, Mar 26, 2010 at 6:59 AM, Shawn Smith wrote: > We're using Solr 1.4 Java replication, which seems to be working > nicely.  While writing production monitors to chec

Re: jmap output help

2010-03-29 Thread Bill Au
Take a heap dump and use jhat to find out for sure. Bill On Mon, Mar 29, 2010 at 1:03 PM, Siddhant Goel wrote: > Gentle bounce > > On Sun, Mar 28, 2010 at 11:31 AM, Siddhant Goel >wrote: > > > Hi everyone, > > > > The output of "jmap -histo:live 27959 | head -30" is something like the > > follo

Re: Filter query with special character using SolrJ client

2010-03-29 Thread Indika Tantrigoda
Thank you very much for the explanation. Regards, Indika On 29 March 2010 22:28, Chris Hostetter wrote: > > : It works, thanks. Just implemented the code...:):):) > : > : Could you explain what "{!field f=yourStringField}Cameras & Photos" does. > > {!field} says that the string should be parsed

Re: jmap output help

2010-03-29 Thread Siddhant Goel
Gentle bounce On Sun, Mar 28, 2010 at 11:31 AM, Siddhant Goel wrote: > Hi everyone, > > The output of "jmap -histo:live 27959 | head -30" is something like the > following : > > num #instances #bytes class name > -- >1:448441

Re: Filter query with special character using SolrJ client

2010-03-29 Thread Chris Hostetter
: It works, thanks. Just implemented the code...:):):) : : Could you explain what "{!field f=yourStringField}Cameras & Photos" does. {!field} says that the string should be parsed using the FIeldQParser. the FieldQParser takes an 'f' local param telling it what field you want to use, and the

Re: Filter query with special character using SolrJ client

2010-03-29 Thread Indika Tantrigoda
It works, thanks. Just implemented the code...:):):) Could you explain what "{!field f=yourStringField}Cameras & Photos" does. Regards, Indika On 29 March 2010 21:55, Chris Hostetter wrote: > > : Since the names of the string fields are not predefined I might have to > : find a method to do t

Absolutely empty resultset regardless of what I am searching for

2010-03-29 Thread MitchK
Hello guys, my analysis.jsp shows me the right results. That means, everything seems to be parsed the right way and there are some matches. However, when I try this live, there are never any matched documents. When I try out to look up whether there is anything in my index, I get the expected re

Re: Filter query with special character using SolrJ client

2010-03-29 Thread Chris Hostetter
: Since the names of the string fields are not predefined I might have to : find a method to do this automatically. if the fields are strings, and you are only looking for "exact" matches (ie: you don't need any special query parser syntax) use the "field" QParser : > SolrQuery.addFilterQu

Re: Drill down a solr result set by facets

2010-03-29 Thread Dhanushka Samarakoon
Thanks Indika, that looks good. I'll look at the article. If anyone else has any good ideas please send them too. On Mon, Mar 29, 2010 at 11:09 AM, Indika Tantrigoda wrote: > Hi Dhanushka, > > Have you tried to use the filter query parameter. > Check out this article, the Applying Constraints sec

Re: Drill down a solr result set by facets

2010-03-29 Thread Indika Tantrigoda
Hi Dhanushka, Have you tried to use the filter query parameter. Check out this article, the Applying Constraints section should be helpful to you. http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Faceted-Search-Solr Solr Wiki link to filter query parameter http://wiki.apac

Re: Drill down a solr result set by facets

2010-03-29 Thread Dhanushka Samarakoon
Thanks for the reply. I was just giving the above as an example. Something as simple as following is also not working. /select/?q=france+fDepartmentName:History&version=2.2& So it looks like the query parameter syntax I'm using is wrong. This is the params array I'm getting from the result. 10 0

Re: Drill down a solr result set by facets

2010-03-29 Thread Tommy Chheng
Try adding quotes to your query: DepartmentName:Chemistry+fSponsor:\"US Cancer/Diabetic Research Institute\" The parser will split on whitespace Tommy Chheng Programmer and UC Irvine Graduate Student Twitter @tommychheng http://tommy.chheng.com On 3/29/10 8:49 AM, Dhanushka Samarakoon wrot

RE: One item, multiple fields, and range queries

2010-03-29 Thread Steven A Rowe
David, The standard one-to-many solution is indexing each address (the many) as its own document, and then either copy the other fields from your current schema to these documents, or index using a heterogeneous field schema, grouping the different doc type instances with a unique key (the one)

Drill down a solr result set by facets

2010-03-29 Thread Dhanushka Samarakoon
Hi, I'm trying to perform a search based on keywords and then reduce the result set based on facets that user selects. First query for a search would look like this. http://localhost:8983/solr/select/?q=cancer+stem&version=2.2&wt=php&start=&rows=10&indent=on&qt=dismax&facet=on&facet.mincount=1&fa

Re: ReplicationHandler reports incorrect replication failures

2010-03-29 Thread Shawn Smith
Thanks. I created https://issues.apache.org/jira/browse/SOLR-1853 2010/3/27 Noble Paul നോബിള്‍ नोब्ळ् : > please create a bug >

solr-trunk in production?

2010-03-29 Thread Agethle, Matthias
Hi, I need the patch SOLR-236 (field collapsing) in a production-system which currently is running on Solr 1.4. Can I switch to the trunk version (and apply the patch) without problems or is this not recommended? Matthias

Re: RejectedExecutionException when searching with DirectSolrConnection

2010-03-29 Thread Don Werve
A followup: I discovered something interesting. If I don't run Jetty in the same JVM as DirectSolrConnection, all is well. Nrr.

More like this - setting a minimum number of terms used to build queries

2010-03-29 Thread Xavier Schepler
Hey, Is there a way to make the "more like this" feature build its queries from a minimum number of interesting terms ? It looks like this component fires query with only 1 term in them. I got a lot of results that aren't similar at all with the parsed document fields. My parameters : &mlt.

Re: One item, multiple fields, and range queries

2010-03-29 Thread Lukas Kahwe Smith
On 29.03.2010, at 15:11, David Smiley (@MITRE.org) wrote: > > Sorry, I intended to design my post so that one wouldn't have to read the > thread for context but it seems I failed to do that. Don't bother reading > the thread. The use-case I'm pondering modifying Lucene/Solr to solve is > the o

Re: Delete id from a specific core

2010-03-29 Thread Erik Hatcher
Lee - Use the url parameter. ~/dev/solr/example/exampledocs: java -jar post.jar -help SimplePostTool: version 1.2 This is a simple command line tool for POSTing raw XML to a Solr port. XML data can be read from files specified as commandline args; as raw commandline arg strings; or via STDIN.

RE: One item, multiple fields, and range queries

2010-03-29 Thread David Smiley (@MITRE.org)
Sorry, I intended to design my post so that one wouldn't have to read the thread for context but it seems I failed to do that. Don't bother reading the thread. The use-case I'm pondering modifying Lucene/Solr to solve is the one-to-many problem. Imagine a document that contains multiple address

Delete id from a specific core

2010-03-29 Thread Lee Smith
Hey All From the docs deleting from an index os pretty simpl: java -Ddata=args -Dcommit=no -jar post.jar "SP2514N" How about from a specific core? Say I wanted to delete id=12344 from core 1 Hope this makes sense and is easy to answer! Regards Lee

Re: Getting solr response in HTML format : HTMLResponseWriter

2010-03-29 Thread Arnaud Garcia
2010/3/29 Arnaud Garcia > Hello everybody > > I m using NUTCH with SOLR and the result of solr searching as you know is > in XML format . > > > Because I want an HTML format for the response (like the result of NUTCH > searching result) > > so I have tried to attach the xslt steelsheet to the res

Getting solr response in HTML format : HTMLResponseWriter

2010-03-29 Thread Arnaud Garcia
Hello everybody I m using NUTCH with SOLR and the result of solr searching as you know is in XML format . Because I want an HTML format for the response (like the result of NUTCH searching result) so I have tried to attach the xslt steelsheet to the response of SOLR with passing this 2 variable

Re: How to use Payloads with Solr?

2010-03-29 Thread Grant Ingersoll
On Mar 27, 2010, at 5:31 AM, MitchK wrote: > > Hello community, > > since I have searched for a solution to get TermPositions in Solr, I became > more aware of the "payload"-features. So I decided to learn more about > payloads. > In the wiki, there is not much said about them, so I will ask

Experiences with SOLR-1797 ?

2010-03-29 Thread Daniel Nowak
Hello, has anyone some experiences with this patch of SOLR-1797 (http://issues.apache.org/jira/browse/SOLR-1797) ? Best Regards Daniel Nowak Senior Developer Rocket Internet GmbH | Saarbrücker Straße 20/21 | 10405 Berlin | Deutschland tel: +49 30 / 559 554 66 | fax: +49 30 / 559 554

field QParserPlugin - Help needed

2010-03-29 Thread Nair, Manas
Hello Experts, Could anyone please help me by directing me to some link where I can get more details on Solr's field QParserPlugin. I would be really grateful. Thankyou all, Manas

RE: Perfect Match

2010-03-29 Thread Nair, Manas
Awesome Ahmet. Thanks for the reply. It seems to work now. Thanks a ton. From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Tue 3/23/2010 2:35 PM To: solr-user@lucene.apache.org Subject: RE: Perfect Match > Thankyou Ahmet. You were right. > artist_s:Dora is

Solr not returning all documents?

2010-03-29 Thread Adrian Pemsel
Hi, As part of our application I have written a reindex task that runs through all documents in a core one by one (using *:*, a start offset and a row limit of 1) and adds them to a new core (potentially with a new schema). However, while working well for small sets this approach somehow does not