Re: DIH doesn't handle bound namespaces?

2011-11-05 Thread Lance Norskog
Yes, the xpath thing is a custom lightweight thing for high-speed use. There is a separate full XSL processor. http://wiki.apache.org/solr/DataImportHandler#Configuration_in_data-config.xml-1 I think this lets you run real XSL on input files. I assume it lets you throw in your favorite XSL

Re: Questions about Solr's security

2011-11-05 Thread Erick Erickson
Well, one of the values of having people come at this from all different angles is that the documentation can be customized as each person sees in from a unique angle. The Wiki pages are freely-editable, it'd be great if you were to go ahead and add your perspective. Best Erick On Thu, Nov 3,

Re: Jetty logging

2011-11-05 Thread Erick Erickson
Take a close look and you should see things like: INFO: [] webapp=/solr path=/select/ params={indent=onstart=0q=*:*version=2.2rows=10} hits=19 status=0 QTime=4 which is the request as it gets into Solr. I'm not quite sure how to configure Jetty so it dumps the raw http request, but the above is

Re: Access Score in Custom Function Query

2011-11-05 Thread Erick Erickson
First, have you looked at External File Fields as a way of accessing the external info? As for the rest, you'd probably have to write a custom handler to put that data in the response packets Although the score of the first document in the list is the max-score by definition, assuming you're

Re: Highlighter showing matched query words only

2011-11-05 Thread Erick Erickson
Not that I know of. The regex shouldn't be all that expensive, do you have proof that this is a performance issue? If you don't, I'd just do the simple thing first... And probably just searching for em would be better than REs Best Erick On Thu, Nov 3, 2011 at 7:04 PM, Nikeman

Re: how to achieve google.com like results for phrase queries

2011-11-05 Thread Erick Erickson
First, the default query operator is ignored by edismax, so that's not doing anything. Why would you expect newspaper latimes to be found at all in latimes.com? What proof do you have that the two terms are even in the latimes.com document? You can look at the Query Elevation Component to force

Re: Can Solr handle large text files?

2011-11-05 Thread Erick Erickson
Sure, if you write a custom update handler. But I'm not at all sure this is ideal. You're requiring all that data to be transmitted across the wire and processed by Solr. Assuming you have more than one input source, the Solr server in the background will be handling up to N documents

Re: Solr, MultiValues and links...

2011-11-05 Thread Erick Erickson
Hmmm, MultiValues are guaranteed to be returned in the order they were inserted, so you might be able to do the linking yourself given the results. But have you considered grouping (aka field collapsing) on the ISBN number? If you indexed each record uniquely, that might do what you need. Best

Re: Comparing apples oranges?

2011-11-05 Thread Erick Erickson
What about Function Queries? They can essentially take field values and use them as part of the score calculations Best Erick On Fri, Nov 4, 2011 at 6:28 AM, Martin Koch m...@issuu.com wrote: Hi List I have a solr index where I want to include numerical fields in my ranking function as

Re: Jetty logging

2011-11-05 Thread darul
Thank for the reply Erik, I have to check my log4j config on monday as I filtered on some solr packages growing logs for nothing...but without filtering I remember I not see any traces of queries..I will check it again. I remember seeing it in console so I am certainly wrong...again ;) -- View

Dynamic rating based on Like feature

2011-11-05 Thread Eugene Strokin
Hello, I have a task which seems trivial, but I couldn't find any related information from Solr documentation. So I'm asking the community for an advice. I have relatively big amount (about 25 Millions) of documents which are describing products. Those products could be rated by humans and/or

Re: Dynamic rating based on Like feature

2011-11-05 Thread Sujit Pal
Hi Eugene, I proposed a solution for something similar, maybe it will help you. http://sujitpal.blogspot.com/2011/05/custom-sorting-in-solr-using-external.html -sujit On Sat, 2011-11-05 at 16:43 -0400, Eugene Strokin wrote: Hello, I have a task which seems trivial, but I couldn't find any

Re: Dynamic rating based on Like feature

2011-11-05 Thread Erick Erickson
You might find some joy in ExternalFileFields. It's still a pain to update, but at least you wouldn't be re-indexing your documents all that often. You'd have to maintain the age somewhere else though I think. Perhaps a database with uniqueKey and datestamp for each like and then just do a

Re: how to achieve google.com like results for phrase queries

2011-11-05 Thread alxsss
Hi Erick, The term newspaper latimes is not found in latimes.com. However, google places it in the first place. My guess is that mm parameter must not be set as 2lt;-1 in order to achieve google.com like ranking for two word phrase queries. My goal is to set mm parameter in such a way that

Fw: Term frequency question

2011-11-05 Thread Craig Stadler
- Original Message - From: Craig Stadler cstadle...@hotmail.com To: solr-user@lucene.apache.org Sent: Friday, November 04, 2011 1:39 PM Subject: Term frequency question I am using this reference link: http://www.mail-archive.com/solr-user@lucene.apache.org/msg26389.html However

Re: limiting searches to particular sources

2011-11-05 Thread Chris Hostetter
: Yes -- how do I specify the field as a constant in DIH? https://wiki.apache.org/solr/DataImportHandlerFaq#How_would_I_insert_a_static_value_into_a_field_.3F -Hoss

Re: how to achieve google.com like results for phrase queries

2011-11-05 Thread Ted Dunning
Google achieves their results by using data not found in the web pages themselves. This additional data critically includes link text, but also is derived from behavioral information. On Sat, Nov 5, 2011 at 5:07 PM, alx...@aim.com wrote: Hi Erick, The term newspaper latimes is not found