SOLR Search Query : Exception : Software caused connection abort: recv failed

2010-03-10 Thread Kranti™ K K Parisa
Hi, I am trying to test the SOLR search with very big query.. sample code snippet is as follows. when I try that its throwing exceptions. Is SOLR query has any limitations with size or length..etc?? = solrServer = SolrUtils.getSolrServe

HTMLStripTransformer not working with data importer

2010-03-10 Thread James Ostheimer
HI- I am working a contract to index some wordpress data. For the posts I of course have html in the content of the column, I'd like to strip it out. Here is my data importer config Looks perfect according to the wiki docs, but th

Re: Filter to cut out all zeors?

2010-03-10 Thread Norberto Meijome
won't this replace *all* 0s ? ie, 1024 will become 124 ? _ {Beto|Norberto|Numard} Meijome "The only people that never change are the stupid and the dead" Jorge Luis Borges. I speak for myself, not my employer. Contents may be hot. Slippery when wet. Reading disclaimers ma

Filtering search results

2010-03-10 Thread Srikanth B
Hi We are planning to use Solr in one of our projects.We are implementing functionality where the user performs a search & further refines the search results based on various parameters like location, age and other parameters. For example, in dice.com, the visitor can search by some keyword and

Re: SolrConfig - constructing the object

2010-03-10 Thread Chris Hostetter
: My first thought after reading this : "ouch!". : My second thought after reading this : "Hey, I think this guys is right". Please don't take it personally, there's a reason i had to make a standard disclaimer this type of thing. : So, I am new to all things Solr, and am working with some

Re: Using SOLR

2010-03-10 Thread Erick Erickson
Luke won't help you "retrieve the matched taxonomy", it just lets you look at your index and run queries against it WARNING: I haven't personally used MoreLikeThis functionality, but it sounds like that's at least in the ballpark if you consider your Taxonomy a document and want the list of do

Re: Updating FAQ for International Characters?

2010-03-10 Thread Chris Hostetter
: Any time a character like that was index Solr through a unknown entity error. : But if converted to À or À then everything works great. : : I tried out using Tomcat versus Jetty and got the same results. Before I edit Uh, you mean like the characters in exampledocs/utf8-example.xml ? it cont

Re: digest

2010-03-10 Thread Dennis Gearon
I forgot about thread hijacking. Darn those Xheaders :-) Thanks for the other info, however :-) Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Tue, 3/9/10, Chris Host

Re: Using SOLR

2010-03-10 Thread CP Hennessy
Hi, I may not have stated my aim clearly enough or in case I'm using the wrong terms, I'll restate what I want to be able to do: - I have a fixed set of words and phrases some of which I expect to find in the documents I want to process. This set I call my taxonomy. - I have many documents to

Updating FAQ for International Characters?

2010-03-10 Thread Eric Pugh
Hi all, On the wiki page http://wiki.apache.org/solr/FAQ under the section "Why don't International Characters Work?" there are a number of options specified for dealing with a character like À (an A with a caret, the agrave character). Any time a character like that was index Solr throug

Generating a sitemap

2010-03-10 Thread Ian Evans
Been testing nutch to crawl for solr and I was wondering if anyone had already worked on a system for getting the urls out of solr and generating an XML sitemap for Google.

Re: Matching a string across multiple fields

2010-03-10 Thread Static Void
Alex Thurlow wrote: So on with the questions. I'm indexing music, and I have artist and title fields. It works great if I just search for the artist or just for the title, but if I search for a string of both, that artist, title combination does not come up first. For example, I have: artis

Re: Matching a string across multiple fields

2010-03-10 Thread Jan Høydahl / Cominvent
You may like to check out the DisMax handler instead of hand-coding your queries. It has lots of features to tune stuff like this. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 10. mars 2010, at 22.15, Alex Thurlow wrote:

Solr training in Norway

2010-03-10 Thread Jan Høydahl / Cominvent
Hi all Solr folks in Europe. Lucid's training course "Developing Search Applications With Solr" is now available in Oslo, Norway. First course is in less than two weeks, next is in June. See www.solrtraining.com Also, have any of you completed the course earlier? It covers quite a broad range

Re: Matching a string across multiple fields

2010-03-10 Thread Erik Hatcher
What's your query parser? What fields are you indexing and what types are they? Are you using any copyField's? What do you get from &debugQuery=true as the parsedQuery output? Erik On Mar 10, 2010, at 4:15 PM, Alex Thurlow wrote: So on with the questions. I'm indexing music, and

Matching a string across multiple fields

2010-03-10 Thread Alex Thurlow
So on with the questions. I'm indexing music, and I have artist and title fields. It works great if I just search for the artist or just for the title, but if I search for a string of both, that artist, title combination does not come up first. For example, I have: artist: Whitney Houston ti

Snapshot / Distribution Process

2010-03-10 Thread Lars R. Noldan
Is anyone aware of a comprehensive guide for setting up the Snapshot Distribution process on Solr 1.3? I'm working through: http://wiki.apache.org/solr/CollectionDistribution#The_Snapshot_and_Distribution_Process And have run into a roadblock where the solr/bin/snappuller finds the appropria

Re: Architectural help

2010-03-10 Thread blargy
So I can just create a view (or temporary table) and then just have a simple "select * from (view or table)" in my DIH config? Constantijn Visinescu wrote: > > Try making a database view that contains everything you want to index, and > then just use the DIH. > > Worked when i tested it ;) >

Re: Boosting exact matches

2010-03-10 Thread Ahmet Arslan
> So I figured out part of my > problem.  The string field type doesn't have the > lowercase filter on it.  I can't seem to figure out how > to add the lowercase filter to it though. > > This: > omitNorms="true"> > > > > > > Gives me this error. > SEVERE: org.apache.solr.common.SolrExceptio

Re: Boosting exact matches

2010-03-10 Thread Alex Thurlow
So I figured out part of my problem. The string field type doesn't have the lowercase filter on it. I can't seem to figure out how to add the lowercase filter to it though. This: Gives me this error. SEVERE: org.apache.solr.common.SolrException: analyzer without class or tokenizer & fi

Re: Architectural help

2010-03-10 Thread Constantijn Visinescu
Try making a database view that contains everything you want to index, and then just use the DIH. Worked when i tested it ;) On Wed, Mar 10, 2010 at 1:56 AM, blargy wrote: > > I was wondering if someone could be so kind to give me some architectural > guidance. > > A little about our setup. We

Boosting exact matches

2010-03-10 Thread Alex Thurlow
Hi all, I've searched the archives and web, but I haven't found a great solution to this issue. I would like to boost the results for things that match exactly. For example, I have an artist field with possible values of: Houston Whitney Houston Marques Houston I would like the Houston

Re: Filter to cut out all zeors?

2010-03-10 Thread Sebastian F
yes, thank you. That was exactly what I was looking for! Great help! From: Ahmet Arslan To: solr-user@lucene.apache.org Sent: Tue, March 9, 2010 7:26:46 PM Subject: Re: Filter to cut out all zeors? > I'm trying to figure out the best way to cut out all zeros >

Re: Solrj - Sorting with range queries

2010-03-10 Thread Yonik Seeley
On Wed, Mar 10, 2010 at 10:57 AM, Steve Radhouani wrote: > I noticed that, Yonik, but I couldnt figure out how to preserve the > "correct" order produced by Solr. Do you have any idea how to preserve it? Open a JIRA issue for this. We prob just need to change _facetQuery = new HashMap(); to

Re: Highlighting

2010-03-10 Thread Lee Smith
Cant see why you would put highlighting in a separate field. Isn't it the idea to highlight the content found in a search result like google would do ? Lee On 10 Mar 2010, at 15:52, Joe Calderon wrote: > no, thats not the case, see this example response in json format: > { > "responseHeader":

Re: Solrj - Sorting with range queries

2010-03-10 Thread Steve Radhouani
I noticed that, Yonik, but I couldnt figure out how to preserve the "correct" order produced by Solr. Do you have any idea how to preserve it? Thanks. 2010/3/10 Yonik Seeley > On Wed, Mar 10, 2010 at 9:49 AM, Steve Radhouani > wrote: > > Thanks Erik. That's what I've done so far, but I was won

Re: Highlighting

2010-03-10 Thread Joe Calderon
no, thats not the case, see this example response in json format: { "responseHeader":{ "status":0, "QTime":0, "params":{ "indent":"on", "q":"title_edge:fami", "hl.fl":"title_edge", "wt":"json", "hl":"on", "rows":"1"}}, "response":{"numFound":1

Boundary match as part of query language?

2010-03-10 Thread Jan Høydahl / Cominvent
Hi, Sometimes you need to anchor your search to start/end of field. Example: 1. title=New York Yankees 2. title=New York 3. title=York If I search title:"New York", or title:"York" I would get a match, but I'd like to anchor my search to beginning and/or end of the field, e.g. with regex synta

Re: Solrj - Sorting with range queries

2010-03-10 Thread Yonik Seeley
On Wed, Mar 10, 2010 at 9:49 AM, Steve Radhouani wrote: > Thanks Erik. That's what I've done so far, but I was wondering wether there > is a "cleaner" solution from Solr itself. Hmmm, AFAIK Solr itself will currently return the facet.query results in the order they were specified. So perhaps this

Fwd: Highlighting

2010-03-10 Thread Lee Smith
Im am getting results no problem with the query. But from what I believe it should wrap around the text in the result. So if I search ie Andrew within the return content Ie would have the contents with the word Andrew and hl.fl=attr_content Thank you for you help Begin forwarded message: >

Re: Highlighting

2010-03-10 Thread Joe Calderon
just to make sure were on the same page, youre saying that the highlight section of the response is empty right? the results section is never highlighted but a separate section contains the highlighted fields specified in hl.fl= On Wed, Mar 10, 2010 at 5:23 AM, Ahmet Arslan wrote: > > >> Yes Cont

Re: get english spell dictionary

2010-03-10 Thread michaelnazaruk
I combine index spell with file dictionary spell! -- View this message in context: http://old.nabble.com/english-%28american%29-spell-dictionary-tp27778741p27850983.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solrj - Sorting with range queries

2010-03-10 Thread Steve Radhouani
Thanks Erik. That's what I've done so far, but I was wondering wether there is a "cleaner" solution from Solr itself. -Steve 2010/3/10 Erik Hatcher > You're going to have to employ some UI work to make the arrangement how you > like. Order from Solr isn't going to be guaranteed. > >Eri

Re: distinct on my result

2010-03-10 Thread gwk
Hi, The autosuggest core is filled by a simple script (written in PHP) which request facet values for all the possible strings one can search for and adds them one by one as a document. Our case has some special issues due to the fact that we search in multiple languages (Typing "España" will

Re: SolrConfig - constructing the object

2010-03-10 Thread Kimberly Kantola
My first thought after reading this : "ouch!". My second thought after reading this : "Hey, I think this guys is right". :) So, I am new to all things Solr, and am working with some code which creates some new Handlers to be called by the Solr app. What I am doing is updating these classes

Re: distinct on my result

2010-03-10 Thread stocki
hey. okay. thx my suggestion run in another core ;) do you distinct during the import with DIH ? gwk-4 wrote: > > Hi, > > I ran into the same issue, and what I did (at > http://www.mysecondhome.co.uk/) was to create a separate core just for > autosuggest which is fully updated once an h

embedded server / servlet container

2010-03-10 Thread Dino Di Cola
Dear all, I am trying to provide an embedded server to a web application deployed in a servlet container (like tomcat). I would like to call a method like this protected void configSolrServer() throws ConfigException { String idxHome = systemProp.getProperty(Configuration.Key.IDX_HOME)

Re: distinct on my result

2010-03-10 Thread gwk
Hi, I ran into the same issue, and what I did (at http://www.mysecondhome.co.uk/) was to create a separate core just for autosuggest which is fully updated once an hour which contains the distinct values of the items I want to look for including the count so I can display the approximate amou

Re: get english spell dictionary

2010-03-10 Thread Jan Høydahl / Cominvent
You probably don't want to include words in your dictionary which are not in your index. Have you tried Solr's feature to generate spellcheck dictionary from one or more of your index fields? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solr

distinct on my result

2010-03-10 Thread stocki
hello. i implement my suggest-function with edgengramfilter. now when i get my result , is the result not distinct. often ist the name double or more. is it possible that solr gives me only distinct result ? "response":{"numFound":172,"start":0,"docs":[ { "name":"Halloween"},

Re: Issue on stopword list

2010-03-10 Thread Jan Høydahl / Cominvent
Also, eDisMax query parser will be a welcome tool for these kinds of requirements: https://issues.apache.org/jira/browse/SOLR-1553 From the feature list: "advanced stopword handling... stopwords are not required in the mandatory part of the query but are still used (if indexed) in the proximity

Re: Solrj - Sorting with range queries

2010-03-10 Thread Erik Hatcher
You're going to have to employ some UI work to make the arrangement how you like. Order from Solr isn't going to be guaranteed. Erik On Mar 10, 2010, at 5:37 AM, Steve Radhouani wrote: Using Solrj, I would like to sort the response of a range query based on some specific labels.

Re: 2 indexes - 1 Faceted search

2010-03-10 Thread Erick Erickson
Not really SOLR doesn't have a way to implement this kind of thing. You'd have to do some sort of post-processing of the results I think. HTH Erick 2010/3/10 André Maldonado > Hi all. > > I have 2 indexes with different structures, but a few equal fields. There > is some way to do a faceted se

Re: Highlighting

2010-03-10 Thread Ahmet Arslan
> Yes Content is stored and I get same > results adding that parameter. > > Still not highlighting the content :-( > > Any other ideas What is the field type of attr_content? And what is your query? Are you running your query on another field and then requesting snippets from attr_content?

Re: Highlighting

2010-03-10 Thread Lee Smith
Yes Content is stored and I get same results adding that parameter. Still not highlighting the content :-( Any other ideas Lee On 9 Mar 2010, at 23:14, Ahmet Arslan wrote: > >> Yes it shows when I run the debug >> >> -> name="org.apache.solrhandler.component.HighlightComponent"> >> 0.0

Re: Dynamic Solr indexing

2010-03-10 Thread Jan Høydahl / Cominvent
Hi, Yes, it will be a really nice package. I think the aim is to keep the ZK stuff optional, which can be nice for small installs or upgrading without embracing the ZK parts. All of this is still in the beginning of development. Much of the cloud stuff is aimed at 1.5 but there are as usual no

Re: how to do partial word searches?

2010-03-10 Thread Rob Ganly
hi all, i was having the same problem, i needed to be able to search a substring anywhere within a word for a specific field. i used the NGramTokenizerFactory factory in my index analyzer and it seems to work well. ( http://lucene.apache.org/solr/api/org/apache/solr/analysis/NGramTokenizerFactory

Solrj - Sorting with range queries

2010-03-10 Thread Steve Radhouani
Using Solrj, I would like to sort the response of a range query based on some specific labels. For instance, using the query: facet=true &facet.query={!key= Less than 100}[* TO 99] &facet.query={!key=100 - 200}[100 TO 200] &facet.query={!key=200 +}[201 TO *] I would like to display the response i

2 indexes - 1 Faceted search

2010-03-10 Thread André Maldonado
Hi all. I have 2 indexes with different structures, but a few equal fields. There is some way to do a faceted search (faceting the equal fields) in the 2 indexes at same time, with only one response (like a UNION in sql server)? Thank's "Então aproximaram-se os que estavam no barco, e adoraram-

Scaling indexes with high document count

2010-03-10 Thread Peter S
Hello, I wonder if anyone might have some insight/advice on index scaling for high document count vs size deployments... The nature of the incoming data is a steady stream of, on average, 4GB per day. Importantly, the number of documents inserted during this time is ~7million (i.e. lots of sm

RE: search and count ocurrences

2010-03-10 Thread Frederico Azeiteiro
Thanks Chris. Could something like that be implemented in c# ? :) Does anyone has any link where I can start digging? This is not an urgent matter, just something to investigate and implement in a near future. Frederico -Original Message- From: Chris Hostetter [mailto:hossman_luc...@f

Scaling indexes with high document count

2010-03-10 Thread Peter Sturge
Hello, I wonder if anyone might have some insight/advice on index scaling for high document count vs size deployments... The nature of the incoming data is a steady stream of, on average, 4GB per day. Importantly, the number of documents inserted during this time is ~7million (i.e. lots of small

Scaling indexes with high document count

2010-03-10 Thread Peter Sturge
Hello, I wonder if anyone might have some insight/advice on index scaling for high document count vs size deployments... The nature of the incoming data is a steady stream of, on average, 4GB per day. Importantly, the number of documents inserted during this time is ~7million (i.e. lots of small