Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

2010-02-25 Thread Bradford Stephens
Thanks for coming, everyone! We had around 25 people. A *huge* success, for Seattle. And a big thanks to 10gen for sending Richard. Can't wait to see you all next month. On Wed, Feb 24, 2010 at 2:15 PM, Bradford Stephens wrote: > The Seattle Hadoop/Scalability/NoSQL (yeah, we vary the title) mee

Is it possible to disable fieldNorm?

2010-02-25 Thread Jason Chaffee
I would like to either disable fieldNorm in the scoring or make sure that it is the same for all documents. I am creating EdgeNGrams and that can cause the number of terms for a document to be variable, but I do not want it do affect the scoring for this field. Is there an easy way to do this?

Warning : no lockType configured for...

2010-02-25 Thread Mani EZZAT
Hello, I have this warning even if (I think) everything is setted correctly I'm dynamically creating cores with a new index, using the same schema and solrconfig.xml I looked at the solr code (SolrCore, Config, SolrConfig, SolrIndexWriter etc...) and everything seems fine to me. The log prov

Re: Autosuggest/Autocomplete with solr 1.4 and EdgeNGrams

2010-02-25 Thread Sachin
Hello Joe, The whitespacetokenizerfactory seems to have done the trick, I would for now keep it like this and closely monitor to see if there are any performance implications of using EdgeNGrams but for now this works like a charm. Thanks! -Original Message- From: Joe Calder

Re: If you could have one feature in Solr...

2010-02-25 Thread Stefano Cherchi
Grant, I'm not a java developer but a sysadmin and I've been struggling for a couple of month now to build a full web search engine stack based on hadoop + nutch + solr . I don't know much about the documentation for developers so I trust you if you say it's good. What I do know is that I fou

Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

2010-02-25 Thread Tim Terlegård
2010/2/25 Bradford Stephens : > Thanks for coming, everyone! We had around 25 people. A *huge* > success, for Seattle. And a big thanks to 10gen for sending Richard. > > Can't wait to see you all next month. Did anyone record the event? /Tim

unexpected result using OR in query

2010-02-25 Thread György Frivolt
Hi, I run into an unexpected behaviour for Solr with query parsing. I need to fetch article which contain several expressions. However, I noticed the following behaviour: - when I fetch results for query A I get a number of results X - for query B I get a number of results Y - for query A B a

SolrJ commit options

2010-02-25 Thread gunjan_versata
Hi, We are using SolrJ to handle commits to our solr server.. All runs fine.. But whenever the commit happens, the server becomes slow and stops responding.. therby resulting in TimeOut errors on our production. We are using the default commit with waitFlush = true, waitSearcher = true... Can I

Re: If you could have one feature in Solr...

2010-02-25 Thread Robert Muir
Gora, have you tried the Hindi Analyzer in lucene? if you add it to lucene, the results exceed at least everything from FIRE 2008. So I don't really understand where you are getting this information! > Actually, the state of the art for NLP in Indian languages is > quite poor, at least in the o

Re: SolrJ commit options

2010-02-25 Thread Shalin Shekhar Mangar
On Thu, Feb 25, 2010 at 5:34 PM, gunjan_versata wrote: > > We are using SolrJ to handle commits to our solr server.. All runs fine.. > But whenever the commit happens, the server becomes slow and stops > responding.. therby resulting in TimeOut errors on our production. We are > using the default

Re: If you could have one feature in Solr...

2010-02-25 Thread Gora Mohanty
On Thu, 25 Feb 2010 07:37:33 -0500 Robert Muir wrote: > Gora, have you tried the Hindi Analyzer in lucene? if you add it > to lucene, the results exceed at least everything from FIRE 2008. [...] Oh! No, sorry, I haven't. So far, I have only looked at search through Solr, and I guess I definitely

Re: If you could have one feature in Solr...

2010-02-25 Thread Robert Muir
Gora, I wonder perhaps if there is a documentation issue. e.g. Thai, Arabic, Chinese were mentioned here previously, these are all supported, too. Let me know if you have any ideas! On Thu, Feb 25, 2010 at 7:45 AM, Gora Mohanty wrote: > On Thu, 25 Feb 2010 07:37:33 -0500 > Robert Muir wrote:

CoreAdmin

2010-02-25 Thread Sudhakar_Thangavel
Hi, Am new to Solr .Am not getting clearly in wiki..can any one tell me how to configure coreAdmin i need step by step instruction.. -- View this message in context: http://old.nabble.com/CoreAdmin-tp27714439p27714439.html Sent from the Solr - User mailing list archive at Nabble.com.

CoreAdmin

2010-02-25 Thread Sudhakar_Thangavel
Hi, Am new to Solr .Am not getting clearly in wiki..can any one tell me how to configure coreAdmin i need step by step instruction.. -- View this message in context: http://old.nabble.com/CoreAdmin-tp27714440p27714440.html Sent from the Solr - User mailing list archive at Nabble.com.

Removing duplicate values from multivalued fields

2010-02-25 Thread muneeb
Hi, Is there a way to remove duplicate values from the multivalued fields? I am using Solrj client with solr 1.4 version. Thanks in advance, -Ali -- View this message in context: http://old.nabble.com/Removing-duplicate-values-from-multivalued-fields-tp27714441p27714441.html Sent from the So

Re: unexpected result using OR in query

2010-02-25 Thread Ahmet Arslan
--- On Thu, 2/25/10, György Frivolt wrote: >     I run into an unexpected behaviour for Solr > with query parsing. I need > to fetch article which contain several expressions. > However, I noticed the > following behaviour: > > - when I fetch results for query A I get a number of > results X >

Re: Is it possible to disable fieldNorm?

2010-02-25 Thread Ahmet Arslan
> I would like to either disable > fieldNorm in the scoring or make sure that it is the same > for all documents.  I am creating EdgeNGrams and that > can cause the number of terms for a document to be variable, > but I do not want it do affect the scoring for this > field.  Is there an easy way t

Solr Extract

2010-02-25 Thread Lee Smith
Hey All I am having a go at extracting some file as per the wiki guide. I cd to the root directory of the folder and run the command with no success apart from some broken HTML If you see this here: http://screencast.com/t/MGRiZTU5M It might help to understand what Im doing wrong. hope someon

Re: CoreAdmin

2010-02-25 Thread Siddhant Goel
Hi, Did you *really* go through this page - http://wiki.apache.org/solr/CoreAdmin ? On Thu, Feb 25, 2010 at 7:40 PM, Sudhakar_Thangavel wrote: > > Hi, >Am new to Solr .Am not getting clearly in wiki..can any one tell me > how to configure coreAdmin i need step by step instruction.. > > >

Schema configurations for setting non-case sensitive search and matching partial word in a search string

2010-02-25 Thread Turner, Robbin J
This is probably stated somewhere, but I've look an obviously missing it. But is there a specific field type analyzer or setting for case sensitivity? And something on the worddelimiterfilterfactor to allow for a match if it's not the whole word in the query, such as animal will match animals?

Re: If you could have one feature in Solr...

2010-02-25 Thread Gora Mohanty
On Thu, 25 Feb 2010 07:54:06 -0500 Robert Muir wrote: > Gora, I wonder perhaps if there is a documentation issue. > > e.g. Thai, Arabic, Chinese were mentioned here previously, these > are all supported, too. > > Let me know if you have any ideas! Sorry, are you saying that these are available

Re: sorting

2010-02-25 Thread Claudio Martella
Chris Hostetter wrote: > :title^1.2 contentEN^0.8 contentIT^0.8 contentDE^0.8 > :title^1.2 contentEN^0.8 contentIT^0.8 contentDE^0.8 > > FWIW: I don't think you understand what the "bf" param is for ... it's not > analogous to qf and pf, it's for expressing a list of boost functions -- a

Issues with spell checker

2010-02-25 Thread cjkadakia
First of all, I want to thank you guys for your help thus far. It's been very useful during my search-engine integration project. :) So I'm having a few issues with the spell checking component of Solr. For reference, I'm using Solr 1.4. First, I'm not getting any search results period. Here's m

new/first searcher

2010-02-25 Thread solrquestion6
Hi, Is it the wrong approach to have the same warmup queries in both new and first searcher? The wiki shows a sorting query for the newSearcher and the same sorting query plus facet/filter queries for the firstSearcher. -- View this message in context: http://old.nabble.com/new-first-search

Re: Schema configurations for setting non-case sensitive search and matching partial word in a search string

2010-02-25 Thread Erick Erickson
Pipe things through LowerCaseFilterFactory to turn everything into lower case, at both index and query time, assuming you want to perform caseless matches. Depending on the behavior you want as far as "partial word matches", it depends (tm). The specific example you cite would be handled by stemmi

Delta Query - DIH

2010-02-25 Thread JavaGuy84
Hi,My data config looks like below, I am able to successfully run the Full-Import query without any issue. I am not sure how can I implement a delta query as each of the tables get updated independantly and I need the updates of that particular table to get reflected independently (in

DIH Issue in Delta Query

2010-02-25 Thread JavaGuy84
Hi, My data config looks like below I am able to successfully run the Full-Import query without any issue. I am not sure how can I implement a delta query as each of the tables get updated independantly and I need the updates of that particular table to get reflected in s

Re: Performance issue in indexing the data with DIH when using subqueries

2010-02-25 Thread JavaGuy84
Thanks a lot Shalin.. This resolve my issue :). Thanks, Barani Shalin Shekhar Mangar wrote: > > On Tue, Feb 23, 2010 at 1:01 AM, JavaGuy84 wrote: > >> >> Hi, >> >> I am facing a performace issue when I am trying to index the data using >> DIH.. I have a model as below >> >> Tables >> ---

Re: Extended stats via JMX

2010-02-25 Thread Shalin Shekhar Mangar
On Thu, Feb 25, 2010 at 10:56 AM, Dan Trainor wrote: > > Right now, being inexperienced with JMX and all, I was wondering if there > was a way to pull all Solr-specific items out of there. I see some general > counters pertaining to each of my Solr instances, but nothing along the > lines of loo

Re: If you could have one feature in Solr...

2010-02-25 Thread Ron Mayer
Erik Hatcher wrote: > Ron - I think SOLR-792 meets the need you describe. What do you think? > It's "tree faceting", allowing you to facet down 2 levels deep > arbitrarily on any two fields. Ideally we'd enhance it to be of > arbitrary depth too. Nice! It certainly handles my main use case. Th

Re: If you could have one feature in Solr...

2010-02-25 Thread Gora Mohanty
On Thu, 25 Feb 2010 13:06:03 -0500 Robert Muir wrote: > Yeah, Thai and Arabic have the stuff in Solr 1.4 > For Chinese, if you want to do CJK bigram indexing, this is there > too. If you want to do word-based "smart" indexing, you need to > add an additional jar file to your classpath. OK, but u

Re: If you could have one feature in Solr...

2010-02-25 Thread Shawn Heisey
I would like to be able to do a delta import on arbitrary data, not a last modified date. Specifically, our database has an auto_increment field called DID, or document identifier. For changes to existing data. this field is updated anytime a row is changed in any way, effectively turning it

Re: If you could have one feature in Solr...

2010-02-25 Thread Robert Muir
Yeah, Thai and Arabic have the stuff in Solr 1.4 For Chinese, if you want to do CJK bigram indexing, this is there too. If you want to do word-based "smart" indexing, you need to add an additional jar file to your classpath. we can add a wiki page with examples of how to use these maybe to make it

Re: Extended stats via JMX

2010-02-25 Thread Matthew Runo
https://issues.apache.org/jira/browse/SOLR-1750 might help you, since I don't think that all of stats.jsp is exposed via MBeans. I could be wrong about that though.. (apologies, our solr servers are firewalled and I can't connect via JMX at the moment) Thanks for your time! Matthew Runo Softwa

Solr Cell RTF Woes

2010-02-25 Thread Bill Engle
Any RTF file I tried to index in Solr 1.4 throws these errors out. I have no issues with doc, pdf. Any thoughts? Thanks. Apache Tomcat/6.0.18 - Error report