Thanks for coming, everyone! We had around 25 people. A *huge*
success, for Seattle. And a big thanks to 10gen for sending Richard.
Can't wait to see you all next month.
On Wed, Feb 24, 2010 at 2:15 PM, Bradford Stephens
wrote:
> The Seattle Hadoop/Scalability/NoSQL (yeah, we vary the title) mee
I would like to either disable fieldNorm in the scoring or make sure that it is
the same for all documents. I am creating EdgeNGrams and that can cause the
number of terms for a document to be variable, but I do not want it do affect
the scoring for this field. Is there an easy way to do this?
Hello,
I have this warning even if (I think) everything is setted correctly
I'm dynamically creating cores with a new index, using the same schema
and solrconfig.xml
I looked at the solr code (SolrCore, Config, SolrConfig, SolrIndexWriter
etc...) and everything seems fine to me.
The log prov
Hello Joe,
The whitespacetokenizerfactory seems to have done the trick, I would for now
keep it like this and closely monitor to see if there are any performance
implications of using EdgeNGrams but for now this works like a charm. Thanks!
-Original Message-
From: Joe Calder
Grant, I'm not a java developer but a sysadmin and I've been struggling for a
couple of month now to build a full web search engine stack based on hadoop +
nutch + solr .
I don't know much about the documentation for developers so I trust you if you
say it's good.
What I do know is that I fou
2010/2/25 Bradford Stephens :
> Thanks for coming, everyone! We had around 25 people. A *huge*
> success, for Seattle. And a big thanks to 10gen for sending Richard.
>
> Can't wait to see you all next month.
Did anyone record the event?
/Tim
Hi,
I run into an unexpected behaviour for Solr with query parsing. I need
to fetch article which contain several expressions. However, I noticed the
following behaviour:
- when I fetch results for query A I get a number of results X
- for query B I get a number of results Y
- for query A B a
Hi,
We are using SolrJ to handle commits to our solr server.. All runs fine..
But whenever the commit happens, the server becomes slow and stops
responding.. therby resulting in TimeOut errors on our production. We are
using the default commit with waitFlush = true, waitSearcher = true...
Can I
Gora, have you tried the Hindi Analyzer in lucene? if you add it to lucene,
the results exceed at least everything from FIRE 2008.
So I don't really understand where you are getting this information!
> Actually, the state of the art for NLP in Indian languages is
> quite poor, at least in the o
On Thu, Feb 25, 2010 at 5:34 PM, gunjan_versata wrote:
>
> We are using SolrJ to handle commits to our solr server.. All runs fine..
> But whenever the commit happens, the server becomes slow and stops
> responding.. therby resulting in TimeOut errors on our production. We are
> using the default
On Thu, 25 Feb 2010 07:37:33 -0500
Robert Muir wrote:
> Gora, have you tried the Hindi Analyzer in lucene? if you add it
> to lucene, the results exceed at least everything from FIRE 2008.
[...]
Oh! No, sorry, I haven't. So far, I have only looked at search
through Solr, and I guess I definitely
Gora, I wonder perhaps if there is a documentation issue.
e.g. Thai, Arabic, Chinese were mentioned here previously, these are all
supported, too.
Let me know if you have any ideas!
On Thu, Feb 25, 2010 at 7:45 AM, Gora Mohanty wrote:
> On Thu, 25 Feb 2010 07:37:33 -0500
> Robert Muir wrote:
Hi,
Am new to Solr .Am not getting clearly in wiki..can any one tell me
how to configure coreAdmin i need step by step instruction..
--
View this message in context:
http://old.nabble.com/CoreAdmin-tp27714439p27714439.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi,
Am new to Solr .Am not getting clearly in wiki..can any one tell me
how to configure coreAdmin i need step by step instruction..
--
View this message in context:
http://old.nabble.com/CoreAdmin-tp27714440p27714440.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi,
Is there a way to remove duplicate values from the multivalued fields? I am
using Solrj client with solr 1.4 version.
Thanks in advance,
-Ali
--
View this message in context:
http://old.nabble.com/Removing-duplicate-values-from-multivalued-fields-tp27714441p27714441.html
Sent from the So
--- On Thu, 2/25/10, György Frivolt wrote:
> I run into an unexpected behaviour for Solr
> with query parsing. I need
> to fetch article which contain several expressions.
> However, I noticed the
> following behaviour:
>
> - when I fetch results for query A I get a number of
> results X
>
> I would like to either disable
> fieldNorm in the scoring or make sure that it is the same
> for all documents. I am creating EdgeNGrams and that
> can cause the number of terms for a document to be variable,
> but I do not want it do affect the scoring for this
> field. Is there an easy way t
Hey All
I am having a go at extracting some file as per the wiki guide.
I cd to the root directory of the folder and run the command with no success
apart from some broken HTML
If you see this here: http://screencast.com/t/MGRiZTU5M
It might help to understand what Im doing wrong.
hope someon
Hi,
Did you *really* go through this page -
http://wiki.apache.org/solr/CoreAdmin ?
On Thu, Feb 25, 2010 at 7:40 PM, Sudhakar_Thangavel
wrote:
>
> Hi,
>Am new to Solr .Am not getting clearly in wiki..can any one tell me
> how to configure coreAdmin i need step by step instruction..
>
>
>
This is probably stated somewhere, but I've look an obviously missing it. But
is there a specific field type analyzer or setting for case sensitivity? And
something on the worddelimiterfilterfactor to allow for a match if it's not the
whole word in the query, such as animal will match animals?
On Thu, 25 Feb 2010 07:54:06 -0500
Robert Muir wrote:
> Gora, I wonder perhaps if there is a documentation issue.
>
> e.g. Thai, Arabic, Chinese were mentioned here previously, these
> are all supported, too.
>
> Let me know if you have any ideas!
Sorry, are you saying that these are available
Chris Hostetter wrote:
> :title^1.2 contentEN^0.8 contentIT^0.8 contentDE^0.8
> :title^1.2 contentEN^0.8 contentIT^0.8 contentDE^0.8
>
> FWIW: I don't think you understand what the "bf" param is for ... it's not
> analogous to qf and pf, it's for expressing a list of boost functions -- a
First of all, I want to thank you guys for your help thus far. It's been very
useful during my search-engine integration project. :)
So I'm having a few issues with the spell checking component of Solr. For
reference, I'm using Solr 1.4.
First, I'm not getting any search results period. Here's m
Hi,
Is it the wrong approach to have the same warmup queries in both new and
first searcher? The wiki shows a sorting query for the newSearcher and the
same sorting query plus facet/filter queries for the firstSearcher.
--
View this message in context:
http://old.nabble.com/new-first-search
Pipe things through LowerCaseFilterFactory to turn everything into lower
case, at both index and query time, assuming you want to perform caseless
matches.
Depending on the behavior you want as far as "partial word matches", it
depends (tm). The specific example you cite would be handled by stemmi
Hi,My data config looks like below,
I am able to successfully run the
Full-Import query without any issue. I am not sure how can I implement a
delta query as each of the tables get updated independantly and I need the
updates of that particular table to get reflected independently (in
Hi,
My data config looks like below
I am able to successfully run the Full-Import query without any issue. I am
not sure how can I implement a delta query as each of the tables get updated
independantly and I need the updates of that particular table to get
reflected in s
Thanks a lot Shalin.. This resolve my issue :).
Thanks,
Barani
Shalin Shekhar Mangar wrote:
>
> On Tue, Feb 23, 2010 at 1:01 AM, JavaGuy84 wrote:
>
>>
>> Hi,
>>
>> I am facing a performace issue when I am trying to index the data using
>> DIH.. I have a model as below
>>
>> Tables
>> ---
On Thu, Feb 25, 2010 at 10:56 AM, Dan Trainor wrote:
>
> Right now, being inexperienced with JMX and all, I was wondering if there
> was a way to pull all Solr-specific items out of there. I see some general
> counters pertaining to each of my Solr instances, but nothing along the
> lines of loo
Erik Hatcher wrote:
> Ron - I think SOLR-792 meets the need you describe. What do you think?
> It's "tree faceting", allowing you to facet down 2 levels deep
> arbitrarily on any two fields. Ideally we'd enhance it to be of
> arbitrary depth too.
Nice! It certainly handles my main use case.
Th
On Thu, 25 Feb 2010 13:06:03 -0500
Robert Muir wrote:
> Yeah, Thai and Arabic have the stuff in Solr 1.4
> For Chinese, if you want to do CJK bigram indexing, this is there
> too. If you want to do word-based "smart" indexing, you need to
> add an additional jar file to your classpath.
OK, but u
I would like to be able to do a delta import on arbitrary data, not a
last modified date. Specifically, our database has an auto_increment
field called DID, or document identifier. For changes to existing data.
this field is updated anytime a row is changed in any way, effectively
turning it
Yeah, Thai and Arabic have the stuff in Solr 1.4
For Chinese, if you want to do CJK bigram indexing, this is there too.
If you want to do word-based "smart" indexing, you need to add an additional
jar file to your classpath.
we can add a wiki page with examples of how to use these maybe to make it
https://issues.apache.org/jira/browse/SOLR-1750 might help you, since I don't
think that all of stats.jsp is exposed via MBeans. I could be wrong about that
though.. (apologies, our solr servers are firewalled and I can't connect via
JMX at the moment)
Thanks for your time!
Matthew Runo
Softwa
Any RTF file I tried to index in Solr 1.4 throws these errors out. I have
no issues with doc, pdf. Any thoughts? Thanks.
Apache Tomcat/6.0.18 - Error report