Re: Using XSLT with DIH for a URLDataSource

2010-02-24 Thread Noble Paul നോബിള്‍ नोब्ळ्
you are right. The StreamSource class is not throwing the proper exception Do we really have to handle this.? On Thu, Feb 25, 2010 at 9:06 AM, Lance Norskog wrote: > [Taken off the list] > > The problem is that the XSLT code swallows the real exception, and > does not return it as the "deeper" e

Extended stats via JMX

2010-02-24 Thread Dan Trainor
Hi - This is my first night working with JMX, particularly for the purpose of querying Solr statistics running under Tomcat. Before, I was trying to xpath to stats.jsp which just felt dirty. I gotta say - I think this is pretty neat. Right now, being inexperienced with JMX and all, I was wond

Any way to recover a corrupt index from a "live" IndexReader?

2010-02-24 Thread Trey
Hi All, It seems I have a corrupt index on disk on my Master, but the live IndexReader is still working. I don't want to restart Solr (1.4), because I'm pretty sure the corrupt index will be loaded upon restart, causing me to delete and rebuild the index from source. Is there any way to restore

Re: If you could have one feature in Solr...

2010-02-24 Thread Matthew Rushton
Real time search would be awesome. -Matt

Re: If you could have one feature in Solr...

2010-02-24 Thread Gora Mohanty
On Wed, 24 Feb 2010 15:49:15 +0100 Markus Jelsma wrote: > Well, i don't have a specific request in mind. However, i can > image a growing internet market for thai, chinese and arabic > speaking people and the native languages on the african > continent. Providing them with stemmers to handle plur

Re: Highlighting inside a field with HTML contents

2010-02-24 Thread Lance Norskog
Yes, the raw HTML will have word inserted. This may put markup where you did not intend. On Mon, Feb 22, 2010 at 7:24 AM, Xavier Schepler wrote: > Hello, > > this field would not be searched, but it would be used to display results. > > A query could be : > > q=table&hl=true&hl.fl=htmlfield&hl.fr

Re: Using XSLT with DIH for a URLDataSource

2010-02-24 Thread Lance Norskog
[Taken off the list] The problem is that the XSLT code swallows the real exception, and does not return it as the "deeper" exception. To show the right error, the code would open a file name or an URL directly. The problem is, the code has to throw an exception on a file or an URL and try the oth

Re: If you could have one feature in Solr...

2010-02-24 Thread Andy
1) Built-in hierarchical faceting Right now there're 2 patches, SOLR-64 and SOLR-792. SOLR-64 seems to be slated for 1.5 release but according to the wiki seems to have poor performance. SOLR-792 has better performance according to the wiki but it's unclear if it'll ever be part of the Solr dist

Re: If you could have one feature in Solr...

2010-02-24 Thread Andy
The Solr documentation feels more like a reference guide detailing all the API's. It's great for more advanced users, but as a beginner I often feel lost reading the doc. It would be really helpful to have a more step-by-step, tutorial approach in the doc showing how to do things with tips & tr

Re: If you could have one feature in Solr...

2010-02-24 Thread Ron Mayer
Grant Ingersoll wrote: > What would it be? * Run a MapReduce-likejob on all docs matching the results of a search? I'm currently working on an app where I hope to be able to do a query (hopefully using solr) and generate a map where every state (or county or zip-code or school district or police

Free Webinar: Mastering Solr 1.4 with Yonik Seeley

2010-02-24 Thread Yonik Seeley
I'd like to invite you to join me for an in-depth review of Solr's powerful, versatile new features and functions. The free webinar, sponsored by my company, Lucid Imagination, covers an intensive "how-to" for the features you need to make the most of Solr for your search application: * Faceti

Re: If you could have one feature in Solr...

2010-02-24 Thread fergus mcmenemie
Chipping in The wiki based nature of solr's documentation is rather different compared to most payware and some open source products. However once you get used to its "style" I found it quite adequate. I also dawned on me that portions of Solr are advancing very quickly and that the wiki styl

Re: stop words make dismax fail

2010-02-24 Thread Ahmet Arslan
> The problem is that when a user enters a query like 'meet > the president', zero results are returned. I imagine it has > something to do with 'the' being stripped out, then only 2 > of the 3 terms matching. As a temporary workaround I set > minshouldmatch to 1 so I do get results. That causes ot

stop words make dismax fail

2010-02-24 Thread Nagelberg, Kallin
I'm having a problem when users enter stopwords in their query. I'm using a dismax request handler against a field setup like:

Re: Spell check returns strange suggestion

2010-02-24 Thread darniz
Adding on top of this there are business requirement we need to meet. Let me add we are having a huge content with make, model and their description. Right now we are copying make,model and description to our spelll check field. So our spellcheck fields have words like honda, civic etc.. We are

Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

2010-02-24 Thread Bradford Stephens
The Seattle Hadoop/Scalability/NoSQL (yeah, we vary the title) meetup is tonight! We're going to have a guest speaker from MongoDB :) As always, it's at the University of Washington, Allen Computer Science building, Room 303 at 6:45pm. You can find a map here: http://www.washington.edu/home/maps/s

Re: Date Facets

2010-02-24 Thread Liam O'Boyle
In response to myself, The problem occurs because the date ranges are inclusive. I can fix this by making facet.date.gap = +1MONTH-1SECOND, but is there a way to specify that the upper bound is exclusive, rather than inclusive? Liam On Wed, 2010-02-24 at 16:54 +1100, Liam O'Boyle wrote: > After

Parse query logs

2010-02-24 Thread Jason Rutherglen
Greetings, Is there code in Solr land, or perhaps code someone's written that parses the Solr Tomcat log entries for given queries and their times? Thanks! Jason

Re: If you could have one feature in Solr...

2010-02-24 Thread Israel Ekpo
Grant, One feature that I would like to see is the ability to do a Bitwise search I have had to work around this with a Query Parser plugin that uses a org.apache.lucene.search.Filter I think having this feature would be very nice and I prefer it to searching with multiple OR type queries especi

Re: Autosuggest/Autocomplete with solr 1.4 and EdgeNGrams

2010-02-24 Thread Smiley, David W.
On Feb 24, 2010, at 1:17 AM, Sachin wrote: > Hi All, > > I am trying to setup autosuggest using solr 1.4 for my site and needed some > pointers on that. Basically, we provide autosuggest for user typed in > characters in the searchbox. The autosuggest index is created with older user > typed

Re: latest lucene libraries in maven repo

2010-02-24 Thread David Smiley @MITRE.org
Lucene 2.9.1 is out of course (and in repos) but the 2.9.1-dev as found in Solr's source control right now is not. This is pretty frustrating and I can only expect it will be a recurring problem. If Solr is going to use -dev versions then I think Solr needs to put them in a repo somewhere. FYI

Re: If you could have one feature in Solr...

2010-02-24 Thread straup
I actually found the documentation pretty great especially since (my experience, anyway) most Java projects seem to default to generic JavaDoc derived documentation (and that makes me cry). That said, more cookbook-style "recipes" or stories would be helpful for some of the more esoteric parts

Re: If you could have one feature in Solr...

2010-02-24 Thread Grant Ingersoll
On Feb 24, 2010, at 11:08 AM, Stefano Cherchi wrote: > Decent documentation. What parts do you feel are lacking? Or is it just across the board? Wikis are both good and bad for documentation, IMO. -Grant

Solr Cell and Deduplication - Get ID of doc

2010-02-24 Thread Bill Engle
Hi - New Solr user here. I am using Solr Cell to index files (PDF, doc, docx, txt, htm, etc.) and there is a good chance that a new file will have duplicate content but not necessarily the same file name. To avoid this I am using the deduplication feature of Solr. true id

Re: Autosuggest/Autocomplete with solr 1.4 and EdgeNGrams

2010-02-24 Thread Joe Calderon
i had to create a autosuggest implementation not too long ago, originally i was using faceting, where i would match wildcards on a tokenized field and facet on an unaltered field, this had the advantage that i could do everything from one index, though it was also limited by the fact suggestions ca

Re: If you could have one feature in Solr...

2010-02-24 Thread Stefano Cherchi
Decent documentation. S -- "Anyone proposing to run Windows on servers should be prepared to explain what they know about servers that Google, Yahoo, and Amazon don't." Paul Graham "A mathematician is a device for turning coffee into theorems." Paul Erdos (wh

Re: Autosuggest/Autocomplete with solr 1.4 and EdgeNGrams

2010-02-24 Thread Grant Ingersoll
You might also look at http://issues.apache.org/jira/browse/SOLR-1316 On Feb 24, 2010, at 1:17 AM, Sachin wrote: > > > Hi All, > > I am trying to setup autosuggest using solr 1.4 for my site and needed some > pointers on that. Basically, we provide autosuggest for user typed in > characters

Re: If you could have one feature in Solr...

2010-02-24 Thread Markus Jelsma
One additional feature within MoreLikeThis might be.. MoreLikeTHESE. This would not be the same as querying multiple documents and fetching MoreLikeThis documents for each individual result. This would then actually only return MoreLikeThis documents based on multiple documents. Another colleg

Re: If you could have one feature in Solr...

2010-02-24 Thread Paul
Limit the number of results when the results are sorted. In other words, if the results are sorted by name and there are 10,000 results, then there will be items of low relevancy mixed in with the results and it is hard for the user to find the relevant ones. If I could say, "give me no more than

Re: Documents disappearing

2010-02-24 Thread Pascal Dimassimo
Hoss, Thanks for your answers. You are absolutely right, I should have provided you more details. We index using 4 processes that read from a queue of documents. Each process send one document at a time to the /update handler. Yes, I double checked that no delete occur. Since that indexation,

Re: If you could have one feature in Solr...

2010-02-24 Thread Markus Jelsma
Well, i don't have a specific request in mind. However, i can image a growing internet market for thai, chinese and arabic speaking people and the native languages on the african continent. Providing them with stemmers to handle plurals etc. will allow for a better search experience. Also, othe

Re: If you could have one feature in Solr...

2010-02-24 Thread Jan Høydahl / Cominvent
A mature document processing pipeline, perhaps integration of www.openpipeline.org which is Apache2.0 licensed

Re: If you could have one feature in Solr...

2010-02-24 Thread Robert Muir
On Wed, Feb 24, 2010 at 9:22 AM, Markus Jelsma wrote: > > - stemmers for many more different languages > > I don't want to hijack this thread, but i would like to know which languages you are interested in! -- Robert Muir rcm...@gmail.com

Re: If you could have one feature in Solr...

2010-02-24 Thread Markus Jelsma
- performing multiple queries at once, perhaps abusing HTTP POST. On some application there is a page that executes five different queries. The HTTP overhead is not that much of a problem but it would be a nice to have. - retrieving documents per facet, not unlike the results from the MoreLikeTh

Re: If you could have one feature in Solr...

2010-02-24 Thread Stephen Duncan Jr
On Wed, Feb 24, 2010 at 8:42 AM, Grant Ingersoll wrote: > What would it be? > Near real-time search & faceting. -- Stephen Duncan Jr www.stephenduncanjr.com

Index size

2010-02-24 Thread Jean-Sebastien Vachon
Hi All, I'm currently looking on integrating Solr and I'd like to have some hints on the size of the index (number of documents) I could possibly host on a server running a Double-Quad server (16 cores) with 48Gb of RAM running Linux. Basically, I need to determine how many of these servers wou

Re: If you could have one feature in Solr...

2010-02-24 Thread Patrick Sauts
Synchronisation between the slaves to switch the new index at the same time after replication. Grant Ingersoll a écrit : What would it be?

If you could have one feature in Solr...

2010-02-24 Thread Grant Ingersoll
What would it be?

Re: Solr/Drupal Integration - Query Question

2010-02-24 Thread Peter Wolanin
The Drupal schema and solrconfig and the example schema and solrconfig have different fields and defaults, and likely Drupal won't find the fields its looking for and might not be even using the right query perser. -Peter On Thu, Feb 11, 2010 at 3:19 PM, jaybytez wrote: > > So I got it to work b

Re: Solr/Drupal Integration - Query Question

2010-02-24 Thread Peter Wolanin
Can you tell me more about the rord() performance issues? I'm one of the maintainers of the Drupal module, so I'd like to switch if there is a better option. Thanks, Peter On Wed, Feb 10, 2010 at 12:00 AM, Lance Norskog wrote: > The admin/form.jsp is supposed to prepopulate fl= with '*,score'

Re: some scores to 0 using omitNorns=false

2010-02-24 Thread Raimon Bosch
We have just tested it with the last version of Solr and we still have scores to 0. adeelmahmood wrote: > > I was gonna ask a question about this but you seem like you might have the > answer for me .. wat exactly is the omitNorms field do (or is expected to > do) .. also if you could please h

Strange search behavior

2010-02-24 Thread Jan Simon Winkelmann
Hi, I'm having some problems understanding why certain search queries don't return any results. I have a field of type "text", which is defined like this:

Re: ERROR:SCHEMA-INDEX-MISMATCH

2010-02-24 Thread Erik Hatcher
I'm confused, you say it's working but then cite the same issue. Recover your index? Did you somehow update your index using those error strings? Did you back up your index before upgrading? Erik On Feb 24, 2010, at 5:06 AM, deepak agrawal wrote: Hi, Thanks for the solution...

Want to create custom inputformat to read from solr

2010-02-24 Thread Rakhi Khatwani
Hi, Has anyone tried creating customInputFormat which reads from solrIndex for processing using mapreduce??? is it possible doin tht?? and how? Regards, Raakhi

Re: ERROR:SCHEMA-INDEX-MISMATCH

2010-02-24 Thread deepak agrawal
Hi, Thanks for the solution Now its working.. But can you please help me how can i recover my old index which is having the fields - * ERROR:SCHEMA-INDEX-MISMATCH,stringValue=4194304 ERROR:SCHEMA-INDEX-MISMATCH,stringValue=0 ERROR:SCHEMA-INDEX-MISMATCH,stringValue=4 ERROR:SCHEMA-INDEX-MISMAT

Re: ERROR:SCHEMA-INDEX-MISMATCH

2010-02-24 Thread Erik Hatcher
The "long" field type changed from 1.3 to 1.4. Set your fields to be of type "plong", and add the plong field type to your schema: You'll see more about this in Solr's example schema.xml comments. Erik On Feb 24, 2010, at 3:15 AM, deepak agrawal wrote: Hi, I upgrade Solr

Re: Date Facets

2010-02-24 Thread gwk
Hi Liam, This happens because the range searches for date faceting are inclusive on both ends. So values on the exact edges of the intervals are counted twice. You can see some solutions at http://old.nabble.com/Date-Faceting-and-Double-Counting-td25227846.html Regards, gwk On 2/24/2010 6:

RE: logging

2010-02-24 Thread Peter A. Kirk
Hi, thanks. I looked at these sites, and also the info about "java logging": http://java.sun.com/j2se/1.5.0/docs/guide/logging/overview.html But I couldn't really follow the info about configuration for logging. The user manual for SLF4J tells you how to call the log methods - but not how to ac

ERROR:SCHEMA-INDEX-MISMATCH

2010-02-24 Thread deepak agrawal
Hi, I upgrade Solr v1.3 to v1.4 but in new version i still use the old index. I changed the new schema with old fields also. I have fields in my schema - but after upgarding when i am searching i got the reult like this - * ERROR:SCHEMA-INDEX-MISMATCH,stringValue=4194304 ERROR:SCHEMA-IND