Re: ExtractingRequestHandler and Solr 3.1

2011-04-14 Thread Liam O'Boyle
Hi Grant, After comparing the differences between my solrconfig.xml and that used by the example, the key difference is that I didn't have str name=captureAttrtrue/str in the defaults for the ERH. Commenting out this line in the example configuration causes the example to display the same

Re: jetty update

2011-04-14 Thread ramires
hi i use solr(trunk 4.0) with nutch (1.2.rc4). In nutch I put ./solr/dist/apache-solr-core-4.0-SNAPSHOT.jar ./solr/dist/apache-solr-solrj-4.0-SNAPSHOT.jar becouse of javabin change. In solr I use jetty-6.1.26-patched-JETTY-1340.jar and jetty-util-6.1.26-patched-JETTY-1340.jar but i stil get this

phpnative response writer in SOLR 3.1 ?

2011-04-14 Thread Ralf Kraus
Hello, I just updatet to SOLR 3.1 and wondering if the phpnative response writer plugin is part of it? ( https://issues.apache.org/jira/browse/SOLR-1967 ) When I try to compile the sources files I get some errors : PHPNativeResponseWriter.java:57:

Unsubsribe

2011-04-14 Thread Stephan Raemy
Sent from my iPhone

Unsubsribe

2011-04-14 Thread pankaj bhatt
On Thu, Apr 14, 2011 at 1:32 PM, Stephan Raemy stephan.ra...@gmail.comwrote: Sent from my iPhone

Unsuscribe

2011-04-14 Thread Quirino Brizi
Unsuscribe me

Why are they different?

2011-04-14 Thread m _ 米蟲ы~
String fileUrl = C:/TEMP/table.txt; File file = new File(fileUrl); ContentStreamUpdateRequest up = new ContentStreamUpdateRequest( /update/extract); up.addFile(file);

Do EntityProcessor honor onError=skip when nextRow() fails?

2011-04-14 Thread Reeza Edah Tally
Hi, The document that I am trying to index with DIH contains an entity with fields queried from a DB and an entity with the content of a file extracted with TikaEntityProcessor. I was testing the onError=skip option with TikaEntityProcessor and found out it does not work. It basically behaves

Solr and inner joins1

2011-04-14 Thread Antony Corfield [awc]
Hi, I have a solr core to which I can add simple documents (essentially these consist of a triple: resource URL, referrer URL and timestamp) that provide information on activity data for open-access repositories. Replicating this in a SQL database I can retrieve other resource URLs (for a

most stable way to get facet pivoting

2011-04-14 Thread Nikolas Tautenhahn
Hi, I want to evaluate (and probably use in production) facet pivoting - what is the best approach to get a as-stable-as-can-be version of solr which is able to do facet pivoting? I was hoping to see this in Solr 3.1, but apparently it is only in the dev versions/nightlies... Is it possible to

Grouping

2011-04-14 Thread ramires
hi I use solr 4.0 trunk. I grouped my result perfectly but when i try to cluster these result it gives a error null. All my fields stored and indexed. solr/clustering?q=123123group=truegroup.field=sitestart=0 Problem accessing /solr/clustering. Reason: null

Fast DIH with 1:M multValue entities

2011-04-14 Thread Tim Gilbert
We are working on importing a large number of records into Solr using DIH. We have one schema with ~2000 fields declared which map off to several database schemas so that typically each document will have ~500 fields in use. We have about 2 million rows which we are importing, and we are seeing

Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread getagrip
Having updated from 1.4.1 to 3.1.0 some documents are not parsed correctly anymore: 1. Both the result's id field and the highlighting's header do not display special-characters e.g. German Umlauts anymore. 2. The highlighting section is messed up as words appear in random order instead of

Re: Grouping

2011-04-14 Thread Grant Ingersoll
What's the full exception? On Apr 14, 2011, at 5:40 AM, ramires wrote: hi I use solr 4.0 trunk. I grouped my result perfectly but when i try to cluster these result it gives a error null. All my fields stored and indexed.

Re: Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
Hmmm, looks like the highlighting code changed. Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexeshl=truehl.fl=featureswt=jsonindent=true highlighting:{ UTF8TEST:{ features:[eaiou with emcircumflexes/em: êâîôû]}}} With 3.1, this now looks

Re: Fast DIH with 1:M multValue entities

2011-04-14 Thread Erick Erickson
I'm not sure this applies, but have you looked at http://wiki.apache.org/solr/DataImportHandler#CachedSqlEntityProcessor http://wiki.apache.org/solr/DataImportHandler#CachedSqlEntityProcessorBest Erick On Thu, Apr 14, 2011 at 9:12 AM, Tim Gilbert tim.gilb...@morningstar.comwrote: We are

RE: Fast DIH with 1:M multValue entities

2011-04-14 Thread Tim Gilbert
How did I miss that? Thanks, I will try that as it seems to be in memory lookup solution I needed. Thanks Erick, Tim -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, April 14, 2011 10:58 AM To: solr-user@lucene.apache.org Subject: Re: Fast DIH

Re: Grouping

2011-04-14 Thread ramires
here full exception HTTP ERROR 500 Problem accessing /solr/clustering. Reason: null java.lang.NullPointerException at org.apache.solr.handler.clustering.ClusteringComponent.process(ClusteringComponent.java:89) at

Re: Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
On Thu, Apr 14, 2011 at 10:49 AM, Yonik Seeley yo...@lucidimagination.com wrote: Hmmm, looks like the highlighting code changed.  Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexeshl=truehl.fl=featureswt=jsonindent=true  highlighting:{  

Spellcheck suggestions as solr docs

2011-04-14 Thread sivaprasad
Hi, I have configured spell check for the terms and it is working fine.But the spell check suggestions are just simple strings.For my requirement, i need the documents.How can i achieve this? Are there any other ways to do spell check?Please suggest .. Regards, Siva -- View this message in

Re: Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread Koji Sekiguchi
(11/04/14 23:49), Yonik Seeley wrote: Hmmm, looks like the highlighting code changed. Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexeshl=truehl.fl=featureswt=jsonindent=true highlighting:{ UTF8TEST:{ features:[eaiou

Re: Spellcheck suggestions as solr docs

2011-04-14 Thread Erick Erickson
I'm not sure what you mean by i need the documents and how that relates to spell check. Are you saying you need the name of the document the suggestion came from? Please clarify a bit Best Erick On Thu, Apr 14, 2011 at 11:25 AM, sivaprasad sivaprasa...@echidnainc.comwrote: Hi, I have

Re: Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
On Thu, Apr 14, 2011 at 11:27 AM, Koji Sekiguchi k...@r.email.ne.jp wrote: I'm not sure, but it is due to HtmlEncoder?      !-- Configure the standard encoder --      encoder name=html               default=true               class=solr.highlight.HtmlEncoder / it set as default in example

RE: Spellcheck suggestions as solr docs

2011-04-14 Thread Dyer, James
Siva, If you specify spellcheck.collate=true, then the spell checker will return you a corrected query. Your client has to re-run this query as there is no way to get Solr to automatically redirect the response to the correction. The new query will return documents that have the corrected

Re: RE: Indexing Question for large dataset

2011-04-14 Thread karsten-solr
Hi Joshua, what is the use-case? Do you need only the facets for one field (for each query)? Do you need all facet-values or only the first 10 in .sort=index (FACET_SORT_INDEX / numeric order) / in .sort=count (FACET_SORT_COUNT) ? How many different facet-valuss do you have per field? Do you

Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi, I'd really appreciate if somebody could help me. Solr 3.1, I have a (multivalued) field called author_split, which splits original author names, i.e. Mills, Joy. When I use the solr web interface to enter author_split:mills the ExtendedDismaxQParser produces query

Re: Vetting Our Architecture: 2 Repeaters and Slaves.

2011-04-14 Thread Parker Johnson
Otis and Erick, Thanks for the responses and for thinking over my potential scenarios. The big draw for me on 2 repeaters idea is that I can: 1. Maximize my hardware. I don't need a standby master. Instead, I can use the second repeater to field customer requests. 2. After the primary

Solr 3.1 core with RAMDirectory isn't reloaded

2011-04-14 Thread nskmda
Hello, We just tried core reloading on a freshly installed Solr 3.1.0 with RamDirectoryFactory. It doesn't seem to happen. With the FSDirectoryFactory everything works fine. Looks like the RamDirectoryFactory implementation caches directory and if it's available it doesn't really reopen it thus

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
Let's see the raw query you send from your webapp. Based on this: +(author_split:joy^80.0)~1.0 () I'm going to guess you send your query something like blahblahblahdefType=edismaxblahblahblah rather than blahblahblahqt=edismaxblahblahblah The former expects you to define the all your

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi Eric, thanks for the reply. I'm sending qt=partitioned, the request handler is defined as follows: requestHandler name=partitioned class=solr.SearchHandler default=true lst name=defaults str name=defTypeedismax/str str name=echoParamsexplicit/str float name=tie0.1/float

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
OK, something doesn't make sense here, but you knew that. I'm guessing that something you're leaving out when cutting/pasting will give us a clue. Here's what I'd like to try. 1 Show us the entire results from debugQuery=on, as well as the URL. Please do this for both the query that works and the

Understanding the DisMax tie parameter

2011-04-14 Thread Burton-West, Tom
Hello, I'm having trouble understanding the relationship of the word tie and tiebreaker to the explanation of this parameter on the wiki. What two (or more things) are in a tie? and how does the number in the range from 0 to 1 break the tie?

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Yonik Seeley
2011 at 5:04 PM, Burton-West, Tom tburt...@umich.edu wrote: Hello, I'm having trouble understanding the relationship of the word tie and tiebreaker to the explanation of this parameter on the wiki. What two (or more things) are in a tie? and how does the number in the range from 0 to 1

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi Erick, many many thanks. Headbang^80.0. I was just getting to the same conclusion - that I'm not reaching handler I think I am, when I got your message. As almost always, the mystery has a simple explanation. Not only that I was not reaching the handler I thought I was. In fact I was not

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Jay Hill
Dismax works by first selecting the highest scoring sub-query of all the sub-queries that were run. If I want to search on three fields, manu, name and features, I can configure dismax like this: requestHandler name=search_dismax class=solr.SearchHandler lst name=defaults str

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Chris Hostetter
: Perhaps the parameter could have had a better name. It's essentially : max(score of matching clauses) + tie * (score of matching clauses that : are not the max) : : So it can be used and thought of as a tiebreak only in the sense that : if two docs match a clause (with essentially the same

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
Been there, done that, got the tshirt G... Glad it's resolved. Erick On Thu, Apr 14, 2011 at 5:33 PM, Marek Tichy ma...@gn.apc.org wrote: Hi Erick, many many thanks. Headbang^80.0. I was just getting to the same conclusion - that I'm not reaching handler I think I am, when I got your

Re: Vetting Our Architecture: 2 Repeaters and Slaves.

2011-04-14 Thread Lance Norskog
SAN vendors make high-priced super-fast shared file system hardware. They don't use NFS, usually they have a kernel drop-in file system. On 4/14/11, Parker Johnson parker_john...@gap.com wrote: Otis and Erick, Thanks for the responses and for thinking over my potential scenarios. The big

Re: tika/pdfbox knobs levers

2011-04-14 Thread Lance Norskog
Tika creates document-level metadata and text from the input file. That's it. If you want to use PDFbox directly, you need your own Solr plugin. On 4/13/11, Markus Jelsma markus.jel...@openindex.io wrote: Hi, I'm not sure how Solr allows for adjusting these Tika settings to get the desired

Re: Solr and inner joins1

2011-04-14 Thread Otis Gospodnetic
Hi Antony, Check out hit #1: http://search-lucene.com/?q=solr+join Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Antony Corfield [awc] a...@aber.ac.uk To: solr-user@lucene.apache.org

Re: how to get lots fields this way?

2011-04-14 Thread Otis Gospodnetic
Nope... But look at this: http://search-lucene.com/m/HQWF72Yv4K4 Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Floyd Wu floyd...@gmail.com To: solr-user@lucene.apache.org Cc: Otis

Search and index Result

2011-04-14 Thread satya swaroop
Hi all, i just made a duplication of solrdispatchfilter as solrdispatchfilter1 and solrdispatchfilter2 such that all the /update or /update/extract things are passed through the solrdispatchfilter1 and all search (/select) things are passes through the solrdispatchfilter2. It is because i