Search and index Result

2011-04-14 Thread satya swaroop
Hi all, i just made a duplication of solrdispatchfilter as solrdispatchfilter1 and solrdispatchfilter2 such that all the /update or /update/extract things are passed through the solrdispatchfilter1 and all search (/select) things are passes through the solrdispatchfilter2. It is because i

Re: how to get lots fields this way?

2011-04-14 Thread Otis Gospodnetic
Nope... But look at this: http://search-lucene.com/m/HQWF72Yv4K4 Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Floyd Wu > To: solr-user@lucene.apache.org > Cc: Otis Gospodnetic > Sen

Re: Solr and inner joins1

2011-04-14 Thread Otis Gospodnetic
Hi Antony, Check out hit #1: http://search-lucene.com/?q=solr+join Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Antony Corfield [awc] > To: "solr-user@lucene.apache.org" > Sent: Th

Re: tika/pdfbox knobs & levers

2011-04-14 Thread Lance Norskog
Tika creates document-level metadata and text from the input file. That's it. If you want to use PDFbox directly, you need your own Solr plugin. On 4/13/11, Markus Jelsma wrote: > Hi, > > I'm not sure how Solr allows for adjusting these Tika settings to get the > desired output. At least a few de

Re: Vetting Our Architecture: 2 Repeaters and Slaves.

2011-04-14 Thread Lance Norskog
SAN vendors make high-priced super-fast shared file system hardware. They don't use NFS, usually they have a kernel drop-in file system. On 4/14/11, Parker Johnson wrote: > > Otis and Erick, > > Thanks for the responses and for thinking over my potential scenarios. > > The big draw for me on 2 r

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
Been there, done that, got the tshirt ... Glad it's resolved. Erick On Thu, Apr 14, 2011 at 5:33 PM, Marek Tichy wrote: > Hi Erick, > many many thanks. Headbang^80.0. > I was just getting to the same conclusion - that I'm not reaching > handler I think I am, when I got your message. As almost a

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Chris Hostetter
: Perhaps the parameter could have had a better name. It's essentially : max(score of matching clauses) + tie * (score of matching clauses that : are not the max) : : So it can be used and thought of as a tiebreak only in the sense that : if two docs match a clause (with essentially the same sco

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Jay Hill
Dismax works by first selecting the highest scoring sub-query of all the sub-queries that were run. If I want to search on three fields, manu, name and features, I can configure dismax like this: dismax * 0.0* manu name features *:* Now I'll use this query: http

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi Erick, many many thanks. Headbang^80.0. I was just getting to the same conclusion - that I'm not reaching handler I think I am, when I got your message. As almost always, the mystery has a simple explanation. Not only that I was not reaching the handler I thought I was. In fact I was not reachin

Re: Understanding the DisMax tie parameter

2011-04-14 Thread Yonik Seeley
2011 at 5:04 PM, Burton-West, Tom wrote: > Hello, > > I'm having trouble understanding the relationship of the word "tie" and > "tiebreaker" to the explanation of this parameter on the wiki. > What two (or more things) are in a tie? and how does the number in the range > from 0 to 1 break the ti

Understanding the DisMax tie parameter

2011-04-14 Thread Burton-West, Tom
Hello, I'm having trouble understanding the relationship of the word "tie" and "tiebreaker" to the explanation of this parameter on the wiki. What two (or more things) are in a tie? and how does the number in the range from 0 to 1 break the tie? http://wiki.apache.org/solr/DisMaxQParserPlugin#t

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
OK, something doesn't make sense here, but you knew that. I'm guessing that something you're leaving out when cutting/pasting will give us a clue. Here's what I'd like to try. 1> Show us the entire results from &debugQuery=on, as well as the URL. Please do this for both the query that works and th

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi Eric, thanks for the reply. I'm sending qt=partitioned, the request handler is defined as follows: edismax explicit 0.1 author_split^80.0 title^20.0 publisher_split^8.0 15 *:* true synopsis,auth

Re: Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Erick Erickson
Let's see the raw query you send from your webapp. Based on this: +(author_split:joy^80.0)~1.0 () I'm going to guess you send your query something like blahblahblah&defType=edismax&blahblahblah rather than blahblahblah&qt=edismax&blahblahblah The former expects you to define the all your parame

Solr 3.1 core with RAMDirectory isn't reloaded

2011-04-14 Thread nskmda
Hello, We just tried core reloading on a freshly installed Solr 3.1.0 with RamDirectoryFactory. It doesn't seem to happen. With the FSDirectoryFactory everything works fine. Looks like the RamDirectoryFactory implementation caches directory and if it's available it doesn't really reopen it thus n

Re: Vetting Our Architecture: 2 Repeaters and Slaves.

2011-04-14 Thread Parker Johnson
Otis and Erick, Thanks for the responses and for thinking over my potential scenarios. The big draw for me on 2 repeaters idea is that I can: 1. Maximize my hardware. I don't need a standby master. Instead, I can use the "second" repeater to field customer requests. 2. After the primary repea

Dismax parameters weirdness - boosting/mm causes no match ?

2011-04-14 Thread Marek Tichy
Hi, I'd really appreciate if somebody could help me. Solr 3.1, I have a (multivalued) field called author_split, which splits original author names, i.e. "Mills, Joy". When I use the solr web interface to enter author_split:mills the ExtendedDismaxQParser produces query +author_split:mi

Re: RE: Indexing Question for large dataset

2011-04-14 Thread karsten-solr
Hi Joshua, what is the use-case? Do you need only the facets for one field (for each query)? Do you need all facet-values or only the first 10 in .sort=index (FACET_SORT_INDEX / numeric order) / in .sort=count (FACET_SORT_COUNT) ? How many different facet-valuss do you have per field? Do you only

RE: Spellcheck suggestions as solr docs

2011-04-14 Thread Dyer, James
Siva, If you specify "spellcheck.collate=true", then the spell checker will return you a corrected query. Your client has to re-run this query as there is no way to get Solr to automatically redirect the response to the correction. The new query will return documents that have the corrected t

Re: Special Character & Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
On Thu, Apr 14, 2011 at 11:27 AM, Koji Sekiguchi wrote: > I'm not sure, but it is due to HtmlEncoder? > >       >                     default="true" >               class="solr.highlight.HtmlEncoder" /> > > it set as default in example config. Thanks Koji, So it looks like the problems here are

Re: Spellcheck suggestions as solr docs

2011-04-14 Thread Erick Erickson
I'm not sure what you mean by "i need the documents" and how that relates to spell check. Are you saying you need the name of the document the suggestion came from? Please clarify a bit Best Erick On Thu, Apr 14, 2011 at 11:25 AM, sivaprasad wrote: > Hi, > > I have configured spell check fo

Re: Special Character & Hightlighting issues after 3.1.0 update

2011-04-14 Thread Koji Sekiguchi
(11/04/14 23:49), Yonik Seeley wrote: Hmmm, looks like the highlighting code changed. Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexes&hl=true&hl.fl=features&wt=json&indent=true "highlighting":{ "UTF8TEST":{ "features":["eaiou withcir

Spellcheck suggestions as solr docs

2011-04-14 Thread sivaprasad
Hi, I have configured spell check for the terms and it is working fine.But the spell check suggestions are just simple strings.For my requirement, i need the documents.How can i achieve this? Are there any other ways to do spell check?Please suggest .. Regards, Siva -- View this message in conte

Re: Special Character & Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
On Thu, Apr 14, 2011 at 10:49 AM, Yonik Seeley wrote: > Hmmm, looks like the highlighting code changed.  Using the example > doc, with 1.4 I get > > http://localhost:8983/solr/select?q=features:circumflexes&hl=true&hl.fl=features&wt=json&indent=true > >  "highlighting":{ >  "UTF8TEST":{ >        "

Re: Grouping

2011-04-14 Thread ramires
here full exception HTTP ERROR 500 Problem accessing /solr/clustering. Reason: null java.lang.NullPointerException at org.apache.solr.handler.clustering.ClusteringComponent.process(ClusteringComponent.java:89) at org.apache.solr.handler.component.SearchHandler.handleRequest

RE: Fast DIH with 1:M multValue entities

2011-04-14 Thread Tim Gilbert
How did I miss that? Thanks, I will try that as it seems to be "in memory" lookup solution I needed. Thanks Erick, Tim -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, April 14, 2011 10:58 AM To: solr-user@lucene.apache.org Subject: Re: Fast DIH

Re: Fast DIH with 1:M multValue entities

2011-04-14 Thread Erick Erickson
I'm not sure this applies, but have you looked at http://wiki.apache.org/solr/DataImportHandler#CachedSqlEntityProcessor Best Erick On Thu, Apr 14, 2011 at 9:12 AM, Tim Gilbert wrote: > We are working on importing a large nu

Re: Special Character & Hightlighting issues after 3.1.0 update

2011-04-14 Thread Yonik Seeley
Hmmm, looks like the highlighting code changed. Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexes&hl=true&hl.fl=features&wt=json&indent=true "highlighting":{ "UTF8TEST":{ "features":["eaiou with circumflexes: êâîôû"]}}} With 3.1, this now

Re: Grouping

2011-04-14 Thread Grant Ingersoll
What's the full exception? On Apr 14, 2011, at 5:40 AM, ramires wrote: > hi > > I use solr 4.0 trunk. I grouped my result perfectly but when i try to > cluster these result it gives a error null. All my fields stored and > indexed. > > solr/clustering?q=123123&group=true&group.field=si

Special Character & Hightlighting issues after 3.1.0 update

2011-04-14 Thread getagrip
Having updated from 1.4.1 to 3.1.0 some documents are not parsed correctly anymore: 1. Both the result's id field and the highlighting's header do not display special-characters e.g. German Umlauts anymore. 2. The highlighting section is messed up as words appear in random order instead of r

Fast DIH with 1:M multValue entities

2011-04-14 Thread Tim Gilbert
We are working on importing a large number of records into Solr using DIH. We have one schema with ~2000 fields declared which map off to several database schemas so that typically each document will have ~500 fields in use. We have about 2 million "rows" which we are importing, and we are seeing

Grouping

2011-04-14 Thread ramires
hi I use solr 4.0 trunk. I grouped my result perfectly but when i try to cluster these result it gives a error null. All my fields stored and indexed. solr/clustering?q=123123&group=true&group.field=site&start=0 Problem accessing /solr/clustering. Reason: null java.lang.NullPointerExcep

most stable way to get facet pivoting

2011-04-14 Thread Nikolas Tautenhahn
Hi, I want to evaluate (and probably use in production) facet pivoting - what is the best approach to get a "as-stable-as-can-be" version of solr which is able to do facet pivoting? I was hoping to see this in Solr 3.1, but apparently it is only in the dev versions/nightlies... Is it possible to

Solr and inner joins1

2011-04-14 Thread Antony Corfield [awc]
Hi, I have a solr core to which I can add simple documents (essentially these consist of a triple: resource URL, referrer URL and timestamp) that provide information on activity data for open-access repositories. Replicating this in a SQL database I can retrieve other resource URLs (for a giv

Do EntityProcessor honor onError=skip when nextRow() fails?

2011-04-14 Thread Reeza Edah Tally
Hi, The document that I am trying to index with DIH contains an entity with fields queried from a DB and an entity with the content of a file extracted with TikaEntityProcessor. I was testing the onError="skip" option with TikaEntityProcessor and found out it does not work. It basically behaves

Why are they different?

2011-04-14 Thread m _ 米蟲ы~
String fileUrl = "C:/TEMP/table.txt"; File file = new File(fileUrl); ContentStreamUpdateRequest up = new ContentStreamUpdateRequest( "/update/extract"); up.addFile(file);

Unsuscribe

2011-04-14 Thread Quirino Brizi
Unsuscribe me

Unsubsribe

2011-04-14 Thread pankaj bhatt
On Thu, Apr 14, 2011 at 1:32 PM, Stephan Raemy wrote: > > > Sent from my iPhone >

Unsubsribe

2011-04-14 Thread Stephan Raemy
Sent from my iPhone

phpnative response writer in SOLR 3.1 ?

2011-04-14 Thread Ralf Kraus
Hello, I just updatet to SOLR 3.1 and wondering if the phpnative response writer plugin is part of it? ( https://issues.apache.org/jira/browse/SOLR-1967 ) When I try to compile the sources files I get some errors : PHPNativeResponseWriter.java:57: org.apache.solr.request.PHPNativeResponseWri

Re: jetty update

2011-04-14 Thread ramires
hi i use solr(trunk 4.0) with nutch (1.2.rc4). In nutch I put ./solr/dist/apache-solr-core-4.0-SNAPSHOT.jar ./solr/dist/apache-solr-solrj-4.0-SNAPSHOT.jar becouse of javabin change. In solr I use jetty-6.1.26-patched-JETTY-1340.jar and jetty-util-6.1.26-patched-JETTY-1340.jar but i stil get this

Re: ExtractingRequestHandler and Solr 3.1

2011-04-14 Thread Liam O'Boyle
Hi Grant, After comparing the differences between my solrconfig.xml and that used by the example, the key difference is that I didn't have true in the defaults for the ERH. Commenting out this line in the example configuration causes the example to display the same behaviour as I'm seeing. I've