solrDocumentList

2012-06-01 Thread gopes
We are using Lucid UI and solr to index our collection of xml files. I am getting the solrDocumentList like this [SolrDocument[{id=1331226833510, Street_Addr=[113 113TH ST], name=[113 113TH ST SASKATOON SK S7N1V8], Municipality_Name=[SASKATOON], Province_Code=[SK], Postal_Code=[S7N1V8]}] But I n

Re: Using Data Import Handler to invoke a stored procedure with output (cursor) parameter

2012-06-01 Thread Michael Della Bitta
Apologies for the terseness of this reply, as I'm on my mobile. To treat the result of a function call as a table in Oracle SQL, use the table() function, like this: select * from table(my_stored_func()) HTH, Michael On Jun 1, 2012 8:01 PM, "Niran Fajemisin" wrote: > So I was able to run some

Re: Replacing payloads for per-document-per-keyword scores

2012-06-01 Thread Chris Hostetter
: > Hoss guessed that we could override Term Frequency with PreAnalyzedField[1] : > for the per-keyword scores, since keywords (tags) always have a Term : > Frequency of 1 and the TF calculation is very fast. However it turns out : > that you can't[2] specify TF in the PreAnalyzedField. Yeah ... s

Re: Using Data Import Handler to invoke a stored procedure with output (cursor) parameter

2012-06-01 Thread Niran Fajemisin
So I was able to run some additional tests today on this. I tried to use a stored function instead of a stored procedure. The hope was that the Stored Function would simply be a wrapper for the Store Procedure and would simply return the cursor as the return value. This unfortunately did not wor

Re: solr, how can I make search query with fixed slop(distance)

2012-06-01 Thread Jack Krupansky
Take a look at the Surround Query Parser that lets you do span queries: http://wiki.apache.org/solr/SurroundQueryParser solr 2w user But, they are very simple, maybe too simple. OTOH, you may be able to combine them with "nested queries". The Lucid Imagination LucidWorks Enterprise product

Re: Difference between textfield and strfield

2012-06-01 Thread Gau
is there any other option to sorting. I mean, sorting can affect query performance. Is there a way to embed this into Solr and not have a toll on the system, I tried boosting the scores based on strdist, but that seems to bring in more results than expected. -- View this message in context: http

Re: possible status codes from solr during a (DIH) data import process

2012-06-01 Thread Shawn Heisey
On 6/1/2012 11:51 AM, Savvas Andreas Moysidis wrote: Hello, Driven by the same requirements we also implemented the same polling mechanism (in java) and found it a bit awkward and error prone having to search through the returned response for occurrences of the terms "failure" or "Rollback" etc.

Re: A few random questions about solr queries.

2012-06-01 Thread Shawn Heisey
On 5/29/2012 4:18 AM, santamaria2 wrote: *3)* I've rummaged around a bit, looking for info on when to use q vs fq. I want to clear my doubts for a certain use case. Where should my date range queries go? In q or fq? The default settings in my site show results from the past 90 days with buttons

Lucene/Solr Search Engineers

2012-06-01 Thread SV
Hi, We are hiring multiple Lucene/Solr engineers, tech leads, architects based in Minneapolis - both full time and consulting for developing new search platform. Please reach out to me - svamb...@gmail.com Thanks, Venkat Ambati Sr. Manager, Best Buy

Re: Data Import Handler fields with different values in column and name

2012-06-01 Thread Jack Krupansky
I'm still looking, but I do see this config for a unit test: public static final String dc_singleEntity = "\n" + "\n" + "\n" + "\n" + " \n" + " \n" + " " + " \n" + "\n" + ""; Suggesting that you

Re: How to find the age of a page

2012-06-01 Thread Jack Krupansky
If you uncomment the "timestamp" field in the Solr example, Solr will automatically initialize it for each new document to be the time when the document is indexed (or most recently indexed). Any field declared with default="NOW" and not explicitly initialized will have the current time when in

Re: possible status codes from solr during a (DIH) data import process

2012-06-01 Thread Savvas Andreas Moysidis
Hello, Driven by the same requirements we also implemented the same polling mechanism (in java) and found it a bit awkward and error prone having to search through the returned response for occurrences of the terms "failure" or "Rollback" etc. It would be *really* handy if the status command retur

RE: Data Import Handler fields with different values in column and name

2012-06-01 Thread Dyer, James
I do not see any logging statements in the code, so I don't think there's anything on that end that can be done. It would be easy, though, if he is using multiple mappings to remove the duplicate and see if that solves it. From a more-thorough review of the code, though, I think my intial hunc

Re: eliminate adminPath tag from solr.xml file?

2012-06-01 Thread Chris Hostetter
: http://wiki.apache.org/solr/CoreAdmin#Core_Administration : : if you wanted to eliminate administration of the core from the web site, : : could you eliminate either solr.xml or remove the : : from the solr.xml file? As mentioned on that page... adminPath - Relative path to access the Cor

Re: I got ERROR, Unable to execute query

2012-06-01 Thread Jack Krupansky
Is test_5 created by a stored procedure? If so, is there a possibility that the stored procedure may have done an update and not returned data - but just sometimes? -- Jack Krupansky -Original Message- From: Jihyun Suh Sent: Friday, June 01, 2012 12:02 PM To: solr-user-h...@lucene.ap

solr, how can I make search query with fixed slop(distance)

2012-06-01 Thread Jihyun Suh
I want to search data within fixed slop in Solr. For example, I make search query 'title:+solr +user ~2' for search some data which have 'solr' and 'user' within 2 slops. But it's not working in Solr. I get some parameter, defType=edismax, pf, qs, ps. It's not change the search result, but order.

Re: EventListeners of DIH

2012-06-01 Thread khuram120
I am looking to do the same, I also want to update a table after every document is updated/added/deleted from the Solr index. OnImportStart and End only works at the beginning and end of the imports, but I need an event which should be fired after each document in the index rather than the one whic

Re: Data Import Handler fields with different values in column and name

2012-06-01 Thread Rafael Taboada
Hi! I think it works but using alias with Oracle database Am i wrong with this??? Anyone tried DIH with Oracle? Thanks for your help On Fri, Jun 1, 2012 at 10:34 AM, Jack Krupansky wrote: > James: Is there some particular DIH logging

Re: per-fieldtype similarity not working

2012-06-01 Thread Robert Muir
On Fri, Jun 1, 2012 at 11:39 AM, Markus Jelsma wrote: > Hi! > > > Ah, it makes sense now! This global configured similarity knows returns a > fieldType defined similarity if available and if not the standard Lucene > similarity. This would, i assume, mean that the two defined similarities > bel

Re: Data Import Handler fields with different values in column and name

2012-06-01 Thread Rafael Taboada
Hi Jack. Logging just show import is successful. Jun 1, 2012 8:50:38 AM org.apache.solr.handler.dataimport.DocBuilder finish INFO: Import completed successfully Jun 1, 2012 8:50:38 AM org.apache.solr.update.DirectUpdateHandler2 commit INFO: start commit(optimize=false,waitFlush=false,waitSearcher

Re: Data Import Handler fields with different values in column and name

2012-06-01 Thread Rafael Taboada
Hi James, I'm not duplicating fields. Just using one field asunto: Thanks for your help. On Fri, Jun 1, 2012 at 9:50 AM, Dyer, James wrote: > Are you leaving both mappings in there, like this... > > > > > > > > > If so, I'm not sure you can map "asunto" to two different fields like

RE: per-fieldtype similarity not working

2012-06-01 Thread Markus Jelsma
Hi! Ah, it makes sense now! This global configured similarity knows returns a fieldType defined similarity if available and if not the standard Lucene similarity. This would, i assume, mean that the two defined similarities below without per fieldType declared similarities would always yield t

Re: Data Import Handler fields with different values in column and name

2012-06-01 Thread Jack Krupansky
James: Is there some particular DIH logging he can turn on to see what is really happening with his field name mapping? In other words, if DIH/Solr really is ignoring that field mapping, to find out exactly why. -- Jack Krupansky -Original Message- From: Dyer, James Sent: Friday, Jun

Search request on Solr Cloud

2012-06-01 Thread Trym R. Møller
Hi I would like to execute the following query on Solr trunk (cloud): http://localhost:8983/solr/select?collection=myCollection&q=*%3A*&start=0&rows=10&wt=xml but it fails with a http 404 error. 1.

Sorting with customized function of score

2012-06-01 Thread Toan V Luu
Hi, When i use "sort=score asc" then it works, but when I use a customized function like "sort=sum(score,2) asc" then I got an error "can not sort on multivalued field: sum(score,2)". Do you know why and how to solve it? Thanks Toan.

RE: why DIH works in normal mode,error in debug mode

2012-06-01 Thread Dyer, James
Try setting it to 0 or -1. Or check the Mysql JDBC driver documentation about valid values for "Statement.setFetchSize()" I think someone else recently asked on this same list about problems with the latest Mysql driver and fetch sizes, so this driver may be particularly finicky. James Dyer E

[job] Looking for local SOLR developers

2012-06-01 Thread Chambeda
Hi all, My team is looking for developers with SOLR experience for a company in Minneapolis, MN. If interested please reply to this posting and I can fill you in on more details. Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/job-Looking-for-local-SOLR-developers-tp

Re: why DIH works in normal mode,error in debug mode

2012-06-01 Thread wangjing
In my datasource config file: i have done it,set batchSize="500" On Fri, Jun 1, 2012 at 10:38 PM, Dyer, James wrote: > I see this in your stacktrace:  java.sql.SQLException: Illegal value for > setFetchSize(). > > It must be that your JDBC driver doesn't like the default value (300) that is

Re: per-fieldtype similarity not working

2012-06-01 Thread Robert Muir
On Fri, Jun 1, 2012 at 5:13 AM, Markus Jelsma wrote: > Thanks but i am clearly missing something? We declare the similarity in the > fieldType just as in the example and looking at the example again i don't see > how it's being done differently. What am i missnig and where do i miss it? :) > Hi

Re: Strip html

2012-06-01 Thread Tigunn
the xslt do that Jack Krupansky-2 wrote > > The bottom line is that you will need to have your own code that will > detect > the "choice" tag and map it to the desired choice, and you will have to do > that before you "strip" html. > > So, given: > > >

RE: Data Import Handler fields with different values in column and name

2012-06-01 Thread Dyer, James
Are you leaving both mappings in there, like this... If so, I'm not sure you can map "asunto" to two different fields like this. For that, you may need to write a transformer that will duplicate "asunto" for you. Although, in most cases all you need to do is add a in schema.xml to

RE: why DIH works in normal mode,error in debug mode

2012-06-01 Thread Dyer, James
I see this in your stacktrace: java.sql.SQLException: Illegal value for setFetchSize(). It must be that your JDBC driver doesn't like the default value (300) that is used. In your datasource tag, try adding a "batchSize" attribute of either 0 or -1 (if using -1, DIH automatically changes it t

Re: Strip html

2012-06-01 Thread Jack Krupansky
The bottom line is that you will need to have your own code that will detect the "choice" tag and map it to the desired choice, and you will have to do that before you "strip" html. So, given: C

Re: possible status codes from solr during a (DIH) data import process

2012-06-01 Thread geeky2
thank you ALL for the great feedback - very much appreciated! -- View this message in context: http://lucene.472066.n3.nabble.com/possible-status-codes-from-solr-during-a-DIH-data-import-process-tp3987110p3987263.html Sent from the Solr - User mailing list archive at Nabble.com.

eliminate adminPath tag from solr.xml file?

2012-06-01 Thread geeky2
hello all, referring to: http://wiki.apache.org/solr/CoreAdmin#Core_Administration if you wanted to eliminate administration of the core from the web site, could you eliminate either solr.xml or remove the from the solr.xml file? thank you, -- View this message in context: http://lucene.

Re: how to show DIH query sql in log file

2012-06-01 Thread Rahul Warawdekar
Hi, Turn the Solr logging level to "FINE" for the DIH packages/classes and they will show up in the log. http://:/solr//admin/logging On Fri, Jun 1, 2012 at 9:34 AM, wangjing wrote: > how to show DIH query's sql in log file for troubleshooting? > > thanks. > -- Thanks and Regards Rahul A. W

Re: Cannot get highlighting to work

2012-06-01 Thread Asfand Qazi
Ah... on further inspection of the schema, I saw that the field type was a custom one that had been configured differently from the standard 'text' one. I simply got rid of the custom field type and set it back to text. Then as you said I reindexed the data (another blunder on my part before)

Re: Strip html

2012-06-01 Thread Tigunn
Thanks for your answers. Unfortunately, i can't try before monday. In first my solr's settings: In schema.xml: In my php : in a loop on all document xml of my database Exist-db (xml database wich store xml files) A exemple of a doc xml: I follow the steps: 1 - i transform xml to html, it's a

Re: How can I remove the home page priority of site home page from search results

2012-06-01 Thread Jack Krupansky
What are the three documents? In any case, it looks like the "fieldNorm" for title is 2.3 times greater for the first document compared to the second document and the third document has an even smaller fieldNorm for title. Further, as explain explains, only the title field is bing used. This

Sharing common config between different search handlers

2012-06-01 Thread Jochen Just
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hey there list, my application needs two searching modes. The first modes needs to consider synonyms the other one must not. Currently I have two field types with basically the same configuration except that one field type uses a SynonymFilter the ot

Re: Cannot get highlighting to work

2012-06-01 Thread Jack Krupansky
I got confused in the last paragraph - does a purely alphabetic term get highlighted properly or not? I am trying to figure out if the problem relates only to terms that decompose into phrases (as alphanumeric terms do) or for all terms. Thanks. If the analyzer changes, the data must be reinde

Re: Stop Words in SpellCheckComponent

2012-06-01 Thread Jack Krupansky
You forgot to give us the field definition for "name". Is it the same as in the 3.6 example, or is it changed? Make sure that you delete all existing data after you change the schema/config. Do a direct query on the spellcheck field (name:the) to verify whether "the" is being indexed or not.

Re: Strip html

2012-06-01 Thread Jack Krupansky
"I tryed to strip_tags() (php function) before index again. But it doesn't work." What does it not do correctly? Show us. Show an actual document as posted to Solr. As Hoss said, if you are stripping HTML before posting the document to Solr, then you want a field type that doesn't use the "s

Re: Multi-words synonyms matching

2012-06-01 Thread O. Klein
Looking for some more background information I stumbled upon https://issues.apache.org/jira/browse/LUCENE-3668. If you read the last post it confirms my issue. So maybe this is a bug? Bernd Fehling-2 wrote > > Are you sure with LUCENE_33 (Use of BitVector)? > > > Am 31.05.2012 17:20, schrieb

Re: How to find the age of a page

2012-06-01 Thread in.abdul
Shameema Umer, you can add another one new field in schema .. while updating or indexing add the time stamp to that current field .. Thanks and Regards, S SYED ABDUL KATHER On Fri, Jun 1, 2012 at 3:44 PM, Shameema Umer [via Lucene] < ml-node+s472066n3987234...@n3.nabble.co

solr commit execute taking time to execute

2012-06-01 Thread Sri Krishna
The data i use to update is very less, infact at max 2 to 3 words. Any suggestions for improvement ? . The requirement is that updated index need to be useful there an then, so lazy commit or auto commit are not useful here here is the log info Jun 1, 2012 3:52:39 PM org.apache.solr.core.SolrDe

Re: Strip html

2012-06-01 Thread Tigunn
Excuse me, i explain my need: i have a xml file like exemple: I want to indexing the xsl transformation; i transform my xml to html, i have: - si les ruches d’abeilles prouvent la monarchie, les fourmillières, les troupes d’éléphants ou de castors prouvent

Facing problem in SOLR replication

2012-06-01 Thread Krishn Murari Mishra
I get following errors printed on my slave console many times when I go for Master - Slave replication option May 31, 2012 11:50:44 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when proces

RE: per-fieldtype similarity not working

2012-06-01 Thread Markus Jelsma
Thanks but i am clearly missing something? We declare the similarity in the fieldType just as in the example and looking at the example again i don't see how it's being done differently. What am i missnig and where do i miss it? :) -Original message- > From:Robert Muir > Sent: Thu 31-Ma

Re: Cannot get highlighting to work

2012-06-01 Thread Asfand Qazi
On 31/05/12 21:10, Jack Krupansky wrote: Try a query that uses a term that doesn't split an alphanumeric term into two terms. Then check to see what field type you used for the symbol and marker_symbol fields and whether the analyzer for that field type has changed in 3.6. Aha - yes, not usin

Re: Hightlighting and excerpt

2012-06-01 Thread Shameema Umer
Hi Togla to get excerpt you should add the fragment size parameter. hl.fragsize=300 eg: &hl=true&hl.fl=content&hl.fragsize=300 Thanks Shameema On Thu, May 31, 2012 at 7:31 PM, Ahmet Arslan wrote: > > I need something like http://cl.ly/2o2E0g0S422d2p1X203h . See how TCMB > > was stressed? > > Hi

Re: Stop Words in SpellCheckComponent

2012-06-01 Thread Matthias Müller
> But your most recent email referred to "stopword.txt". > > So, either add "the" to german_stop_long.txt, or change the "words" option > of your stopfilter to refer to "stopwords.txt". Sorry for that confusion: The stopfilter refers to the stopwords.txt Now I'm just talking about the solr exampl

Re: How can I remove the home page priority of site home page from search results

2012-06-01 Thread Shameema Umer
I added braces to key words and debuged: i really need to boost term frequency. Please help. 1.2125369 = (MATCH) fieldWeight(title:gold in 102), product of: 1.0 = tf(termFreq(title:gold)=1) 4.8501477 = idf(docFreq=11,maxDocs=564) 0.25 = fieldNorm(field=title, doc=102) 0.5304849 = (MATCH) field