loading solr from Pig?

2013-08-21 Thread geeky2
Hello All, Is anyone loading Solr from a Pig script / process? I was talking to another group in our company and they have standardized on MongoDB instead of Solr - apparently there is very good support between MongoDB and Pig - allowing users to "stream" data directly from a Pig process in to Mo

Re: translating a character code to an ordinal?

2013-06-10 Thread geeky2
i will try it out and let you know - -- View this message in context: http://lucene.472066.n3.nabble.com/translating-a-character-code-to-an-ordinal-tp4068966p4069339.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: translating a character code to an ordinal?

2013-06-10 Thread geeky2
i will try it. i guess i made a "poor" assumption that you would not get predictable results when copying a code like "mycode" to an int field where where the desired end result in the int field is say, "1". i was worried that some sort of ascii conversion or "wrap around" would happen in the int

custom field tutorial

2013-06-07 Thread geeky2
can someone point me to a "custom field" tutorial. i checked the wiki and this list - but still a little hazy on how i would do this. essentially - when the user issues a query, i want my class to interrogate a string field (containing several codes - example boo, baz, bar) and return a single

Re: translating a character code to an ordinal?

2013-06-07 Thread geeky2
thx, please send me a link to the book so i get/purchase it. thx mark -- View this message in context: http://lucene.472066.n3.nabble.com/translating-a-character-code-to-an-ordinal-tp4068966p4068997.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: translating a character code to an ordinal?

2013-06-07 Thread geeky2
hello jack, thank you for the code ;) what "book" are you referring to? AFAICT - all of the 4.0 books are "future order". we won't be moving to 4.0 (soon enough). so i take it - copyfield will not work, eg - i cannot take a code like ABC and copy it to an int field and then use the regex to tu

translating a character code to an ordinal?

2013-06-07 Thread geeky2
hello all, environment: solr 3.5, centos problem statement: i have several character codes that i want to translate to ordinal (integer) values (for sorting), while retaining the original code field in the document. i was thinking that i could use a copyField from my "code" field to my "ord" fi

Re: seeing lots of "autowarming" messages in log during DIH indexing

2013-05-31 Thread geeky2
the DIH is launched via a script - called by a "cron like" scheduler. clean, commit and optimize are all true. thx mark #!/bin/bash SERVER=$1 PORT=$2 CLEAN=$3 COMMIT=$4 OPTIMIZE=$5 COREPATH=$6 echo SERVER: $SERVER echo PORT: $PORT echo CLEAN: $CLEAN echo COMMIT: $COMMIT echo OPTIMIZE: $OPTIMI

Re: seeing lots of "autowarming" messages in log during DIH indexing

2013-05-20 Thread geeky2
you mean i would add this switch to my script that kicks of the dataimport? exmaple: OUTPUT=$(curl -v http://${SERVER}.intra.searshc.com:${PORT}/solrpartscat/${CORE}/dataimport -F command=full-import -F clean=${CLEAN} -F commit=${COMMIT} -F optimize=${OPTIMIZE} -F openSearcher=false) what need

seeing lots of "autowarming" messages in log during DIH indexing

2013-05-20 Thread geeky2
hello, we are tracking down some performance issues with our DIH process. not sure if this is related - but i am seeing tons of the messages below in the logs during re-indexing of the core. what do these messages mean? 2013-05-18 19:37:30,623 INFO [org.apache.solr.update.UpdateHandler] (pool

Re: having trouble storing large text blob fields - returns binary address in search results

2013-05-18 Thread geeky2
hello your comment made me think - so i decided to double check myself. i opened up the schema in squirrel and made sure that the two columns in question were actually of type TEXT in the schema - check i went in to the db-config.xml and removed all references to ClobTransformer, removed the cas

Re: having trouble storing large text blob fields - returns binary address in search results

2013-05-17 Thread geeky2
Hello Gora, thank you for the reply - i did finally get this to work. i had to cast the column in the DIH to a clob - like this. cast(att.attr_val AS clob) as attr_val, cast(rsr.rsr_val AS clob) as rsr_val, once this was done, the ClobTransformer worked. to my knowledge - this parti

having trouble storing large text blob fields - returns binary address in search results

2013-05-16 Thread geeky2
hello environment: solr 3.5 can someone help me with the correct configuration for some large text blob fields? we have two fields in informix tables that are of type text. when we do a search the results for these fields come back looking like this: [B@17c232ee i have tried setting them u

Re: why does "*" affect case sensitivity of query results

2013-04-30 Thread geeky2
hello erik, thank you for the info - yes - i did notice ;) one more reason for us to upgrade from 3.5. thx mark -- View this message in context: http://lucene.472066.n3.nabble.com/why-does-affect-case-sensitivity-of-query-results-tp4059801p406.html Sent from the Solr - User mailing list

Re: why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
here is the jira link: https://issues.apache.org/jira/browse/SOLR-219 -- View this message in context: http://lucene.472066.n3.nabble.com/why-does-affect-case-sensitivity-of-query-results-tp4059801p4059814.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
was looking in Smiley's book on page 129 and 130. from the book, >> No text analysis is performed on the search word containing the wildcard, not even lowercasing. So if you want to find a word starting with Sma, then sma* is required instead of Sma*, assuming the index side of the field's type i

why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
hello, environment: solr 3.5 problem statement: when query has "*" appended, it turns case sensitive. assumption: query should NOT be case sensitive actual value in database at time of index: 4387828BULK here is a snapshot of what works and does not work. what works: itemModelNoExactMatch

having trouble searching on EdgeNGramFilterFactory field with a length < minGramSize

2013-03-19 Thread geeky2
hello, i am trying to debug the following query in the analyzer: *+itemModelNoExactMatchStr:JVM1640CJ01 +plsBrandId:0432 +plsBrandDesc:ge* the query is going against a field (plsBrandDesc) that is being indexed with solr.EdgeNGramFilterFactory and a minGramSize of 3. i have included the compl

need general advice on how others version and mange core deployments over time

2013-03-14 Thread geeky2
hello everyone, i know this is a general topic - but would really appreciate info from others that are doing this now. - how are others managing this so that users are impacted the least - how are others handling the scenario where users don't want to migrate forward. thx mark -- View

Re: having trouble escaping a character string

2013-03-12 Thread geeky2
oh - now i see what i was doing wrong. i kept trying to use the hex code of %22 as a replacement for the double quote - but that was not working - thank you jack, mark -- View this message in context: http://lucene.472066.n3.nabble.com/having-trouble-escaping-a-character-string-tp404679

Re: having trouble escaping a character string

2013-03-12 Thread geeky2
attempting to upload the screenshot bmp file. the embedded image is difficult to make out. temp1.bmp -- View this message in context: http://lucene.472066.n3.nabble.com/having-trouble-escaping-a-character-string-tp4046796p4046798.

having trouble escaping a character string

2013-03-12 Thread geeky2
hello all, i am searching on this field type: for this string: 30326R-26" TILLER when i use the analyzer and issue the query - it indicates success (please see attached screen shot) but when i issue the searc

Re: question about syntax for multiple terms in filter query

2013-03-12 Thread geeky2
jack, did you mean "function query" or filter query i was going to do this in my request handler for parts +itemType:1 +sellingPrice:[1 TO *] -- View this message in context: http://lucene.472066.n3.nabble.com/question-about-syntax-for-multiple-terms-in-filter-query-tp4046442p4046715

Re: question about syntax for multiple terms in filter query

2013-03-12 Thread geeky2
hello jack, yes - i will always be using the two constraints at the same time. thank you again for the info. thx mark -- View this message in context: http://lucene.472066.n3.nabble.com/question-about-syntax-for-multiple-terms-in-filter-query-tp4046442p4046650.html Sent from the Solr - User

Re: question about syntax for multiple terms in filter query

2013-03-11 Thread geeky2
otis and jack - thank you VERY much for the feedback - jack - >> use a single fq containing two mandatory clauses if those clauses appear together often << this is the use case i have to account for - eg, right now i have this in my request handler ... itemType:1 ...

question about syntax for multiple terms in filter query

2013-03-11 Thread geeky2
hello everyone, i have a question on the filter query syntax for multiple terms, after reading this: http://wiki.apache.org/solr/CommonQueryParameters#fq i see from the above that two (2) syntax constructs are supported fq=term1:foo & fq=term2:bar and fq=+term1:foo +term2:bar is there a reas

searching for q terms that start with a dash/hyphen being interpreted as prohibited clauses

2013-01-17 Thread geeky2
hello environment: solr 3.5 problem statement: i have a requirement to search for part numbers that start with a dash / hyphen. example q= term: *-0004A-0436* example query: http://some_url:some_port/some_core/select?facet=false&sort=score+desc%2C+rankNo+asc%2C+partCnt+desc&start=0&q=*-0004A-

performing a boolean query (OR) with a large number of terms

2013-01-09 Thread geeky2
hello, environment: solr 3.5 i have a requirement to perform a boolean query (like the example below) with a large number of terms. the number of terms could be 15 or possibly larger. after looking over several theads and the smiley book - i think i just have include the parens and string all o

Re: is it possible to save the search query?

2012-11-20 Thread geeky2
Hello, i think you are asking two questions here - i'll see if i can give you some simple examples for both 1) how can i pull data from a solr search result set and compare it to another for analysis? one way - might be to drive the results in to files and then use xslt to extract relevant infor

RE: How do I best detect when my DIH load is done?

2012-11-19 Thread geeky2
James, was it you (cannot remember) that replied to one of my queries on this subject and mentioned that there was consideration being given to "cleaning" up the response codes to remove ambiguity? -- View this message in context: http://lucene.472066.n3.nabble.com/How-do-I-best-detect-when-

Re: How do I best detect when my DIH load is done?

2012-11-19 Thread geeky2
Hello Andy, i had a similar question on this some time ago. http://lucene.472066.n3.nabble.com/possible-status-codes-from-solr-during-a-DIH-data-import-process-td3987110.html#a3987123 http://lucene.472066.n3.nabble.com/need-input-lessons-learned-or-best-practices-for-data-imports-td3801327.html#

Re: large text blobs in string field

2012-11-05 Thread geeky2
Erick, thanks for the insight. FWIW and to add to the context of this discussion, if we do decide to add the previously mentioned content as a multivalued field, we would likely use a DIH hooked to our database schema (this is currently how we add ALL content to our core) and within the DIH, us

Re: large text blobs in string field

2012-11-05 Thread geeky2
Gora, currently our core does use mult-valued fields. however the exsiting multi-valued fields in the schema are will only result in 3 - 10 values. we are thinking of using the text blob approach primarily because of the large number of possible values in this field. if we were to use a multi

large text blobs in string field

2012-11-02 Thread geeky2
hello environment - solr 3.5 i would like to know if anyone is using the technique of placing large text blobs in to a "non-indexed" string field and if so - are there any good/bad aspects to consider? we are thinking of doing this to represent a 1:M relationship with the "Many" being represent

Re: need help with exact match search

2012-10-22 Thread geeky2
hello jack, that was it! thx mark -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-with-exact-match-search-tp4014832p4015103.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: need help with exact match search

2012-10-19 Thread geeky2
hello jack, thank you very much for the reply - i will re-test and let you know. really appreciate it ;) thx mark -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-with-exact-match-search-tp4014832p4014848.html Sent from the Solr - User mailing list archive at Na

need help with exact match search

2012-10-19 Thread geeky2
environment: solr 3.5 Hello, i have a query for an exact match that is bringing back one (1) additional record that is NOT an exact match. when i do an exact match search for 404 - i should get back three (3) document, *but i get back the additional record, with an itemModelNoExactMatchStr of

Re: need help understanding an issue with scoring

2012-08-28 Thread geeky2
Chris, Jack, thank you for the detailed replies and help ;) -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-understanding-an-issue-with-scoring-tp4002897p4003782.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: need help understanding an issue with scoring

2012-08-23 Thread geeky2
hello, this is the query i am using: cat goquery.sh #!/bin/bash SERVER=$1 PORT=$2 QUERY="http://$SERVER.blah.blah.com:${PORT}/solrpartscat/core1/select?qt=itemNoProductTypeBrandSearch&q=9030&rows=2000&debugQuery=on&fl=*,score"; curl -v $QUERY -- View this message in context: http://luc

need help understanding an issue with scoring

2012-08-23 Thread geeky2
hello, i am trying to understand the "debug" output from a query, and specifically - how scores for two (2) documents are derived and why they are so far apart. the user is entering 9030 for the search the search is rightfully returning the top document, however - the question is why is the docu

using tie parameter of edismax to raise a score (disjunction max query)?

2012-08-23 Thread geeky2
Hello all, this "more specific" question is related to my earlier post at: http://lucene.472066.n3.nabble.com/need-help-understanding-an-issue-with-scoring-td4002897.html i am reading here about the tie parameter: http://wiki.apache.org/solr/ExtendedDisMax?highlight=%28edismax%29#tie_.28Tie_brea

Re: Holy cow do I love 4.0's admin screen

2012-08-23 Thread geeky2
Andy, we are not running solr 4.0 here in production. can you elaborate on your comment related to your polling script written in ruby and how the new data import status screen makes your polling app obsolete? i wrote my own polling app (in shell) to work around the very same issues: http://luc

Re: need help understanding an issue with scoring

2012-08-23 Thread geeky2
looks like the original complete list of the results did not get attached to this thread here is a snippet of the list. what i am trying to demonstrate, is the difference in scoring and ultimately, sorting - and the breadth of documents (a few hundred) between the two documents of interest (9030

Re: need help understanding an issue with scoring

2012-08-23 Thread geeky2
update: as an experiment - i changed the query to a wildcard (9030*) instead of an explicit value (9030) example: QUERY="http://$SERVER.intra.searshc.com:${PORT}/solrpartscat/core1/select?qt=itemNoProductTypeBrandSearch&q=9030*&rows=2000&debugQuery=on&fl=*,score"; this resulted in a results lis

need help understanding times used in dataimport?command=status

2012-07-11 Thread geeky2
hello all, i noticed something in one of our logs that periodically polls the status of an data import. can someone help me understand where / how the "times" for "Full Dump Started" are derived? here it shows the dataimport dump starting at 1:32 0 0 db-data-confi

RE: maxNumberOfBackups does not cleanup - jira 3361

2012-07-10 Thread geeky2
thank you James - that is good to know. for the short-term we'll just use cron and kill backup directories that are older than x. for the long-term, we'll just migrate to 4.0 thanks again -- View this message in context: http://lucene.472066.n3.nabble.com/maxNumberOfBackups-does-not-cleanup-j

maxNumberOfBackups does not cleanup - jira 3361

2012-07-10 Thread geeky2
environment: solr 3.5 hello all, i have a question on this jira - https://issues.apache.org/jira/browse/SOLR-3361 the jira states that, "with "backupAfter"=commit, the backups do not get cleaned up" however - we are noticing this same issue in our environment, when using optimize. can someone

avgTimePerRequest JMX M-Bean displays with NaN instead of 0 - when no activity

2012-06-28 Thread geeky2
hello all, environment: solr 3.5, jboss, wily we have been setting up jmx monitoring for our solr installation. while running tests - i noticed that of the 6 JMX M-Beans (avgRequestsPerSecond, avgTimePerRequest, errors, requests, timeouts, totalTime) ... the avgTimePerRequest M-Bean was produci

question about jmx value (avgRequestsPerSecond) output from solr

2012-06-27 Thread geeky2
hello all, environment: centOS, solr 3.5, jboss 5.1 i have been using wily (a monitoring tool) to instrument our solr instances in stress. can someone help me to understand something about the jmx values being output from solr? please note - i am new to JMX. problem / issue statement: for a gi

Re: seeing errors during replication process on slave boxes - read past EOF

2012-06-04 Thread geeky2
hello, i have shell scripts that handle all of the operational tasks. example: curl -v http://${SERVER}.bogus.com:${PORT}/somecore/dataimport -F command=full-import -F clean=${CLEAN} -F commit=${COMMIT} -F optimize=${OPTIMIZE} -- View this message in context: http://lucene.472066.n3.nabble.

seeing errors during replication process on slave boxes - read past EOF

2012-06-03 Thread geeky2
hello all, environment: solr 3.5 1 - master 2 - slave slaves are set to poll master every 10 minutes. i have had replication running on one master and two slaves - for a few weeks now. these boxes are not production boxes - just QA/test boxes. right after i started a re-index on the master -

Re: possible status codes from solr during a (DIH) data import process

2012-06-01 Thread geeky2
thank you ALL for the great feedback - very much appreciated! -- View this message in context: http://lucene.472066.n3.nabble.com/possible-status-codes-from-solr-during-a-DIH-data-import-process-tp3987110p3987263.html Sent from the Solr - User mailing list archive at Nabble.com.

eliminate adminPath tag from solr.xml file?

2012-06-01 Thread geeky2
hello all, referring to: http://wiki.apache.org/solr/CoreAdmin#Core_Administration if you wanted to eliminate administration of the core from the web site, could you eliminate either solr.xml or remove the from the solr.xml file? thank you, -- View this message in context: http://lucene.

possible status codes from solr during a (DIH) data import process

2012-05-31 Thread geeky2
hello all, i have been asked to write a small polling script (bash) to periodically check the status of an import on our Master. our import times are small, but there are business reasons why we want to know the status of an import after a specified amount of time. i need to perform certain acti

need to verify my understanding of default value of mm (minimum match) for edismax

2012-05-24 Thread geeky2
environment: solr 3.5 default operator is OR i want to make sure i understand how the mm param(minimum match) works for the edismax parser http://wiki.apache.org/solr/ExtendedDisMax?highlight=%28dismax%29#mm_.28Minimum_.27Should.27_Match.29 it looks like the rule is 100% of the terms must match

RE: index-time boosting using DIH

2012-05-22 Thread geeky2
thank you james for the feedback - i appreciate it. ultimately - i was trying to decide if i was missing the boat by ONLY using query time boosting, and i should really be using index time boosting. but after your reply, reading the solr book, and looking at the lucene dox - it looks like index-t

RE: index-time boosting using DIH

2012-05-22 Thread geeky2
thanks for the reply, so to use the $docBoost pseudo-field name, would you do something like below - and would this technique likely increase my total index time? ... -- View this message in context: http://lucene.472066.n3.nabble.com/index-tim

index-time boosting using DIH

2012-05-22 Thread geeky2
hello all, can i use the technique described on the wiki at: http://wiki.apache.org/solr/SolrRelevancyFAQ#index-time_boosts if i am populating my core using a DIH? looking at the posts on this subject and the wiki docs - leads me to believe that you can only use this when you are using the xml

Re: doing a full-import after deleting records in the database - maxDocs

2012-05-15 Thread geeky2
hello thanks for the reply this is the output - docsPending = 0 commits : 1786 autocommit maxDocs : 1000 autocommit maxTime : 6ms autocommits : 1786 optimizes : 3 rollbacks : 0 expungeDeletes : 0 docsPending : 0 adds : 0 deletesById : 0 deletesByQuery : 0 errors : 0 cumulative_adds : 178775

doing a full-import after deleting records in the database - maxDocs

2012-05-15 Thread geeky2
hello, After doing a DIH full-import (with clean=true) after deleting records in the database, i noticed that the number of documents processed, did change. example: Indexing completed. Added/Updated: 595908 documents. Deleted 0 documents. however, i noticed the numbers on the statistics page

need help with getting exact matches to score higher

2012-05-15 Thread geeky2
Hello all, i am trying to tune our core for exact matches on a single field (itemNo) and having issues getting it to work. in addition - i need help understanding the output from debugQuery=on where it presents the scoring. my goal is to get exact matches to arrive at the top of the results.

Re: not getting expected results when doing a delta import via full import

2012-05-14 Thread geeky2
update on this: i also tried manipulating the timestamps in the dataimport.properties file to advance the date so that no records could be older than last_index_time example: #Mon May 14 12:42:49 CDT 2012 core1-model.last_index_time=2012-05-15 14\:38\:55 last_index_time=2012-05-15 14\:38\:55 ~

not getting expected results when doing a delta import via full import

2012-05-14 Thread geeky2
hello all, i am not getting the expected results when trying to set up delta imports according to the wiki documentation here: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport?highlight=%28delta%29|%28import%29 i have the following set up in my DIH, query="select [complic

solr snapshots - old school and replication - new school ?

2012-05-03 Thread geeky2
hello all, enviornment: centOS and solr 3.5 i want to make sure i understand the difference between snapshots and solr replication. snapshots are "old school" and have been deprecated with solr replication "new school". do i have this correct? btw: i have replication working (now), between my

Re: should slave replication be turned off / on during master clean and re-index?

2012-05-03 Thread geeky2
thanks for all of the advice / help. i appreciate it ;) -- View this message in context: http://lucene.472066.n3.nabble.com/should-slave-replication-be-turned-off-on-during-master-clean-and-re-index-tp3945531p3959088.html Sent from the Solr - User mailing list archive at Nabble.com.

dataimport handler (DIH) - notify when it has finished?

2012-05-01 Thread geeky2
Hello all, is there a notification / trigger / callback mechanism people use that allows them to know when a dataimport process has finished? we will be doing daily delta-imports and i need some way for an operations group to know when the DIH has finished. thank you, -- View this message in

Re: should slave replication be turned off / on during master clean and re-index?

2012-05-01 Thread geeky2
hello shawn, thanks for the reply. ok - i did some testing and yes you are correct. autocommit is doing the "commit" work in chunks. yes - the slaves are also going to having everything to nothing, then slowly building back up again, lagging behind the master. ... and yes - this is probably n

Re: should slave replication be turned off / on during master clean and re-index?

2012-04-27 Thread geeky2
hello, thank you for the reply, >> Does a "clean" mean issuing a deletion query (e.g. *:*) prior to re-indexing all of your content? I don't think the slaves will download any changes until you've committed at some point on the master. << well, in this case when i say, "clean" (on the Master

should slave replication be turned off / on during master clean and re-index?

2012-04-27 Thread geeky2
hello all, i am just getting replication going on our master and two (2) slaves. from time to time, i may need to do a complete re-index and clean on the master. should replication on the slave - remain On or Off during a full clean and re-index on the Master? thank you, -- View this message i

impact of EdgeNGramFilterFactory on indexing process?

2012-04-26 Thread geeky2
Hello all, i am experimenting with EdgeNGramFilterFactory - on two of the fieldTypes in my schema. i believe i understand this - but want to verify: 1) will this increase my index time? 2) will increase the number of documents in my index? thank you -- View this message in context:

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-26 Thread geeky2
hello, sorry - i overlooked this message - thanks for checking back and thanks for the info. yes - replication seems to be working now: tailed from logs just now: 2012-04-26 09:21:33,284 INFO [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Slave in sync with master. 2012-04-26 09:21:53

Re: faceted searches - design question - facet field not part of qf search fields

2012-04-25 Thread geeky2
thank you BOTH, Erick and Hos for the insight. -- View this message in context: http://lucene.472066.n3.nabble.com/faceted-searches-design-question-facet-field-not-part-of-qf-search-fields-tp3936509p3938080.html Sent from the Solr - User mailing list archive at Nabble.com.

faceted searches - design question - facet field not part of qf search fields

2012-04-24 Thread geeky2
hello all, this is more of a design / newbie question on how others combine faceted search fields in to their requestHandlers. say you have a request handler set up like below. does it make sense (from a design perspective) to add a faceted search field that is NOT part of the main search fiel

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-24 Thread geeky2
that was it! thank you. i did notice something else in the logs now ... what is the meaning or implication of the message, "Connection reset".? 2012-04-24 12:59:19,996 INFO [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Slave in sync with master. 2012-04-24 12:59:39,998 INFO [org.a

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-24 Thread geeky2
hello, thank you for the reply, yes - master has been indexed. ok - makes sense - the polling interval needs to change i did check the solr war file on both boxes (master and slave). they are identical. actually - if they were not indentical - this would point to a different issue altogether

correct location in chain for EdgeNGramFilterFactory ?

2012-04-24 Thread geeky2
hello all, i want to experiment with the EdgeNGramFilterFactory at index time. i believe this needs to go in post tokenization - but i am doing a pattern replace as well as other things. should the EdgeNGramFilterFactory go in right after the pattern replace?

solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-23 Thread geeky2
hello all, enviornment: centOS and solr 3.5 i am attempting to set up replication betweeen two solr boxes (master and slave). i am getting the following in the logs on the slave box. 2012-04-23 10:54:59,985 SEVERE [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Master at: http://someip:

Re: searching across multiple fields using edismax - am i setting this up right?

2012-04-13 Thread geeky2
thank you for the response. it seems to be working well ;) 1) i tried your suggestion about removing the qt parameter - *somecore/partItemNoSearch*&q=dishwasher&debugQuery=on&rows=10 but this results in a 404 error message - is there some configuration i am missing to support this short-hand s

searching across multiple fields using edismax - am i setting this up right?

2012-04-12 Thread geeky2
hello all, i just want to check to make sure i have this right. i was reading on this page: http://wiki.apache.org/solr/ExtendedDisMax, thanks to shawn for educating me. *i want the user to be able to fire a requestHandler but search across multiple fields (itemNo, productType and brand) WITHOUT

Re: is there a downside to combining search fields with copyfield?

2012-04-12 Thread geeky2
>> You end up with one multivalued field, which means that you can only have one analyzer chain. << actually two of the three fields being considered for combination in to a single field ARE multivalued fields. would this be an issue? >> With separate fields, each field can be analyzed differ

is there a downside to combining search fields with copyfield?

2012-04-12 Thread geeky2
hello everyone, can people give me their thoughts on this. currently, my schema has individual fields to search on. are there advantages or disadvantages to taking several of the individual search fields and combining them in to a single search field? would this affect search times, term tokeni

why does building war from source produce a different size file?

2012-03-29 Thread geeky2
hello all, i have been pulling down the 3.5 solr war file from the mirror site. the size of this file is: 6403279 Nov 22 14:54 apache-solr-3.5.0.war when i build the war file from source - i get a different sized file: ./dist/apache-solr-3.5-SNAPSHOT.war 6404098 Mar 29 11:41 ./dist/apache-s

Re: authentication for solr admin page?

2012-03-28 Thread geeky2
update - ok - i was reading about replication here: http://wiki.apache.org/solr/SolrReplication and noticed comments in the solrconfig.xml file related to HTTP Basic Authentication and the usage of the following tags: username password *Can i place these tags in the request handler to

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-28 Thread geeky2
thank you, James. -- View this message in context: http://lucene.472066.n3.nabble.com/preventing-words-from-being-indexed-in-spellcheck-dictionary-tp3861472p3865670.html Sent from the Solr - User mailing list archive at Nabble.com.

authentication for solr admin page?

2012-03-28 Thread geeky2
hello, environment: running solr 3.5 under jboss 5.1 i have been searching the user list along with the locations below - to find out how you require a user to authenticate in to the solr /admin page. i thought this would be a common issue - but maybe not ;) any help would be apprecaited than

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
hello, should i apply the StopFilterFactory at index time or query time. right now - per the schema below - i am applying it at BOTH index time and query time. is this correct? thank you, mark // snipped from schema.xml

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
thank you very much for the info ;) -- View this message in context: http://lucene.472066.n3.nabble.com/preventing-words-from-being-indexed-in-spellcheck-dictionary-tp3861472p3861987.html Sent from the Solr - User mailing list archive at Nabble.com.

preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
hello all, i am creating a spellcheck dictionary from the itemDescSpell field in my schema. is there a way to prevent certain words from entering the dictionary - as the dictionary is being built? thanks for any help mark // snipped from solarconfig.xml default itemDescSpell

spellcheck file format - multiple words on a line?

2012-03-23 Thread geeky2
hello all, for business reasons, we are sourcing the spellcheck file from another business group. the file we receive looks like the example data below can solr support this type of format - or do i need to process this file in to a format that has a single word on a single line? thanks for a

suggestions on automated testing for solr output

2012-03-16 Thread geeky2
hello all, i know this is never a fun topic for people, but our SDLC mandates that we have unit test cases that attempt to validate the output from specific solr queries. i have some ideas on how to do this, but would really appreciate feedback from anyone that has done this or is doing it now.

does solr have a mechanism for intercepting requests - before they are handed off to a request handler

2012-03-09 Thread geeky2
hello all, does solr have a mechanism that could intercept a request (before it is handed off to a request handler). the intent (from the business) is to send in a generic request - then pre-parse the url and send it off to a specific request handler. thank you, mark -- View this message in co

need input - lessons learned or best practices for data imports

2012-03-05 Thread geeky2
hello all, we are approaching the time when we will move our first solr core in to a more "production like" environment. as a precursor to this, i am attempting to write some documents on impact assessment and batch load / data import strategies. does anyone have processes or lessons learned - t

does the location of a match (within a field) affect the score?

2012-03-02 Thread geeky2
hello all, example: i have a field named itemNo the user does a search, itemNo:665 there are three document in the core, that look like this doc1 - itemNo = 1237899*665* doc2 - itemNo = *665*1237899 doc3 - itemNo = 123*665*7899 does the location or placement of the search string (beginnin

need to support bi-directional synonyms

2012-02-22 Thread geeky2
hello all, i need to support the following: if the user enters "sprayer" in the desc field - then they get results for BOTH "sprayer" and "washer". and in the other direction if the user enters "washer" in the desc field - then they get results for BOTH "washer" and "sprayer". would i set up

proper syntax for using sort query parameter in responseHandler

2012-02-17 Thread geeky2
what is the proper syntax for including sort directive in my responseHandler? i tried this but got an error: edismax all 10 itemNo^1.0 *:* * rankNo desc* itemType:1 false thank you mark -- View this message in

RE: spellcheck configuration not providing suggestions or corrections

2012-02-13 Thread geeky2
thank you sooo much - that was it. also - thank you for the tip on which field to hit, eg itemDesc in stead of itemDescSpell. thank you, mark -- View this message in context: http://lucene.472066.n3.nabble.com/spellcheck-configuration-not-providing-suggestions-or-corrections-tp3740877p3741783

RE: spellcheck configuration not providing suggestions or corrections

2012-02-13 Thread geeky2
hello thank you for the suggestion - however this did not work. i went in to solrconfig and change the count to 20 - then restarted the server and then did a reimport. is it possible that i am not firing the request handler that i think i am firing ? default false

Re: struggling with solr.WordDelimiterFilterFactory and periods "." or dots

2012-02-10 Thread geeky2
hello, >> Or does your field in schema.xml have anything like autoGeneratePhraseQueries="true" in it? << there is no reference to this in our production schema. this is extremely confusing. i am not completely clear on the issue? reviewing our previous messages - it looks like the data is bein

Re: struggling with solr.WordDelimiterFilterFactory and periods "." or dots

2012-02-09 Thread geeky2
>> OK, first question is why are you searching on two different values? Is that intentional? << yes - our users have to be able to locate a part or model number (that may or may not have periods in that number) even if they do NOT enter the number with the embedded periods. example: actual

Re: struggling with solr.WordDelimiterFilterFactory and periods "." or dots

2012-02-08 Thread geeky2
hello, thanks for sticking with me on this ...very frustrating ok - i did perform the query with the debug parms using two scenarios: 1) a successful search (where i insert the period / dot) in to the itemNo field and the search returns a document. itemNo:BP2.1UAA http://hfsthssolr1.intra.sea

  1   2   >