Re: Nested boosting in map function in solr?

2014-06-03 Thread Jack Krupansky
I think you need to use parameter substitution for those nested queries since the "boost" parameter takes a white-space delimited sequence of function queries. -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Tuesday, June 3, 2014 2:22 AM To:

Re: suspect SOLR query from D029 (SOLR master)

2014-06-03 Thread Jack Krupansky
11 * 11 or 121 query terms, which shouldn't be so bad. But... maybe the Lucene FST for your synonym list is huge. Someone with deeper Lucene knowledge would have to address that. -- Jack Krupansky -Original Message- From: Branham, Jeremy [HR] Sent: Tuesday, June 3, 2014 3:57

Re: solr multi-tenant: anyone use per-tenant synonyms file?

2014-06-02 Thread Jack Krupansky
Try to stay with a separate collection/core for each tenant - otherwise relevancy for document scores gets "polluted" by other tenants, even if you do use filter queries to isolate what documents get returned for a tenant in a multi-tenant core. -- Jack Krupansky -Origin

Re: change in EnumField configuration - what do you think?

2014-06-02 Thread Jack Krupansky
docvalues, right? -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Monday, June 2, 2014 10:46 AM To: solr-user@lucene.apache.org Subject: Re: change in EnumField configuration - what do you think? Would both then be supported? I see where it would be easily detectable. And I

Re: Solr 4.8 synonyms expansion for each primary term

2014-06-02 Thread Jack Krupansky
x27;t dug into the new REST API to know the answer. -- Jack Krupansky -Original Message- From: Archana R Sent: Monday, June 2, 2014 10:44 AM To: solr-user@lucene.apache.org Subject: Solr 4.8 synonyms expansion for each primary term we recently upgraded to Solr 4.8 and we are using REST A

Re: Uneven shard heap usage

2014-05-31 Thread Jack Krupansky
When you restart, how long does it take it hit the problem? And how much query or update activity is happening in that time? Is there any other activity showing up in the log? If you bring up only a single node in that problematic shard, do you still see the problem? -- Jack Krupansky

Re: How to build Solr4.0 Final?

2014-05-30 Thread Jack Krupansky
Make sure you checkout from the "tagged" branch, not trunk or the actual release branch which has updates since the release. See: http://svn.apache.org/viewvc/lucene/dev/tags/ -- Jack Krupansky -Original Message- From: eShard Sent: Friday, May 30, 2014 11:55 AM To:

Re: search component needs access to results of previous component

2014-05-30 Thread Jack Krupansky
rks Search Solr-based product. It re-executes enhanced queries in a query component. I don't think the source is available though. And I don't recall if it was cloud-enabled. See: http://docs.lucidworks.com/display/lweug/Queries+and+Relevance -- Jack Krupansky -Original Message

Re: How to check and handle empty Solr query

2014-05-30 Thread Jack Krupansky
Consider using the edismax query parser, which allows you to specify q.alt which is an alternative query such as "*:". Or use an application layer to handle the situation before it gets to Solr. -- Jack Krupansky -Original Message- From: vit Sent: Friday, May 30, 2014

Re: Block document for certain search words

2014-05-30 Thread Jack Krupansky
Explain your use case a little more, but you can define terms as stop words with a stop filter, which means they won't appear in the index. -- Jack Krupansky -Original Message- From: Christian Loock Sent: Friday, May 30, 2014 5:38 AM To: solr-user@lucene.apache.org Subject:

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
cases, but my personal imagination is not at issue for this particular thread. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, May 29, 2014 9:46 AM To: solr-user@lucene.apache.org Subject: Re: wildcard matches in EnumField - what do I need to change in code to

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
https://cwiki.apache.org/confluence/display/solr/Working+with+Enum+Fields -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Thursday, May 29, 2014 2:50 AM To: solr-user@lucene.apache.org Subject: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches? H

Re: Solr Cell Tika - date.formats

2014-05-28 Thread Jack Krupansky
HH:mm:ss'Z' -MM-dd'T'HH:mm:ss -MM-dd -MM-dd hh:mm:ss -MM-dd HH:mm:ss EEE MMM d hh:mm:ss z EEE, dd MMM HH:mm:ss zzz , dd-MMM-yy HH:mm:ss zzz EEE MMM d HH:mm:ss See: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+

Re: Regex with local params is not working

2014-05-28 Thread Jack Krupansky
Post the parsed query itself. Yes, edismax should always generate a disjunctionmaxquery - in addition to the regexquery. -- Jack Krupansky -Original Message- From: Lokn Sent: Wednesday, May 28, 2014 1:53 AM To: solr-user@lucene.apache.org Subject: Re: Regex with local params is not

Re: Any Solrj API to obtain field list?

2014-05-27 Thread Jack Krupansky
You might consider an update request processor as an alternative. It runs on the server and might be simpler. You can even use the stateless script update processor to avoid having to write any custom Java code. -- Jack Krupansky -Original Message- From: T. Kuro Kurosaka Sent

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Jack Krupansky
You could also consider DataStax Enterprise, which integrates Apache Cassandra as the primary database and Solr for indexing and query. See: http://www.datastax.com/what-we-offer/products-services/datastax-enterprise -- Jack Krupansky -Original Message- From: Ali Nazemian Sent

Re: Solr - Cores not initialised

2014-05-26 Thread Jack Krupansky
f the XML elements. It appears that you mangled that message as well! Feel free to post the complete message here as well. -- Jack Krupansky -Original Message- From: Manikandan Saravanan Sent: Monday, May 26, 2014 1:52 AM To: solr-user@lucene.apache.org Cc: Varuna Venkatesh Subject

Re: about analyzer and tokenizer

2014-05-26 Thread Jack Krupansky
book". Your SYNONYMS.TXT file would contain: macbook,mac book Only use the synonyms filter at index time. The standard query parsers don't support phrases for synonyms. -- Jack Krupansky -Original Message- From: rachun Sent: Monday, May 26, 2014 5:26 AM To: solr-user@lucene.apa

Re: Query translation of User Fields

2014-05-25 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Yonik Seeley Sent: Sunday, May 25, 2014 10:58 AM To: solr-user@lucene.apache.org Subject: Re: Query translation of User Fields On Thu, May 22, 2014 at 10:56 AM, Jack Krupansky wrote: Hmmm... that doesn't sound like what I would

Re: Solr 4.7.2 ValueSourceParser classCast exception

2014-05-23 Thread Jack Krupansky
Are you sure that you compiled your code with the proper Solr jars so that the class signature (extends, implements, and constructors) matches the Solr 4.7.2 jars? I mean, Java is simply complaining that your class is not a valid value source class of the specified type. -- Jack Krupansky

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Feel free to look at the source code for post.jar. I mean, all it is really doing is scanning the directory (optionally recursively) and then streaming each file to Solr. -- Jack Krupansky -Original Message- From: benjelloun Sent: Friday, May 23, 2014 8:15 AM To: solr-user

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Is there a particular reason you are adverse to using post.jar? I mean, if there is some bug or inconvenience, let us know so we can fix it! The Solr server itself does not provide any ability to "crawl" file systems (LucidWorks Search does.) post.jar does provide that convenience.

Re: How to Disable Commit Option and Just Manage it via SolrConfig?

2014-05-23 Thread Jack Krupansky
class, or maybe you would have to copy and edit it. Also, note that the delete command also has a commitWithin setting. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Thursday, May 22, 2014 6:36 AM To: solr-user@lucene.apache.org Subject: How to Disable Commit Option and J

Re: Atomic update by query instead of ID

2014-05-22 Thread Jack Krupansky
No, but it sure would be nice to have the Elasticsearch feature of supplying a script for update. -- Jack Krupansky -Original Message- From: Saumitra Srivastav Sent: Thursday, May 22, 2014 11:13 AM To: solr-user@lucene.apache.org Subject: Atomic update by query instead of ID Is is

Re: Query translation of User Fields

2014-05-22 Thread Jack Krupansky
Hmmm... that doesn't sound like what I would have expected - I would have thought that Solr would throw an exception on the "user" field, rather than simply treat it as a text keyword. File a Jira. Either it's a bug or the doc is not complete. -- Jack Krupansky -

Re: multiple queries in single request

2014-05-22 Thread Jack Krupansky
No, I was rejecting BOTH methods 1 and 2. I was suggesting a different method. I'll leave it to somebody else to describe the method so that it is easier to understand. -- Jack Krupansky -Original Message- From: Pavel Belenkovich Sent: Thursday, May 22, 2014 4:00 AM To: solr

Re: pdfs

2014-05-22 Thread Jack Krupansky
Yeah, I recall running into infinite loop issues with PDFBox in Solr years ago. They keep fixing these issues, but they keep popping up again. Sigh. -- Jack Krupansky -Original Message- From: Siegfried Goeschl Sent: Thursday, May 22, 2014 4:35 AM To: solr-user@lucene.apache.org

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Your original message had "q=toyota featured:true^100" and also using bq - both are valid. If either is not working for you, please be specific about what exactly is not behaving as you expected - what the symptom is. Sometimes you have to experiment with the boost factor. -- Jack

Re: pdfs

2014-05-21 Thread Jack Krupansky
cific symptom? -- Jack Krupansky -Original Message- From: Brian McDowell Sent: Thursday, May 22, 2014 12:24 AM To: solr-user@lucene.apache.org Subject: pdfs Has anyone had issues with indexing pdf files? Some pdfs are bringing down Solr completely so that it actually needs to be man

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Just add the boost to the keyword: q=toyota^100. Or, use the dismax or edismax query parsers and then the boost can be specified for the field: qf=keyword^100. -- Jack Krupansky -Original Message- From: manju16832003 Sent: Thursday, May 22, 2014 12:04 AM To: solr-user

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
The results will be scored, but only based on terms in q, not terms in fq. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 6:41 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Interesting!! I did not know that using &qu

Re: boosting multivalued fields

2014-05-21 Thread Jack Krupansky
Yes. -- Jack Krupansky -Original Message- From: vit Sent: Wednesday, May 21, 2014 11:20 AM To: solr-user@lucene.apache.org Subject: boosting multivalued fields is it posiible to boost values of the same field. For example in a query like that: category_id:(2271578^0.5 22718986^0.4

Re: multiple queries in single request

2014-05-21 Thread Jack Krupansky
Nothing special for this use case. This seems to be a use case that I would call "bulk data retrieval - based on ID". I would suggest "batching" your requests - limit each request query to, say, 50 or 100 IDs. -- Jack Krupansky -Original Message- From: Pav

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
, or add a boosting query using the bq parameter. The latter approach works for the dismax and edismax query parsers only. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 5:51 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Hi

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
filtering terms to participate in the document scoring. In other words, what exactly were you trying to achieve by using fq? -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 12:19 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR

Re: Distributed Search in Solr with different queries per shard

2014-05-21 Thread Jack Krupansky
Unfortunately the same query will be sent to all cores if you use the shards parameter to query multiple cores. Is there some characteristic of the first core that is distinct from the second core so that you could OR the differences between the two? -- Jack Krupansky -Original Message

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
g of documents that is different? For the latter, the fq is purely a "filter" that removes documents, but the terms in fq do not participate in relevance or scoring calculation of documents, so documents will not get boosted based on the terms present in fq. -- Jack Krupansk

Re: solr-user Digest of: get.100322

2014-05-21 Thread Jack Krupansky
il to follow it. Or, maybe we should enhance Solr to check available memory and log a stern warning if the index size exceeds system memory when Solr is started. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Tuesday, May 20, 2014 1:49 PM To: solr-user@lucene.apache.

Re: trigger delete on nested documents

2014-05-20 Thread Jack Krupansky
document would make it discontiguous with the rest of the block of documents. Just update the block by resending the entire block of documents. For e previous discussion of this limitation: http://lucene.472066.n3.nabble.com/block-join-and-atomic-updates-td4117178.html -- Jack Krupansky

Re: Howto Search word which contains the character "

2014-05-20 Thread Jack Krupansky
there are some examples in my e-book that shows how to map @ and _ to ALPHA. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Tuesday, May 20, 2014 4:55 AM To: solr-user@lucene.apache.org Subject: Re: Howto Search word which contains the character " Hi, It is speci

Re: Index / Query IP Address as number.

2014-05-19 Thread Jack Krupansky
addresses. And then you could output to multiple fields - one for the raw string for wildcard matches, say, and one as an integer for proximity or range checks. -- Jack Krupansky -Original Message- From: SolrUser1543 Sent: Monday, May 19, 2014 3:04 PM To: solr-user@lucene.apache.org Subject

Re: Subscribe for Solr Mails

2014-05-19 Thread Jack Krupansky
Read this web page for information about subscribing: https://lucene.apache.org/solr/discussion.html -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Monday, May 19, 2014 6:12 AM To: solr-user@lucene.apache.org Subject: Subscribe for Solr Mails Dear Team

Re: How to delete docs only in a spec shard within a collection

2014-05-19 Thread Jack Krupansky
Normally a delete-by-query needs to be distributed to all shards since it is not known which document keys it will affect. It sounds like SolrCloud does not support the feature you would really like: support for distrb=false. You can file a Jira request for a feature "improvement."

Re: Index / Query IP Address as number.

2014-05-18 Thread Jack Krupansky
What are you using for your default query operator, and do you have autoGeneratePhraseQueries set to "true" for your field type? I mean, a query for 192.168.1.4 shouldn't match 192.168.1.3 - unless you have autoGeneratePhraseQueries set to "false" (the defa

Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Jack Krupansky
01 k CD-ROM. => CD-ROM. vs. CD ROM Finally, the default for the splitOnNumerics attribute is "1" (true), which is why "abc123xyz" is split into three terms. If you don't want that split, set splitOnNumerics="0". There are more details on WDF in my e-book:

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Jack Krupansky
sure would be nice to see more substantial and easier to use support for Payload in Solr. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 17, 2014 1:24 AM To: solr-user@lucene.apache.org Subject: Re: What is the usage of solr.NumericPayloadTokenFilterFactory

Re: slow performance on simple filter

2014-05-16 Thread Jack Krupansky
Add the debugQuery=true parameter and look at the "timing" section to see which search component is consuming the time. Are you using faceting or highlighting? 7 million documents is actually a fairly small index. -- Jack Krupansky -Original Message- From: mizayah Sent:

Re: date range queries efficiency

2014-05-16 Thread Jack Krupansky
exact query matches such as year without expanding the date to a range for the full interval. -- Jack Krupansky -Original Message- From: Dmitry Kan Sent: Friday, May 9, 2014 6:41 AM To: solr-user@lucene.apache.org Subject: date range queries efficiency Hi, There was a mention eith

Re: Difference between search strings

2014-05-16 Thread Jack Krupansky
For these specific examples, the results should be the same, but mostly that's because the term is a simple sequence of letters. I have an extended discussion of characters in terms in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/

Status of mail?

2014-05-16 Thread Jack Krupansky
Is the mail list working again yet?? -- Jack Krupansky

Re: Help to Understand a Solr Query

2014-05-15 Thread Jack Krupansky
Please don't re-use an existing message thread for a new, completely independent question! Also, try to make the subject line indicate something about the actual issue. -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 13, 2014 10:56 AM To: solr

Re: Difference between search strings

2014-05-15 Thread Jack Krupansky
Inside of quotes you only have to escape quote and backslash. Add the debugQuery=true parameter to see exactly how Solr processes characters and generates queries. But... in a URL you have to URL-encode URL query parameters: http://en.wikipedia.org/wiki/Query_string -- Jack Krupansky

Re: Indexing DateField timezone problem

2014-05-14 Thread Jack Krupansky
nually add that to dates. -- Jack Krupansky -Original Message- From: hakanbillur Sent: Friday, May 9, 2014 4:38 AM To: solr-user@lucene.apache.org Subject: Indexing DateField timezone problem <http://lucene.472066.n3.nabble.com/file/n4135079/Capture2.png> <http://lucene.472066

Re: KeywordTokenizerFactory splits the string for the exclamation mark

2014-05-14 Thread Jack Krupansky
ry parser: http://lucene.apache.org/core/4_8_0/queryparser/org/apache/lucene/queryparser/classic/package-summary.html#Escaping_Special_Characters -- Jack Krupansky -Original Message- From: Romani Rupasinghe Sent: Tuesday, May 13, 2014 11:14 AM To: solr-user@lucene.apache.org Subject: KeywordT

Re: distrib=false is not honoring

2014-05-14 Thread Jack Krupansky
at is your rationale? What is this odd "mcat.intent" query response writer type that you are specifying with the qt parameter? -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Wednesday, May 14, 2014 1:35 AM To: solr-user@lucene.apache.org Subject: distrib=false i

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-13 Thread Jack Krupansky
I do have basic coverage for that filter (and all other filters) and the parameter values in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html That said, are you sure you want to be using the payload feature of Lucene

Re: Too many documents Exception

2014-05-13 Thread Jack Krupansky
One of the hard-core Lucene guys is going to have to help you out. Or you may have to write some custom code to fix the index for any such shard. If you have deleted any documents, it may be sufficient to simply optimize the index. -- Jack Krupansky -Original Message- From: yamazaki

Re: Easises way to insatll solr cloud with tomcat

2014-05-13 Thread Jack Krupansky
Check out HDS from Heliosearch - it comes packaged with Tomcat, ready to go: http://heliosearch.com/download.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Monday, May 12, 2014 8:23 AM To: solr-user@lucene.apache.org Subject: Re: Easises way to insatll solr cloud

Re: is it possible for solr to calculate and give back the price of a product based on its sub-products

2014-05-11 Thread Jack Krupansky
n any case, please clarify your use case. -- Jack Krupansky -Original Message- From: Gharbi Mohamed Sent: Friday, May 9, 2014 3:29 PM To: solr-user@lucene.apache.org Subject: is it possible for solr to calculate and give back the price of a product based on its sub-products Hi, I am usin

Re: LetterTokenizerFactory doesn't work as expected

2014-05-11 Thread Jack Krupansky
Please post your full field type analyzer. The letter tokenizer should in fact return "I", "can", and "t" - if it is used properly. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 10, 2014 8:28 AM To: solr-user@lucene.apache.org S

Re: Use XSD or DTD to make Solr schema?

2014-05-07 Thread Jack Krupansky
Sorry, but you have to create the schema manually, but... you could possibly get by with Solr schemaless mode to dynamically create the schema based on the actual data values. See: https://cwiki.apache.org/confluence/display/solr/Schemaless+Mode -- Jack Krupansky -Original Message

Re: Too many documents Exception

2014-05-07 Thread Jack Krupansky
ad such an overstuffed index. -- Jack Krupansky -Original Message- From: [Tech Fun]山崎 Sent: Tuesday, May 6, 2014 8:54 PM To: solr-user@lucene.apache.org Subject: Too many documents Exception Hello everybody, Solr 4.3.1(and 4.7.1), Num Docs + Deleted Docs > 2147483647(Integer.MAX_VALUE

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
Think of debugQuery as your "Solr BFF"! -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 6, 2014 7:36 AM To: solr-user@lucene.apache.org Subject: Re: Help to Understand a Solr Query Thank you this is what I was looking for all this time I

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
alternatives - at least one of the alternatives must match. -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 6, 2014 4:44 AM To: solr-user@lucene.apache.org Subject: Re: Help to Understand a Solr Query When I go through the debug results I f

Re: Indexing scanned PDFs

2014-05-06 Thread Jack Krupansky
tly from a word processor source file will retain smooth curves on characters while a PDF generated from scanned page images will show heavy pixelation. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, May 6, 2014 1:30 AM To: solr-user@lucene.apache.org S

Re: Strict Search in Apache Solr

2014-05-05 Thread Jack Krupansky
The term "strict search" is not in the Lucene/Solr nomenclature - it could mean any number of things. It sounds as if maybe you want to do a phrase search, looking for an exact phrase - yes, you can do that by enclosing the phrase in quotes. -- Jack Krupansky -Origin

Re: Relevancy help

2014-05-05 Thread Jack Krupansky
what the document scores look like. -- Jack Krupansky -Original Message- From: Ravi Solr Sent: Monday, May 5, 2014 5:41 PM To: solr-user@lucene.apache.org Subject: Relevancy help Hello, I have a weird relevancy requirement. We search news content hence chronology is very importan

Re: Turning on KeywordRepeat and RemoveDups on an existing fieldType.

2014-05-05 Thread Jack Krupansky
an do the reindex incrementally (as you replace existing documents) as well if you don't mind if the difference in relevancy takes an extended time to become apparent. -- Jack Krupansky -Original Message- From: Michael Tracey Sent: Monday, May 5, 2014 4:52 PM To: solr-user@lu

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
"dismax" means Disjunction Maximum, which means Lucene takes the highest scoring clause (field), for each search term. This is effectively an OR of the clauses. -- Jack Krupansky -Original Message- From: nativecoder Sent: Monday, May 5, 2014 11:21 AM To: solr-user@lucene.

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
Read up on the edismax query parser first: http://wiki.apache.org/solr/ExtendedDisMax The "^" operator is known as boosting or field boosting and is used to influence document scores for relevancy. It has no analog in SQL. -- Jack Krupansky -Original Message- From: n

Re: Wildcard malfunctioning

2014-05-05 Thread Jack Krupansky
MappingCharFilterFactory. -- Jack Krupansky -Original Message- From: Román González Sent: Monday, May 5, 2014 7:00 AM To: solr-user@lucene.apache.org Subject: Wildcard malfunctioning Hi all! Sorry in advance if this question was posted but I were unable to find it with search engines

Re: Explain Solr Query Execution

2014-05-04 Thread Jack Krupansky
ntity in your request. Maybe you should start by telling us what you are trying to achieve, in plain English. -- Jack Krupansky -Original Message- From: nativecoder Sent: Sunday, May 4, 2014 2:20 PM To: solr-user@lucene.apache.org Subject: Explain Solr Query Execution How will a

Re: Stemming not working with wildcard search

2014-04-28 Thread Jack Krupansky
Wildcards and stemming are incompatible at query time - you need to manually stem the term before applying your wildcard. Wildcards are not supported in quoted phrases. They will be treated as punctuation, and ignored by the standard tokenizer or the word delimiter filter. -- Jack Krupansky

Re: Wildcard search not working with search term having special characters and digits

2014-04-28 Thread Jack Krupansky
analyzed at index time for wildcard to work. Ditto is your filed type uses the word delimiter filter with the split digits option enabled - the alpha and numeric portions will generate separate terms - and cause a wildcard to fail. -- Jack Krupansky -Original Message- From: Geepalem

Re: How can I convert xml message for updating a Solr index to a javabin file

2014-04-27 Thread Jack Krupansky
Look at the SolrJ source code and doc. JavaBin is more of a protocol than a file format. -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Sunday, April 27, 2014 2:16 AM To: solr-user@lucene.apache.org Subject: RE: How can I convert xml message for updating a Solr index to

Re: get term frequency, just only keywords search

2014-04-26 Thread Jack Krupansky
simply want to boost documents containing a phrase - just use the pf, pf2, and pf3 parameters of edsimax or explicitly boost the phrase, such as "research development"^20. -- Jack Krupansky -Original Message- From: ksmith Sent: Saturday, April 26, 2014 5:38 AM To: solr-user@luce

Re: Not allowing exact match with WordDelimiterFilterFactory

2014-04-25 Thread Jack Krupansky
quot; for your text field type. -- Jack Krupansky -Original Message- From: Kashish Sent: Friday, April 25, 2014 2:49 PM To: solr-user@lucene.apache.org Subject: Not allowing exact match with WordDelimiterFilterFactory Hi, I am having some problem with WordDelimiterFilte

Re: TB scale

2014-04-25 Thread Jack Krupansky
? -- Jack Krupansky -Original Message- From: Ed Smiley Sent: Friday, April 25, 2014 3:48 PM To: solr-user@lucene.apache.org Subject: TB scale Anyone with experience, suggestions or lessons learned in the 10 -100 TB scale they'd like to share? Researching optimum design for a Solr

Re: Search for a mask that matches the requested string

2014-04-25 Thread Jack Krupansky
No, neither Lucene nor Solr provide a "mask match" feature. You could write custom code to emulate such a feature. Elasticsearch appears to have done that with its "percolate" feature: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-per

Re: dynamic field assignments

2014-04-25 Thread Jack Krupansky
these provide the tools to dynamically add fields to a schema. See: https://cwiki.apache.org/confluence/display/solr/Schemaless+Mode Dynamic field are a very powerful feature of Solr, but please don't treat them as a panacea for weak data modeling. Use them only in moderation. -- Jack

Re: dynamic field assignments

2014-04-25 Thread Jack Krupansky
ist, but it is not a substitute for professional review. -- Jack Krupansky -Original Message- From: John Thorhauer Sent: Friday, April 25, 2014 7:10 AM To: solr-user@lucene.apache.org Subject: dynamic field assignments I have a scenario where I would like dynamically assign incoming do

Re: While Indexing in Solr getting "SolrException: Invalid Date String:'12/20/13'

2014-04-21 Thread Jack Krupansky
/ParseDateFieldUpdateProcessorFactory.html There are some examples in my Solr 4.x Deep Dive e-book. -- Jack Krupansky -Original Message- From: Niranjan Sent: Monday, April 21, 2014 12:01 PM To: solr-user@lucene.apache.org Subject: While Indexing in Solr getting "SolrException: Invalid Date String:

Re: space between search terms

2014-04-18 Thread Jack Krupansky
. The simplest approach for now is to do the query-time synonym expansion in your app layer as a preprocessor. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Friday, April 18, 2014 7:38 PM To: solr-user@lucene.apache.org Subject: Re: space between search terms Hi Jack, I

Re: space between search terms

2014-04-18 Thread Jack Krupansky
To resolve that, use a sloppy phrase: q="indiranagar xyz"~1 Or, set qs=1 for the edismax query parser. -- Jack Krupansky -Original Message- From: kumar Sent: Friday, April 18, 2014 6:34 AM To: solr-user@lucene.apache.org Subject: space between search terms Hi, I Have a fiel

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Jack Krupansky
Make sure your field type has the autoGeneratePhraseQueries="true" attribute (default is false). q.op only applies to explicit terms, not to terms which decompose into multiple terms. Confusing? Yes! -- Jack Krupansky -Original Message- From: Alistair Sent: Friday, April

Re: WordDelimiterFilterFactory and StandardTokenizer

2014-04-16 Thread Jack Krupansky
Typically the white space tokenizer is the best choice when the word delimiter filter will be used. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Wednesday, April 16, 2014 11:03 PM To: solr-user@lucene.apache.org Subject: Re: WordDelimiterFilterFactory and

Re: Show the score in the search result

2014-04-16 Thread Jack Krupansky
And try it with wt=xml ... maybe there's something odd with JSON. And also with echoParams=all so we can be sure what's really passed. I suppose if you had an "invariant" for "fl", then fl would be ignored, but... that would be a less likely scenario. -- Jack Kr

Re: Show the score in the search result

2014-04-16 Thread Jack Krupansky
Also, "*:*" is a constant score query, so the score will always be 1.0. Not a terribly good example to request the score. Please provide the Solr query response, with the debug=true parameter so we can see for ourselves that no score is returned. -- Jack Krupansky -Origin

Re: Transformation on a numeric field

2014-04-15 Thread Jack Krupansky
this processor as well. We could also write a generic script that takes a source and destination field name and then does a specified operation on it, like add an offset or multiple by a scale factor. -- Jack Krupansky -Original Message- From: Jean-Sebastien Vachon Sent: Tuesday

Re: What is Overseer?

2014-04-15 Thread Jack Krupansky
I should have suggested three levels in my question: 1) important to average users, 2) expert-only, and 3) internal implementation detail. Yes, expert-only does have a place, but it is good to mark features as such. -- Jack Krupansky -Original Message- From: Chris Hostetter Sent

Re: Fetching document by comparing date to today date

2014-04-12 Thread Jack Krupansky
therwise URL parsing will treat it as a space. See: https://cwiki.apache.org/confluence/display/solr/Working+with+Dates -- Jack Krupansky -Original Message- From: Darniz Sent: Saturday, April 12, 2014 4:33 PM To: solr-user@lucene.apache.org Subject: Fetching document by comparing date

Re: stucked with log4j configuration

2014-04-12 Thread Jack Krupansky
Consider the Heliosearch distribution of Solr (HDS) - it comes pre-configured for Tomcat: http://heliosearch.com/heliosearch-distribution-for-solr/ -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Saturday, April 12, 2014 10:16 AM To: solr-user@lucene.apache.org Subject

Re: Search a list of words and returned order

2014-04-11 Thread Jack Krupansky
all terms and has a significant boost: q=see spot run (+see +spot +run)^10 -- Jack Krupansky -Original Message- From: Croci Francesco Luigi (ID SWS) Sent: Friday, April 11, 2014 9:47 AM To: 'solr-user@lucene.apache.org' Subject: Search a list of words and returned order

Re: Pushing content to Solr from Nutch

2014-04-10 Thread Jack Krupansky
Does your Solr schema match the data output by nutch? It’s up to you to create a Solr schema that matches the output of nutch – read up on the nutch doc for that info. Solr doesn’t define that info, nutch does. -- Jack Krupansky From: Xavier Morera Sent: Thursday, April 10, 2014 12:58 PM To

Re: Regex For *|* at hl.regex.pattern

2014-04-07 Thread Jack Krupansky
The regex pattern should match the text of the fragment. IOW, exclude whatever delimiters are not allowed in the fragment. The default is: [-\w ,\n"']{20,200} -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Monday, April 7, 2014 10:21 AM To:

Re: Anyone going to ApacheCon in Denver next week?

2014-04-06 Thread Jack Krupansky
ay evening. -- Jack Krupansky -Original Message- From: Siegfried Goeschl Sent: Sunday, April 6, 2014 9:12 AM To: solr-user@lucene.apache.org Subject: Re: Anyone going to ApacheCon in Denver next week? Hi folks, I’m already here and would love to join :-) Cheers, Siegfried Goeschl

Re: Rounding date indexing to minute

2014-04-06 Thread Jack Krupansky
s the Solr stateless script update processor. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Sunday, April 6, 2014 8:29 AM To: solr-user@lucene.apache.org Subject: Re: Rounding date indexing to minute Have you tried date math formulas? Don't need to round u

Re: Solr Search on Fields name

2014-04-05 Thread Jack Krupansky
One technique is to add a copyField directive to your schema, which can use a wildcard to copy a bunch of fields to a single, combinaed field that you can query directly, such as rullAll:key. Or, consider using a multivalued field. -- Jack Krupansky -Original Message- From

Re: Query and field name with wildcard

2014-04-05 Thread Jack Krupansky
The LucidWorks Search query parser lets you use the "all" pseudo-field to search across all fields. See: http://docs.lucidworks.com/display/lweug/Field+Queries For example: q = all:some_word -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Friday, April 4,

Re: Strange behavior of edismax and mm=0 with long queries (bug?)

2014-04-05 Thread Jack Krupansky
query term phrase matches exactly. -- Jack Krupansky From: Nils Kaiser Sent: Friday, April 4, 2014 10:10 AM To: solr-user@lucene.apache.org Subject: Strange behavior of edismax and mm=0 with long queries (bug?) Hey, I am currently using solr to recognize songs and people from a list of user

<    2   3   4   5   6   7   8   9   10   11   >