Re: Searching in stopwords

2013-07-27 Thread Jack Krupansky
Edismax should be able to handle a query consisting of only query-time stop words. What does your text field type analyzer look like? -- Jack Krupansky -Original Message- From: Rohit Kumar Sent: Saturday, July 27, 2013 9:59 PM To: solr-user@lucene.apache.org Subject: Searching

Re: Converti XML response into JavaBin encoding

2013-07-26 Thread Jack Krupansky
Look at org.apache.solr.response.BinaryResponseWriter. This serializes from a SolrQueryResponse. There is no direct XML to JavaBin serialization. It uses org.apache.solr.common.util.JavaBinCodec. This is the server side processing. For the client side, see the SolrJ package. -- Jack

Re: Difference between qf and pf parameters

2013-07-26 Thread Jack Krupansky
in the q query, but if the slop is 1, they won't necessarily be required to be adjacent in the boost. pf, pf2, and pf3 are all related. As are ps, ps2, and ps3. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Friday, July 26, 2013 9:11 AM To: solr-user@lucene.apache.org

Re: Exact Match

2013-07-26 Thread Jack Krupansky
Is url a string field or is it text (and hence analyzed, which typically removes all punctuation)? A URL typically needs to be stored as both string and text so that the user can do both exact match and keyword match. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent

Re: Spell check SOLR 3.6.1 not working for numbers

2013-07-26 Thread Jack Krupansky
Query converter is a plugin if the app has special requirements. -- Jack Krupansky -Original Message- From: Dyer, James Sent: Friday, July 26, 2013 11:33 AM To: solr-user@lucene.apache.org Subject: RE: Spell check SOLR 3.6.1 not working for numbers That makes sense because the Query

Re: Synonym Phrase

2013-07-26 Thread Jack Krupansky
the query at the application level and generate a Solr query that has the synonyms pre-expanded. Application preprocessing could be as simple as scanning for the synonym phrases and then adding OR terms for the synonym phrases. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent

Re: Synonym Phrase

2013-07-26 Thread Jack Krupansky
the autogeneratePhraseQuery attribute of the field type set to true.) But, I don't recall the details... and it's not the default, which maybe it should be. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Friday, July 26, 2013 12:18 PM To: solr-user@lucene.apache.org

Re: Exact Search Problem

2013-07-26 Thread Jack Krupansky
Separate fields for URL as string and URL as keywords makes sense. You can also use the URL classifier update processor or a regex filter to have a third field to match solely the domain name, if that is needed. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Friday

Re: Auto Indexing in Solr

2013-07-25 Thread Jack Krupansky
-- Jack Krupansky -Original Message- From: archit2112 Sent: Thursday, July 25, 2013 2:12 AM To: solr-user@lucene.apache.org Subject: Auto Indexing in Solr Hi Im using Solr 4's Data Import Utility to index Oracle 10g XE database. Im using full imports as well as delta imports. I want

Re: maximum number of documents per shard?

2013-07-25 Thread Jack Krupansky
alert, more careful with your hardware selection and network design, etc. -- Jack Krupansky -Original Message- From: Nicole Lacoste Sent: Thursday, July 25, 2013 4:14 AM To: solr-user@lucene.apache.org Subject: Re: maximum number of documents per shard? Is there a limit on the number

Re: SolrCloud commit process is too time consuming, even if documents are light

2013-07-25 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Radu Ghita Sent: Thursday, July 25, 2013 8:20 AM To: solr-user@lucene.apache.org Subject: SolrCloud commit process is too time consuming, even if documents are light Hi, We are having a client with business model that requires indexing each

Re: Solr 4.2.1 limit on number of rows or number of hits per shard?

2013-07-25 Thread Jack Krupansky
talk of supporting streaming, which presumably would allow access to all results, but chunked/paged in some way. -- Jack Krupansky -Original Message- From: Tom Burton-West Sent: Thursday, July 25, 2013 1:39 PM To: solr-user@lucene.apache.org Subject: Solr 4.2.1 limit on number of rows

Re: Wildcard matching of dynamic fields

2013-07-25 Thread Jack Krupansky
patterns or lists of field names. (I have more examples in my book.) -- Jack Krupansky -Original Message- From: Artem Karpenko Sent: Thursday, July 25, 2013 11:05 AM To: solr-user@lucene.apache.org Subject: Wildcard matching of dynamic fields Hi, given a dynamic field dynamicField name

Re: Solr Index Files in a Directories

2013-07-25 Thread Jack Krupansky
-- Jack Krupansky -Original Message- From: Rajesh Jain Sent: Thursday, July 25, 2013 3:57 PM To: solr-user@lucene.apache.org Subject: Solr Index Files in a Directories I have flume sink directory where new files are being written periodically. How can I instruct solr to index the files

Re: how to get unique latest results from solr

2013-07-24 Thread Jack Krupansky
Do your time range query, sort by the time field as descending, and take the first result. -- Jack Krupansky -Original Message- From: Alok Bhandari Sent: Wednesday, July 24, 2013 9:08 AM To: solr-user@lucene.apache.org Subject: how to get unique latest results from solr Hello All

Re: how to get unique latest results from solr

2013-07-24 Thread Jack Krupansky
In that case, the answer is that no, Solr does not have such a feature. You could simulate it by doing a separate query (using the method I suggested) for each of the 10 employees, one at a time. -- Jack Krupansky -Original Message- From: Alok Bhandari Sent: Wednesday, July 24

Re: Usage Of Real Time Get Handler Of Solr

2013-07-24 Thread Jack Krupansky
Details here: http://wiki.apache.org/solr/RealTimeGet -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, July 24, 2013 5:07 AM To: solr-user@lucene.apache.org Subject: Usage Of Real Time Get Handler Of Solr Hi; There is a real time get handler at Solr

Re: Matched words from document - Stemmed and Synonyms

2013-07-24 Thread Jack Krupansky
The debugQuery=true parameter will give you an explain section that details what terms matched in each document. There is an XML version as well (debug.explain.structured). Unfortunately, these are the analyzed (stemmed, lower case, synonyms expanded) terms. Pick your poison! -- Jack

Re: facet.method value per field

2013-07-24 Thread Jack Krupansky
Yes, per-field facet method is supported. -- Jack Krupansky -Original Message- From: GaneshSe Sent: Wednesday, July 24, 2013 10:20 AM To: solr-user@lucene.apache.org Subject: facet.method value per field We are trying to use facet across multiple fields, we would like to know how

Re: Unexpected search results with date range queries and OR

2013-07-24 Thread Jack Krupansky
Solr supports pure negative queries, but only at the top level. Pure negative sub-queries are not supported. To work around this limitation your need to add *:* to the sub-query: (offTime:[2013-07-24T14:35:46.319Z TO *]) OR (*:* NOT offTime:[* TO *]) -- Jack Krupansky -Original Message

Re: filter query result by user

2013-07-23 Thread Jack Krupansky
There is no such thing as a qf filter - qf is simply a list of names of fields to search for the terms from the query, q, as well as boost factors. Filtering is done with filter queries - fq. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Tuesday, July 23, 2013 9:39 AM

Re: Document Similarity Algorithm at Solr/Lucene

2013-07-23 Thread Jack Krupansky
that the top results will be more relevant. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Tuesday, July 23, 2013 6:16 AM To: solr-user@lucene.apache.org Subject: Re: Document Similarity Algorithm at Solr/Lucene Actually I need a specialized algorithm. I want to use

Re: deserializing highlighting json result

2013-07-23 Thread Jack Krupansky
The JSON keys within the highlighting object are the document IDs, and then the keys within those objects are the highlighted field names. Again, I repeat my question: Exactly why is it difficult to deserialize? Seems simple enough. -- Jack Krupansky -Original Message- From: Mysurf

Re: how number of indexed fields effect performance

2013-07-23 Thread Jack Krupansky
that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Tue, Jul 23, 2013 at 12:36 AM, Jack Krupansky j...@basetechnology.comwrote: After restarting Solr and doing a couple of queries to warm the caches, are queries already slow

Re: WikipediaTokenizer for Removing Unnecesary Parts

2013-07-23 Thread Jack Krupansky
Are you actually seeing that output from the WikipediaTokenizerFactory?? Really? Even if you use the Solr Admin UI analysis page? You should just see the text tokens plus the URLs for links. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Tuesday, July 23, 2013 10:53

Re: maximum number of documents per shard?

2013-07-23 Thread Jack Krupansky
performance . There is no hard limit besides that 2.1 billion Lucene limit, but... performance will vary. -- Jack Krupansky -Original Message- From: Ali, Saqib Sent: Tuesday, July 23, 2013 6:18 PM To: solr-user@lucene.apache.org Subject: maximum number of documents per shard? still 2.1

Re: Regex in Stopword.xml

2013-07-22 Thread Jack Krupansky
.TXT file. In any case, Solr and Lucene do not support stop words that are regular expressions, although a regex filter can simulate them to a limited degree. -- Jack Krupansky -Original Message- From: Scatman Sent: Monday, July 22, 2013 7:48 AM To: solr-user@lucene.apache.org Subject

Re: how to improve (keyword) relevance?

2013-07-22 Thread Jack Krupansky
Could you please be more specific about the relevancy problem you are trying to solve? -- Jack Krupansky -Original Message- From: eShard Sent: Monday, July 22, 2013 9:57 AM To: solr-user@lucene.apache.org Subject: how to improve (keyword) relevance? Good morning, I'm currently

Re: deserializing highlighting json result

2013-07-22 Thread Jack Krupansky
Exactly why is it difficult to deserialize? Seems simple enough. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Monday, July 22, 2013 11:14 AM To: solr-user@lucene.apache.org Subject: deserializing highlighting json result When I request a json result I get

Re: how to improve (keyword) relevance?

2013-07-22 Thread Jack Krupansky
- qualification only applies to a single term, but you can use parentheses: q=text%3A(test+pdf) -- Jack Krupansky -Original Message- From: eShard Sent: Monday, July 22, 2013 12:34 PM To: solr-user@lucene.apache.org Subject: Re: how to improve (keyword) relevance? Sure, let's say

Re: how number of indexed fields effect performance

2013-07-22 Thread Jack Krupansky
Was all of this running fine previously and only started running slow recently, or is this your first measurement? Are very simple queries (single keyword, no filters or facets or sorting or anything else, and returning only a few fields) working reasonably well? -- Jack Krupansky

Re: /update/extract error

2013-07-22 Thread Jack Krupansky
You need a dynamic field pattern for ignored_* to ignore unmapped metadata. -- Jack Krupansky -Original Message- From: franagan Sent: Monday, July 22, 2013 5:14 PM To: solr-user@lucene.apache.org Subject: /update/extract error Hi all, im testing solrcloud (version 4.3.1) with 2

Re: Question about field boost

2013-07-22 Thread Jack Krupansky
-title fields. -- Jack Krupansky -Original Message- From: Joe Zhang Sent: Monday, July 22, 2013 11:06 PM To: solr-user@lucene.apache.org Subject: Question about field boost Dear Solr experts: Here is my query: defType=dismaxq=term1+term2qf=title^100 content Apparently (at least I thought

Re: how number of indexed fields effect performance

2013-07-22 Thread Jack Krupansky
processing are taking the most time. Also check whether the reported QTime seems to match actual wall clock time; sometimes formatting of the results and network transfer time can dwarf actual query time. How many fields are you returning on a typical query? -- Jack Krupansky -Original

Re: Question about field boost

2013-07-22 Thread Jack Krupansky
That means that for that document china occurs in the title vs. snowden found in a document but not in the title. -- Jack Krupansky -Original Message- From: Joe Zhang Sent: Tuesday, July 23, 2013 12:52 AM To: solr-user@lucene.apache.org Subject: Re: Question about field boost Is my

Re: Order by an expression in Solr

2013-07-20 Thread Jack Krupansky
to sorting, and Solr does have a sort parameter: http://wiki.apache.org/solr/CommonQueryParameters#sort -- Jack Krupansky -Original Message- From: cmd.ares Sent: Saturday, July 20, 2013 2:51 AM To: solr-user@lucene.apache.org Subject: Order by an expression in Solr In SQL you can order

Early Access Release #3 for Solr 4.x Deep Dive book is now available for download on Lulu.com

2013-07-19 Thread Jack Krupansky
Okay, it’s hot off the e-presses: Solr 4.x Deep Dive, Early Access Release #3 is now available for purchase and download as an e-book for $9.99 on Lulu.com at: http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21079719.html (That link says “1

Re: Date for 4.4 solr release

2013-07-19 Thread Jack Krupansky
real_soon:[NOW+3DAYS TO NOW+10DAYS] -- Jack Krupansky -Original Message- From: Jabouille Jean Charles Sent: Friday, July 19, 2013 11:10 AM To: solr-user@lucene.apache.org Subject: Date for 4.4 solr release Hi, we are currently using solr 4.2.1. There are a lot of fix in the 4.4

Re: Indexing CSV files in a Folder

2013-07-19 Thread Jack Krupansky
Read: http://wiki.apache.org/solr/UpdateCSV -- Jack Krupansky -Original Message- From: Rajesh Jain Sent: Friday, July 19, 2013 1:55 PM To: solr-user@lucene.apache.org Subject: Indexing CSV files in a Folder Hi I have flume dumping CSV files in folders and I would like Solr

Re: Collapsing similar queries

2013-07-19 Thread Jack Krupansky
distinct groups there. If you have a specific, narrow domain in mind, a thesaurus of concepts and synonyms for that domain would help you a lot. -- Jack Krupansky -Original Message- From: Otis Gospodnetic Sent: Friday, July 19, 2013 12:33 PM To: solr-user@lucene.apache.org Subject: Collapsing

Re: The way edismax parses colon seems weird

2013-07-19 Thread Jack Krupansky
What field type analyzer and tokenizer are you using, and what does a sample of the input data look like? Generally, a single backslash I all that is needed for escaping. And, escaping is not needed within a quoted phrase, except for quotes and literal backslashes. -- Jack Krupansky

Re: The way edismax parses colon seems weird

2013-07-19 Thread Jack Krupansky
Very good chance that is it. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Friday, July 19, 2013 7:16 PM To: solr-user@lucene.apache.org Subject: Re: The way edismax parses colon seems weird Could this be related: https://issues.apache.org/jira/browse/SOLR

Re: The way edismax parses colon seems weird

2013-07-19 Thread Jack Krupansky
As a workaround, enclose the term in quotes, without the escaping: http://localhost:8080/solr/select?q=12:34defType=edismaxdebug=queryqf=content -- Jack Krupansky -Original Message- From: jefferyyuan Sent: Friday, July 19, 2013 7:09 PM To: solr-user@lucene.apache.org Subject: Re: The way

Re: Doc's FunctionQuery result field in my custom SearchComponent class ?

2013-07-18 Thread Jack Krupansky
As detailed in previous email, termfreq is not a field - it is a transformer or function. Technically, it is actually a ValueSource. If you look at the TextResponseWriter.writeVal method you can see you it kicks off the execution of transformers for writing documents. -- Jack Krupansky

Re: Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Jack Krupansky
It would probably be better to integrate the responses (document lists.) Solr response writers do a lot of special processing of the response data, so you can't just throw random objects into the response. You may need to explain your use case a little more clearly. -- Jack Krupansky

Re: Getting a large number of documents by id

2013-07-18 Thread Jack Krupansky
). Check it out: http://wiki.apache.org/solr/RealTimeGet -- Jack Krupansky -Original Message- From: Brian Hurt Sent: Thursday, July 18, 2013 10:46 AM To: solr-user@lucene.apache.org Subject: Getting a large number of documents by id I have a situation which is common in our current use

Re: Search with punctuations

2013-07-17 Thread Jack Krupansky
with the Edge n-gram token filter with a min and max of 3 so that It will index INTERNATIONAL as itself plus INT. And then maybe add a regex char filter to combine INT'L into INTL. -- Jack Krupansky -Original Message- From: kobe.free.wo...@gmail.com Sent: Wednesday, July 17, 2013 8:09 AM

Re: Why Sort Doesn't Work?

2013-07-17 Thread Jack Krupansky
In general, sorting doesn't work well for multivalued and tokenized fields. You need to copy your tokenized url to a utl_str string field and then sort that field. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, July 17, 2013 5:54 AM To: solr-user

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-17 Thread Jack Krupansky
future, not a here and now. Maybe in the 5.0 timeframe? I don't want anyone to get the impression that there are off-the-shelf patches that completely solve the synonym phrase problem. Yes, progress is being made, but we're not there yet. -- Jack Krupansky -Original Message- From

Re: How can I learn the total count of how many documents indexed and how many documents updated?

2013-07-17 Thread Jack Krupansky
I don't think that breakdown is readily available from Solr. Sounds like a good Jira request for improvement in the response. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, July 17, 2013 10:06 AM To: solr-user@lucene.apache.org Subject: How can I learn

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-17 Thread Jack Krupansky
I would also note that the LucidWorks Search query parser implements query-time synonym phrases. I don't know if anybody has anything better than that. Unfortunately, that is proprietary and is kind of a workaround for current Lucene/Solr limitations than a long-term solution. -- Jack

Re: How can I learn the total count of how many documents indexed and how many documents updated?

2013-07-17 Thread Jack Krupansky
counts. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, July 17, 2013 10:55 AM To: solr-user@lucene.apache.org Subject: Re: How can I learn the total count of how many documents indexed and how many documents updated? I will open a Jira for it and apply a patch

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-17 Thread Jack Krupansky
Search for query-time phrase synonyms, off-the-shelf, today, no patches required.) -- Jack Krupansky -Original Message- From: Roman Chyla Sent: Wednesday, July 17, 2013 11:44 AM To: solr-user@lucene.apache.org Subject: Re: Searching w/explicit Multi-Word Synonym Expansion OK, let's do

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-17 Thread Jack Krupansky
as well. I didn't mean to discourage or denigrate the later, just to highlight that doing custom code is not the same as solutions being available off the shelf. -- Jack Krupansky -Original Message- From: Roman Chyla Sent: Wednesday, July 17, 2013 12:13 PM To: solr-user@lucene.apache.org

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-17 Thread Jack Krupansky
phrase.) -- Jack Krupansky -Original Message- From: dmarini Sent: Wednesday, July 17, 2013 2:03 PM To: solr-user@lucene.apache.org Subject: Re: Searching w/explicit Multi-Word Synonym Expansion Roman, As a developer, I understand where you are coming from. My issue is that I specialize

Re: Range query on a substring.

2013-07-16 Thread Jack Krupansky
as a range query on a substring in Solr or Lucene. -- Jack Krupansky -Original Message- From: Marcin Rzewucki Sent: Tuesday, July 16, 2013 5:13 AM To: solr-user@lucene.apache.org Subject: Re: Range query on a substring. By multivalued I meant an array of values. For example: arr name

Re: Doc's FunctionQuery result field in my custom SearchComponent class ?

2013-07-16 Thread Jack Krupansky
, and then a call to DocTransformers.transform() in each response writer will evaluate the embedded function queries and insert their values in the results as they are being written. -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Tuesday, July 16, 2013 1:37 AM To: solr-user

Re: Are analysers applied to each value in a multi-valued field separately?

2013-07-16 Thread Jack Krupansky
the positionIncrementGap (typically 100 for text.) -- Jack Krupansky -Original Message- From: Daniel Collins Sent: Tuesday, July 16, 2013 8:46 AM To: solr-user@lucene.apache.org Subject: Are analysers applied to each value in a multi-valued field separately? I'm guessing the answer is yes, but here's

Re: Are analysers applied to each value in a multi-valued field separately?

2013-07-16 Thread Jack Krupansky
=text_limit4:a23; Indicating that the token position limit filter does work, but only for the relative position, making it not much more useful than the token count limit filter. Oh well. -- Jack Krupansky -Original Message- From: Daniel Collins Sent: Tuesday, July 16, 2013 12:18 PM To: solr

Re: Searching w/explicit Multi-Word Synonym Expansion

2013-07-16 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: dmarini Sent: Tuesday, July 16, 2013 5:23 PM To: solr-user@lucene.apache.org Subject: Searching w/explicit Multi-Word Synonym Expansion Hi Everyone, I'm using Solr (version 4.3) for the first time and through much research I got into writing

Re: Range query on a substring.

2013-07-16 Thread Jack Krupansky
values to a fixed number of digits with leading zeros, and then they would be properly ordered. But, I don't think we have a token filter that can do that, although I imagine that a new one could be proposed. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Tuesday, July 16

Re: Running Solr in a cluster - high availability only

2013-07-15 Thread Jack Krupansky
need at least three zookeepers for HA. They need to be external to the cluster in production. * Load balancing - you need to do your own testing to confirm whether you need it. If so, that is outside of Solr. * SolrCloud automatically recovers nodes when they come back up. -- Jack Krupansky

Re: How to Indicate Solr That: Both Ascified and Non-Ascii versions of tokens are same?

2013-07-15 Thread Jack Krupansky
and is a pain with phrases. It is worth a Jira though. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Monday, July 15, 2013 9:06 AM To: solr-user@lucene.apache.org Subject: How to Indicate Solr That: Both Ascified and Non-Ascii versions of tokens are same? When I search

Re: How to Indicate Solr That: Both Ascified and Non-Ascii versions of tokens are same?

2013-07-15 Thread Jack Krupansky
, it is no different that highlighting tokens that have differences in upper and lower case. -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Monday, July 15, 2013 9:13 AM To: solr-user@lucene.apache.org Subject: Re: How to Indicate Solr That: Both Ascified and Non-Ascii

Re: How to pass null OR empty values to fq?

2013-07-15 Thread Jack Krupansky
so hard just to avoid a few conditional statements in your app layer?? -- Jack Krupansky -Original Message- From: SolrLover Sent: Monday, July 15, 2013 11:43 AM To: solr-user@lucene.apache.org Subject: How to pass null OR empty values to fq? Hi, I am trying to pass empty values to fq

Re: Different 'fl' for first X results

2013-07-15 Thread Jack Krupansky
full control in a custom script. -- Jack Krupansky -Original Message- From: Weber Sent: Monday, July 15, 2013 4:58 PM To: solr-user@lucene.apache.org Subject: Different 'fl' for first X results How to get a different field list in the first X results? For example, in the first 5

Re: Solr 4.3.1: Errors When Attempting to Index LatLon Fields

2013-07-15 Thread Jack Krupansky
Make sure that dynamicFields are within fields rather than types. Solr tends to ignore misplaced configuration elements. -- Jack Krupansky -Original Message- From: Scott Vanderbilt Sent: Monday, July 15, 2013 5:10 PM To: solr-user@lucene.apache.org Subject: Solr 4.3.1: Errors When

Re: Different 'fl' for first X results

2013-07-15 Thread Jack Krupansky
SOLR-5005 - JavaScriptRequestHandler https://issues.apache.org/jira/browse/SOLR-5005 -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Monday, July 15, 2013 6:56 PM To: solr-user@lucene.apache.org Subject: Re: Different 'fl' for first X results Is there a JIRA

Re: SolrCloud leader

2013-07-14 Thread Jack Krupansky
. Scheduling is currently outside of the scope of Solr and SolrCloud. -- Jack Krupansky -Original Message- From: kowish.adamosh Sent: Sunday, July 14, 2013 8:42 AM To: solr-user@lucene.apache.org Subject: Re: SolrCloud leader The problem is that I don't want to invoke data import on 8 server

Re: ACL implementation: Pseudo-join performance Atomic Updates

2013-07-14 Thread Jack Krupansky
is that clearly updating all documents in the index is a non-starter. -- Jack Krupansky -Original Message- From: Oleg Burlaca Sent: Sunday, July 14, 2013 11:02 AM To: solr-user@lucene.apache.org Subject: ACL implementation: Pseudo-join performance Atomic Updates Hello all, Situation: We

Re: solr autodetectparser tikaconfig dataimporter error

2013-07-14 Thread Jack Krupansky
Caused by: java.lang.NoSuchMethodError: That means you have some out of date jars or some newer jars mixed in with the old ones. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Sunday, July 14, 2013 3:07 PM To: solr-user@lucene.apache.org Subject: Re: solr

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
it belongs and queries gets distributed to the nodes that have the data needed to satisfy the queries. -- Jack Krupansky -Original Message- From: kowish.adamosh Sent: Saturday, July 13, 2013 5:11 AM To: solr-user@lucene.apache.org Subject: SolrCloud leader Hi guys, Is it possible

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
and that distracts attention from optimizing the full application architecture. -- Jack Krupansky -Original Message- From: Radim Kolar Sent: Saturday, July 13, 2013 10:32 AM To: solr-user@lucene.apache.org Subject: Re: SolrCloud leader Since SolrCloud is a master-free architecture, you can send

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
. This is an interesting issue to ponder. -- Jack Krupansky -Original Message- From: kowish.adamosh Sent: Saturday, July 13, 2013 10:51 AM To: solr-user@lucene.apache.org Subject: Re: SolrCloud leader Ya :-) I need it for different reason. I have 8 server nodes and one of them has

Re: How to set a condition on the number of docs found

2013-07-12 Thread Jack Krupansky
query (including the fl parameter) if you know a term that has a 1-to-1 relationship to your query results. It is worth filing a Jira to add numfound() as a function query value source. -- Jack Krupansky -Original Message- From: Matt Lieber Sent: Friday, July 12, 2013 1:45 AM To: solr

Re: How to set a condition over stats result

2013-07-12 Thread Jack Krupansky
is not an aggregate function. Ditto for min and max. -- Jack Krupansky -Original Message- From: mihaela olteanu Sent: Friday, July 12, 2013 1:44 AM To: solr-user@lucene.apache.org Subject: Re: How to set a condition over stats result What if you perform sub(sum(myfieldvalue),100) 0 using

Re: zero-valued retrieval scores

2013-07-12 Thread Jack Krupansky
Did you put a boost of 0.0 on the documents, as opposed to the default of 1.0? x * 0.0 = 0.0 -- Jack Krupansky -Original Message- From: Joe Zhang Sent: Friday, July 12, 2013 10:31 PM To: solr-user@lucene.apache.org Subject: zero-valued retrieval scores when I search a keyword

Re: zero-valued retrieval scores

2013-07-12 Thread Jack Krupansky
For the calculation of norm, see note number 6: http://lucene.apache.org/core/4_3_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html You would need to talk to the Nutch guys to see why THEY are setting document boost to 0.0. -- Jack Krupansky -Original Message- From

Re: amount of values in a multi value field - is denormalization always the best option?

2013-07-11 Thread Jack Krupansky
as discrete Solr documents. But, it does all depend on your particular data and particular requirements. -- Jack Krupansky -Original Message- From: Flavio Pompermaier Sent: Thursday, July 11, 2013 7:50 AM To: solr-user@lucene.apache.org Subject: Re: amount of values in a multi value field

Re: What happens in indexing request in solr cloud if Zookeepers are all dead?

2013-07-11 Thread Jack Krupansky
no nodes can be down for the cluster to do updates. -- Jack Krupansky -Original Message- From: Zhang, Lisheng Sent: Thursday, July 11, 2013 9:28 AM To: solr-user@lucene.apache.org Subject: What happens in indexing request in solr cloud if Zookeepers are all dead? Hi, In solr cloud

Re: Applying Sum on Field

2013-07-11 Thread Jack Krupansky
-- Jack Krupansky -Original Message- From: Jamshaid Ashraf Sent: Thursday, July 11, 2013 7:56 AM To: solr-user@lucene.apache.org Subject: Applying Sum on Field Hi, I'm a new solr user, I wanted to know is there any way to apply sum on a field in a result document of group query

Re: What happens in indexing request in solr cloud if Zookeepers are all dead?

2013-07-11 Thread Jack Krupansky
Sorry, no updates if no Zookeepers. There would be no way to assure that any node knows the proper configuration. Queries are a little safer using most recent configuration without zookeeper, but update consistency requires accurate configuration information. -- Jack Krupansky -Original

Re: Partial Matching in both query and field

2013-07-11 Thread Jack Krupansky
A couple of possibilities: 1. Make sure to reload the core. 2. Check that the Solr schema version is new enough to recognize autoGeneratePhraseQueries. 3. What query parser are you using? -- Jack Krupansky -Original Message- From: James Bathgate Sent: Thursday, July 11, 2013 5:26

Re: How to set a condition over stats result

2013-07-11 Thread Jack Krupansky
JavaScript logic you wanted. When we get that feature, it might be interesting to implement a variation of the standard stats component as a JavaScript script, and then people could easily hack it such as in your request. Fascinating. -- Jack Krupansky -Original Message- From: Matt

Re: Solr limitations

2013-07-10 Thread Jack Krupansky
to imply that a 96-node or 128-node cluster won't perform well. -- Jack Krupansky -Original Message- From: Ramkumar R. Aiyengar Sent: Wednesday, July 10, 2013 4:03 AM To: solr-user@lucene.apache.org Subject: Re: Solr limitations I understand, thanks. I just wanted to check in case

Re: simple date query

2013-07-10 Thread Jack Krupansky
You can't use two fields in one range query, but you can combine two range queries: startDate_tdt:[* TO NOW] AND endDate_tdt:[NOW TO *] -- Jack Krupansky -Original Message- From: Marcos Mendez Sent: Wednesday, July 10, 2013 9:31 AM To: solr-user@lucene.apache.org Subject: simple

Re: amount of values in a multi value field - is denormalization always the best option?

2013-07-10 Thread Jack Krupansky
as the full document. And, there is no way to address or synchronize individual elements of multivalued fields. Joins are great... if used in moderation. Heavy use of joins is not a great idea. -- Jack Krupansky -Original Message- From: Marcelo Elias Del Valle Sent: Wednesday, July

Re: amount of values in a multi value field - is denormalization always the best option?

2013-07-10 Thread Jack Krupansky
idea. 1. Break things down. 2. Keep things simple. 3. Join is not simple. 4. Only use non-simple features in careful moderation. There is no reasonable short cut to doing a robust data model. Shortcuts may seem enticing in the short run, but will eat you alive in the long run. -- Jack Krupansky

Re: Document count mismatch

2013-07-09 Thread Jack Krupansky
1. Try facet.missing=true to count the number of documents that do not have a value for that field. 2. Try facet.limit=n to set the number of returned facet values to a larger or smaller value than the default of 100. 3. Try reading the Faceting chapter of my book! -- Jack Krupansky

Re: Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread Jack Krupansky
Simple math: x times zero equals zero. That's why the default document boost is 1.0 - score times 1.0 equals score. Any particular reason you wanted to zero out the document score from the document level? -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Tuesday, July

Re: two types of answers in my query

2013-07-09 Thread Jack Krupansky
in the results. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Tuesday, July 09, 2013 2:38 AM To: solr-user@lucene.apache.org Subject: two types of answers in my query Hi, A general question: Let's say I have Car And CarParts 1:n relation. And I have discovered

Re: Document count mismatch

2013-07-09 Thread Jack Krupansky
I don't quite follow the question. Give us an example. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Tuesday, July 09, 2013 9:37 AM To: solr-user@lucene.apache.org Subject: Re: Document count mismatch Ok, one more question. I have another field at my schema: *url

Re: Deleted Docs

2013-07-09 Thread Jack Krupansky
Solr (Lucene, actually) will be doing segment merge operations in the background, continually, so generally you won't need to do optimize operations. Generally, an explicit delete and a replace of an existing document are the only two ways that you would get a deleted document. -- Jack

Re: join not working with UUIDs

2013-07-09 Thread Jack Krupansky
Your join is requesting to use the join_id field (from) of documents matching the query of cor_parede:branca, but the join_id field of that document is empty. Maybe you intended to search in the other direction, like acessorio1:Teclado. -- Jack Krupansky -Original Message- From

Re: join not working with UUIDs

2013-07-09 Thread Jack Krupansky
Oops... I misread and confused your q and fq params. -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Tuesday, July 09, 2013 7:47 PM To: solr-user@lucene.apache.org Subject: Re: join not working with UUIDs Your join is requesting to use the join_id field (from

Re: Solr limitations

2013-07-08 Thread Jack Krupansky
suggested guidelines and you should do okay. -- Jack Krupansky -Original Message- From: Marcelo Elias Del Valle Sent: Monday, July 08, 2013 9:46 AM To: solr-user@lucene.apache.org Subject: Solr limitations Hello everyone, I am trying to search information about possible solr

Re: Atomic updates and indexed fields

2013-07-08 Thread Jack Krupansky
a hybrid NoSql/Solr solution such as DataStax Enterprise, where the data is persisted in Cassandra and indexed in Solr, allowing selective updates of all fields. See: http://www.datastax.com/ -- Jack Krupansky -Original Message- From: Bram Van Dam Sent: Monday, July 08, 2013 10:22 AM

Re: Surround query parser not working?

2013-07-08 Thread Jack Krupansky
Yes, you should be able to used nested query parsers to mix the queries. Solr 4.1(?) made it easier. -- Jack Krupansky -Original Message- From: Abeygunawardena, Niran Sent: Monday, July 08, 2013 7:00 AM To: solr-user@lucene.apache.org Subject: Re: Surround query parser not working

Re: Are the XML element names in schema.xml case sensitive?

2013-07-08 Thread Jack Krupansky
Nope. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Monday, July 08, 2013 7:20 AM To: solr-user@lucene.apache.org Subject: Re: Are the XML element names in schema.xml case sensitive? But not dynamicField or any others? Regards, Alex On 7 Jul 2013 23:39

Re: Solr limitations

2013-07-08 Thread Jack Krupansky
is uncharted territory, and there is no slam-dunk solution that is guaranteed to work really well for all apps in all environments - be prepared to doing multiple Proof of Concept implementations. -- Jack Krupansky -Original Message- From: Marcelo Elias Del Valle Sent: Monday, July 08, 2013

<    7   8   9   10   11   12   13   14   15   16   >