Re: join not working with UUIDs

2013-07-09 Thread Jack Krupansky
Oops... I misread and confused your "q" and "fq" params. -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Tuesday, July 09, 2013 7:47 PM To: solr-user@lucene.apache.org Subject: Re: join not working with UUIDs Your join is requesting to use the &q

Re: join not working with UUIDs

2013-07-09 Thread Jack Krupansky
Your join is requesting to use the "join_id" field ("from") of documents matching the query of "cor_parede:branca", but the join_id field of that document is empty. Maybe you intended to search in the other direction, like "acessorio1:Teclado".

Re: Deleted Docs

2013-07-09 Thread Jack Krupansky
Solr (Lucene, actually) will be doing segment merge operations in the background, continually, so generally you won't need to do optimize operations. Generally, an explicit delete and a replace of an existing document are the only two ways that you would get a deleted document. --

Re: Document count mismatch

2013-07-09 Thread Jack Krupansky
I don't quite follow the question. Give us an example. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Tuesday, July 09, 2013 9:37 AM To: solr-user@lucene.apache.org Subject: Re: Document count mismatch Ok, one more question. I have another field at my schema:

Re: two types of answers in my query

2013-07-09 Thread Jack Krupansky
if a car or part was matched in the results. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Tuesday, July 09, 2013 2:38 AM To: solr-user@lucene.apache.org Subject: two types of answers in my query Hi, A general question: Let's say I have Car And CarParts 1:n relatio

Re: Calculating Solr document score by ignoring the field.

2013-07-09 Thread Jack Krupansky
Simple math: x times zero equals zero. That's why the default document boost is 1.0 - score times 1.0 equals score. Any particular reason you wanted to zero out the document score from the document level? -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Tuesday,

Re: Document count mismatch

2013-07-09 Thread Jack Krupansky
1. Try facet.missing=true to count the number of documents that do not have a value for that field. 2. Try facet.limit=n to set the number of returned facet values to a larger or smaller value than the default of 100. 3. Try reading the Faceting chapter of my book! -- Jack Krupansky

Re: Indexing fails for docs with high Latin1 chars

2013-07-08 Thread Jack Krupansky
Right, the charset must agree with the charset of the program that wrote the file. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Monday, July 08, 2013 7:43 PM To: solr-user@lucene.apache.org Subject: Re: Indexing fails for docs with high Latin1 chars On 7/8/2013 4:43

Re: Indexing fails for docs with high Latin1 chars

2013-07-08 Thread Jack Krupansky
Maybe you need to add "; charset=UTF-8" to your Content-type: curl "http://localhost:8080/solr/update/?commit=true&stream.file=c:/solr/tml/exampledocs/57917486.xml&stream.contentType=application/xml; charset=UTF-8” -- Jack Krupansky -Original Message- F

Re: solr way to exclude terms

2013-07-08 Thread Jack Krupansky
What is the actual use case? In other words, why is the list so long? Maybe exclusion by keyword is not the proper solution... but we need what the underlying problem is. Is this for document access control? -- Jack Krupansky -Original Message- From: Angela Zhu Sent: Monday, July

Re: Solr limitations

2013-07-08 Thread Jack Krupansky
t/schemaless mode is uncharted territory, and there is no slam-dunk solution that is guaranteed to work really well for all apps in all environments - be prepared to doing multiple Proof of Concept implementations. -- Jack Krupansky -Original Message- From: Marcelo Elias Del Valle S

Re: Are the XML element names in schema.xml case sensitive?

2013-07-08 Thread Jack Krupansky
Nope. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Monday, July 08, 2013 7:20 AM To: solr-user@lucene.apache.org Subject: Re: Are the XML element names in schema.xml case sensitive? But not dynamicField or any others? Regards, Alex On 7 Jul 2013 23:39

Re: Surround query parser not working?

2013-07-08 Thread Jack Krupansky
Yes, you should be able to used nested query parsers to mix the queries. Solr 4.1(?) made it easier. -- Jack Krupansky -Original Message- From: Abeygunawardena, Niran Sent: Monday, July 08, 2013 7:00 AM To: solr-user@lucene.apache.org Subject: Re: Surround query parser not working

Re: Atomic updates and indexed fields

2013-07-08 Thread Jack Krupansky
consider a hybrid NoSql/Solr solution such as DataStax Enterprise, where the data is persisted in Cassandra and indexed in Solr, allowing selective updates of all fields. See: http://www.datastax.com/ -- Jack Krupansky -Original Message- From: Bram Van Dam Sent: Monday, July 08, 2013 1

Re: Solr limitations

2013-07-08 Thread Jack Krupansky
eters, for example. Please don't go there! In short, put up a small cluster and start doing a Proof of Concept cluster. Stay within my suggested guidelines and you should do okay. -- Jack Krupansky -Original Message- From: Marcelo Elias Del Valle Sent: Monday, July 08, 2013 9

Re: TZ parameter

2013-07-07 Thread Jack Krupansky
The TZ parameter only applies to "date math" rounding. You haven't used any date math. What are you actually trying to accomplish? Would [* TO NOW] cover your cases? Or [* TO NOW/DAY+1DAY]? -- Jack Krupansky -Original Message- From: Matt Mitchell Sent: Sunday, July

Re: Are the XML element names in schema.xml case sensitive?

2013-07-07 Thread Jack Krupansky
Yes, the XML element names (tags) and attribute names are all case sensitive, but... Solr has a special hack for "fieldtype" as well as "fieldType". -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Sunday, July 07, 2013 11:25 PM To: solr-us

Re: Why shouldn't lang-id component work at query-time?

2013-07-07 Thread Jack Krupansky
Default Lucen/Solr searching doesn't support "qf" or a list of fields to search, so you can't use that technique there. -- Jack Krupansky -Original Message- From: adfel70 Sent: Sunday, July 07, 2013 1:54 PM To: solr-user@lucene.apache.org Subject: Re: Why shouldn

Re: Why shouldn't lang-id component work at query-time?

2013-07-07 Thread Jack Krupansky
e for each language is probably the best you're going to do without doing something super-sophisticated. -- Jack Krupansky -Original Message- From: adfel70 Sent: Sunday, July 07, 2013 1:32 PM To: solr-user@lucene.apache.org Subject: Why shouldn't lang-id component work at q

Re: searching both english and japanese

2013-07-07 Thread Jack Krupansky
Just use the edismax query parser and set "qf" to all of the searchable first, in both languages. And then just use to copy from the main field(s) to the parallel, alternate language fields. -- Jack Krupansky -Original Message- From: Shalom Ben-Zvi Kazaz Sent: Sunday, Jul

Re: Is it possible to find a leader from a list of cores in solr via java code

2013-07-06 Thread Jack Krupansky
remature optimization" are very poor tools to lead a system design. Focus the energy on the data modeling and the overall application design. -- Jack Krupansky -Original Message- From: vicky desai Sent: Sunday, July 07, 2013 1:17 AM To: solr-user@lucene.apache.org Subject: Re: Is it

Re: Changing the number of shards?

2013-07-06 Thread Jack Krupansky
ific primary shards. In particular, increasing the number of primary shards (number of partitions of key values.) Now, I am certainly not an expert in ES and my elaboration may still be flawed, but I am endeavoring to ascertain what the facts are, either from the doc or from others who are m

Changing the number of shards?

2013-07-05 Thread Jack Krupansky
primary fails.”) See: http://www.elasticsearch.org/guide/reference/glossary/ My understanding is that with Solr you can do things like split shards and change the number of shards per node. Is this an advantage that ES does not offer? Any ES experts want to comment? -- Jack Krupansky

Re: [Announcement] Norch- a search engine for node.js

2013-07-05 Thread Jack Krupansky
And... is is based on Lucene/Solr? -- Jack Krupansky -Original Message- From: Ali, Saqib Sent: Friday, July 05, 2013 6:09 PM To: solr-user@lucene.apache.org Subject: Re: [Announcement] Norch- a search engine for node.js Very interesting. What is the upper limit on the number of

Re: Sorting

2013-07-05 Thread Jack Krupansky
And don't forget to test with sortable DocValues. I mean, sorting (and faceting) was one of the main motivations for DocValues. -- Jack Krupansky -Original Message- From: Otis Gospodnetic Sent: Friday, July 05, 2013 3:42 PM To: solr-user@lucene.apache.org Subject: Re: Sortin

Re: Sending Documents via SolrServer as MapReduce Jobs at Solrj

2013-07-05 Thread Jack Krupansky
easier to use! -- Jack Krupansky -Original Message- From: Walter Underwood Sent: Friday, July 05, 2013 12:11 PM To: solr-user@lucene.apache.org Subject: Re: Sending Documents via SolrServer as MapReduce Jobs at Solrj Why is it better to require another large software system (Hadoop

Early Access Release #2 for Solr 4.x Deep Dive book is now available for download on Lulu.com

2013-07-04 Thread Jack Krupansky
Okay, it’s hot off the e-presses: Solr 4.x Deep Dive, Early Access Release #2 is now available for purchase and download as an e-book for $9.99 on Lulu.com at: http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21079719.html (That link says “1”, but

Re: Auto Soft commit not working !!!

2013-07-04 Thread Jack Krupansky
1. Do you have an update processor chain that doesn't have RunUpdate in it? 2. Is the solrconfig directive missing? 3. Is _version_ missing from your schema? -- Jack Krupansky -Original Message- From: Rohit Kumar Sent: Thursday, July 04, 2013 9:22 PM To: solr-user@lucene.apach

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Jack Krupansky
expose whatever alternative metrics you wish. But, before you embark on such a venture, be aware that the performance of such an alternative relevance model might not be as appealing as you might want. You'll have to do a proof of concept to see how well things actually work out. -- Jac

Re: Find related words

2013-07-04 Thread Jack Krupansky
You can take a look at the MoreLikeThis/Find Similar feature. That gives you an approximation, but using documents rather than discrete terms. You would have to write a custom component of your own based on logic from MLT. -- Jack Krupansky -Original Message- From: Dotan Cohen Sent

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Jack Krupansky
ew feature/improvement. -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Thursday, July 04, 2013 9:45 AM To: solr-user@lucene.apache.org Subject: Total Term Frequency per ResultSet in Solr 4.3 ? Hi , I have lots of crawled data, indexed in my Solr (4.3.0) and lets say user

Re: omitTermFreqAndPositions="true" in easy English, please?

2013-07-03 Thread Jack Krupansky
Oops... I wasn't reading carefully enough - frequencies and positions only relate to tokenized fields (text) - not string fields. That doesn't impact your ability to do AND and OR of discrete string terms of a multivalued string field. -- Jack Krupansky -Original Message-

Re: omitTermFreqAndPositions="true" in easy English, please?

2013-07-03 Thread Jack Krupansky
Yes, but it is simply doing an AND or OR of the individual terms - no phrases or implied ordering of the terms. -- Jack Krupansky -Original Message- From: Ali, Saqib Sent: Thursday, July 04, 2013 12:52 AM To: solr-user@lucene.apache.org Subject: Re: omitTermFreqAndPositions="tru

Re: omitTermFreqAndPositions="true" in easy English, please?

2013-07-03 Thread Jack Krupansky
phrases, and there is no scoring difference whether a term occurs once or a thousand times in that field for each document. A lot less information needs to be stored in the index. -- Jack Krupansky -Original Message- From: Ali, Saqib Sent: Wednesday, July 03, 2013 10:31 PM To: solr-user

Re: unused fields in Solr schema.xml increase the index size

2013-07-03 Thread Jack Krupansky
view differences. -- Jack Krupansky -Original Message- From: Ali, Saqib Sent: Wednesday, July 03, 2013 11:55 AM To: solr-user@lucene.apache.org Subject: unused fields in Solr schema.xml increase the index size Hello all, Do unused fields in Solr Schem.xml increase the size of the

Re: Search for string ending with question mark

2013-07-03 Thread Jack Krupansky
nce it is a wildcard character. Yes, string_field:*\? should match any string field that ends with a "?". -- Jack Krupansky -Original Message- From: JZ Sent: Wednesday, July 03, 2013 10:59 AM To: solr-user@lucene.apache.org Subject: Search for string ending with question mark

Re: Use case indexed="false" stored="false" field

2013-07-03 Thread Jack Krupansky
to undefined fields. In other words, you are telling Solr that it is okay to have inputs for these fields - simply ignore them. But... you could still have update processors that look at the values of "ignored" fields and maybe assigns them to other, non-ignored fields. -- Jack

Re: Newbie SolR - Need advice

2013-07-03 Thread Jack Krupansky
Design your own application layer for both indexing and query that knows about both SQL and Solr. Give it a REST API and then your client applications can talk to your REST API and not have to care about the details of Solr or SQL. That's the best starting point. -- Jack Krup

Re: Partial Matching in both query and field

2013-07-02 Thread Jack Krupansky
Ahhh... you put autoGeneratePhraseQueries="false" on the field - but it needs to be on the field type. You can see from the parsed query that it generated the phrase. -- Jack Krupansky -Original Message- From: James Bathgate Sent: Tuesday, July 02, 2013 5:35 PM To:

Re: Partial Matching in both query and field

2013-07-02 Thread Jack Krupansky
You will need to set q.op to "OR", and... use a field type that has the autoGeneratePhraseQueries attribute set to "false". -- Jack Krupansky -Original Message- From: James Bathgate Sent: Tuesday, July 02, 2013 5:10 PM To: solr-user@lucene.apache.org Subject: Part

Re: How to show just the parent domains from results in Solr

2013-07-02 Thread Jack Krupansky
tom script with the Stateless Script update processor. My book has examples for URL Classify. -- Jack Krupansky -Original Message- From: A Geek Sent: Tuesday, July 02, 2013 1:47 PM To: solr user Subject: How to show just the parent domains from results in Solr hi All, I've indexed

Re: How to query Solr for empty field or specific value

2013-07-02 Thread Jack Krupansky
*&fq=((*:* -color.not_null:[* TO *]) OR color:blue) -- Jack Krupansky -Original Message- From: Van Tassell, Kristian Sent: Tuesday, July 02, 2013 3:47 PM To: solr-user@lucene.apache.org Subject: How to query Solr for empty field or specific value Hello, I'm using Solr 4.2 and am trying to get a s

Re: Newbie SolR - Need advice

2013-07-02 Thread Jack Krupansky
Consider DataStax Enterprise - it combines Cassandra for NoSql data storage with Solr for indexing - fully integrated. http://www.datastax.com/ -- Jack Krupansky -Original Message- From: fabio1605 Sent: Tuesday, July 02, 2013 12:44 PM To: solr-user@lucene.apache.org Subject: Re

Re: Newbie SolR - Need advice

2013-07-02 Thread Jack Krupansky
Start with the Solr Tutorial. http://lucene.apache.org/solr/tutorial.html -- Jack Krupansky -Original Message- From: fabio1605 Sent: Tuesday, July 02, 2013 11:16 AM To: solr-user@lucene.apache.org Subject: Newbie SolR - Need advice Hi we have a MSSQL Server which is just getting

Re: Converting nested data model to solr schema

2013-07-02 Thread Jack Krupansky
It sounds like 4.4 will have an RC next week, so the prospects for block join in 4.4 are kind of dim. I mean, such a significant feature should have more than a few days to bake before getting released. But... who knows what Yonik has planned! -- Jack Krupansky -Original Message

Re: need distance in miles not in kilometers

2013-07-02 Thread Jack Krupansky
Simply multiply by the number of miles per kilometer, 0.621371: fl=_dist_:mul(geodist(),0.621371) -- Jack Krupansky -Original Message- From: irshad siddiqui Sent: Tuesday, July 02, 2013 5:19 AM To: solr-user@lucene.apache.org Subject: need distance in miles not in kilometers Hi, I

Re: Solr 4.3 Pivot Performance Issue

2013-07-02 Thread Jack Krupansky
What is the nature of your degradation? -- Jack Krupansky -Original Message- From: solrUserJM Sent: Tuesday, July 02, 2013 4:22 AM To: solr-user@lucene.apache.org Subject: Solr 4.3 Pivot Performance Issue Hi There, I notice with the upgrade from solr 4.0 to solr 4.3 that we had a

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Jack Krupansky
sources. But, yeah, as Otis says, "re-index" is really just a euphemism for deleting your Solr data directory and indexing from scratch from the original data sources. -- Jack Krupansky -Original Message- From: Otis Gospodnetic Sent: Monday, July 01, 2013 2:26 PM To:

Re: are fields stored or unstored by default xml

2013-07-01 Thread Jack Krupansky
Correct - the field definitions inherit the attributes of the field type, and it is the field type that has the actual default values for indexed and stored (and other attributes.) -- Jack Krupansky -Original Message- From: Yonik Seeley Sent: Monday, July 01, 2013 3:56 PM To: solr

Re: are fields stored or unstored by default xml

2013-07-01 Thread Jack Krupansky
"stored" and "indexed" both default to "true". This is legal: This detail will be in Early Access Release #2 of my book on Friday. -- Jack Krupansky -Original Message- From: Otis Gospodnetic Sent: Monday, July 01, 2013 2:21 PM To: solr-user@lucen

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Jack Krupansky
You can write any function query in the field list of the "fl" parameter. Sounds like you want "termfreq": termfreq(field_arg,term) fl=id,a,b,c,termfreq(a,xyz) -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Monday, July 01, 2013 10

Re: Distinct values in multivalued fields

2013-07-01 Thread Jack Krupansky
Unfortunately, update processors only "see" the new, fresh, incoming data, not any existing document data. This is a case where your best bet may be to read the document first and then merge your new value into the existing list of values. -- Jack Krupansky -Original Message-

Re: Converting nested data model to solr schema

2013-07-01 Thread Jack Krupansky
to get parent or child IDs and then do a second query filtered by those IDs. And, yes, this only approximates the full power of an SQL join - but at a tiny fraction of the cost. -- Jack Krupansky -Original Message- From: adfel70 Sent: Monday, July 01, 2013 9:56 AM To: solr-user

Re: RemoveDuplicatesTokenFilterFactory to avoid import duplicate values in multivalued field

2013-07-01 Thread Jack Krupansky
g" is inappropriate for this email list (or any email list.) -- Jack Krupansky -Original Message- From: tuedel Sent: Monday, July 01, 2013 8:15 AM To: solr-user@lucene.apache.org Subject: Re: RemoveDuplicatesTokenFilterFactory to avoid import duplicate values in multivalued field H

Re: Unique key error while indexing pdf files

2013-07-01 Thread Jack Krupansky
quot; data model - which includes what expectations you have about the unique ID/key for each document. So, for that first PDF file, what expectation (according to your data model) do you have for what its ID/key should be? -- Jack Krupansky -Original Message- From: archit2112 Sent

Re: Unique key error while indexing pdf files

2013-07-01 Thread Jack Krupansky
It all depends on your data model - tell us more about your data model. For example, how will users or applications query these documents and what will they expect to be able to do with the ID/key for the documents? How are you expecting to identify documents in your data model? -- Jack

Re: No date.gap on pivoted facets

2013-06-30 Thread Jack Krupansky
s correspond to your date gap. You can do that with an update processor, or do it before you send the data to Solr. In the next release of my book I have a script for a StatelessScriptUpdateProccessor (with examples) that supports truncation of dates to a desired resolution, copying or modifyi

Re: increase search score of certain category only for certain keyword

2013-06-29 Thread Jack Krupansky
is good for keyword search. Use the text variant in qf. -- Jack Krupansky -Original Message- From: winsu Sent: Friday, June 28, 2013 9:26 PM To: solr-user@lucene.apache.org Subject: increase search score of certain category only for certain keyword Hi, Currently i've certain sample

Re: Schema design for parent child field

2013-06-29 Thread Jack Krupansky
to simulate the effect of a simple join in a single clean query. But you can do a separate query to get parent record details. -- Jack Krupansky -Original Message- From: Sperrink Sent: Saturday, June 29, 2013 5:08 AM To: solr-user@lucene.apache.org Subject: Schema design for parent child

Re: Replicating files containing external file fields

2013-06-28 Thread Jack Krupansky
to.) Sorry, I don't have the answer to the reload question at the tip of my tongue. -- Jack Krupansky -Original Message- From: Arun Rangarajan Sent: Friday, June 28, 2013 7:42 PM To: solr-user@lucene.apache.org Subject: Re: Replicating files containing external file fields Ja

Re: change solr core schema and config via http

2013-06-28 Thread Jack Krupansky
Ah, yes, good old multi-tenant - I should have known. Yeah, the Solr API is evolving, albeit too slowly for the needs of some. -- Jack Krupansky -Original Message- From: Wu, James C. Sent: Friday, June 28, 2013 7:06 PM To: solr-user@lucene.apache.org Subject: RE: change solr core

Re: change solr core schema and config via http

2013-06-28 Thread Jack Krupansky
How could you not have ssh access to the Solr host machine? I mean, how are you managing that server, without ssh access? And if you are not managing the server, what business do you have trying to change the Solr configuration?!?!? Something fishy here! -- Jack Krupansky -Original

Re: An issue with atomic updates?

2013-06-28 Thread Jack Krupansky
Well, it is known to me and documented in my book. BTW, that field value is simply ignored. There are tons of places in Solr where undefined values or outright garbage are simply ignored, silently. Go ahead and file a Jira though. -- Jack Krupansky -Original Message- From: Sam

Re: Replicating files containing external file fields

2013-06-28 Thread Jack Krupansky
Show us your directive. Maybe there is some subtle error in the file name. -- Jack Krupansky -Original Message- From: Arun Rangarajan Sent: Friday, June 28, 2013 1:06 PM To: solr-user@lucene.apache.org Subject: Re: Replicating files containing external file fields Erick, Thx for

Re: Context search in solr

2013-06-28 Thread Jack Krupansky
. Sure, people don't like seeing the mis-matched results in the list and a larger number of results, but it's all a tradeoff to assure that the most relevant results are higher and exact matching is a little looser. -- Jack Krupansky -Original Message- From: Erick Erickson Sent:

Re: displaying one result per domain

2013-06-27 Thread Jack Krupansky
in the book. You can also use a regular expression tokenfilter to extract the host name as well. And you can use standard Solr "grouping" to group by the field containing host name. -- Jack Krupansky -Original Message- From: Wojciech Kapelinski Sent: Thursday, June 27, 20

Re: how to delete on column of a doc in solr

2013-06-27 Thread Jack Krupansky
me, and then you can update with atomic update. You may want to rethink your data model. -- Jack Krupansky -Original Message- From: anurag.jain Sent: Thursday, June 27, 2013 8:28 AM To: solr-user@lucene.apache.org Subject: how to delete on column of a doc in solr In my solr sche

Re: Solr admin search with wildcard

2013-06-27 Thread Jack Krupansky
Just from the string field to a "text" field and use standard tokenization, then you can search the text field for "youtube" or even "something" that is a component of the URL path. No wildcard required. -- Jack Krupansky -Original Message- From: Amit

Re: Solr admin search with wildcard

2013-06-27 Thread Jack Krupansky
No, you cannot use wildcards within a quoted term. Tell us a little more about what your strings look like. You might want to consider tokenizing or using ngrams to avoid the need for wildcards. -- Jack Krupansky -Original Message- From: Amit Sela Sent: Thursday, June 27, 2013 3:33

Re: Solr document auto-upload?

2013-06-26 Thread Jack Krupansky
directly implemented in Solr -- Jack Krupansky -Original Message- From: aspielman Sent: Wednesday, June 26, 2013 2:16 PM To: solr-user@lucene.apache.org Subject: Solr document auto-upload? Is it possible to to configure Solr to automatically grab documents in a specidfied directory, with

Re: Solr 4.2.1 - master taking long time to respond after tomcat restart

2013-06-26 Thread Jack Krupansky
You need to do occasional hard commits, otherwise the update log just grows and grows and gets replayed on each server start. -- Jack Krupansky -Original Message- From: Arun Rangarajan Sent: Wednesday, June 26, 2013 1:18 PM To: solr-user@lucene.apache.org Subject: Solr 4.2.1 - master

Re: Dynamic Type For Solr Schema

2013-06-26 Thread Jack Krupansky
ence Guide nor current release from Lucid, but see the detailed examples in my book. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, June 26, 2013 10:51 AM To: solr-user@lucene.apache.org Subject: Dynamic Type For Solr Schema I use Solr 4.3.1 as SolrCloud. I k

Re: Solr indexer and Hadoop

2013-06-26 Thread Jack Krupansky
o 4.4. If not in 4.4, 4.5 is probably a slam-dunk. -- Jack Krupansky -Original Message- From: David Larochelle Sent: Wednesday, June 26, 2013 11:24 AM To: solr-user@lucene.apache.org Subject: Re: Solr indexer and Hadoop Pardon, my unfamiliarity with the Solr development process. Now

Re: URL search and indexing

2013-06-26 Thread Jack Krupansky
If there is a bug... we should identify it. What's a sample post command that you issued? -- Jack Krupansky -Original Message- From: Flavio Pompermaier Sent: Wednesday, June 26, 2013 10:53 AM To: solr-user@lucene.apache.org Subject: Re: URL search and indexing I was doing ex

Re: How to truncate a particular field, LimitTokenCountAnalyzer or LimitTokenCountFilter?

2013-06-26 Thread Jack Krupansky
/4_3_1/analyzers-common/org/apache/lucene/analysis/miscellaneous/LimitTokenCountFilterFactory.html The new Apache Solr Reference? No mention of the filter. -- Jack Krupansky -Original Message- From: Daniel Collins Sent: Wednesday, June 26, 2013 3:38 AM To: solr-user@lucene.apache.org

Re: StatsComponent doesn't work if field's type is TextField - can I change field's type to String

2013-06-26 Thread Jack Krupansky
You could use an update processor to turn the text string into multiple string values. A short snippet of JavaScript in a StatelessScriptUpdateProcessor could do the trick. The field could then be a multivalued string field. -- Jack Krupansky -Original Message- From: Elran Dvir

Re: Is there a way to capture div tag by id?

2013-06-25 Thread Jack Krupansky
Guide mislead people with examples that clearly can never run as expected with real data. -- Jack Krupansky -Original Message- From: eShard Sent: Tuesday, June 25, 2013 1:17 PM To: solr-user@lucene.apache.org Subject: Is there a way to capture div tag by id? let's say I have a div

Re: Is it possible to searh Solr with a longer query string?

2013-06-25 Thread Jack Krupansky
/tomcat-5.5-doc/config/http.html) --- If you're not using Tomcat, your container may have a similar limit. -- Jack Krupansky -Original Message- From: yang, gang Sent: Tuesday, June 25, 2013 5:47 PM To: solr-user@lucene.apache.org Cc: Meng, Fan Subject: RE: Is it possible to searh

Re: Querying multiple collections in SolrCloud

2013-06-25 Thread Jack Krupansky
ection - add all the fields to one schema - there is no time or space penalty if most of the field are empty for most documents. -- Jack Krupansky -Original Message- From: Chris Toomey Sent: Tuesday, June 25, 2013 6:08 PM To: solr-user@lucene.apache.org Subject: Querying multiple col

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
), you automatically get most of that. The user can query by a URL fragment, such as "apache.org", ".org", "lucene.apache.org", etc. and the tokenization will strip out the punctuation. I'll add this script to my list of examples to add in the next rev of my

Re: Solr indexer and Hadoop

2013-06-25 Thread Jack Krupansky
??? Hadoop=HDFS If the data is not in Hadoop/HDFS, just use the normal Solr indexing tools, including SolrCell and Data Import Handler, and possibly ManifoldCF. -- Jack Krupansky -Original Message- From: engy.morsy Sent: Tuesday, June 25, 2013 8:10 AM To: solr-user

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
-sequences that occur in the URL without the need for wildcards or regular expressions. -- Jack Krupansky -Original Message- From: Jan Høydahl Sent: Tuesday, June 25, 2013 6:28 AM To: solr-user@lucene.apache.org Subject: Re: URL search and indexing Probably a good match for the RegExp

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
There are examples in my book: http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21079719.html But... I still think you should use a tokenized text field as well - use all three: raw string, tokenized text, and URL classification fields. -- Jack

Re: Pivot-Facets with ranges

2013-06-25 Thread Jack Krupansky
No, facet.pivot takes a comma-separated list of "fields", with no support for "ranges". But, you can have a combination of field and range facets without pivoting. -- Jack Krupansky -Original Message- From: Jakob Frank Sent: Tuesday, June 25, 2013 6

Re: Solr indexer and Hadoop

2013-06-25 Thread Jack Krupansky
Solr does not have any integrated Hadoop/HDFS crawling or indexing support today. Sorry. LucidWorks Search does have HDFS crawling support: http://docs.lucidworks.com/display/lweug/Using+the+High+Volume+HDFS+Crawler Cloudera Search has HDFS support as well. -- Jack Krupansky -Original

Re: Book progress (Solr 4.x Deep Dive) - see my blog

2013-06-25 Thread Jack Krupansky
Please report any comments or issues to my email address or comment on my blog. Comments on the blog will benefit other readers, but the choice is yours. Thanks! -- Jack Krupansky -Original Message- From: Bernd Fehling Sent: Tuesday, June 25, 2013 2:06 AM To: solr-user

Re: why does the has to be indexed.

2013-06-24 Thread Jack Krupansky
If you use the edismax query parser, the "uf" parameter can be set to restrict the fields that the user can directly reference. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Monday, June 24, 2013 11:51 AM To: solr-user@lucene.apache.org Subject: why does the

Re: (e)dismax Query Generation

2013-06-24 Thread Jack Krupansky
SolrJ: SolrQuery query = new SolrQuery(); query.setQuery("video"); query.setParam("defType", "edismax"); query.setParam("qf", "features^20.0+text^0.3"); query.setParam("bq", "cat:electronics^5.0"); QueryResponse queryResp

Re: Creating solr query.

2013-06-24 Thread Jack Krupansky
xample tries to facet on the "cat" field, but the query restricts results to the single category "music". -- Jack Krupansky -Original Message- From: Ashwin Tandel Sent: Monday, June 24, 2013 2:37 PM To: solr-user Subject: Creating solr query. Hi, It's with

Re: What should be the definitions ( field type ) for a field that will be search with user free text

2013-06-24 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Monday, June 24, 2013 10:26 AM To: solr-user@lucene.apache.org Subject: What should be the definitions ( field type ) for a field that will be search with user free text currently I am using text_general. I want to search with

Re: Need assistance in defining search urls

2013-06-24 Thread Jack Krupansky
day - even though LucidWorks Search does in fact have it. I'm not even sure if anybody has filed a Jira for it. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Monday, June 24, 2013 10:43 AM To: solr-user@lucene.apache.org Subject: Re: Need assistance in defi

Book progress (Solr 4.x Deep Dive) - see my blog

2013-06-24 Thread Jack Krupansky
I won’t continue to bore annoy anybody on this list with tedious comments about my new Solr book on Lulu.com... please bookmark my blog, http://basetechnology.blogspot.com/, for further updates on the book. The book itself is here: http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early

Re: Need assistance in defining search urls

2013-06-24 Thread Jack Krupansky
Search query parser does support that feature. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Monday, June 24, 2013 7:26 AM To: solr-user@lucene.apache.org Subject: Need assistance in defining search urls Now, each doc looks like this (i generated random user text in the freetext

Re: Help with synonyms

2013-06-24 Thread Jack Krupansky
h sometimes means that they can't be identical - since replacement rules mean that a term will not appear in the index. -- Jack Krupansky -Original Message- From: Peter Kirk Sent: Monday, June 24, 2013 4:10 AM To: solr-user@lucene.apache.org Subject: Help with synonyms Hi I hav

Re: document id in nutch/solr

2013-06-23 Thread Jack Krupansky
ing named fields. Or... talk to the nutch people about how to do field name mapping on the nutch side of the fence. Hold off on UUIDs until you figure all of the above out and everything is working without them. -- Jack Krupansky -Original Message- From: Joe Zhang Sent: Sunday, J

Re: Solr File System Search

2013-06-23 Thread Jack Krupansky
t; rootEntity="true" dataSource="dir"> (Seems like I had a copy of of the processors for some reason. Sorry, no recollection.) And this was my request handler in solrconfig: class="org.apache.solr.handler.dataimport.DataImp

Re: edismax: date range facet with queries containing OR clause

2013-06-23 Thread Jack Krupansky
Do the requests in parallel (separate threads) and then the performance won't be impacted significantly. -- Jack Krupansky -Original Message- From: Sourajit Basak Sent: Sunday, June 23, 2013 10:23 AM To: solr-user@lucene.apache.org Subject: Re: edismax: date range facet with qu

Re: edismax: date range facet with queries containing OR clause

2013-06-23 Thread Jack Krupansky
If your keywords are the value in some other field, then, yes, you can use facet pivots: facet.pivot=keyword_field,date_field (See the example in the book! Or on the wiki.) -- Jack Krupansky -Original Message- From: Sourajit Basak Sent: Sunday, June 23, 2013 10:29 AM To: solr-user

Re: edismax: date range facet with queries containing OR clause

2013-06-23 Thread Jack Krupansky
Just do separate faceted query requests: q= keyword1 facet.range=date_field_name ... facet=true q= keyword2 facet.range=date_field_name ... facet=true Where the "..." means fill in the additional facet.range.xxx parameters (start, end, gap, etc.) -- Jack Krupansky -Origin

Re: modeling multiple values on 1:n connection

2013-06-23 Thread Jack Krupansky
en them), maybe in conjunction with a copyField of the raw, uncoordinated multvalued fields for easy query access. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Sunday, June 23, 2013 4:37 AM To: solr-user@lucene.apache.org Subject: Re: modeling multiple values on 1:n conne

<    8   9   10   11   12   13   14   15   16   17   >