Re: DocValues without re-index?

2014-07-22 Thread Mikhail Khludnev
Michael, What's first re-indexing? I'm sure you are aware about binary/number DocValues updates, but it works for existing column strides. I can guess you are talking about something like sidecar index http://www.youtube.com/watch?v=9h3ax5Wmxpk On Tue, Jul 22, 2014 at 6:50 AM, Michael Ryan

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-22 Thread Yavar Husain
Thanks Jack for your guidance on DSE. However it would be great if somebody could help me solving my use case: So my full text data lies on Cassandra along with an ID. Now I have a lot of structured data linked to the ID which lies on an RDBMS (read MySQL). I need this structured data as it would

Re: SOLR 4.4 - Slave always replicates full index

2014-07-22 Thread Robin Woods
I did observe the same.. 1. updated an existing document.. means potentially marking the previous document as deleted and adding a new version of it.. posted the JSON doc using the Documents interface on the Admin UI.. left the default commit within 1000 ms there on the Documents UI.. 2. NOT

spatial search: find result in bbox OR first result outside bbox

2014-07-22 Thread elisabeth benoit
Hello, I am using solr 4.2.1. I have the following use case. I should find results inside bbox OR if there is none, first result outside bbox within a 1000 km distance. I was wondering what is the best way to proceed. I was considering doing a geofilt search from the center of my bounding box

Re: Mixing ordinary and nested documents

2014-07-22 Thread Bjørn Axelsen
Thanks, Umesh You can get the parent bitset by running a the parent doc type query on the solr indexsearcher. Then child bitset by runnning the child doc type query. Then use these together to create a int[] where int[i] = parent of i. Can you kindly add an example? I am not quite sure how

wrong docFreq while executing query based on uniqueKey-field

2014-07-22 Thread Johannes Siegert
Hi. My solr-index (version=4.7.2.) has an id-field: field name=id type=string indexed=true stored=true/ ... uniqueKeyid/uniqueKey The index will be updated once per hour. I use the following query to retrieve some documents: q=id:2^2 id:1^1 I would expect that the document(2) should be

Re: wrong docFreq while executing query based on uniqueKey-field

2014-07-22 Thread Apoorva Gaurav
I faced the same issue sometime back, root cause is docs getting deleted and created again without getting optimized. Here is the discussion http://www.signaldump.org/solr/qpod/22731/docfreq-coming-to-be-more-than-1-for-unique-id-field On Tue, Jul 22, 2014 at 4:56 PM, Johannes Siegert

Re: wrong docFreq while executing query based on uniqueKey-field

2014-07-22 Thread Jack Krupansky
Deleted documents remain in the Lucene index until an optimize or segment merge operation removes them. As a result they are still counted in document frequency. An update is a combination of a delete and an add of a fresh document. -- Jack Krupansky -Original Message- From:

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-22 Thread Jack Krupansky
I don't think the Solr Data Import Handler has a Cassandra plugin (entity processor) yet, so the most straight forward approach is to write a Java app that reads from Cassandra, then reads the corresponding RDBMS data, combines the data, and then uses SolrJ to add documents to Solr. Your best

RE: DocValues without re-index?

2014-07-22 Thread Michael Ryan
I mean re-adding all of the documents in my index. The DocValues wiki page says that this is necessary, but I wanted to know if there was a way around it. -Michael -Original Message- From: Mikhail Khludnev [mailto:mkhlud...@griddynamics.com] Sent: Tuesday, July 22, 2014 2:14 AM To:

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-22 Thread Yavar Husain
Exactly. Thanks a lot Jack. +1 for Your best bet is to get that RDBMS data moved to Cassandra or DSE ASAP. On Tue, Jul 22, 2014 at 5:15 PM, Jack Krupansky j...@basetechnology.com wrote: I don't think the Solr Data Import Handler has a Cassandra plugin (entity processor) yet, so the most

Re: DocValues without re-index?

2014-07-22 Thread Shawn Heisey
On 7/22/2014 6:14 AM, Michael Ryan wrote: I mean re-adding all of the documents in my index. The DocValues wiki page says that this is necessary, but I wanted to know if there was a way around it. If your index meets the strict criteria for Atomic Updates, you could update all the documents

Re: Edit Example Post.jar to read ALL file types

2014-07-22 Thread jrusnak
So by using the SimplePostTool I can define the application type and handling of specific documents (Such as word, powerpoint, xml, png, etcetera). I have defined these and they are handled based on their type. In my file system however, I have a large number of files that can be read as plain

Re: Edit Example Post.jar to read ALL file types

2014-07-22 Thread jrusnak
I am copy-pasting the file extensions /from /the text document /into /the source code, not /from /the source code. My typing mistake. -- View this message in context: http://lucene.472066.n3.nabble.com/Edit-Example-Post-jar-to-read-ALL-file-types-tp4148312p4148567.html Sent from the Solr -

Re: Mixing ordinary and nested documents

2014-07-22 Thread Umesh Prasad
Query parentFilterQuery = new TermQuery(new Term(document_type, parent)); int[] childToParentDocMapping = new int[searcher.maxDoc()]; DocSet allParentDocSet = searcher.getDocSet(parentFilterQuery); DocIterator iter = allParentDocSet.iterator();

Re: Mixing ordinary and nested documents

2014-07-22 Thread Umesh Prasad
public static DocSet mapChildDocsToParentOnly(DocSet childDocSet) { DocSet mappedParentDocSet = new BitDocSet(); DocIterator childIterator = childDocSet.iterator(); while (childIterator.hasNext()) { int childDoc = childIterator.nextDoc(); int

RE: Multiterm analysis in complexphrase query

2014-07-22 Thread Allison, Timothy B.
Hi Gopal, I just started a repository on github (https://github.com/tballison/tallison-lucene-addons) to host a standalone version of LUCENE-5205 (with other patches to come). SOLR-5410 is next (Solr wrapper of the SpanQueryParser), and then I'll try to add LUCENE-5317 (concordance) and

Java heap Space error

2014-07-22 Thread Ameya Aware
Hi i am running into java heap space issue. Please see below log. ERROR - 2014-07-22 11:38:59.370; org.apache.solr.common.SolrException; null:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space at

Re: Java heap Space error

2014-07-22 Thread Shawn Heisey
On 7/22/2014 11:37 AM, Ameya Aware wrote: i am running into java heap space issue. Please see below log. All we have here is an out of memory exception. It is impossible to know *why* you are out of memory from the exception. With enough investigation, we could determine the area of code where

Re: Java heap Space error

2014-07-22 Thread Ameya Aware
So can i come over this exception by increasing heap size somewhere? Thanks, Ameya On Tue, Jul 22, 2014 at 2:00 PM, Shawn Heisey s...@elyograg.org wrote: On 7/22/2014 11:37 AM, Ameya Aware wrote: i am running into java heap space issue. Please see below log. All we have here is an out of

Re: Java heap Space error

2014-07-22 Thread Rafał Kuć
Hello! Yes, just edit your Jetty configuration file and add -Xmx and -Xms parameters. For example, the file you may be looking at it /etc/default/jetty. -- Regards, Rafał Kuć Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ So can

How to get Lacuma to match Lucuma

2014-07-22 Thread Warren Bell
What field type or filters do I use to get something like the word “Lacuma” to return results with “Lucuma” in it ? The word “Lucuma” has been indexed in a field with field type text_en_splitting that came with the original solar examples. Thanks, Warren fieldType name=text_en_splitting

NoClassDefFoundError while indexing in Solr

2014-07-22 Thread Ameya Aware
Hi I am running into below error while indexing a file in solr. Can you please help to fix this? ERROR - 2014-07-22 16:40:32.126; org.apache.solr.common.SolrException; null:java.lang.RuntimeException: java.lang.NoClassDefFoundError: com/uwyn/jhighlight/renderer/XhtmlRendererFactory at

Re: How to get Lacuma to match Lucuma

2014-07-22 Thread Anshum Gupta
Hi Warren, Check out the section about fuzzy search here https://cwiki.apache.org/confluence/display/solr/The+Standard+Query+Parser. On Tue, Jul 22, 2014 at 1:29 PM, Warren Bell warr...@clarksnutrition.com wrote: What field type or filters do I use to get something like the word “Lacuma” to

Re: How to get Lacuma to match Lucuma

2014-07-22 Thread Jack Krupansky
Or possibly use the synonym filter at query or index time for common misspellings or misunderstandings about the spelling. That would be automatic, without the user needing to add the explicit fuzzy query operator. -- Jack Krupansky -Original Message- From: Anshum Gupta Sent:

Re: SOLR 4.4 - Slave always replicates full index

2014-07-22 Thread Robin Woods
I think, I found the issue! I actually missed to mention a very important step that I did, which is, CORE SWAP otherwise, it's not replicating the full index. when we do CORE SWAP, doesn't it do the same checks of copying only deltas? -- View this message in context:

Re: SOLR 4.4 - Slave always replicates full index

2014-07-22 Thread Shawn Heisey
On 7/22/2014 5:00 PM, Robin Woods wrote: I think, I found the issue! I actually missed to mention a very important step that I did, which is, CORE SWAP otherwise, it's not replicating the full index. when we do CORE SWAP, doesn't it do the same checks of copying only deltas? Yes, it will

Query using doc Id

2014-07-22 Thread Mukundaraman Valakumaresan
Hi, Is it possible to execute queries using doc Id as a query parameter For eg, query docs whose doc Id is between 100 and 200 Thanks Regards Mukund

Re: Query using doc Id

2014-07-22 Thread santosh sidnal
i guess you can use these two params in your query, rows=100start=100 which will give you 100 documents after 100th document. On Wed, Jul 23, 2014 at 10:19 AM, Mukundaraman Valakumaresan muk...@8kmiles.com wrote: Hi, Is it possible to execute queries using doc Id as a query parameter

Re: Query using doc Id

2014-07-22 Thread Alexandre Rafalovitch
Do you mean something different from docId:[100 TO 200] ? Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On Wed, Jul 23, 2014 at 11:49 AM,

Re: NoClassDefFoundError while indexing in Solr

2014-07-22 Thread Shalin Shekhar Mangar
Solr is trying to load com/uwyn/jhighlight/renderer/XhtmlRendererFactory but that is not a class which is shipped or used by Solr. I think you have some custom plugins (a highlighter perhaps?) which uses that class and the classpath is not setup correctly. On Wed, Jul 23, 2014 at 2:20 AM, Ameya

Re: Replication Problem from solr-3.6 to solr-4.0

2014-07-22 Thread askumar1444
Same with me too, in a multi-core Master/Slave. 11:17:30.476 [snapPuller-8-thread-1] INFO o.a.s.h.SnapPuller - Master's generation: 87 11:17:30.476 [snapPuller-8-thread-1] INFO o.a.s.h.SnapPuller - Slave's generation: 3 11:17:30.476 [snapPuller-8-thread-1] INFO o.a.s.h.SnapPuller - Starting