Re:Re: solr 5.2.1, data import issue, shown processed rows doesn't match acturally indexed doc quantity.

2016-04-14 Thread cqlangyi
hi guys, thank you very much for the help. sorry been so lated to reply. 1. "commit" didn't help. after commit, the 'numFound' of "*:*" query is still the same. 2. "id" field in every doc is generated by solr using UUID, i have idea how to check if there is a duplicated one. but i ass

Re: Growing memory?

2016-04-14 Thread Shawn Heisey
On 4/14/2016 1:25 PM, Betsey Benagh wrote: > bin/solr status shows the memory usage increasing, as does the admin ui. > > I¹m running this on a shared machine that is supporting several other > applications, so I can¹t be particularly greedy with memory usage. Is > there anything out there that gi

Re: How to get stats on currency field?

2016-04-14 Thread Chris Hostetter
The thing to remember about currency fields is that even if you tend to only put one currency value in it, any question of interpreting the values in that field has to be done relative to a specific currency, and the exchange rates may change dynamically. So use the currency function to get a

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
Another thought - again probably not it, but just in case... Shouldn't this: &node=x.x.x.x:9001_solr Actually be this? &node=x.x.x.x:9001/solr

[ANNOUNCE] YCSB 0.8.0 Release

2016-04-14 Thread Chrisjan Matser
On behalf of the development community, I am pleased to announce the release of YCSB 0.8.0. Though there were no major Solr updates in this release, we are always interested in having members from the community help with ensuring that we have compliance with Solr's latest and greatest. Highlights

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
Jay - it's probably too simple, but the error says "not currently active" which could, of course, mean that although it's up and running, it's not listening on the port you have in the command line... Or that the port is blocked by a firewall or other network problem. I note that you're using por

How to get stats on currency field?

2016-04-14 Thread Pranaya Behera
Hi, I have a currency field type. How do I get StatsComponent to work with it ? Currently StatsComponent works with strings, numerics but not currency field. Another question is how to copy only the value part from a currency field ? e.g. if my field name is "mrp" and the value is "62.00, U

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
Thanks Eric! I'll look into that immediately - yes, I think that cURL would qualify as scriptable for my IT lead. In the end, I found I could do it two ways... Either copy the entire solr data directory over to /var/solr/data on the new machine, change the directory name and the entries in the c

Re: Adding replica on solr - 5.50

2016-04-14 Thread Erick Erickson
Post your clusterstate.json file? You shouldn't even have a clusterstate.json file with anything in it. In the 5x code line the state of each collection is kept under the relevant collections z-noed in "state.json". Confusingly, though, the clusterstate.json node still exists but is empty... Bes

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Thanks for the help John. > On Apr 14, 2016, at 6:22 PM, John Bickerstaff > wrote: > > Sure - couldn't agree more. > > I couldn't find any good documentation on the Solr site about how to add a > replica to a Solr cloud. The Admin UI appears to require that the > directories be created anyway

Re: Adding replica on solr - 5.50

2016-04-14 Thread Erick Erickson
bq: the Solr site about how to add a replica to a Solr cloud. The Admin UI appears to require that the directories be created anyway No, no, a thousand times NO! You're getting confused, I think, with the difference between _cores_ and _collections_ (or replicas in a collection). Do not use the

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Jeff, I couldn't agree more with you. I think the reason it is not working is because of screwed up clusterstate.json, not sure how to fix it. Have already restarted my zk servers. Any more suggestions regarding the same. > On Apr 14, 2016, at 5:21 PM, Jeff Wartes wrote: > > I’m all for findin

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
Sure - couldn't agree more. I couldn't find any good documentation on the Solr site about how to add a replica to a Solr cloud. The Admin UI appears to require that the directories be created anyway. There is probably a way to do it through the UI, once Solr is installed on a new machine - and I

JSON Facet Stats Mincount

2016-04-14 Thread Nick Vasilyev
Hello, I am trying to get a list of items that have more than one manufacturer using the following json facet query. This works fine without mincount, but errors out as soon as I add it. Is this possible or am I doing something wrong? json.facet={ groupID: { type: terms, field: gro

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jeff Wartes
I’m all for finding another way to make something work, but I feel like this is the wrong advice. There are two options: 1) You are doing something wrong. In which case, you should probably invest in figuring out what. 2) Solr is doing something wrong. In which case, you should probably invest

Re: HTTP Client Only

2016-04-14 Thread Jeff Wartes
If you’re already using java, just use the CloudSolrClient. If you’re using the default router, (CompositeId) it’ll figure out the leaders and send documents to the right place for you. If you’re not using java, then I’d still look there for hints on how to duplicate the functionality. On

Re: UUID processor handling of empty string

2016-04-14 Thread Susmit Shukla
Hi Chris/Erick, Does not work in the sense the order of documents does not change on changing sort from asc to desc. This could be just a trivial bug where UUID processor factory is generating uuid even if it is empty. This is on solr 5.3.0 Thanks, Susmit On Thu, Apr 14, 2016 at 2:30 PM, Chr

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
5.4 This problem drove me insane for about a month... I'll send you the doc. On Thu, Apr 14, 2016 at 5:02 PM, Jay Potharaju wrote: > Thanks John, which version of solr are you using? > > On Thu, Apr 14, 2016 at 3:59 PM, John Bickerstaff < > j...@johnbickerstaff.com> > wrote: > > > su - solr -c

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Thanks John, which version of solr are you using? On Thu, Apr 14, 2016 at 3:59 PM, John Bickerstaff wrote: > su - solr -c "/opt/solr/bin/solr create -c statdx -d /home/john/conf > -shards 1 -replicationFactor 2" > > However, this won't work by itself. There is some preparation > necessary... I

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
su - solr -c "/opt/solr/bin/solr create -c statdx -d /home/john/conf -shards 1 -replicationFactor 2" However, this won't work by itself. There is some preparation necessary... I'll send you the doc. On Thu, Apr 14, 2016 at 4:55 PM, Jay Potharaju wrote: > Curious what command did you use? > >

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Curious what command did you use? On Thu, Apr 14, 2016 at 3:48 PM, John Bickerstaff wrote: > I had a hard time getting replicas made via the API, once I had created the > collection for the first time although that may have been ignorance on > my part. > > I was able to get it done fairly ea

Re: Adding replica on solr - 5.50

2016-04-14 Thread John Bickerstaff
I had a hard time getting replicas made via the API, once I had created the collection for the first time although that may have been ignorance on my part. I was able to get it done fairly easily on the Linux command line. If that's an option and you're interested, let me know - I have a roug

Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Hi, I am using solr 5.5 and testing adding a new replica when a solr instance comes up. When I run the following command I get an error. I have 1 replica and trying to add another replica. http://x.x.x.x:8984/solr/admin/collections?action=ADDREPLICA&collection=test2&shard=shard1&node=x.x.x.x:9001_

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread Walter Underwood
> On Apr 14, 2016, at 12:18 PM, John Bickerstaff > wrote: > > If a user types in "foobarbaz figo" I want all documents with "figo" in the > contentType field boosted above every other document in the results. This is a very common requirement that seems like a good idea, but has very bad corn

Re: UUID processor handling of empty string

2016-04-14 Thread Chris Hostetter
I'm also confused by what exactly you mean by "doesn't work" but a general suggestion you can try is putting the RemoveBlankFieldUpdateProcessorFactory before your UUID Processor... https://lucene.apache.org/solr/6_0_0/solr-core/org/apache/solr/update/processor/RemoveBlankFieldUpdateProcessorFa

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Sara Woodmansee
Thanks Jack. So - if I understand (all email feedback thus far) correctly: — Upgrading to newer version vital (5.5 —6.0) — EnglishMinimalStemFilter: upgrading to v5.5-6.0 will NOT help with stemming issues, as code has not been updated. — PorterStemFilter: Has been updated to work with bet

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread John Bickerstaff
Thanks - so this: bq=contentType:(original query text here)^1000 is exactly what I want to do to every incoming query via an entry in a custom requestHandler. Thus my question about how to reference the original query text in the requestHandler xml... I believe that if I want to do that, I'm go

RE: Shard ranges seem incorrect

2016-04-14 Thread Chris Hostetter
: Hi - bumping this issue. Any thoughts to share? Shawn's response to your email seemed spot on acurate to me -- is there something about his answer that doesn't match up with what you're seeing? can you clarify/elaborate your concerns? http://mail-archives.apache.org/mod_mbox/lucene-solr-user

RE: Shard ranges seem incorrect

2016-04-14 Thread Markus Jelsma
Hi - bumping this issue. Any thoughts to share? Thanks, M -Original message- > From:Markus Jelsma > Sent: Tuesday 12th April 2016 13:49 > To: solr-user > Subject: Shard ranges seem incorrect > > Hi - i've just created a 3 shard 3 replica collection on Solr 6.0.0 and we > noticed s

HTTP Client Only

2016-04-14 Thread Robert Brown
Hi, I have a collection with 2 shards, 1 replica each. When I send updates, I currently /admin/ping each of the nodes, and then pick one at random. I'm guessing it makes more sense to only send updates to one of the leaders, so I'm contemplating getting the collection status instead, and fi

Re: DIH with Nested Documents - Configuration Issue

2016-04-14 Thread Mikhail Khludnev
Giving child="true" Solr 5.5 creates a documents block with implicit relations across parent and nested children. These later retrievable via https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-BlockJoinQueryParsers only. Giving the fact you run 4.10 I don't think you really

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Novin Novin
Thanks for the great advice Erick. On 14 April 2016 at 18:18, Erick Erickson wrote: > BTW, the place optimize seems best used is when the index isn't > updated very often. I've seen a pattern where the index is updated > once a night (or even less). In that situation, optimization makes > more s

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Jack Krupansky
BTW, I did check and that stemmer code is the same today as it was in 3.x, so there should be no change in stemmer behavior there. -- Jack Krupansky On Thu, Apr 14, 2016 at 3:47 PM, Sara Woodmansee wrote: > Hi Shawn, > > Thanks so much the feedback. And for the heads-up regarding (the bad form

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Sara Woodmansee
Hi Shawn, Thanks so much the feedback. And for the heads-up regarding (the bad form of) starting a new discussion from an existing one. Thought removing all content wouldn’t track to original. (Sigh). This is what you get when you have photographers posting to high-end forums. Thanks Erick, r

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread Erick Erickson
Right, edismax is where I'd start. NOTE: there are about a zillion options here so you may find yourself lost in a bit of a maze for a while, but it's usually faster than coding it yourself ;). In this case, take a look at the "bq" parameter to edismax and make it something like bq=contentType:(or

Re: Growing memory?

2016-04-14 Thread Erick Erickson
In a word, "no", there are simply too many variables. It's like asking "how much memory will a Java program need?" But Solr does like memory, both the Java heap and the OS memory. Here's a long blog on how to scope this out: https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstra

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Erick Erickson
re: upgrading to 5.x... 5X Solr's are NOT guaranteed to read 3x indexes, you'd have to go through 4x to do that. If you can re-index from scratch that would be best. Best, Erick On Thu, Apr 14, 2016 at 12:29 PM, Shawn Heisey wrote: > On 4/14/2016 11:17 AM, Sara Woodmansee wrote: >> I posted yes

Re: How to declare field type for IntPoint field in solr 6.0 schema?

2016-04-14 Thread rafis
Thank you, Shawn! It can wait. There are other features in 6.0 I was waiting for. It is always nice to have such improvements! -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-declare-field-type-for-IntPoint-field-in-solr-6-0-schema-tp4270040p4270256.html Sent from th

Re: UUID processor handling of empty string

2016-04-14 Thread Erick Erickson
What do you mean "doesn't work"? An empty string is different than not being present. Thee UUID update processor (I'm pretty sure) only adds a field if it is _absent_. Specifying it as an empty string fails that test so no value is added. At that point, if this uuid field is also the , then each d

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Shawn Heisey
On 4/14/2016 11:17 AM, Sara Woodmansee wrote: > I posted yesterday, however I never received my own post, so worried it did > not go through (?) I *did* see your previous message, but couldn't immediately think of anything constructive to say. I've had a little bit of time on my lunch break toda

Re: Growing memory?

2016-04-14 Thread Betsey Benagh
bin/solr status shows the memory usage increasing, as does the admin ui. I¹m running this on a shared machine that is supporting several other applications, so I can¹t be particularly greedy with memory usage. Is there anything out there that gives guidelines on what an appropriate amount of heap

UUID processor handling of empty string

2016-04-14 Thread Susmit Shukla
Hi, I have configured solr schema to generate unique id for a collection using UUIDUpdateProcessorFactory I am seeing a peculiar behavior - if the unique 'id' field is explicitly set as empty string in the SolrInputDocument, the document gets indexed with UUID update processor generating the id.

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread John Bickerstaff
OK - that's interesting. Perhaps I'm thinking too much like a developer and just want to be able to reach into context and grab anything any time I want... Thanks for the input... = To clarify, I want to boost the document's score if the user enters a term found in the contentType field. A

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Sara Woodmansee
Hi Walter and Jack, Many thanks for your feedback! I have no idea why the developer is using such an old version, but hoping that your feedback and suggestions will give them a push in the right direction. Is it a huge undertaking to upgrade from v3.6 to v5.5?? (I surely hope not.) Thanks agai

Re: Growing memory?

2016-04-14 Thread Erick Erickson
Yes, it will do GC automatically, but only after some threshold has been reached. It doesn't collect as soon as something is no longer referenced. So you typically see a sawtooth pattern where memory increases for a while, then drops back when a GC happens, then increases, then drops back. Proble

Re: Growing memory?

2016-04-14 Thread Shawn Heisey
On 4/14/2016 12:45 PM, Betsey Benagh wrote: > I'm running solr 6.0.0 in server mode. I have one core. I loaded about 2000 > documents in, and it was using about 54 MB of memory. No problem. Nobody was > issuing queries or doing anything else, but over the course of about 4 hours, > the memory us

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread Erick Erickson
You really don't do that in solrconfig.xml. This seems like an XY problem. You're trying to solve some particular use-case and accessing the terms in solrconfig.xml. You've already found the ability to configure edismax as your defType and apply boosts to particular fields... Best, Erick On Thu,

Re: Growing memory?

2016-04-14 Thread Betsey Benagh
Thanks for the quick response. Forgive the naïve question, but shouldn¹t it be doing garbage collection automatically? Having to manually force GC via jconsole isn¹t a sustainable solution. Thanks again, betsey On 4/14/16, 2:54 PM, "Erick Erickson" wrote: >well, things _are_ running, specifica

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Jack Krupansky
Yes, this is the intended behavior. All of the Solr stemmers are based on heuristics that are not perfect, and are not based on the real dictionary. You can solve one problem by switching to another stemmer, but then you run into a different problem, rinse and repeat. The code has a specific rule

Re: Growing memory?

2016-04-14 Thread Erick Erickson
well, things _are_ running, specifically the communications channels are looking for incoming messages and the like, generating garbage etc. Try attaching jconsole to the process and hitting the GC button to force a garbage collection. As long as your memory gets to some level and drops back to th

Re: Referencing incoming search terms in searchHandler XML

2016-04-14 Thread John Bickerstaff
Maybe I'm overdoing it... It seems to me that qf= text contentType^1000 would do this for me more easily - as it appears to assume the incoming search terms... However, I'd still like to know the simplest way to reference the search terms in the XML - or possibly get a URL that points the way. T

Growing memory?

2016-04-14 Thread Betsey Benagh
X-posted from stack overflow... I'm running solr 6.0.0 in server mode. I have one core. I loaded about 2000 documents in, and it was using about 54 MB of memory. No problem. Nobody was issuing queries or doing anything else, but over the course of about 4 hours, the memory usage had tripled to

Referencing incoming search terms in searchHandler XML

2016-04-14 Thread John Bickerstaff
I have the following (essentially hard-coded) line in the Solr Admin Query UI = bq: contentType:(searchTerm1 searchTerm2 searchTerm2)^1000 = The "searchTerm" entries represent whatever the user typed into the search box. This can be one or more words. Usually less than 5. I want to put

SolrTestCaseJ4 errors with SOLR 4.9 (works with SOLR 4.8.1)

2016-04-14 Thread vsrikanthp
Hi, I'm trying to upgrade from SOLR 4.8.1 to SOLR 4.9. Some of our test cases (using SolrTestCaseJ4 framework) which work with 4.8.1 are failing when I try to run them with SOLR 4.9. I'm trying to figure out (and fix) the test cases. We are using maven surefire plugin with JUNIT to run the tests

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Walter Underwood
Solr 3.6 is a VERY old release. You won’t see any fixes for that. I would recommend starting with Solr 5.5 and keeping an eye on Solr 6.x, which has just started releases. Removing -ing endings is pretty aggressive. That changes “tracking meeting” into “track meet”. Most of the time, you’ll be

Re: How to search for a First, Last of contact which are stored in differnet multivalued fields

2016-04-14 Thread Thrinadh Kuppili
Thank you Eric Will try it and let you know. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-search-for-a-First-Last-of-contact-which-are-stored-in-differnet-multivalued-fields-tp4269901p4270192.html Sent from the Solr - User mailing list archive at Nabble.com.

DIH error - Bad Request

2016-04-14 Thread Brian Narsi
We have solr 5.1.0 running for several months retrieving about 10.5 millions records with no issues and no errors or warning in logs. I checked several times and the number of records reported as processed in DIH was exactly the same number in the collection. Recently I reviewed logs and found out

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Erick Erickson
BTW, the place optimize seems best used is when the index isn't updated very often. I've seen a pattern where the index is updated once a night (or even less). In that situation, optimization makes more sense. But when an index is continually updated, it's mostly wasted effort. Best, Erick On Thu

Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-14 Thread Sara Woodmansee
Hello all, I posted yesterday, however I never received my own post, so worried it did not go through (?) Also, I am not a coder, so apologies if not appropriate to post here. I honestly don't know where else to turn, and am determined to find a solution, as search is essential to our site. We

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Erick Erickson
Unless you have somewhat unusual circumstances, I wouldn't optimize at all, despite the name it really doesn't help all that much in _most_ cases. If your percentage deleted docs doesn't exceed, say, 15-20% I wouldn't bother. Most of what optimize does is reclaim resources from deleted docs. This

RE: Solr Support for BM25F

2016-04-14 Thread Jay Parashar
To use per-field similarity you have to add to your schema.xml file: And then in individual fields you can use the BM25 with different k1 and b. -Original Message- From: David Cawley [mailto:david.cawl...@mail.dcu.ie] Sent: Thursday, April 14, 2016 11:42 AM To: solr-user@lucene.apache.or

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Novin Novin
Thanks Erick, for pointing out. You are right. I was optimizing every 10 mins. And I have change this to every day in night. On 14-Apr-2016 5:20 pm, "Erick Erickson" wrote: > don't issue an optimize command... either you have a solrj client that > issues a client.optimize() command or you pre

RE: Multiple data-config.xml in one collection?

2016-04-14 Thread Jay Parashar
Thanks a lot Daniel. -Original Message- From: Davis, Daniel (NIH/NLM) [C] [mailto:daniel.da...@nih.gov] Sent: Thursday, April 14, 2016 11:41 AM To: solr-user@lucene.apache.org Subject: RE: Multiple data-config.xml in one collection? Jay Parashar wrote: > One thing, and please correct if

Re: Solr Support for BM25F

2016-04-14 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
Hi David, I implemented bm25f for Europeana on Solr 4.x a couple of years ago, you can find it here: https://github.com/europeana/contrib/tree/master/bm25f-ranking maybe I should contribute it back.. Please do not hesitate to contact me if you need help :) Cheers, Diego From: solr-user@lucen

Re: Solr Support for BM25F

2016-04-14 Thread Chris Hostetter
: a per field basis. I understand BM25 similarity is now supported in Solr BM25 has been supported for a while, the major change recently is that it is now the underlying default in Solr 6. : but I was hoping to be able to configure k1 and b for different fields such : as title, description, an

Re: Solr Support for BM25F

2016-04-14 Thread Doug Turnbull
Hey David You can configure BM25 differently for each field by configuring the similarity per field type, as shown here in this example from the Solr tests https://github.com/sudarshang/lucene-solr/blob/master/solr/core/src/test-files/solr/conf/schema-bm25.xml#L32 On Thu, Apr 14, 2016 at 12:41 P

Solr Support for BM25F

2016-04-14 Thread David Cawley
Hello, I am developing an enterprise search engine for a project and I was hoping to implement BM25F ranking algorithm to configure the tuning parameters on a per field basis. I understand BM25 similarity is now supported in Solr but I was hoping to be able to configure k1 and b for different field

RE: Multiple data-config.xml in one collection?

2016-04-14 Thread Davis, Daniel (NIH/NLM) [C]
Jay Parashar wrote: > One thing, and please correct if wrong, I have noticed running DataImport for > a particular config overwrites the existing data for a document...that is, > there is > no way to preserve the existing data. > > For example if you have a schema of 5 fields and running the

Re: MiniSolrCloudCluster usage in solr 7.0.0

2016-04-14 Thread Erick Erickson
Rohana: Let's back up a bit, this really feels like an XY problem. Why do you want to do this? MiniSolrCloudCluster is designed as a test mechanism, it is not intended (AFAIK) for any kind of stand-alone operation so you'd be on your own if thats your goal... Best, Erick On Thu, Apr 14, 2016 at

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Erick Erickson
don't issue an optimize command... either you have a solrj client that issues a client.optimize() command or you pressed the "optimize now" in the admin UI. Solr doesn't do this by itself. Best, Erick On Thu, Apr 14, 2016 at 8:30 AM, Novin Novin wrote: > How can I stop happening "DirectUpdateHan

RE: Multiple data-config.xml in one collection?

2016-04-14 Thread Jay Parashar
You have to specify which one to run. Each DIH will run only one XML (e.g. health-topics-conf.xml) One thing, and please correct if wrong, I have noticed running DataImport for a particular config overwrites the existing data for a document...that is, there is no way to preserve the existing d

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Novin Novin
How can I stop happening "DirectUpdateHandler2 Starting optimize... Reading and rewriting the entire index! Use with care" Thanks novin On 14 April 2016 at 14:36, Shawn Heisey wrote: > On 4/14/2016 7:23 AM, Novin Novin wrote: > > Thanks for reply Shawn. > > > > Below is snippet of jetty.xml and

Re: Optimal indexing speed in Solr

2016-04-14 Thread John Bickerstaff
Stupid phone autocorrect... If you add updated documents of the same ID over time, optimizing your collection(s) may help. On Thu, Apr 14, 2016 at 7:50 AM, John Bickerstaff wrote: > If you delete a lot of documents over time, or if you add updated > documents of the same I'd over time, optimizi

RE: MiniSolrCloudCluster usage in solr 7.0.0

2016-04-14 Thread Rohana Rajapakse
Thanks Shawn. I have added few dependency jars into my project. There are no compilation errors or ClassNotFound exceptions, but Zookeeper exception " KeeperException$NodeExistsException: KeeperErrorCode = NodeExists for /solr/solr.xml ". My temporary solrHome folder has a solr.xml. No other f

Re: Solr best practices for many to many relations...

2016-04-14 Thread Jack Krupansky
Solr is a search engine, not a database. JOINs? Although Solr does have some limited JOIN capabilities, they are more for special situations, not the front-line go-to technique for data modeling for search. Rather, denormalization is the front-line go-to technique for data modeling in Solr. In a

Solr best practices for many to many relations...

2016-04-14 Thread Bastien Latard - MDPI AG
Hi Guys, /I am upgrading from solr 4.2 to 6.0.// //I successfully (after some time) migrated the config files and other parameters.../ Now I'm just wondering if my indexes are following the best practices...(and they are probably not :-) ) What would be the best if we have this kind of sql

Re: Optimal indexing speed in Solr

2016-04-14 Thread John Bickerstaff
If you delete a lot of documents over time, or if you add updated documents of the same I'd over time, optimizing your collection(s) may help. On Apr 14, 2016 3:52 AM, "Emir Arnautovic" wrote: > Hi Edwin, > Indexing speed depends on multiple factors: HW, Solr configurations and > load, documents,

DIH with Nested Documents - Configuration Issue

2016-04-14 Thread Jeff Chastain
I am working on a project where the specification requires a parent - child relationship within the Solr data collection ... i.e. a user and the collection of languages they speak (each of which is made up of multiple data fields). My production system is a 4.10 Solr implementation but I have a

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Shawn Heisey
On 4/14/2016 7:23 AM, Novin Novin wrote: > Thanks for reply Shawn. > > Below is snippet of jetty.xml and jetty-https.xml > > jetty.xml:38: name="solr.jetty.threads.idle.timeout" default="5000"/> > /// I presume this one I should increase, But I believe 5 second is enough > time for 250 docs to

Re: dynamicField and type solr.LatLonType

2016-04-14 Thread Shawn Heisey
On 4/14/2016 4:56 AM, Vangelis Katsikaros wrote: > but can't index with > > > subFieldSuffix="_coordinate"/> > > > > > > > I get the following: > HTTP Status 400 - ERROR: [doc=123] Error adding field > 'lala_1'='50.657398,-2.366020' When a LatLonType field is indexed, it will create two addi

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Novin Novin
Thanks for reply Shawn. Below is snippet of jetty.xml and jetty-https.xml jetty.xml:38: /// I presume this one I should increase, But I believe 5 second is enough time for 250 docs to add to solr. jetty.xml:39: jetty-https.xml:45: I'm also seeing "DirectUpdateHandler2 Starting

Re: Solr 5.5 timeout of solrj client

2016-04-14 Thread Shawn Heisey
On 4/14/2016 4:40 AM, Novin Novin wrote: > I'm having error > > when sending solr doc > mid15955728 > org.apache.solr.client.solrj.SolrServerException: Timeout occured > while waiting response from server at: > http://localhost.com:8983/solr/analysis > Caused by: java.net.SocketTimeoutException

Re: MiniSolrCloudCluster usage in solr 7.0.0

2016-04-14 Thread Shawn Heisey
On 4/14/2016 2:01 AM, Rohana Rajapakse wrote: > Can someone give a sample code snippet to create MiniSolrCloudCluster from a > separate java application (outside of solr codebase). Wants to know > dependency jars and config files you need. I would imagine that you need to start with solr-test-f

Re: How to declare field type for IntPoint field in solr 6.0 schema?

2016-04-14 Thread Shawn Heisey
On 4/13/2016 8:57 PM, Rafis Ismagilov wrote: > Should it be PointType, BinaryField, or something else. All examples use > TrieIntField for int. Solr doesn't have support for the new Point types in Lucene yet. They are a recent introduction, and Solr was caught a little off guard by how fast they

dynamicField and type solr.LatLonType

2016-04-14 Thread Vangelis Katsikaros
Hi I use solr [1] on ubuntu 14.04. I am trying to define a dynamicField on a custom type (ie nont built in like "int"). I don't see something mentioned in the documentation that prohibits it but I can't seem to make it work. For a built in type, my code can index fine the following in schema.

Solr 5.5 timeout of solrj client

2016-04-14 Thread Novin Novin
Hi guys, I'm having error when sending solr doc mid15955728 org.apache.solr.client.solrj.SolrServerException: Timeout occured while waiting response from server at: http://localhost.com:8983/solr/analysis at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.j

Re: Optimal indexing speed in Solr

2016-04-14 Thread Emir Arnautovic
Hi Edwin, Indexing speed depends on multiple factors: HW, Solr configurations and load, documents, indexing client: More complex documents, more CPU time to process each document before indexing structure is written down to disk. Bigger the index, more heap is used, more frequent GCs. Maybe you

MiniSolrCloudCluster usage in solr 7.0.0

2016-04-14 Thread Rohana Rajapakse
Can someone give a sample code snippet to create MiniSolrCloudCluster from a separate java application (outside of solr codebase). Wants to know dependency jars and config files you need. Thanks Rohana Registered Office: 24 Darklake View, Estover, Plymouth, PL6 7TL. Company Registration N