Re: File not found error starting Solr 7.4.0 on Windows

2018-07-30 Thread Michael Glasson
Thanks Erick. I tried option 2, changing instances of 'file:%' to 'file:///%' solr.cmd. There were many instances. In fact, 'file:%' was used consistently. When I made the changes, all the errors went away. Regards, Michael

Re: Solr fails even ZK quorum has majority

2018-07-23 Thread Michael Braun
Per the exception, this looks like a network / DNS resolution issue, independent of Solr and Zookeeper code: Caused by: org.apache.solr.common.SolrException: java.net.UnknownHostException: ditsearch001.es.com: Name or service not known Is this address actually resolvable at the time? On Mon, Jul

File not found error starting Solr 7.4.0 on Windows

2018-07-17 Thread Michael Glasson
Has anyone seen this one? F:\Users\user\Downloads\solr-7.4.0>bin\solr.cmd start ERROR StatusLogger Unable to access file:/F:/Users/user/Downloads/solr-7.4.0/server/file:F:/Users/user/Downloads /solr-7.4.0/server/resources/log4j2.xml java.io.FileNotFoundException: F:\Users\user \Downloads\solr

Re: Hardware-Aware Solr Coud Sharding?

2018-07-16 Thread Michael Braun
ld be sensitive to that. > > > > WARNING: haven't done this myself so don't have any examples to point > to > > > > Best, > > Erick > > > > On Tue, Jun 12, 2018 at 8:34 AM, Shawn Heisey > wrote: > >> On 6/12/2018 9:12 AM,

Re: How to split index more than 2GB in size

2018-06-20 Thread Michael Kuhlmann
dexes of hundreds of megabytes in size, which won't fit into an individual machine. Best, Michael Am 20.06.2018 um 19:58 schrieb Sushant Vengurlekar: > Thank you for the detailed response Eric. Very much appreciated. The reason > I am looking into splitting the index into two is because it’s m

Hardware-Aware Solr Coud Sharding?

2018-06-12 Thread Michael Braun
instances, and 4 instances, respectively). Has anyone looked into other ways of handling this that don't require the additional Solr instance deployments? -Michael

Re: Solr Cloud 7.3.1 backups (autofs/NFS)

2018-06-01 Thread Michael Joyner
installed and enabled on nodes, only the NFS host needs any real configuration this way) On 05/31/2018 05:28 PM, Greg Roodt wrote: Thanks! I wasn't aware this existed. Have you used it with Solr backups? On Fri, 1 Jun 2018 at 00:07, Michael Joyner <mailto:mich...@newsrx.com>> wro

Can one set a short node name in Solr Cloud

2018-05-30 Thread Michael Schumann
We are running Solr Cloud version 7.2. The node names in ZooKeeper are very long: over 50 characters. Is there a way to set a friendlier short name both for display purposes in the admin console and also to use when interacting with collections API? Thank you, Michael

Re: Shard size variation

2018-05-03 Thread Michael Joyner
h no parameters respects the maximum segment size, which is a change from now. Finally, expungeDeletes may be useful as that too will respect max segment size, again after LUCENE-7976 is committed. Best, Erick On Wed, May 2, 2018 at 9:22 AM, Michael Joyner wrote: The main reason we go this route is th

Load balanced Solr cluster not updating leader

2018-05-02 Thread Michael B. Klein
d be a big help. Thanks, Michael -- Michael B. Klein Lead Developer, Repository Development and Administration Northwestern University Libraries

Re: Shard size variation

2018-05-02 Thread Michael Joyner
uffer from the problem that they create massive segments that then stick around for a very long time, see: https://lucidworks.com/2017/10/13/segment-merging-deleted-documents-optimize-may-bad/ Best, Erick On Mon, Apr 30, 2018 at 1:56 PM, Michael Joyner wrote: Based on experience, 2x head room is r

Re: Shard size variation

2018-04-30 Thread Michael Joyner
Based on experience, 2x head room is room is not always enough, sometimes not even 3x, if you are optimizing from many segments down to 1 segment in a single go. We have however figured out a way that can work with as little as 51% free space via the following iteration cycle: public void so

How can I prevent adding duplicated copyfield into managed schema

2018-02-06 Thread Michael Hu
containing managed-schema.xml) because each core has its own schema update lock. Above approach does not work for this case. Please advise! Thank you so much! --Michael

Re: 7.2.1 cluster dies within minutes after restart

2018-01-29 Thread Michael Braun
Believe this is reported in https://issues.apache.org/jira/browse/SOLR-10471 On Mon, Jan 29, 2018 at 2:55 PM, Markus Jelsma wrote: > Hello SG, > > The default in solr.in.sh is commented so it defaults to the value set in > bin/solr, which is fifteen seconds. Just uncomment the setting in > solr

Re: LTR original score feature

2018-01-29 Thread Michael Alcorn
>It seems to me that the original score feature is not useful because it is not normalized across all queries and therefore cannot be used to compare relevance in different queries. I don't agree with this statement and it's not what Alessandro was suggesting ("When you put the original score toge

Re: LTR original score feature

2018-01-12 Thread Michael Alcorn
What you're suggesting is that there's a "nonlinear relationship " between the original score (the input variable) and some measure of "relevance" (the outpu

Re: SolrException undefined field *

2018-01-09 Thread Michael Kuhlmann
p you better when you pass the full query string (if you're able to fetch it). -Michael Am 09.01.2018 um 16:38 schrieb Michael Kuhlmann: > First, you might want to index, but what Solr is executing here is a > search request. > > Second, you're querying for a dynamic fiel

Re: SolrException undefined field *

2018-01-09 Thread Michael Kuhlmann
s) is doing the wrong thing. Or your schema definition is not matching what the client expects. Since we don't know what client code you're using, we can't tell more. -Michael Am 09.01.2018 um 16:31 schrieb padmanabhan: > I get the below error whenever an indexing is executed..

Re: Edismax leading wildcard search

2017-12-22 Thread Michael Kuhlmann
e Lucene can't just look in the index for all terms beginning with something; it needs to look in all terms instead. Basically, indexed terms are in alphabetical order, but that doesn't help with leading wildcards. There's a ReversedWildcardFilterFactory in Solr to address this issue. -Michael

Re: Are the entries in managed-schema order dependent?

2017-12-20 Thread Michael Joyner
Thanks! On 12/20/2017 11:37 AM, Erick Erickson wrote: The schema is not order dependent, I freely mix-n-match the fieldType, copyField and field definitions for instance. On Wed, Dec 20, 2017 at 8:29 AM, Michael Joyner wrote: Hey all, I'm wanting to update our managed-schemas to in

Are the entries in managed-schema order dependent?

2017-12-20 Thread Michael Joyner
Hey all, I'm wanting to update our managed-schemas to include the latest options available in the 6.6.2 branch. (point types for one) I would like to be able to sort them and diff them (production vs dist supplied) to create a simple patch that can be reviewed, edited if necessary, and then

Re: How to sort on dates?

2017-12-18 Thread Michael Kuhlmann
oesn't really matter much. I only used normal date fields yet, usually they're fast enough. As a rule of thunb, only use DateRangeField if you really need to index date ranges. -Michael

Re: Wildcard searches with special character gives zero result

2017-12-15 Thread Michael Kuhlmann
-them/ for example. If you really need results for such queries, I suggest to have a copyField which is unstemmed and only tokenized on whitespaces. If you then detect a wildcard character in your query string, search on that field instead of the others. -Michael Am 15.12.2017 um 11:59 schrieb Selvam

Re: How to sort on dates?

2017-12-15 Thread Michael Kuhlmann
Hi Georgios, DateRangeField is a kind of SpatialField which is not sortable at all. For sorting, use a DatePointField instead. It's not deprecated; the deprecated class is TrieDateField. Best, Michael Am 15.12.2017 um 10:53 schrieb Georgios Petasis: > Hi all, > > I have a

Re: NullPointerException in PeerSync.handleUpdates

2017-11-22 Thread Michael Braun
I went ahead and resolved the jira - it was never seen again by us in later versions of Solr. There are a number of bug fixes since the 6.2 release, so I personally recommend updating! On Wed, Nov 22, 2017 at 11:48 AM, Pushkar Raste wrote: > As mentioned in the JIRA, exception seems to be coming

Re: Solr LTR plugin - Training

2017-11-16 Thread Michael Alcorn
Hi, Not sure if this is your issue or not, but the FieldQParser automatically converts multi-term arguments to phrases, so you might have to switch to the DisMaxQParser. I talk a little bit more about it here <https://github.com/airalcorn2/Solr-LTR#learning-to-rank>. -Michael On Thu,

Re: Long blocking during indexing + deleteByQuery

2017-11-08 Thread Michael McCandless
I'm not sure this is what's affecting you, but you might try upgrading to Lucene/Solr 7.1; in 7.0 there were big improvements in using multiple threads to resolve deletions: http://blog.mikemccandless.com/2017/07/lucene-gets-concurrent-deletes-and.html Mike McCandless http://blog.mikemccandless.c

Re: recent utf8 problems

2017-11-06 Thread Dr. Mario Michael Krell
Hi, thank you for your time and trying to narrow down my problem. 1) When looking for Tübingen in the title, I am expecting the 3092484 results. That sounds like a reasonable result. Furthermore, when looking at some of the results, they are exactly what I am looking for. 2) I am testing them

Re: recent utf8 problems

2017-11-06 Thread Dr. Mario Michael Krell
Java upgrade > then you need to show your config before folks can help. > Cheers -- Rick > > > On November 4, 2017 5:11:36 PM EDT, "Dr. Mario Michael Krell" > wrote: >> Hi, >> >> We recently discovered issues with solr with converting utf8 code in &g

recent utf8 problems

2017-11-04 Thread Dr. Mario Michael Krell
Hi, We recently discovered issues with solr with converting utf8 code in the search. One or two month ago everything was still working. - What might have caused it is a Java update (Java 8 Update 151). - We are using firefox as well as chrome for displaying results. - We tested it with Solr 6.5

From Zero to Learning to Rank in Apache Solr

2017-11-02 Thread Michael Alcorn
Here's a tutorial I wrote that some of you all might find useful: https://github.com/airalcorn2/Solr-LTR. Feedback is welcome. Thanks, Michael A. Alcorn

Re: SOLR-11504: Provide a config to restrict number of indexing threads

2017-11-02 Thread Michael McCandless
Actually, it's one lucene segment per *concurrent* indexing thread. So if you have 10 indexing threads in Lucene at once, then 10 in-memory segments will be created and will have to be written on refresh/commit. Elasticsearch uses a bounded thread pool to service all indexing requests, which I th

How to Efficiently Extract Learning to Rank Similarity Features From Solr?

2017-10-23 Thread Michael Alcorn
r, to extract these similarity features from Solr, I have to perform a Solr query for each query, which introduces several bottlenecks. Are there more efficient means of computing these similarity measures for large numbers of queries (other than increased parallelism)? Thanks, Michael A. Alcorn

Using pint field as uniqueKey

2017-10-17 Thread Michael Kondratiev
I'm trying to set up uniqueKey ( what is integer) like that: id But when I upload configuration into solr i see following error: uniqueKey field (id) can not be configured to use a Points based FieldType: pint If i set type=“string” everything seems to be ok.

pint as uniquekey

2017-10-17 Thread Michael Kondratiev
Hello! I'm trying to set up uniqueKey ( what is integer) like that: id But when I upload configuration into solr i see following error: uniqueKey field (id) can not be configured to use a Points based FieldType: pint If i set type=“string” everything seems to be ok.

Re: Strange Behavior When Extracting Features

2017-10-16 Thread Michael Alcorn
If anyone else is following this thread, I replied on the Jira. On Mon, Oct 16, 2017 at 4:07 AM, alessandro.benedetti wrote: > This is interesting, the EFI parameter resolution should work using the > quotes independently of the query parser. > At that point, the query parsers (both) receive a m

Re: Strange Behavior When Extracting Features

2017-10-13 Thread Michael Alcorn
I believe I've discovered a workaround. If you use: { "store": "redhat_efi_feature_store", "name": "case_description_issue_tfidf", "class": "org.apache.solr.ltr.feature.SolrFeature", "params": { "q":"{!dismax qf=text_tfidf}${text}" } } i

Re: Parsing of rq queries in LTR

2017-10-13 Thread Michael Alcorn
feature.SolrFeature", "params": { "q": "{!field f=issue_tfidf}${case_description}" } } you can then use single quotes to incorporate multi-term arguments as Alessandro suggested. I've added this information to the Jira. On Th

Re: Parsing of rq queries in LTR

2017-10-12 Thread Michael Alcorn
It turns out my last comment on that Jira was mistaken. Multi-term EFI arguments still exhibit unexpected behavior. Binoy is trying to help me figure out what the issue is. I plan on updating the Jira once we've figured out the problem. On Thu, Oct 12, 2017 at 3:41 AM, alessandro.benedetti wrote:

Re: Newbie question about why represent timestamps as "float" values

2017-10-10 Thread Michael Kuhlmann
test updates, or chosing which document is more recent, it's totally okay to index it as a long value. But definitely not as a float. -Michael Am 10.10.2017 um 10:55 schrieb alessandro.benedetti: > There was time ago a Solr installation which had the same problem, and the > author

Is there a parsing issue with "OR NOT" or is something else going on? (Solr 6)

2017-10-02 Thread Michael Joyner
'+*.*)^1.0' component in the 2nd query's parsedquery which is not in the 1st query's parsedquery response. The first query does not seem to be matching any of the "NOT" articles to include in the union of sets and is not giving us the expected results. Is wrapping &

Re: Where the uploaded configset from SOLR into zookeeper ensemble resides?

2017-09-27 Thread Michael Kuhlmann
Do you find your configs in the Solr admin panel, in the Cloud --> Tree folder? -Michael Am 28.09.2017 um 04:50 schrieb Gunalan V: > Hello, > > Could you please let me know where can I find the uploaded configset from > SOLR into zookeeper ensemble ? > > In docs it says

Re: Modifing create_core's instanceDir attribute

2017-09-27 Thread Michael Kuhlmann
ute the following part of the URL as additional commands, which of course fails. Just put the URL in quotes, and it will work much better. -Michael Am 27.09.2017 um 23:14 schrieb Miller, William K - Norman, OK - Contractor: > I understand that this has to be done on the command line, but I don

Re: Moving to Point, trouble with IntPoint.newRangeQuery()

2017-09-26 Thread Michael Kuhlmann
Arrgh, forget my question. I just see that newExactQuery() simply triggers newRangeQuery() like you already do. -Michael Am 26.09.2017 um 13:29 schrieb Michael Kuhlmann: > Hi Markus, > > I don't know why there aren't any results. But just out of curiosity, > why don'

Re: Moving to Point, trouble with IntPoint.newRangeQuery()

2017-09-26 Thread Michael Kuhlmann
Hi Markus, I don't know why there aren't any results. But just out of curiosity, why don't you use the better choice IntPoint.newExectQuery(String,int)? What happens if you use that? -Michael Am 26.09.2017 um 13:22 schrieb Markus Jelsma: > Hello, > > I have a QPars

Re: Solr nodes crashing (OOM) after 6.6 upgrade

2017-09-22 Thread Michael Kuhlmann
look at the Amazon cloud settings? Best, Michael Am 22.09.2017 um 12:00 schrieb shamik: > All the tuning and scaling down of memory seemed to be stable for a couple of > days but then came down due to a huge spike in CPU usage, contributed by G1 > Old Generation GC. I'm rea

Strange Behavior When Extracting Features

2017-09-20 Thread Michael Alcorn
05e2229434b38aed11cc65 Have any of you run into this issue? Seems like it could be a bug. Thanks, Michael A. Alcorn

Re: solr Facet.contains

2017-09-15 Thread Michael Kuhlmann
What is the field type? Which Analyzers are configured? How do you split at "~"? (You have to do it by yourself, or configure some tokenizer for that.) What do you get when you don't filter your facets? What do you mean with "it is not working"? What is your result now?

Re: ways to check if document is in a huge search result set

2017-09-12 Thread Michael Kuhlmann
Am 13.09.2017 um 04:04 schrieb Derek Poh: > Hi Michael > > "Then continue using binary search depending on the returned score > values." > > May I know what do you mean by using binary search? An example algorithm is in Java method java.util.Arrays::binarySear

Re: ways to check if document is in a huge search result set

2017-09-12 Thread Michael Kuhlmann
n-thousand results at once. But be warned, the result position of these elements can vary with every single commit, esp. when there're lots of documents with the same score value. -Michael Am 12.09.2017 um 03:21 schrieb Derek Poh: > Some additional information. > > I have a query from

Re: ways to check if document is in a huge search result set

2017-09-10 Thread Michael Kuhlmann
Maybe I don't understand your problem, but why don't you just filter by "supplier information"? -Michael Am 11.09.2017 um 04:12 schrieb Derek Poh: > Hi > > I have a collection of productdocument. > Each productdocument has supplier information in it. > > I

Re: Solr Issue

2017-09-07 Thread Michael Kuhlmann
Hi Patrick, can you attach the query you're sending to Solr and one example result? Or more specific, what are your hl.* parameters? -Michael Am 07.09.2017 um 09:36 schrieb Patrick Fallert: > > Hey Guys,  > i´ve got a problem with my Solr Highlighter.. > When I search for a

Re: Solr6.6 Issue/Bug

2017-09-06 Thread Michael Kuhlmann
Why would you need to start Solr as root? You should definitely not do this, there's no reason for that. And even if you *really* want this: What's so bad about the -force option? -Michael Am 06.09.2017 um 07:26 schrieb Kasim Jinwala: > Dear team, > I am using solr

Re: SOLR Learning to Rank Questions

2017-08-22 Thread Michael Nilsson
Hey Jaoa! To also address your second question, the purpose of the normalizers is to ensure that whatever manipulation you did to your feature values offline at training time (say to minimize floating point precision roundoff) also get reflected online at query rerank time, since you will be passi

Re: Per Text Field Similarity Measures for Learning to Rank

2017-08-22 Thread Michael Nilsson
Hi Michael, Using your example, if you have 5 different fields, you could create 5 individual SolrFeatures against those fields. The one tricky thing here is that you want to use different similarity scoring mechanisms against your fields. By default, Solr uses a single Similarity class <ht

Re: Solr 6.6.0 - High CPU during indexing

2017-08-18 Thread Michael Braun
Have you attached JVisualVM or a similar application to the process to sample where the time is being spent? It can be very helpful for debugging this sort of problem. On Fri, Aug 18, 2017 at 12:37 PM, Joe Obernberger < joseph.obernber...@gmail.com> wrote: > Indexing about 15 million documents pe

Is there a way to determine fields available for faceting for a search without doing the faceting?

2017-08-10 Thread Michael Joyner
Hey all! Is there a way to determine fields available for faceting (those with data) for a search without actually doing the faceting for the fields? -Mike/NewsRx

Re: Highlighting Performance improvement suggestions required - Solr 6.5.1

2017-08-09 Thread Michael Braun
Have you attached JVisualVM or a similar tool for sampling when Solr is answering the requests with highlight? What relevant methods are coming up? On Wed, Aug 9, 2017 at 11:26 AM, sasarun wrote: > Hi Amrit, > > Thanks for the response. I did went through both and that is how I landed > up > wit

Per Text Field Similarity Measures for Learning to Rank

2017-08-04 Thread Michael Alcorn
that type of thing in Solr. It looks like the SolrFeature class might be the way to go, but my colleagues who are more familiar with Solr than I am weren't sure it was possible. Thanks, Michael A. Alcorn

Re: Replication Question

2017-08-02 Thread Michael B. Klein
And the one that isn't getting the updates is the one marked in the cloud diagram as the leader. /me bangs head on desk On Wed, Aug 2, 2017 at 10:31 AM, Michael B. Klein wrote: > Another observation: After bringing the cluster back up just now, the > "1-in-3 nodes don't g

Re: Replication Question

2017-08-02 Thread Michael B. Klein
Another observation: After bringing the cluster back up just now, the "1-in-3 nodes don't get the updates" issue persists, even with the cloud diagram showing 3 nodes, all green. On Wed, Aug 2, 2017 at 9:56 AM, Michael B. Klein wrote: > Thanks for your responses, Shawn a

Re: Replication Question

2017-08-02 Thread Michael B. Klein
fact that this happened (and the way it happened) is making me wonder if/how I can automate this automated staging environment scaling reliably and with confidence that it will Just Work™. Comments and suggestions would be GREATLY appreciated. Michael On Tue, Aug 1, 2017 at 8:14 PM, Erick E

Replication Question

2017-08-01 Thread Michael B. Klein
`replicateAfter` options on the `/replication` requestHandler, or is that a non-solrcloud, standalone-replication thing? Michael

phrase highlight, exact phrases only?

2017-07-25 Thread Michael Joyner
Hello, We are using highlighting and are looking for the exact phrase "HIV Prevention" but are receiving back highlighted snippets like the following where non-phrase matching portions are being highlighted, is there a setting to highlight the entire phrase instead of any partial token match

Re: mm = 1 and multi-field searches (update)

2017-07-24 Thread Michael Joyner
keywords_text_general^0.5 keywords_en^0.1 On 07/21/2017 01:46 PM, Susheel Kumar wrote: Interesting. If its working for you then its good but to your original question, qf seems to be working. Adding to mailing list for the benefit of others. On Fri, Jul 21, 2017 at 9:41 AM, Michael Joyner wrote: Thanks, We

mm = 1 and multi-field searches

2017-07-10 Thread Michael Joyner
Hello all, How does setting mm = 1 for edismax impact multi-field searches? We set mm to 1 and get zero results back when specifying multiple fields to search across. Is there a way to set mm = 1 for each field, but to OR the individual field searches together? -Mike/NewsRx

RE: CDCR - how to deal with the transaction log files

2017-07-10 Thread Michael McCarthy
We have been experiencing this same issue for months now, with version 6.2. No solution to date. -Original Message- From: Xie, Sean [mailto:sean@finra.org] Sent: Sunday, July 09, 2017 9:41 PM To: solr-user@lucene.apache.org Subject: [EXTERNAL] Re: CDCR - how to deal with the transact

Cross Data Center Replication

2017-07-06 Thread Michael McCarthy
what could be causing this issue now? [cid:image001.png@01D2F647.54578B20] Michael McCarthy Senior Software Engineer T 470.769.6687 | M 678.986.4127 Georgia IT Innovation Center - South 2010 Warsaw Road, Roswell, GA 30076 [gm_logos] Confidentiality Note: The above email contains information

Same score for different length matches

2017-06-30 Thread Thomas Michael Engelke
Hey, we have multiple documents that are matches for the query in question ("name:hubwagen"). Thing is, some of the documents only contain the query, while others match 100% in the "name" field: Hochhubwagen 5.9861565 Hubwagen 5.9861565 The debug looks like this (for the first and 5th

Apache 4.9.1 - trouble trying to use complex phrase query parser.

2017-06-28 Thread Michael Craven
Hi - I am trying to use the complex phrase query parser on my Drupal installation. Our core is sore 4.9.1, so I thought it should be no problem. Search works fine when I use a local parameter to do a search of type lucene, dismax, or edismax, (a la {!lucene} etc.), but when I try to do a searc

Proximity searches with a wildcard

2017-06-23 Thread Michael Craven
that could give me some advice or point me to a resource that can? Thanks -Michael

Re: Error after moving index

2017-06-22 Thread Moritz Michael
BTW, is there a better/recommended way to transfer an index to another solr? On Thu, Jun 22, 2017 at 6:09 PM +0200, "Moritz Michael&quo

Re: Error after moving index

2017-06-22 Thread Moritz Michael
Hello Michael, I used the backup functionality to create a snapshot and uploaded this snapshot, so I feel it should be save.  I'll try it again. Maybe the copy operation wasn't successful. 

Re: Error after moving index

2017-06-22 Thread Michael Kuhlmann
Hi Moritz, did you stop your local Solr sever before? Copying data from a running instance may cause headaches. If yes, what happens if you copy everything again? It seems that your copy operations wasn't successful. Best, Michael Am 22.06.2017 um 14:37 schrieb Moritz Munte: &g

Re: org.apache.lucene.index.CheckIndex throws Illegal initial capacity: -16777216

2017-06-18 Thread Moritz Michael
ctory over a 4x index, but then you say you're aware that you can't do that so I'm all confused. Did you try the 4.6 checkindex program? Best, Erick On Sat, Jun 17, 2017 at 1:43 PM, Moritz Michael wrote: > > > > > > > Thx for the advice Alan. I

Re: org.apache.lucene.index.CheckIndex throws Illegal initial capacity: -16777216

2017-06-17 Thread Moritz Michael
rom 5.x to 6 using the current upgrade tool. Alan Woodward www.flax.co.uk > On 17 Jun 2017, at 10:08, Moritz Michael wrote: > > Hello, > > I'm trying to upgrade a Solr 4.6 index to Solr 6. > The upgrade does fail with an error. > > I tried to check the index with o

org.apache.lucene.index.CheckIndex throws Illegal initial capacity: -16777216

2017-06-17 Thread Moritz Michael
Hello, I'm trying to upgrade a Solr 4.6 index to Solr 6. The upgrade does fail with an error. I tried to check the index with org.apache.lucene.index.CheckIndex using this command: java -cp lucene-core-5.5.4.jar:lucene-backward-codecs-5.5.4.jar -ea:org.apache.lucene... org.apache.lucene.index.Che

Re: Solr NLS custom query parser

2017-06-15 Thread Michael Kuhlmann
Hi Arun, your question is too generic. What do you mean with nlp search? What do you expect to happen? The short answer is: No, there is no such parser because the individual requirements will vary a lot. -Michael Am 14.06.2017 um 16:32 schrieb aruninfo100: > Hi, > > I am trying to

Re: StandardDirectoryReader.java:: applyAllDeletes, writeAllDeletes

2017-05-29 Thread Michael McCandless
> > > On Sun, May 28, 2017 at 2:16 PM, Nawab Zada Asad Iqbal > wrote: > >> Thanks Michael and Shawn for the detailed response. I was later able to >> pull the full history using gitk; and found the commits behind this patch. >> >> Mike: >> >> So, in solr

eDisMax with nested documents

2017-05-29 Thread Moritz Michael
Hello, I'm new to this list and having a question regarding nested documents with scoring through eDisMax. We do have an index of e-books that contains the metadata of the e-book and each page as a single document: - Book 1 - Page 1 - Page 2 - Page 3 - ... - Pag

Re: StandardDirectoryReader.java:: applyAllDeletes, writeAllDeletes

2017-05-28 Thread Michael McCandless
Sorry, yes, that commit was one of many on a feature branch I used to work on LUCENE-5438, which added near-real-time index replication to Lucene. Before this change, Lucene's replication module required a commit in order to replicate, which is a heavy operation. The writeAllDeletes boolean option

The unified highlighter html escaping. Seems rather extreme...

2017-05-26 Thread Michael Joyner
t Mark J. Mogul Marshall A. Schorin Mary A. Bonilla Mary L. Schmidt Melanie A. Comito Melissa A. Forouhar Michael B. Harris Michael K. Richards Michael S. Isakoff Minnie Abromowitch Mohamad M. Al-Rahawan Muna Qayed Narayana Gowda Nibal A. Zaghloul Nichole L. Bryant Nina S. Kadan-Lottick Nkechi I. Mba Pa

Is IndexSchema addFields and addCopyFields concurrent?

2017-05-17 Thread Michael Hu
copyFields (one field is source, the other is destination). This causes core initialization failure because the associated fields for copyFields do not exist? Can someone help me? Thank you! --Michael Hu synchronized (oldSchema.getSchemaUpdateLock

newbie question re solr.PatternReplaceFilterFactory

2017-05-10 Thread Michael Tobias
I am sure this is very simple but I cannot get the pattern right. How can I use solr.PatternReplaceFilterFactory to remove all words in brackets from being indexed? eg [ignore this] thanks Michael

recommended zookeeper version for solr cloud

2017-04-26 Thread David Michael Gang
Hi all, Which version of external zookeper is recommended to use in production environments? 3.4.6 which is the version shipped with solr or 3.4.10 which is the latest stable? Thanks, David

Re: Mixing AND OR conditions with query parameters

2017-04-24 Thread Michael Kuhlmann
Make sure to have a whitespace are the OR operator. The parenthesises should be around the OR query, not including the "fq:" -- this should be outside the parenthesises (which are not necessary at all). What exactly are you expecting? -Michael Am 24.04.2017 um 12:59 schrieb VJ: >

Re: AnalyzingInfixSuggester performance

2017-04-18 Thread Michael McCandless
xSuggester instead of a regular Solr index (since both are > using standard Lucene?) is that the AInfixSuggester does sorting at > index-time using the weightField? So it's only ever advantageous to use > this Suggester if you need sorting based on a field? > > Thanks > > On T

Re: extract multi-features for one solr feature extractor in solr learning to rank

2017-04-18 Thread Michael Nilsson
tion of the Feature Transformer <http://lucene.apache.org/solr/6_5_0/solr-ltr/org/apache/solr/ltr/response/transform/LTRFeatureLoggerTransformerFactory.html> to return features that actually triggered in a sparse format <https://cwiki.apache.org/confluence/display/solr/Learning+To+Rank#LearningT

Re: AnalyzingInfixSuggester performance

2017-04-18 Thread Michael McCandless
AnalyzingInfixSuggester uses index-time sort, to sort all postings by the suggest weight, so that lookup, as long as your sort by the suggest weight is extremely fast. But if you need to rank at lookup time by something not "congruent" with the index-time sort then you lose that benefit. Mike McC

Re: Solr Cloud 6.5.0 Replicas go down while indexing

2017-04-04 Thread Michael Joyner
Try Increasing the number of connections your ZooKeeper allows to a very large number. On 04/04/2017 09:02 AM, Salih Sen wrote: Hi, One of the replicas went down again today somehow disabling all updates to cluster with error message "Cannot talk to ZooKeeper - Updates are disabled.” half a

Re: fq performance

2017-03-17 Thread Michael Kuhlmann
should consider disabling the cache for this fq filter so that your filter cache (for other filters you probably have) won't be polluted: fq=*{!cache=false}*access_control:(g1 g2 g5 g99 ...). See http://yonik.com/advanced-filter-caching-in-solr/ for information on that. -Michael Am 1

Re: fq performance

2017-03-16 Thread Michael Kuhlmann
all of your fq filters fit into the cache. Each possible fq filter will take 1 million/8 == 125k bytes, so having hundreds of possible access groups conditions might blow up your query cache (which must fit into RAM). -Michael Am 16.03.2017 um 13:02 schrieb Ganesh M: Hi, We have 1 million o

Data Import Handler on 6.4.1

2017-03-15 Thread Michael Tobias
? ?? And then later in the solrconfig.xml I have: db-data-config.xml Any help much appreciated. Regards Michael -Original Message- From: David Hastings [mailto:hastings.recurs...@gmail.com] Sent: 15 March 2017 17:47 To: solr-user@lucene.apache.org Subject: Re: Get handler

Re: LTR on multiple shards

2017-03-08 Thread Michael Nilsson
helps, Michael On Wed, Mar 8, 2017 at 5:01 AM, Vincent wrote: > Hi all, > > It seems that the curl commands from the LTR wiki ( > https://cwiki.apache.org/confluence/display/solr/Learning+To+Rank) to > post and/or delete features from and to the feature store only affect one >

Re: question related to solr LTR plugin

2017-03-07 Thread Michael Nilsson
this info in a parallel collection and look it up there. Solr recently added jdbc driver support, so you might even be able to use that to get data from a sql store as well. I haven't messed around with these two options myself so you'd be treading somewhat new ground. If

Re: What is the bottleneck for an optimise operation? / solve the disk space and time issues by specifying multiple segments to optimize

2017-03-02 Thread Michael Joyner
You can solve the disk space and time issues by specifying multiple segments to optimize down to instead of a single segment. When we reindex we have to optimize or we end up with hundreds of segments and very horrible performance. We optimize down to like 16 segments or so and it doesn't do

messages in gc log not connected to gcs in indexing time

2017-03-02 Thread David Michael Gang
Hi all, When indexing data i get in the gc log messages like: 2017-03-02T10:43:17.872+: 1088.957: Total time for which application threads were stopped: 0.0002071 seconds, Stopping threads took: 0.888 seconds 2017-03-02T10:43:17.885+: 1088.970: Total time for which application threads

Re: minimal solrconfig example

2017-03-02 Thread David Michael Gang
Thanks Charly. This is what i looked for. On Thu, Mar 2, 2017 at 11:07 AM David Michael Gang wrote: I use the latest version. Solr 6.4.1 On Thu, Mar 2, 2017 at 9:15 AM Aravind Durvasula wrote: Hi David, What is the solr version you are using? To get started, it's better to use the c

Re: minimal solrconfig example

2017-03-02 Thread David Michael Gang
I use the latest version. Solr 6.4.1 On Thu, Mar 2, 2017 at 9:15 AM Aravind Durvasula wrote: > Hi David, > > What is the solr version you are using? > To get started, it's better to use the config file that comes out of the > box. > > Thanks, > Aravind > > > > -- > View this message in context:

minimal solrconfig example

2017-03-01 Thread David Michael Gang
Hi all, I want to create my first solr collection I found an example of solrconfig here. https://github.com/apache/lucene-solr/blob/master/solr/example/files/conf/solrconfig.xml This is a file of more than thousand lines. As i understand this file shows all possible configurations possible. What i

<    1   2   3   4   5   6   7   8   9   10   >