SOl6.3 Alchemy Annotator Not Working

2016-11-23 Thread soumitra80
Hi, I am getting below exception when I am trying to do data Import in solr 6.3 for Alchemy Annotator. 2016-11-24 06:23:12.121 ERROR (qtp-1937374684-28) [ x:dev] o.a.s.h.RequestHandlerBase org.apache.solr.common.SolrException: missing content stream at

Re: SOLR vs mongdb

2016-11-23 Thread Will Martin
mongdb governance criticisms. and a recognition that OSS community is full of people trying new combinations, or old negated scenarios. so governance models have to be coded; Solr has such. This Lucene listserv serves that purpose, among other resources, in coaching work in a secure and best

Need help to update multiple documents

2016-11-23 Thread Reddy Sankar
Hi Team , Facing issue to update multiple document in SOLAR at time in my batch job. Could you please help me by giving example or an documentation for the same. Thanks Sankar Reddy M.B { "add": { "doc": { "quoteNumber": "133940", "quoteStatus": { "set":

Re: SOLR vs mongdb

2016-11-23 Thread Walter Underwood
Sure. Someone sends an HTTP request that deletes all the content. I’m glad to share the curl request. Or you can put content in with fields that are indexed but not stored. Then the content is “gone” as soon as you send it to Solr. Or you change the schema and need to reindex, but don’t have

RE: SOLR vs mongdb

2016-11-23 Thread Kris Musshorn
Will someone please give me a detailed scenario where solr content could "disappear"? Disappear means what exactly? TIA, Kris -Original Message- From: Walter Underwood [mailto:wun...@wunderwood.org] Sent: Wednesday, November 23, 2016 7:47 PM To: solr-user@lucene.apache.org Subject:

Re: SolrCloud -Distribued Indexing

2016-11-23 Thread Jerome Yang
Hi, 1. You can usr solr collections api to create collection with "*implicit*" router. Please check, CREATE https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api1 2. There's several ways to indicate which collection you want send request to. a>

Re: Reload schema or configs failed then drop index, can not recreate that index.

2016-11-23 Thread Jerome Yang
It's solr 6.1, cloud mode. Please ignore the first message. Just take check my second email. I mean if I modify an existing collections's managed-schema and the modification makes reload collection failed. Then I delete the collection, and delete the configs from zookeeper. After that upload an

Re: SOLR vs mongdb

2016-11-23 Thread Walter Underwood
Well, I didn’t actually recommend MongoDB as a repository. :-) If you want transactions and search, buy MarkLogic. I worked there for two years, and that is serious non-muggle technology. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 23,

Re: SOLR vs mongdb

2016-11-23 Thread Alexandre Rafalovitch
Actually, you need to be ok that your content will disappear when you use MongoDB as well :-( But I understand what you were trying to say. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 24 November 2016 at 11:34, Walter Underwood

Re: SOLR vs mongdb

2016-11-23 Thread Walter Underwood
The choice is simple. Are you OK if all your content disappears and you need to reload? If so, use Solr. If not, you need some kind of repository. It can be files in Amazon S3. But Solr is not designed to preserve your data. wunder Walter Underwood wun...@wunderwood.org

Re: SOLR vs mongdb

2016-11-23 Thread Alexandre Rafalovitch
Solr supports automatic detection of content types for new fields. That was - unfortunately - named as schemaless mode. It still is typed under the covers and has limitations. Such as needing all automatically created fields to be multivalued (by the default schemaless definition). MongoDB is

Re: Comparing a Date value in solr

2016-11-23 Thread Sadheera Vithanage
Thankyou Erick, The requirement doesn't really let me use the query like that. Rather, what I would be storing in my document is the day number. E.g: Day : 1, Day : 2 etc I can even store this in milliseconds like 8640,17280. And I want to compare if those days falls within the

RE: negation search help

2016-11-23 Thread Allison, Timothy B.
You've gotten far better answers on this already, but you can use the SpanNotQuery in the SpanQueryParser I maintain and have published to maven central [1][2][3]. This does not carry out any nlp, but this would allow literal "headache (no not)"!~5,0 -> "headache" but not if "no" or "not"

RE: Frequent mismatch in the numDocs between replicas

2016-11-23 Thread Lewin Joy (TMS)
ll PROTECTED 関係者外秘 Hi, Tried this. The explicit commit after Indexing is also not working. As for the leader's document count, the number of records in the leader is also not proper. It is not just the replicas having wrong numbers. Both the leader and replica are having wrong counts. And it

RE: SOLR vs mongdb

2016-11-23 Thread Prateek Jain J
SOLR also supports, schemaless behaviour. and my question is same that, why and where should we prefer mongodb. Web search didn’t helped me on this. Regards, Prateek Jain -Original Message- From: Rohit Kanchan [mailto:rohitkan2...@gmail.com] Sent: 23 November 2016 07:07 PM To:

stream, features and train

2016-11-23 Thread Joe Obernberger
Hi - I'm trying to experiment with the new train, features, model, classify capabilities of Solr 6.3.0. I'm following along on: https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions#StreamingExpressions-StreamSources When I execute: features(UNCLASS, q="*:*",

FileDictionaryFactory along with FuzzyLookupFactory not sending any suggestions

2016-11-23 Thread diwakar bhardwaj
Hi, I've already asked the question here: http://stackoverflow.com/questions/40771451/couldnt-get-data-in-suggester-even-when-storedir-getting-created-by-filediction so, I have a list of cities onto which I want to implement spell-checker. I have the priorities/weights of these cities with me.

Re: Wildcard searches with space in TextField/StrField

2016-11-23 Thread Reth RM
what is the fieldType of those records? On Tue, Nov 22, 2016 at 4:18 AM, Sandeep Khanzode < sandeep_khanz...@yahoo.com.invalid> wrote: > Hi Erick, > I gave this a try. > These are my results. There is a record with "John D. Smith", and another > named "John Doe". > > 1.] {!complexphrase

Re: SOLR vs mongdb

2016-11-23 Thread Rohit Kanchan
Hi Prateek, I think you are talking about two different animals. Solr(actually embedded lucene) is actually a search engine where you can use different features like faceting, highlighting etc but it is a document store where for each text it does create an Inverted index and map that to

SOLR vs mongdb

2016-11-23 Thread Prateek Jain J
Hi All, I have started to use mongodb and solr recently. Please feel free to correct me where my understanding is not upto the mark: 1. Solr is indexing engine but it stores both data and indexes in same directory. Although we can select fields to store/persist in solr via schema.xml.

Re: Should zookeeper be run on the worker machines?

2016-11-23 Thread Will Martin
This is laughable; the so use case wording and the request here. imo of course. On 11/23/2016 11:00 AM, Tech Id wrote: > Hi, > > Can someone please respond to this zookeeper-for-Solr Stack-Overflow > question: http://stackoverflow.com/questions/40755137/should- >

Re: Should zookeeper be run on the worker machines?

2016-11-23 Thread Dorian Hoxha
You can, but you should not. Source: heavy load may slow zookeeper resulting in timeouts etc. On Wed, Nov 23, 2016 at 5:00 PM, Tech Id wrote: > Hi, > > Can someone please respond to this zookeeper-for-Solr Stack-Overflow > question:

Re: negation search help

2016-11-23 Thread Rafa Haro
Hi Hem, The negation detection problem within the clinical or medical context has been tackled for many researchers. At https://github.com/chapmanbe/negex you can find different implementations using different approaches. I have not tested all of them, but I have widely use this concrete

Re: Solr 6 Performance Suggestions

2016-11-23 Thread Walter Underwood
I recently ran benchmarks on 4.10.4 and 6.2.1 and found very little difference in query performance. This was with 8 million documents (homework problems) from production. I used query logs from production. The load is a constant number of requests per minute from 100 threads. CPU usage is

Re: Solr 6 Performance Suggestions

2016-11-23 Thread Bram Van Dam
On 22/11/16 15:34, Prateek Jain J wrote: > I am not sure but I heard this in one of discussions, that you cant migrate > directly from solr 4 to solr 6. It has to be incremental like solr 4 to solr > 5 and then to solr 6. I might be wrong but is worth trying. Ideally the index needs to be

Re: Need to know how to read from solr index from update processor

2016-11-23 Thread Erick Erickson
This looks at first blush like an XY problem. Making a script work in SolrCloud is simple, just distribute it to all your replicas, update your configs and reload the collection. What's not simple is that apparently you want to pull data from the last doc indexed. There's no guarantee at all

Re: StatelessScriptUpdateProcessorFactory with multiple shards

2016-11-23 Thread Erick Erickson
Well, have you distributed both your script and the configs? And what does "doesn't work" mean? If you've changed your configs and pushed them to Zookeeper, and reloaded your collection (by restarting the Solr nodes, issuing the collections API RELOAD command whatever) then all the nodes for any

Re: Problem with Cross Data Center Replication

2016-11-23 Thread Erick Erickson
Your _source_ (i.e. cdcr_testa) doesn't have the CDCR update log configured. This section isn't in solrconfig for cdcr_testa: ${solr.ulog.dir:} The update log is the transfer mechanism between the source and target clusters, so it needs to be configured in both. Best, Erick. P.S. kudos

Should zookeeper be run on the worker machines?

2016-11-23 Thread Tech Id
Hi, Can someone please respond to this zookeeper-for-Solr Stack-Overflow question: http://stackoverflow.com/questions/40755137/should- zookeeper-be-run-on-the-worker-machines Thanks TI

SolrCloud -Distribued Indexing

2016-11-23 Thread Udit Tyagi
Hi, I am a solr user, I am using solr-6.3.0 version, I have some doubts for Distributed indexing and sharding in SolrCloud pease clarify, 1. How can I index documents to a specific shard(I heard about document routing not documentation is not proper for that). I am using solr create command

Re: negation search help

2016-11-23 Thread Alessandro Benedetti
I add to the thread a friend of mine, Rafa just presented at the Apachecon a medical system which deal exactly with a negation engine ( he will publish the slides soon) Hope it helps ! Cheers On Wed, Nov 23, 2016 at 3:03 PM, Alexandre Rafalovitch wrote: > Well, then 'no'

Re: negation search help

2016-11-23 Thread Alexandre Rafalovitch
Well, then 'no' becomes a signal token. So, the question is how many tokens after that it affects in its circle of negation? You could probably use something like https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-SurroundQueryParser to say (if user said 'headache').

RE: Combined Dismax and Block Join Scoring on nested documents

2016-11-23 Thread Mike Allen
Will do once I've validated what I've done. As I'm a total solr novice, on numerous occasions I've done stuff I thought was right, but was actually returning incorrect, but difficult to notice, results. In fact, I've just noticed I'm having issues with adding filter queries, so perhaps I still

RE: negation search help

2016-11-23 Thread Hem Naidu
Correct Alex. The use case is when provider searches on patient medical information for certain symptoms, the mentions likes "no headache" , "no blood loss", "not diabetic" should not show up in the search results. Thanks -Original Message- From: Alessandro Benedetti

Re: negation search help

2016-11-23 Thread Alessandro Benedetti
Now that I read better, do you mean that at indexing time those negations must be recognized, in the way that they are no match ? Cheers On Wed, Nov 23, 2016 at 2:20 PM, Alessandro Benedetti < benedetti.ale...@gmail.com> wrote: > Hi Hem, > are you expecting Solr to parse your natural language

Re: negation search help

2016-11-23 Thread Alessandro Benedetti
Hi Hem, are you expecting Solr to parse your natural language query out of the box ? Are you using any custom query parser ? If not, you need to follow the lucene Syntax to define engative queries. And be careful to the edge cases [1] . Cheers [1]

Re: negation search help

2016-11-23 Thread Hem Naidu
Alex Whenever the keywords or sentence followed by "no", "not", etc should be excluded from the search results. Does solr support this feature? Thanks Sent from my iPhone > On Nov 23, 2016, at 12:09 AM, Alexandre Rafalovitch > wrote: > > How do you _know_ it is not

StatelessScriptUpdateProcessorFactory with multiple shards

2016-11-23 Thread Lambrou, Ioannis
Hello all, I am using solr 5.5 and I am trying to use StatelessScriptUpdateProcessorFactory to append the data I am adding to solr to existing documents. This is working fine when I use one shard but it doesn't work on multiple shards. Is there any way to use it with multiple shards? This is

Re: Error "unexpected docvalues type NUMERIC for field" using rord() function query on single valued int field

2016-11-23 Thread Jaco de Vroed
Hello, We've figured out a workaround for this, using another field that's multivalued and populated with , and using that field in the rord() function query. Nevertheless, this feels like a bug to me. Bye, Jaco. On 23 November 2016 at 09:04, Jaco de Vroed wrote: > Hi,

Problem with Cross Data Center Replication

2016-11-23 Thread WILLMES Gero (SAFRAN IDENTITY AND SECURITY)
Hi Solr users, i try to configure Cross Data Center Replication according to https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=62687462 I set up two independent solr clouds. I created the collection "cdcr_testa" on the source cloud and the collection "backup_collection" on the

Need to know how to read from solr index from update processor

2016-11-23 Thread Prafful Nagpal
Hi All I have developed a script through which I am able to search on single core of solr cloud setup What i want to know is there a way through which i can search the entire collection ( all cores and shards ) with unique key and get a field of that document . var Term =

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread Charlie Hull
Hi all, We also blog about various Solr topics at www.flax.co.uk/blog and also run the London Lucene/Solr Meetup. I'd encourage you to attend a Meetup if you can find one locally, they're great places to hear about Solr projects and meet others working in the field. Alex & others efforts in

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread hairymcclarey
I've also tried searching for stuff like this, personally I really like podcasts as you fit them in when you have no time (or brain power) to read. There are a bunch of old podcasts but most are no longer active (but still have some good content). The only one that I know that occasionally still

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread Alexandre Rafalovitch
The plugin/extension story is a bit messy. Nobody is tracking those publicly, apart from solr.cool. There is more in my list somewhere, but I'd need to collate them. Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 23 November 2016 at 19:30,

update a document without changing anything

2016-11-23 Thread Dorian Hoxha
Hello searcherers, So, I have document that is fully stored. Then I make small change in schema. And now I have to reinsert every document. But I'm afraid of doing a get+insert, because something else may change the document in the meantime. So I want to do an "update" of nothing, so internally

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread Dorian Hoxha
It's why I mentioned the sponsoring. Another things that's missing is a list of plugins,extensions. How to find those ? I've seen solr.cool but I thought there would be more, looks kinda incomplete. On Tue, Nov 22, 2016 at 12:56 PM, Alexandre Rafalovitch wrote: > I tried

Re: Error "unexpected docvalues type NUMERIC for field" using rord() function query on single valued int field

2016-11-23 Thread Jaco de Vroed
Hi, No, I reproduced the original issue, with the rord() function, on a brand new index with docValues=true, with just one doc indexed in it. Any clues? Thanks, Jaco. On 21 November 2016 at 15:06, Pushkar Raste wrote: > Did you turn on/off docValues on a already