Re: Detect term occurrences

2015-09-13 Thread Francisco Andrés Fernández
Thanks again.
For the moment I think it won't be a problem. I have ~500 documents.
Regards,

Francisco

El vie., 11 de sept. de 2015 a la(s) 6:08 p. m., simon 
escribió:

> +1 on Sujit's recommendation: we have a similar use case (detecting drug
> names / disease entities /MeSH terms ) and have been using the
> SolrTextTagger with great success.
>
> We run a separate Solr instance as a tagging  service and add the detected
> tags as metadata fields to a document before it is ingested into our main
> Solr collection.
>
> How many documents/product leaflets do you have ? The tagger is very fast
> at the Solr level but I'm seeing quite a bit of HTTP overhead.
>
> best
>
> -Simon
>
> On Fri, Sep 11, 2015 at 1:39 PM, Sujit Pal  wrote:
>
> > Hi Francisco,
> >
> > >> I have many drug products leaflets, each corresponding to 1 product.
> In
> > the
> > other hand we have a medical dictionary with about 10^5 terms.
> > I want to detect all the occurrences of those terms for any leaflet
> > document.
> > Take a look at SolrTextTagger for this use case.
> > https://github.com/OpenSextant/SolrTextTagger
> >
> > 10^5 entries are not that large, I am using it for much larger
> dictionaries
> > at the moment with very good results.
> >
> > Its a project built (at least originally) by David Smiley, who is also
> > quite active in this group.
> >
> > -sujit
> >
> >
> > On Fri, Sep 11, 2015 at 7:29 AM, Alexandre Rafalovitch <
> arafa...@gmail.com
> > >
> > wrote:
> >
> > > Assuming the medical dictionary is constant, I would do a copyField of
> > > text into a separate field and have that separate field use:
> > >
> > >
> >
> http://www.solr-start.com/javadoc/solr-lucene/org/apache/lucene/analysis/miscellaneous/KeepWordFilterFactory.html
> > > with words coming from the dictionary (normalized).
> > >
> > > That way that new field will ONLY have your dictionary terms from the
> > > text. Then you can do facet against that field or anything else. Or
> > > even search and just be a lot more efficient.
> > >
> > > The main issue would be a gigantic filter, which may mean speed and/or
> > > memory issues. Solr has some ways to deal with such large set matches
> > > by compiling them into a state machine (used for auto-complete), but I
> > > don't know if that's exposed for your purpose.
> > >
> > > But could make a fun custom filter to build.
> > >
> > > Regards,
> > >Alex.
> > > 
> > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> > > http://www.solr-start.com/
> > >
> > >
> > > On 10 September 2015 at 22:21, Francisco Andrés Fernández
> > >  wrote:
> > > > Yes.
> > > > I have many drug products leaflets, each corresponding to 1 product.
> In
> > > the
> > > > other hand we have a medical dictionary with about 10^5 terms.
> > > > I want to detect all the occurrences of those terms for any leaflet
> > > > document.
> > > > Could you give me a clue about how is the best way to perform it?
> > > > Perhaps, the best way is (as Walter suggests) to do all the queries
> > every
> > > > time, as needed.
> > > > Regards,
> > > >
> > > > Francisco
> > > >
> > > > El jue., 10 de sept. de 2015 a la(s) 11:14 a. m., Alexandre
> > Rafalovitch <
> > > > arafa...@gmail.com> escribió:
> > > >
> > > >> Can you tell us a bit more about the business case? Not the current
> > > >> technical one. Because it is entirely possible Solr can solve the
> > > >> higher level problem out of the box without you doing manual term
> > > >> comparisons.In which case, your problem scope is not quite right.
> > > >>
> > > >> Regards,
> > > >>Alex.
> > > >> 
> > > >> Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> > > >> http://www.solr-start.com/
> > > >>
> > > >>
> > > >> On 10 September 2015 at 09:58, Francisco Andrés Fernández
> > > >>  wrote:
> > > >> > Hi all, I'm new to Solr.
> > > >> > I want to detect all ocurrences of terms existing in a thesaurus
> > into
> > > 1
> > > >> or
> > > >> > more documents.
> > > >> > What´s the best strategy to make it?
> > > >> > Doing a query for each term doesn't seem to be the best way.
> > > >> > Many thanks,
> > > >> >
> > > >> > Francisco
> > > >>
> > >
> >
>


Re: search on multiValued field

2015-09-13 Thread Upayavira
Your "correct" doc isn't valid json. Try tag:["tag1", "tag2"] which
would be valid.

Upayavira

On Sat, Sep 12, 2015, at 08:49 AM, sara hajili wrote:
> hi
>  in my schema i have a tag field.
> this field set multiValued="true".
> now my question is about search on multivalued field.
> if i add thid doc to sor:
> {
> id:1,
> tag:{"tag1","tag2"}
> },
> 
> {
> id:2,
> tag:{"tag1","tag2"}
> },
> 
> when i search on q="tag1"
> i get just :
> {
> id:1,
> tag:{"tag1","tag2"}
> },
> 
> but if i add docs in this way:
> {
> id:1,
> tag:"tag1",
> tag:"tag2"
> },
> {
> id:2,
> tag:"tag1",
> tag:"tag2"
> },
> now if i search to q="tag1"
> i get both of docs as a result.
> i'm confused.!what is difference between 2 docs?
> and can i search on this doc ?
> {
> id:1,
> tag:{"tag1","tag2"}
> },
> and get correct result(correct result is getting 2 doc all together)
> tnx.


Re: Solr authentication - Error 401 Unauthorized

2015-09-13 Thread Noble Paul
It is not that solr is over protected, it is just that the clients,
SolrJ as well as bin/solr are not provided with basic auth
capabilities.

 I have opened a ticket to track this
https://issues.apache.org/jira/browse/SOLR-8048

On Sat, Sep 12, 2015 at 7:14 PM, Dan Davis  wrote:
> Noble,
>
> You should also look at this if it is intended to be more than an internal
> API.   Using the minor protections I added to test SOLR-8000, I was able to
> reproduce a problem very like this:
>
> bin/solr healthcheck -z localhost:2181 -c mycollection
>
> Since Solr /select is protected...
>
> On Sat, Sep 12, 2015 at 9:40 AM, Dan Davis  wrote:
>
>> It seems that you have secured Solr so thoroughly that you cannot now run
>> bin/solr status!
>>
>> bin/solr has no arguments as yet for providing a username/password - as a
>> mostly user like you I'm not sure of the roadmap.
>>
>> I think you should relax those restrictions a bit and try again.
>>
>> On Fri, Sep 11, 2015 at 5:06 AM, Merlin Morgenstern <
>> merlin.morgenst...@gmail.com> wrote:
>>
>>> I have secured solr cloud via basic authentication.
>>>
>>> Now I am having difficulties creating cores and getting status
>>> information.
>>> Solr keeps telling me that the request is unothorized. However, I have
>>> access to the admin UI after login.
>>>
>>> How do I configure solr to use the basic authentication credentials?
>>>
>>> This is the error message:
>>>
>>> /opt/solr-5.3.0/bin/solr status
>>>
>>> Found 1 Solr nodes:
>>>
>>> Solr process 31114 running on port 8983
>>>
>>> ERROR: Failed to get system information from http://localhost:8983/solr
>>> due
>>> to: org.apache.http.client.ClientProtocolException: Expected JSON response
>>> from server but received: 
>>>
>>> 
>>>
>>> 
>>>
>>> Error 401 Unauthorized
>>>
>>> 
>>>
>>> HTTP ERROR 401
>>>
>>> Problem accessing /solr/admin/info/system. Reason:
>>>
>>> UnauthorizedPowered by
>>> Jetty://
>>>
>>>
>>> 
>>>
>>> 
>>>
>>
>>



-- 
-
Noble Paul


Re: error while running query on solr slave

2015-09-13 Thread shahper

I some how managed to solve my second issue.

But still facing problem with the first issue. If I shutdown my master 
And start working with slave.Then I am getting error As mention below:-



"error":{
"msg":"undefined field ENTITYTYPE",
"code":400}}


On Thursday 10 September 2015 09:32 PM, shahper wrote:

Sorry for late reply.

I am facing one more issue now.

1. When I am shutting down my master and start working with my slave. 
I am not able to fetch any data.As I can check data folder in my core 
its same as master. but then also I am not able to get and data when I 
run any query.


"error":{
"msg":"undefined field ENTITYTYPE",
"code":400}}




2. This issue came across when I was testing on my master. When I 
update any entity in my database it did not get updated on my indexes .



version of solr :- 5.2.1
running with jetty


On Wednesday 09 September 2015 08:15 PM, Erick Erickson wrote:

Please review:
http://wiki.apache.org/solr/UsingMailingLists

You've essentially said "it doesn't work". There's not enough
information to say _anything_ intelligent.

How does it fail? An messages in the log file? What is
  the query you're sending? Does the slave start up without
error?

Best,
Erick

On Wed, Sep 9, 2015 at 3:44 AM, shahper 
 wrote:

Hi ,

I haves setup master slave solr version 5.2.1.

I have done indexing on master .

And replication is done.

When I am trying to run any query on slave its showing me error its not
running.


Shahper

























--
Shahper Jamil

System Administrator

Tel: +91 124 4548383 Ext- 1033
UK: +44 845 0047 142 Ext- 5133

TBS Website 
Techblue Software Pvt. Ltd
The Palms, Plot No 73, Sector 5, IMT Manesar,
Gurgaon- 122050 (Hr.)

www.techbluesoftware.co.in 


	TBS Facebook 
 
TBS Twitter  TBS Google+ 
 TBS Linked In 



TBS Branding 



Re: Solr health check monitor for multiple cores

2015-09-13 Thread Otis Gospodnetić
Hi Daniel,

Lots of organizations use our SPM to monitor Solr / SolrCloud (see
http://sematext.com/spm/integrations/solr-monitoring.html ) .  We're
working on "Top Queries" type of reporting for Solr, which I'm very excited
about. :)

Otis
--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/


On Fri, Aug 21, 2015 at 5:28 PM, Davis, Daniel (NIH/NLM) [C] <
daniel.da...@nih.gov> wrote:

> I'm wondering what different folks do out there for a health monitor for
> Solr.   I'm running Solr 5.2.1, so far without Solr Cloud, and I anticipate
> having multiple cores.
>
> For now, I can make use solr/corename/admin/ping, but how can I have Solr
> ping all cores?
>
> Dan Davis, Systems/Applications Architect (Contractor),
> Office of Computer and Communications Systems,
> National Library of Medicine, NIH
>
>


Re: solr training

2015-09-13 Thread Otis Gospodnetić
Hi Tim,

A slightly delayed reply ;)
We are running Solr training in NYC next month -
http://sematext.com/training/solr-training.html - 2nd seat is 50% off.

Otis
--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/


On Fri, May 1, 2015 at 2:18 PM, Tim Dunphy  wrote:

> Hey guys,
>
>  My company has a training budget that it wants me to use. So what I'd like
> to find out is if there is any instructor lead courses in the NY/NJ area,
> or courses online that are instructor lead that you could recommend?
>
> Thanks,
> Tim
>
> --
> GPG me!!
>
> gpg --keyserver pool.sks-keyservers.net --recv-keys F186197B
>


Re: solr training

2015-09-13 Thread Tim Dunphy
Cool, I'll check it out. Thanks!

On Sun, Sep 13, 2015 at 9:53 PM, Otis Gospodnetić <
otis.gospodne...@gmail.com> wrote:

> Hi Tim,
>
> A slightly delayed reply ;)
> We are running Solr training in NYC next month -
> http://sematext.com/training/solr-training.html - 2nd seat is 50% off.
>
> Otis
> --
> Monitoring * Alerting * Anomaly Detection * Centralized Log Management
> Solr & Elasticsearch Support * http://sematext.com/
>
>
> On Fri, May 1, 2015 at 2:18 PM, Tim Dunphy  wrote:
>
> > Hey guys,
> >
> >  My company has a training budget that it wants me to use. So what I'd
> like
> > to find out is if there is any instructor lead courses in the NY/NJ area,
> > or courses online that are instructor lead that you could recommend?
> >
> > Thanks,
> > Tim
> >
> > --
> > GPG me!!
> >
> > gpg --keyserver pool.sks-keyservers.net --recv-keys F186197B
> >
>



-- 
GPG me!!

gpg --keyserver pool.sks-keyservers.net --recv-keys F186197B