RE: Highlighting brings in irrelevant words

2016-09-30 Thread Bade, Vidya (Sagar)
Forgot to include: All the three fields used for highlighting are configured as 
follows:





Thank You,
:Sagar

-Original Message-
From: Bade, Vidya (Sagar) [mailto:vb...@webmd.net] 
Sent: Friday, September 30, 2016 2:12 PM
To: solr-user@lucene.apache.org
Subject: Highlighting brings in irrelevant words

Hi,

I am using Solr 4.10.2 following is my request:

defType(Parser)=edismax
fl=id,title,description,link_title
qf=title description keywords
pf=title description
hl=true
hl.fl=title,description,link_title
hl.q=lupus
q=lupus

I have two records in the index. Both about lupus. When I query using the 
above, the highlights are returned but not just the query term but also other 
irrelevant terms as shown below. The only synonym I have for lupus is 
"lupus,SLE,systemic lupus erythematosus" and I don't have the words "Symptoms" 
or "Sleep" in the synonyms file.

Can anyone tell me what I am doing wrong or how to fix this issue?

"highlighting":{
"07":{
  "title":["What Are the Symptoms of Lupus?"],
  "description":["Medical guide to the symptoms of lupus."],
  "link_title":["Lupus: The Symptoms and Signs"]},
"09":{
  "title":["Lupus and Sleep"],
  "description":["Join this team to get the tips you need to sleep better 
while living with lupus."],
  "link_title":["Lupus and Sleep"]}}

Thank You,
Sagar


Highlighting brings in irrelevant words

2016-09-30 Thread Bade, Vidya (Sagar)
Hi,

I am using Solr 4.10.2 following is my request:

defType(Parser)=edismax
fl=id,title,description,link_title
qf=title description keywords
pf=title description
hl=true
hl.fl=title,description,link_title
hl.q=lupus
q=lupus

I have two records in the index. Both about lupus. When I query using the 
above, the highlights are returned but not just the query term but also other 
irrelevant terms as shown below. The only synonym I have for lupus is 
"lupus,SLE,systemic lupus erythematosus" and I don't have the words "Symptoms" 
or "Sleep" in the synonyms file.

Can anyone tell me what I am doing wrong or how to fix this issue?

"highlighting":{
"07":{
  "title":["What Are the Symptoms of Lupus?"],
  "description":["Medical guide to the symptoms of lupus."],
  "link_title":["Lupus: The Symptoms and Signs"]},
"09":{
  "title":["Lupus and Sleep"],
  "description":["Join this team to get the tips you need to sleep better 
while living with lupus."],
  "link_title":["Lupus and Sleep"]}}

Thank You,
Sagar


Re: Re[2]: Traversal of documents through network

2016-04-21 Thread vidya
(1) So,dispalying the content(traversal of documents) depends on my
pagination ?
If i specify all 500 documents to be dispalyed and first 10 on the first
page and remaining on the other, that implies that all documents traverse
through network ?

(2) In my application, front end of UI is developed by one team and back end
of indexing is done by one team. How to implement highlighting feature in
solr. By developing a request handler with highlighting parameters turned on
and with termvector components, can i highlight the term queried ? Or any
special configuration needed ?

Please help me on these two.

Thanks



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Traversal-of-documents-through-network-tp4271555p4272053.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Traversal of documents through network

2016-04-20 Thread vidya
ok. I understand that. So, you would say documents traverse through network.
If i specify some 100 docs to be dispalyed on my first page, will it effect
performance. While docs gets traversed, will there be any high volume
traffic and effects performance of the application.


And whats the time solr takes to index 1GB of data in general.


Thanks



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Traversal-of-documents-through-network-tp4271555p4271743.html
Sent from the Solr - User mailing list archive at Nabble.com.


Traversal of documents through network

2016-04-20 Thread vidya
Hi

When i queried a word in solr, documents having that keyword is displayed in
500 documents,lets say. Will all those documents traverse through network ?
Or how it happens ?

Please help me on this.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Traversal-of-documents-through-network-tp4271555.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: indexing pdf files using post tool

2016-03-16 Thread vidya
Sorry for conveying it in wrong way. I want my data of 1 pdf file to be
indexed with different fields in a document of solr according to data in it
like name;id;title;content etc

Thanks 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/indexing-pdf-files-using-post-tool-tp4263811p4264052.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: indexing pdf files using post tool

2016-03-15 Thread vidya
Hi
I got data into my content field. But i wanted to have differnt fields to be
allocated for data in my file.How can I achieve this ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/indexing-pdf-files-using-post-tool-tp4263811p4263840.html
Sent from the Solr - User mailing list archive at Nabble.com.


indexing pdf files using post tool

2016-03-15 Thread vidya
Hi
I am trying to index a pdf file by using post tool in my linux system,When i
give the command
bin/post -c core2 -p 8984 /root/solr/My_CV.pdf
it is showing the search results like 
"response": {
"numFound": 1,
"start": 0,
"docs": [
  {
"id": "/root/solr-5.5.0/My_CV.pdf",
"meta_creation_date": [
  "2016-03-15T06:22:17Z"
],
"pdf_pdfversion": [
  1.4
],
"dcterms_created": [
  "2016-03-15T06:22:17Z"
],
"x_parsed_by": [
  "org.apache.tika.parser.DefaultParser",
  "org.apache.tika.parser.pdf.PDFParser"
],
"xmptpg_npages": [
  1
],
"creation_date": [
  "2016-03-15T06:22:17Z"
],
"pdf_encrypted": [
  false
],
"title": [
  "My CV"
],
"stream_content_type": [
  "application/pdf"
],
"created": [
  "Tue Mar 15 06:22:17 UTC 2016"
],
"stream_size": [
  18289
],
"dc_format": [
  "application/pdf; version=1.4"
],
"producer": [
  "wkhtmltopdf"
],
"content_type": [
  "application/pdf"
],
"xmp_creatortool": [
  "þÿ"
],
"resourcename": [
  "/root/solr/My_CV.pdf"
],
"dc_title": [
  "My CV"
],
"_version_": 1528851429701189600
  }


but not the actual content in pdf file.
How to index that dat.
Please help me on this.
Can post tool be used for indexing data from HDFS ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/indexing-pdf-files-using-post-tool-tp4263811.html
Sent from the Solr - User mailing list archive at Nabble.com.


accessing data in hdfs by solr in standalone mode

2016-03-14 Thread vidya
Hi
   can solr access the data from HDFS in standalone mode? If so, can u brief 
how it is done.

  Thnaks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/accessing-data-in-hdfs-by-solr-in-standalone-mode-tp4263805.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Data Import Handler Usage

2016-02-16 Thread vidya
Hi

Dataimport section in web ui page still shows me that no data import handler
is defined. And no data is being added to my new collection.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Data-Import-Handler-Usage-tp4257518p4257576.html
Sent from the Solr - User mailing list archive at Nabble.com.


Data Import Handler Usage

2016-02-15 Thread vidya
Hi

I have gone through documents to define data import handler in solr. But i
couldnot implement it.
I have created data-config.xml file that specifies moving data from
collection1 core to another collection, i donno where i need to specify that
second collection.


  
http://localhost:8983/solr/collection1"; query="*:*"/>
  


and request handler is defined as follows in solrconfig.xml



  /home/username/data-config.xml

  

Even after adding this, i couldnot get any data import handler in web url
page for importing.
Why is it so? And what changes need to be done?
I have followed the following url : 
http://www.codewrecks.com/blog/index.php/2013/4/29/loading-data-from-sql-server-to-solr-with-a-data-import-handler




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Data-Import-Handler-Usage-tp4257518.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr-kerbarose URL not accessible

2016-02-12 Thread vidya
Hi

  When I am trying to access my solrCloud web UI page, deployed in cloudera
cluster, I have encountered with the error "DEFECTED TOKENS DETECTED" . Find
the attachment of the error that is added here. It is because of kerbarose
installed on cluster.

Is there any other way that I can access solr in this scenario with
kerbarose installed ?
Writing a java program helps in any way? While writing a java program also,
i have to give connection to solr URL with port or zookeeper host variable.
Will that java program work out?

Please help me out.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-kerbarose-URL-not-accessible-tp4256926.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solrj-collection creation

2016-02-10 Thread vidya
Hi
 
  I want to connect to solrCloud server from java program using
zookeeperHost variable. I know that data can be indexed and searched from a
collection using java program. but Can i able to create a collection
initially from java program?


My problem is that i cannot access solr web page, i'm getting an error like
"defected tokens detected". So, i wanted to connect to solr using solrj API.
For that I need to create a collection initially.Got struck here.

Anyone can help me.
Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solrj-collection-creation-tp4256380.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Multi-lingual search

2016-02-08 Thread vidya
Hi
  Can i implement proximity search if i use 
>seperate core per language 
>field per language
>multilingual field that supports all languages.

And what does proximity search exactly mean?

searching for walk word when walking is indexed,should fetch and display the
record?
It will be included in stemming filter.right?

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-lingual-search-tp4254398p4256094.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Multi-lingual search

2016-02-08 Thread vidya
Hi 
I need to search on these languages which includes proximity search also.
1.Malay
2.Tamil
3.Bahasa Indonesia
4.Vietnamese
5.Cantonese
Will IndicNormalizationFilter work fine or any other filter? Help me if you
have already worked on it or have any idea.


Thanks in advance




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-lingual-search-tp4254398p4255826.html
Sent from the Solr - User mailing list archive at Nabble.com.


Import data from one core to another

2016-02-01 Thread vidya
Hi 

 How to import data from one solr core to another using request handler and
data-config.xml?
In solr-config.xml : I included this in target collection,



/root/Desktop/vidya/solr-data-config.xml



And solr-data-config.xml in the path as mentioned in request handler class

  
http://localhost:8983/solr/";>

  


Then i reloaded the collection and executed this query
http://10.138.90.227:8983/solr/#/student_shard1_replica1/dataimport?command=full-import

But i got an error as "NO DATA IMPORT HANDLER IS DEFINED"

What is the query to fully import data from source to target?

Thanks in advance




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Import-data-from-one-core-to-another-tp4254590.html
Sent from the Solr - User mailing list archive at Nabble.com.


Multi-lingual search

2016-02-01 Thread vidya
Hi

 My use case is to index and able to query different languages in solr which
are not in-built languages supported by solr. How can i implement this ? 

My input document consists of different languages in a field. I came across
"Solr in action" book with searching content in multiple languages i.e.,
chapter 14. For built in languages i have implemented this approach. But for
languages like Tamil, how to implement? Do i need to find for filter classes
of that particular language or any libraries in specific.

Please help me on this.

Thanks in advance.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-lingual-search-tp4254398.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Restoring backups of solrcores

2016-02-01 Thread vidya
Hi 

How can that be useful, can u please explain.
I want to have the same collection name everytime when I index data i.e.,
current_collection.

By collection aliasing, i can create a new collection and point my alias
(say ALIAS) to new collection but cannot rename that collection to the same
current_collection which i have created and indexed previous week.

So, are you asking me to create whatever collection name i want to create
but point out my alias with name i want and change that alias pointing to
new collection that i create and query using my alias name.

Please help me on this.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Restoring-backups-of-solrcores-tp4254080p4254366.html
Sent from the Solr - User mailing list archive at Nabble.com.


Restoring backups of solrcores

2016-01-29 Thread vidya
Hi

I wanted to restore the core into a new one in solrcloud by following the
below link.

https://cwiki.apache.org/confluence/display/solr/Making+and+Restoring+Backups+of+SolrCores

But I couldnt get the success value when restore is given.
It cant be implemented for solrcloud? If so, then what is the alternative
that can be suggested.

My requirement is say,
I have one collection called current_collection which has current weeks
data. In the later week, I want to move the data in current_collection to
previous_collection1 and index new week's data into current_collection by
removing previous data and storing it in previous_collection1.
How can I achieve this?

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Restoring-backups-of-solrcores-tp4254080.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: collection aliasing

2016-01-29 Thread vidya
Hi

 How can a full index rebuild can be done ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-aliasing-tp4252527p4254077.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: collection aliasing

2016-01-28 Thread vidya
Hi

Then what is the difference between collection aliasing and shards parameter
mentioned in request handler of solrconfig.xml.

In request handler of new collection's solrconfig.xml
   shards =
http://localhost:8983/solr/collection1,http://localhost:8983/solr/collection1
I can query both data of collection1 and collection2 in new collection which
is same as collection aliasing.

Is my understanding correct ? If so, then what is the special characteristic
of collection alaising. Please help me.

Thanks in advance




--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-aliasing-tp4252527p4253787.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: collection aliasing

2016-01-24 Thread vidya
Yeah, while querying and indexing also, we can directly use our collection
names. Then what is the use of aliasing ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-aliasing-tp4252527p4252885.html
Sent from the Solr - User mailing list archive at Nabble.com.


collection aliasing

2016-01-22 Thread vidya
Hi

I wanted to mainatain two sets of indexes or collections for maintaing my
large input data for indexing for which i found collection aliasing is
helpful. I have created alais for 2 collections. but my problem is , how can
i point out my alias to 2 different colletions at 2 different times.

Thanks in advance.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-aliasing-tp4252527.html
Sent from the Solr - User mailing list archive at Nabble.com.


collection aliasing,solrctl

2016-01-20 Thread vidya
Hi 
 
I am using solr with cloudera distribution to index data from hdfs and I am
using "solrctl" utility for my deployment. Now i wanted to create collection
alias. How can i perform the action of creating collection aliasing by
commands.

>From google i got : " /admin/collections?action=CREATE "  . How to achieve
this by solrctl or by anything by commands from terminal.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-aliasing-solrctl-tp4252210.html
Sent from the Solr - User mailing list archive at Nabble.com.


Error while reloading collection

2016-01-13 Thread vidya
Hi
 I am using solrcloud on cloudera cluster. I have created collections using
solrctl command which is supported by cloudera search tool. I included one
class of java in schema.xml for creating a field type which is dependent on
a jar that i have included in solrconfig.xml. But when i reload that
collection, I am getting an error that ERROR LOADING THAT CLASS what i
included in schema.xml. What else do i need to include ?

In solrconfig.xml : 

Error while relaoding in command line interface : 







0

197



org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:Error
handling 'reload' action






Please help me on this.Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Error-while-reloading-collection-tp4250635.html
Sent from the Solr - User mailing list archive at Nabble.com.


collection reflection in resource manager node

2016-01-12 Thread vidya
Hi
I have created a collection in one datanode on which solr server is deployed
say DN1. I am having another datanode on which solr server is deployed which
has resource manager service also running on it,say DN2. When i created a
collection using solrctl command in DN1, it got reflected in DN2 but not
DN1. Why is it so ?
Please help me on this.
If i need to put some jars for indexing in my collection,where do i need to
put ? in  DN! or DN2

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/collection-reflection-in-resource-manager-node-tp4250382.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solrcloud -How to delete a doc at a specific shard

2016-01-12 Thread vidya
So, you have deployed solr server on three nodes namely 
  192.168.100.210;211;212 .

Am I correct ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/solrcloud-How-to-delete-a-doc-at-a-specific-shard-tp4249354p4250117.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread vidya
Hi

I am new to solr and having a doubt on how one can know that a particular
shard is in that particular node or ip address.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/solrcloud-How-to-delete-a-doc-at-a-specific-shard-tp4249354p4250072.html
Sent from the Solr - User mailing list archive at Nabble.com.


solr in action - multiple language content in one field

2016-01-11 Thread vidya
Hi

I have gone through solr in action 14th chapter which tells - "searching
content in multiple languages" . But i have a doubt that when i put
documents in solr web UI, it recognises every language and gives me the
result when queried for it. What exactly did they depict in that chapter.
can't solr recognise and process all languages at a time?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-in-action-multiple-language-content-in-one-field-tp4250071.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi 
You described that sharding is to distribute data over multiple machines.Do
I have to make sure that all those machines have solr server or gateway
should be deplyed ?
And what multiple JVM processes run behind a solr server running?
I wanted to know what is a node. -> I understood like a mchine with solr
server deployed.
then what is a solr instance?

Am I correct.If not,please help me

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850p4248865.html
Sent from the Solr - User mailing list archive at Nabble.com.


core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi

I am new to solr. i have a doubt in understanding difference between core
and collection.
>As far as i understand, cores can be created when solr is run in local mode
and collections in solrCloud.
Can you please help me if i am wrong.
>And why do we shard a collection? i read like -
When your data is too large for one node, you can break it up and store it
in sections by creating one or more shards. Each is a portion of the logical
index, or core, and it's the set of all nodes containing that section of the
index.
But when indexing a document in one shard,it gets reflected in every shard
of that collection. But main intention of creating shard is to break up the
data.
>Why do we replicate a collection?

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850.html
Sent from the Solr - User mailing list archive at Nabble.com.


Data migration from one collection to the other collection

2016-01-05 Thread vidya
Hi

I would like to maintain two cores for history data and current data where
hdfs is my datasource. My requirement is that data input should be given to
only one collection and previous data should be moved to history collection.
1)Creating two cores and migrating data from current to history collection
by data-config.xml using solrEntityProcessor. In data-config.xml, where
should i represent two collections for migrating source collection to the
other collection.And how to make sure that happens.Do I need to run a job or
how to make sure that data migration occurs.
https://wiki.apache.org/solr/DataImportHandler#SolrEntityProcessor

2)collection aliasing is a concept which creates new collection after a
period of time.
I read that concept but lagging in how to implement it.Like where do i need
to make changes in my solrcloud.
http://blog.cloudera.com/blog/2013/10/collection-aliasing-near-real-time-search-for-really-big-data/

Please help me on this.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Data-migration-from-one-collection-to-the-other-collection-tp4248646.html
Sent from the Solr - User mailing list archive at Nabble.com.


Querying with action parameter included in URL

2016-01-04 Thread vidya
Hi
 
I am pretty new to solr and when i am going through the tutorials , I came
across urls for querying like 
"http://localhost:8983/solr/admin/configs?action=CREATE&name=booksConfig&baseConfigSet=genericTemplate";
.
I wanted to know how to implement the same by doing changes in schema.xml or
solrconfig.xml. Where should i make changes when an "action=" is specified.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Querying-with-action-parameter-included-in-URL-tp4248576.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: MapReduceIndexerTool Indexing

2016-01-04 Thread vidya
Hi

I would like to index only new data but not already indexed data(delta
Indexing). how can i achieve it using MRIT.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/MapReduceIndexerTool-Indexing-tp4248387p4248573.html
Sent from the Solr - User mailing list archive at Nabble.com.


MapReduceIndexerTool Indexing

2016-01-04 Thread vidya
Hi

I have used MapReduceIndexerTool to index data in my hdfs to solr inorder to
search it. I want to know whether it indexes entire data when some new data
is added to that path, again when tool is run on it.

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/MapReduceIndexerTool-Indexing-tp4248387.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Using the date field for searching

2015-08-11 Thread Bade, Vidya (Sagar)
You can use filter query and form the date as follows when a user enters just 
the year or year and month:

If just the year (1885) was entered -> date:[1885-01-01T00:00:00Z TO 
1886-01-01T00:00:00Z]
If just the year and month (1885-06) were entered -> date:[1885-06-01T00:00:00Z 
TO 1885-07-01T00:00:00Z]

Alternatively use DateRangeField as described at the bottom in the following 
webpage: 

https://cwiki.apache.org/confluence/display/solr/Working+with+Dates

:Sagar

-Original Message-
From: Scott Derrick [mailto:sc...@tnstaafl.net] 
Sent: Tuesday, August 11, 2015 3:02 PM
To: solr-user@lucene.apache.org
Subject: Using the date field for searching

If I query date:1885

I get an error

org.apache.solr.common.SolrException: Invalid Date String:'1885'

If I query date:1885*

I get no results.

and yet there are numerous docs with a year of 1885 in the date string, like so

1885-02-08T00:00:00Z

if I query date:1885-02-08T00:00:00Z

I get 9 results??

Do the users really have to specify a full xml compliant date string to use the 
date: field for searching?

thanks,

Scott


Help with separate root entities in DIH - One each for full and delta import.

2015-07-27 Thread Bade, Vidya (Sagar)
Hi,

I am currently using Solr 4.10.2 and having issues with Delta-imports. For some 
reason delta seems to be inconsistent when using query caching. I am using 
SqlEntityProcessor. To overcome the issue I want to try having two root 
entities - one each for full and delta. Can someone help with a sample format. 
Following is my current data import setup:



 
 
 
 
 

 
  

Thank You,
Sagar