how to check Solr correction path?

2019-06-19 Thread Wendy2
Hi,

How to check Solr correction path?
I created a several collections in Solr 7.3.1. and use DIH to index mongoDB.
How can I get the path for the current Solr correction where DIH is running?

Thanks!  



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


indexing MongoDB using DIH

2019-06-17 Thread Wendy2
Hi,

Has any one tried with the following project to index MongoDB via DIH?
I tried to use it. But could not add a filter in the find() method.

Any suggestions?   Thanks! 

https://github.com/james75/SolrMongoImporter



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Indexing MongoDB via DIH

2019-06-16 Thread Wendy2
Hi,

I need to index several large collection of mongoDB with filters via DIH.
Ruled out vis mongo-connector. 

Any recommendations? 

Thanks!



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: strange behavior

2019-06-06 Thread Wendy2
Hi David,

I see. It fixed now by adding the ().  Thank you so much!
q=audit_author.name:(Burley,%20S.K.)%20AND%20entity.type:polymer



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: strange behavior

2019-06-06 Thread Wendy2
Hi Shawn,

I see. 

I added () and it works now. Thank you very much for your help!

q=audit_author.name:(Burley,%20S.K.)%20AND%20entity.type:polymer&rows=1





--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


strange behavior

2019-06-06 Thread Wendy2


Hi,

Why "AND" didn't work anymore?  

I use Solr 7.3.1 and edismax parser.
Could someone explain to me why the following query doesn't work any more?  
What could be the cause? Thanks! 

q=audit_author.name:Burley,%20S.K.%20AND%20entity.type:polymer

It worked previously but now returned very lower number of documents. 
I had to use "fq" to make it work correctly:

q=audit_author.name:Burley,%20S.K.&fq=entity.type:polymer&rows=1







--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: where to see deleted document in Solr log

2019-06-05 Thread Wendy2
Hi Shawn,

I see. Thank you so much for your reply!   Best!



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


where to see deleted document in Solr log

2019-06-03 Thread Wendy2
Hi,

I am using Solr 7.3.1 to index data via DIH.
Solr admin panel indicated that 152160 documents got indexed, while 3944
documents were deleted. But DIH indicated that added/update: 662059
documents. Deleted 0 documents.
I try to find the deleted documents, but I don't see anywhere in the solr
log. Where could I see it? Is there something I need to configure?  Why the
two status summaries were so different?   Thanks!
 


 


 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Why did Solr stats min/max values were returned as float number for field of type="pint"?

2019-05-03 Thread Wendy2
Hi Joel,

Thanks for your response.

Regarding your response "This syntax is bringing back correct data types",

I have a pint field, the stats returned the following min/max values.
"min":0.0, 
"max":1356.0, 

But I was expecting min/max values like below. Is it possible?Thanks! 
"min":0 
"max":1356



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Why did Solr stats min/max values were returned as float number for field of type="pint"?

2019-05-02 Thread Wendy2
Hi Solr users,

I have a pint field: 


But Solr stats min/max values were returned as float numbers ( "min":0.0, 
"max":1356.0) . I thought "pint" type fields should return min/max as int.
Is there something that user can do to make sure it returns as int type
(which matches the field definition)?   Thanks!


{
  "responseHeader":{
"status":0,
"QTime":17,
"params":{
  "q":"*:*",
  "stats":"true",
  "fl":"",
  "rows":"0",
  "stats.field":"rcsb_entry_info.disulfide_bond_count"}},
  "response":{"numFound":151364,"start":0,"docs":[]
  },
  "stats":{
"stats_fields":{
  "rcsb_entry_info.disulfide_bond_count":{
"min":0.0,
"max":1356.0,
"count":151363,
"missing":1,
"sum":208560.0,
"sumOfSquares":5660388.0,
"mean":1.3778796667613618,
"stddev":5.958002695748158



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Can Solr facet on a particular element in a list

2019-03-26 Thread Wendy2
Hi Erick,

Thank you very much for your response!  OK, I understand what you mean and
will flaten the list into multiple fields. 

Thanks!




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Can Solr facet on a particular element in a list

2019-03-26 Thread Wendy2
Hi Solr Users,

Can Solr facet on a particular element in a list?
For example, I have a list with 4 elements as below.  Is it possible to
facet on the individual element? 
facet on  rcsb_enzyme_lineage.name(0), rcsb_enzyme_lineage.name(1), etc??
Thanks!

"rcsb_enzyme_lineage.name":["NAD(+) glycohydrolase",
  "Hydrolases",
  "Hydrolyzing N-glycosyl compounds",
  "Glycosylases"],



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with slow retrieving data

2019-03-26 Thread Wendy2
Hi Eric,

Thank you for your response!  

On the old system, I changed to use docValues=true, and had better
performance. But the searcher was not warmed before I measured it. Also the
local disk was too small so I used an attached volume which turned out was a
big cause of the slow retrieve.

On the new system, I didn't use docValues=true, but used SSD, so the
retrieve was much much faster.

In both cases, the QTime were good. 

I will keep tuning the performance for sorting, facets, etc. 

Thanks and all the best! 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with slow retrieving data

2019-03-25 Thread Wendy2
Hi Eric,

Thank you very much for your response! I tried 

"Try this: 
1> insure docValues=true for the field. You’ll have to re-index all your
docs. "

I tried the above approach as you recommended, the performance was getting
better, reduced about 3 seconds.

Then I tested on a new cloud server with local SSD for one core on Solr, the
performance was great.
With 5 rows to retrieve, the response time was 0.2s, which is better
than our acceptance criteria :-)
So happy.  Thank you!

=testing
 wget -O output.txt
'http://localhost:8983/solr/s_entry/select?fl=pdb_id,score&q=human&start=0&rows=5'
--2019-03-25 10:23:21-- 
http://localhost:8983/solr/s_entry/select?fl=pdb_id,score&q=human&start=0&rows=5
Resolving localhost (localhost)... ::1, 127.0.0.1
Connecting to localhost (localhost)|::1|:8983... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/json]
Saving to: 'output.txt'

output.txt [
<=> 

 
]   2.90M  16.1MB/sin 0.2s





--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with slow retrieving data

2019-03-25 Thread Wendy2
Hi Eric,

Thank you very much for your response!

"Try this: 
1> insure docValues=true for the field. You’ll have to re-index all your
docs."

I tried use docValues and reduced about 3 seconds.  Now I am going to "try
2> if that doesn’t make much of a difference, try adding
useDocValuesAsStored for the field." and will report back.   Thanks! 
 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with slow retrieving data

2019-03-24 Thread Wendy2
Hi Shawn,Thanks for your response.  I have several Solr cores on the same
Solr instance. The particular core with slow retrieve response has 6 gb
data. Sorry for the confusion.I restart Solr and ran same query with rows=0
vs 1, QTime for both are OK, so I guess it is the retrieving slow? I
also tried return different rows, the more rows, the longer retrieving time.
The machine has 64G ram, I tried 32G for Solr Heap memory, but the
performance didn't improve much.  Any suggestions?  Thank you very
much!=Return 0 rows:232  --.-KB/sin 0s { 
"responseHeader":{"status":0,"QTime":96,"params":{ 
"q":"human",  "fl":"pdb_id,score",  "start":"0",  "rows":"0"}}, 
"response":{"numFound":67428,"start":0,"maxScore":246.08528,"docs":[]  }}~  
Return 1 rows: 584.46K  65.4KB/sin 8.9s {  "responseHeader":{   
"status":0,"QTime":39,"params":{  "q":"human", 
"fl":"pdb_id,score",  "start":"0",  "rows":"1"}}, 
"response":{"numFound":67428,"start":0,"maxScore":246.08528,"docs":[  {
 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Help with slow retrieving data

2019-03-24 Thread Wendy2
Hi Shawn,

Thank you very much for your response! Here is a screen shot.  Is the CPU an
issue?


 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Help with slow retrieving data

2019-03-24 Thread Wendy2
Hi Solr users:I use Solr 7.3.1 and 150,000 documents and about 6GB in total.
When I try to retrieve 2 ids (4 letter code, indexed and stored), it
took 17s to retrieve 1.14M size data. I tried to increase RAM and cache, but
only helped to some degree (from 25s to 17s).  Any idea/suggestions where I
should look?   Thanks!  wget -O
output.txt
'http://localhost:8983/solr/s_entry/search?fl=pdb_id,score&q=human&start=0&rows=2'
  
1.14M  66.7KB/sin 17s 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Help with error on indexing mongoDB document by Solr DataImportHandler

2018-08-14 Thread Wendy2
H Shawn,

Thank you very much for your response! 

Solr DataImportHandler "could not" directly index MongoDB collections. I
used open source SolrMongoImporter project
(https://github.com/james75/SolrMongoImporter) on top of Solr DIH to to
directly index data of MongoDB collections.

What I did was that I wrote a new Java class (public class EntryTransformer
extends Transformer) to convert the ObjectId to a String and to flatten
MongoDB document.  

Here is my data-config.xml file:











So overall, I didn't change any code in the following projects. 

solr-dataimporthandler-7.3.1.jar
solr-dataimporthandler-extras-7.3.1.jar
solr-mongo-importer-1.1.0.jar

Since MongoDB or other NOo-SQL database are in the trend of increasing
usage, I wish that Solr will develop new tools or extend its
DataImportHandler to allow directly index No-SQL document.

All the best!  



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with error on indexing mongoDB document by Solr DataImportHandler

2018-08-13 Thread Wendy2
Update:

I resolved this issue by checking key:value to convert ObjectId to String: 

if(value instanceof ObjectId) {
   map.put(key, (String) value.toString());
} else {
  ..
}
   

A Solr happy user :-)



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Help with error on indexing mongoDB document by Solr DataImportHandler

2018-08-13 Thread Wendy2
Hi Solr users:I encountered the following error when indexing MongoDB data by
using Solr DataImportHandler:org.apache.solr.common.SolrException:
TransactionLog doesn't know how to serialize class org.bson.types.ObjectId;
try implementing ObjectResolver?Is there any fix or workaround for this
issue? I am using solr 7.3.1 and mongoDB 3.6.5.   ===Here is the source
code===https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/update/TransactionLog.java//
write a BytesRef as a byte array  static final JavaBinCodec.ObjectResolver
resolver = new JavaBinCodec.ObjectResolver() {@Overridepublic Object
resolve(Object o, JavaBinCodec codec) throws IOException {  if (o
instanceof BytesRef) {BytesRef br = (BytesRef)o;   
codec.writeByteArray(br.bytes, br.offset, br.length);return null; 
}  // Fallback: we have no idea how to serialize this.  Be noisy to
prevent insidious bugs  throw new
SolrException(SolrException.ErrorCode.SERVER_ERROR,  "TransactionLog
doesn't know how to serialize " + o.getClass() + "; try implementing
ObjectResolver?");}  };



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Help on indexing nested documenta in MongoDB

2018-07-27 Thread Wendy2
Hi fellow Solr users,

I am looking for a way to index nested documents in mongodb using Solr's
DataImportHandler. Is there any recommendations? 

I googled on the web in the last two weeks and found the following posts. I
was able to index the top level fields W/O any issue, but had trouble in
indexing the nested object. 

In the past, I used mongo-connector to index simple JSON documents in
mongodb. But for deeply nested documents in MongoDB, I am looking for a more
powerful way to do it. 

Is there any recommendations or good posts on indexing nested documents in
mongodb using Solr's DataImportHandler?   Thanks!

===tried by following the following references==

https://stackoverflow.com/questions/21450555/steps-to-connect-mongodb-and-solr-using-dataimporthandler



https://github.com/james75/SolrMongoImporter/blob/master/src/main/org/apache/solr/handler/dataimport/MongoMapperTransformer.java

https://github.com/5missions/mongoSolrImporter

https://stackoverflow.com/questions/21450555/steps-to-connect-mongodb-and-solr-using-dataimporthandler

https://mrstevenzhao.blogspot.com/2016/05/apache-solr-install-w-mongodb-indexing.html?showComment=1532376114861#c9086728334737074426

 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: need help on search on last name + middile initial

2018-04-20 Thread Wendy2
The issue was resolved.

*I created a new fieldType:*

  

 

  
  


  
 


*A reference:*
https://opensourceconnections.com/blog/2013/08/21/name-search-in-solr/



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: need help on search on last name + middile initial

2018-04-20 Thread Wendy2
Hi Shawn,

The issue got resolved :-)  Thank you very much for your help!!

*I created a new fieldType:*

  

 

  
  


  
 

*
A reference:*
https://opensourceconnections.com/blog/2013/08/21/name-search-in-solr/



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: How to protect middile initials during search

2018-04-20 Thread Wendy2
Hi Alessandro,

Thank you very much for your reply!

I got the issue resolved based on the suggestion from the article below:
https://opensourceconnections.com/blog/2013/08/21/name-search-in-solr/

*I created a new fieldType:*

  

 

  
  


  




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: How to protect middile initials during search

2018-04-19 Thread Wendy2
Hi Jay,

Thank you very much for your reply!
I re-indexed the data after removing the stopword
filter. It looks that Solr parsed the data correctly but didn't return any
results. Anything else could I try?  Thank you again!  

===debugQuery Output=
{
  "responseHeader":{
"status":0,
"QTime":6,
"params":{
  "q":"\"Ellington, A.\"",
  "indent":"on",
  "fl":"audit_author.name",
  "wt":"json",
  "debugQuery":"true"}},
  "response":{"numFound":0,"start":0,"docs":[]
  },
  "debug":{
"rawquerystring":"\"Ellington, A.\"",
"querystring":"\"Ellington, A.\"",
"parsedquery":"(+DisjunctionMaxQuery(((pdb_id:Ellington, A.)^5.0 |
(audit_author.name:\"ellington, a.\")^5.
.
"parsedquery_toString":"+((pdb_id:Ellington, A.)^5.0 |
(audit_author.name:\"ellington, a.\")^5.0 |
.
 "QParser":"ExtendedDismaxQParser",




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: need help on search on last name + middile initial

2018-04-19 Thread Wendy2
Hi Shawn,

Thank you very much for your reply!
Per your suggestion, I re-indexed the data after removing the stopword
filter. It looks that Solr parsed the data correctly but didn't return any
results. Anything else could I try?  Thank you again!  

===debugQuery Output=
{
  "responseHeader":{
"status":0,
"QTime":6,
"params":{
  "q":"\"Ellington, A.\"",
  "indent":"on",
  "fl":"audit_author.name",
  "wt":"json",
  "debugQuery":"true"}},
  "response":{"numFound":0,"start":0,"docs":[]
  },
  "debug":{
"rawquerystring":"\"Ellington, A.\"",
"querystring":"\"Ellington, A.\"",
"parsedquery":"(+DisjunctionMaxQuery(((pdb_id:Ellington, A.)^5.0 |
(audit_author.name:\"ellington, a.\")^5.
.
"parsedquery_toString":"+((pdb_id:Ellington, A.)^5.0 |
(audit_author.name:\"ellington, a.\")^5.0 |
.
 "QParser":"ExtendedDismaxQParser",

==Here is the analysis from Solr Admin UI==
     





--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


How to ptotect middile initials during search

2018-04-18 Thread Wendy2
Hi fellow Users,

Why did Solr return "Ellington, W.R." when I did a name search for
"Ellington, A."?  
I even added "A." in the protwords.txt file. The debugQuery shows that the
middle initial got dropped in the parsedquery.
How can I make Solr NOT to drop the middle initial?  Thanks for your help!! 
 
==Search results
Ellington, A.D.
Ellington, R.W..

===debugQuery=
{
  "responseHeader":{
"status":0,
"QTime":51,
"params":{
  "q":"\"Ellington, A.\"",
  "indent":"on",
  "fl":"audit_author.name",
  "wt":"json",
  "debugQuery":"true"}},
  "response":{"numFound":2,"start":0,"docs":[
  {
"audit_author.name":"Azzi, A., Clark, S.A., Ellington, R.W.,
Chapman, M.S."},
  {
"audit_author.name":"Ye, X., Gorin, A., Ellington, A.D., Patel,
D.J."}]
  },
  "debug":{
"rawquerystring":"\"Ellington, A.\"",
"querystring":"\"Ellington, A.\"",
   
"parsedquery":"(+DisjunctionMaxQuery(((entity_name_com.name:ellington)^20.0)))/no_coord",
"parsedquery_toString":"+((entity_name_com.name:ellington)^20.0)",
   "QParser":"ExtendedDismaxQParser",




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


need help on search on last name + middile initial

2018-04-18 Thread Wendy2
Hi Solr experts:

How can I make sure Solr doesn't drop middle initial when I do a name
search?
I did a search with double quotes for "Ellington, A.", but Solr parser
dropped the middle initial, so I got both back:
I even tried keeping A. in the protwords.txt file, but didn't work. 
Any work around or suggestions?  Thanks!!!

*RESULTS:*
 "response":{"numFound":2,"start":0,"docs":[
  {
"audit_author.name":"Azzi, A., Clark, S.A., Ellington, R.W.,
Chapman, M.S."},
  {
"audit_author.name":"Ye, X., Gorin, A., Ellington, A.D., Patel,
D.J."}]
  },
  "debug":{
"debugQuery mode indicates that Solr dropped the ""A."" when parsing the
query:
  ""debug"":{
""rawquerystring"":""\""Ellington, A.\,
""querystring"":""\""Ellington, A.\,
   
""parsedquery"":""(+DisjunctionMaxQuery(((entity_name_com.name:ellington)^20.0)))/no_coord"",
""parsedquery_toString"":""+((entity_name_com.name:ellington)^20.0)"",
   ""QParser"":""ExtendedDismaxQParser"", "



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: How can I prevent adding duplicated copyfield into managed schema

2018-02-06 Thread Wendy2
What if you include the core name as part of the direct path?
In my practice, managed-schema file is stored under ~/solr/coreName, so it
is core specific and will never be shared with other cores. 

Wendy  



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-02 Thread Wendy2
Hi Erick,

Yes. Currently I re-index the database on a weekly basis because we only
have weekly release.
As part of the Solr weekly re-index, the batch job will delete the
/solr/core/data folder, restart Solr server, then re-index.
We use Luigi to build/control pipelines of Solr re-index batch jobs.

Thanks for al your help and support!

All the best,

Wendy 
a happy Solr user :-) 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-02 Thread Wendy2
Hi Erick,

Thank you very much for the clarification. I will keep it in my mind since
we are now in the process of migrating MySQL database to mongoDB.

Best Regards,

Wendy 
a happy Solr user 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-02 Thread Wendy2
Good morning, Emir,

Thanks for letting me know that. I used dots to add tableName. as a field
prefix because several columns from different tables have the same names.  
In your opinion, what will be the best way to replace dots?

Happy Friday!

Wendy



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-01 Thread Wendy2
And the coupon has no expiration date on it (LOL).  Thank you again, Emir!

Best Regards,

Wendy



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-01 Thread Wendy2
Excellent!!! Thank you so much for all your help, Emir!

Both worked now and I got 997 result counts back as the expected number :-) 

/rcsb/search?q=method:"x-ray*" "Solution NMR"&mm=1
/rcsb/search?q=+method:"x-ray*" +"Solution NMR"&mm=1

I will keep this in my mind regarding query with multiple parsers:  
/select?q=method:”x-ray*” OR _query({!edismax mm=7
qf=‘title_field_stem^3,….’}”Solution NMR”). 

Thanks again and have a wonderful Thursday!
If you ever come to NJ area, I would like to take you out for a lunch to
thank you for all your help!

Wendy



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-02-01 Thread Wendy2
Good morning, Emir,

Here are the debug output for case 1f-a (q=method:"x-ray*" "Solution NMR"),
1f-b (q=+method:"x-ray*" +"Solution NMR"). both returned zero counts. It
looks that the querystrings are the same.   Thanks for following up on my
post and your help! -- Wendy


*=DebugQuery Outputs for case 1f-a,
1f-b===*
*1f-a (/search?q=method:"x-ray*" "Solution NMR"): result count = 0*
 "debug":{
"rawquerystring":"method:\"x-ray*\" \"Solution NMR\"",
"querystring":"method:\"x-ray*\" \"Solution NMR\"",
"parsedquery":"(+(PhraseQuery(method:\"x rai\")
DisjunctionMaxQuery(((pdb_id:Solution NMR)^5.0 |
(entity_name_com.name:\"solut nmr\")^20.0 | (citation_author.name:\"solut
nmr\")^5.0 | (audit_author.name:\"solut nmr\")^5.0 |
rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut nmr\")^3.0 |
(classification:\"solut nmr\")^15.0 | (struct_keywords.text:\"solut
nmr\")^12.0 | (entity.pdbx_description:\"solut nmr\")^10.0 |
(pdbx_descriptor_stem:\"solut nmr\")^10.0 | (citation.title:\"solut
nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut nmr\")^15.0 |
(entity_src_gen_concat_stem:\"solut nmr\")^15.0 | (struct.title:\"solut
nmr\")^35.0 | (group_id_stem:\"solut nmr\")^10.0)))~2)/no_coord",
"parsedquery_toString":"+((method:\"x rai\" ((pdb_id:Solution NMR)^5.0 |
(entity_name_com.name:\"solut nmr\")^20.0 | (citation_author.name:\"solut
nmr\")^5.0 | (audit_author.name:\"solut nmr\")^5.0 |
rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut nmr\")^3.0 |
(classification:\"solut nmr\")^15.0 | (struct_keywords.text:\"solut
nmr\")^12.0 | (entity.pdbx_description:\"solut nmr\")^10.0 |
(pdbx_descriptor_stem:\"solut nmr\")^10.0 | (citation.title:\"solut
nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut nmr\")^15.0 |
(entity_src_gen_concat_stem:\"solut nmr\")^15.0 | (struct.title:\"solut
nmr\")^35.0 | (group_id_stem:\"solut nmr\")^10.0))~2)",


*1f-b (/search?q=+method:"x-ray*" +"Solution NMR") result count = 0:*
"debug":{
"rawquerystring":" method:\"x-ray*\"  \"Solution NMR\"",
"querystring":" method:\"x-ray*\"  \"Solution NMR\"",
"parsedquery":"(+(PhraseQuery(method:\"x rai\")
DisjunctionMaxQuery(((pdb_id:Solution NMR)^5.0 |
(entity_name_com.name:\"solut nmr\")^20.0 | (citation_author.name:\"solut
nmr\")^5.0 | (audit_author.name:\"solut nmr\")^5.0 |
rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut nmr\")^3.0 |
(classification:\"solut nmr\")^15.0 | (struct_keywords.text:\"solut
nmr\")^12.0 | (entity.pdbx_description:\"solut nmr\")^10.0 |
(pdbx_descriptor_stem:\"solut nmr\")^10.0 | (citation.title:\"solut
nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut nmr\")^15.0 |
(entity_src_gen_concat_stem:\"solut nmr\")^15.0 | (struct.title:\"solut
nmr\")^35.0 | (group_id_stem:\"solut nmr\")^10.0)))~2)/no_coord",
"parsedquery_toString":"+((method:\"x rai\" ((pdb_id:Solution NMR)^5.0 |
(entity_name_com.name:\"solut nmr\")^20.0 | (citation_author.name:\"solut
nmr\")^5.0 | (audit_author.name:\"solut nmr\")^5.0 |
rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut nmr\")^3.0 |
(classification:\"solut nmr\")^15.0 | (struct_keywords.text:\"solut
nmr\")^12.0 | (entity.pdbx_description:\"solut nmr\")^10.0 |
(pdbx_descriptor_stem:\"solut nmr\")^10.0 | (citation.title:\"solut
nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut nmr\")^15.0 |
(entity_src_gen_concat_stem:\"solut nmr\")^15.0 | (struct.title:\"solut
nmr\")^35.0 | (group_id_stem:\"solut nmr\")^10.0))~2)",



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-01-31 Thread Wendy2
Hi Emir,

Listed below are the debugQuery outputs from query without "OR" operator. I
really appreciate your help!  --Wendy

===DebugQuery Outputs for case 1f-a, 1f-b  without "OR"
operator=
*1f-a (/search?q=+method:"x-ray*" +method:"Solution NMR") result counts = 0:
*

  "debug":{
"rawquerystring":" method:\"x-ray*\"  method:\"Solution NMR\"",
"querystring":" method:\"x-ray*\"  method:\"Solution NMR\"",
"parsedquery":"(+(PhraseQuery(method:\"x rai\")
PhraseQuery(method:\"solut nmr\"))~2)/no_coord",
"parsedquery_toString":"+((method:\"x rai\" method:\"solut nmr\")~2)",


*1f-b (/search?q=method:"x-ray*" method:"Solution NMR") result counts = 0: *

"debug":{
"rawquerystring":"method:\"x-ray*\" method:\"Solution NMR\"",
"querystring":"method:\"x-ray*\" method:\"Solution NMR\"",
"parsedquery":"(+(PhraseQuery(method:\"x rai\")
PhraseQuery(method:\"solut nmr\"))~2)/no_coord",
"parsedquery_toString":"+((method:\"x rai\" method:\"solut nmr\")~2)",



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-01-31 Thread Wendy2
Hi Emir,

Thank you so much for following up with your ticket.
Listed below are the parts of debugQuery outputs via /search request
handler. The reason I used * in the query term is that there are a couple of
methods starting with "x-ray". When I used space surrounding the "OR"
boolean search operator (refer to 1f) below, I got zero results. If I remove
the space, the result count = 19. 

Thank you very much for investigating this issue. I am a happy Solr user. We
implemented Solr to our web site text search (www.rcsb.org) last year and
have improved the search results :-). Now we want to expand our text search
to support Boolean search and I am facing this issue.  Thank you again for
all your help and support! --Wendy  

===DebugQuery Outputs for case 1d, 1e, 1f =
*1d(/search?q=method:"x-ray*"):* result counts = 884

"debug":{
"rawquerystring":"method:\"x-ray*\"",
"querystring":"method:\"x-ray*\"",
"parsedquery":"(+PhraseQuery(method:\"x rai\"))/no_coord",
"parsedquery_toString":"+method:\"x rai\"",

*1e (/search?q=method:"Solution NMR"):* result counts = 153

 "debug":{
"rawquerystring":"method:\"Solution NMR\"",
"querystring":"method:\"Solution NMR\"",
"parsedquery":"(+PhraseQuery(method:\"solut nmr\"))/no_coord",
"parsedquery_toString":"+method:\"solut nmr\"",

*1f (/search?q=method:"x-ray*" OR "Solution NMR"):* result counts = 0

 "debug":{
"rawquerystring":"method:\"x-ray*\" OR \"Solution NMR\"",
"querystring":"method:\"x-ray*\" OR \"Solution NMR\"",
"parsedquery":"(+(PhraseQuery(method:\"x rai\")
DisjunctionMaxQuery(((pdb_id:OR)^5.0)) DisjunctionMaxQuery(((pdb_id:Solution
NMR)^5.0 | (entity_name_com.name:\"solut nmr\")^20.0 |
(citation_author.name:\"solut nmr\")^5.0 | (audit_author.name:\"solut
nmr\")^5.0 | rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut
nmr\")^3.0 | (classification:\"solut nmr\")^15.0 |
(struct_keywords.text:\"solut nmr\")^12.0 | (entity.pdbx_description:\"solut
nmr\")^10.0 | (pdbx_descriptor_stem:\"solut nmr\")^10.0 |
(citation.title:\"solut nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut
nmr\")^15.0 | (entity_src_gen_concat_stem:\"solut nmr\")^15.0 |
(struct.title:\"solut nmr\")^35.0 | (group_id_stem:\"solut
nmr\")^10.0)))~3)/no_coord",
"parsedquery_toString":"+((method:\"x rai\" ((pdb_id:OR)^5.0)
((pdb_id:Solution NMR)^5.0 | (entity_name_com.name:\"solut nmr\")^20.0 |
(citation_author.name:\"solut nmr\")^5.0 | (audit_author.name:\"solut
nmr\")^5.0 | rest_fields_stem:\"solut nmr\" | (title_fields_stem:\"solut
nmr\")^3.0 | (classification:\"solut nmr\")^15.0 |
(struct_keywords.text:\"solut nmr\")^12.0 | (entity.pdbx_description:\"solut
nmr\")^10.0 | (pdbx_descriptor_stem:\"solut nmr\")^10.0 |
(citation.title:\"solut nmr\")^25.0 | (struct_keywords.pdbx_keywords:\"solut
nmr\")^15.0 | (entity_src_gen_concat_stem:\"solut nmr\")^15.0 |
(struct.title:\"solut nmr\")^35.0 | (group_id_stem:\"solut
nmr\")^10.0))~3)",
"explain":{},





--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-01-30 Thread Wendy2
Hi Emlr,

Thank you for reading my post and for your reply. I updated my post with
debug info and a better view of the definition of  /search request handler. 

Any suggestion on what I should try? 

Thanks,

Wendy



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Help with Boolean search using Solr parser edismax

2018-01-30 Thread Wendy2
Hi Emir,

Thank you so much for your response. I updated my post with an image which
display the configuration of the /search request handler. Any suggestions?

Thanks,

Wendy




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Help with Boolean search using Solr parser edismax

2018-01-29 Thread Wendy2
Hi Solr users,I am having an issue on boolean search with Solr parser
edismax. The search "OR" doesn't work. The image below shows the different
results tested on different Solr versions. There are two types of search
requester handlers, /select vs /search. The /select requester uses Lucene
default parser, while /search uses Solr edismax parser.  I also listed the
search requester handler below. I am expecting the result count of 997 (844
+ 153) but I only get the correct count via the default /select request
handler on Solr v5.3.0 and 6.2.0.  I I go back to use the old Lucene default
parser via /select request handler, I lose all the nice customization
ranking and sorting :-(Does anyone know some workaround/solution to fix this
type of search issue? THANKS! 

Part of the /search request handler in solrconfig.xml
file:trueexplicitedismaxpdb_id^5.0struct.title^35.0citation.title^25.0title_fields_stem^3.0...rest_fields_stem
^0.3score desc,release_date desc,pdb_id desc7100text



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html