#506: Provide PS, type and field code searching
------------------------+----------------------
  Reporter:  tbrooks    |      Owner:  valkyrie
      Type:  defect     |     Status:  assigned
  Priority:  major      |  Milestone:
 Component:  WebSearch  |    Version:
Resolution:             |   Keywords:  INSPIRE
------------------------+----------------------
Changes (by simko):

 * status:  in_merge => assigned


Comment:

 1) You can make indexes by inserting proper configuration statements
 to the top-level `Makefile`; see for example what I did earlier for
 the "firstauthor" index.  (INSPIRE repo, commit 2146ab19)

 2) The journal index is already made, so you can test the journal
 synonym searching (including volume and pages) even without creating
 the other indexes.  In the journal index synonym configuration, the
 branch currently uses the massaging function `leading_to_number`, but
 I think you should rather use `leading_to_comma`, because INSPIRE
 convention for journal index is to separate //journal,volume,page//
 values by commas.  So with `leading_to_number`, journal searches
 including volume and page would not work.  For the two other indexes,
 the `exact` massaging function seems appropriate.

 3) For the two other classification/doctype indexes, we may perhaps
 consider using the index-time synonyms instead of search-time
 synonyms, especially if people are used to values like `E` from SPIRES
 times.

 4) But I'd like to clarify the terminology regarding
 classification/doctype indexes first.

 WRT "classification" KB, it generates values for 65017 field which is
 called "subject" in cataloguing tools.  How do we want to call the new
 index in the user facing parts of INSPIRE, "classification" or
 "subject" or "fc" or something else?  Consider that people may be
 typing and/or seeing query terms like `classification:E` or
 `classification:"Experiment-HEP"`, so we'd better choose something
 nice.  Maybe stick to "subject" like in cataloguing tools, maybe stick
 to "fc" if we choose this to be the user-facing canonical index name
 and not only an alias, etc.

 WRT "doctype", there is a similar naming mismatch.  Moreover, here the
 word doctype has a very concrete meaning in Invenio, namely the type
 of a document attached to a record.  So it may be misleading to call
 it that.  (BTW, see also somewhat related filetype/doctype index issue
 in ticket:473.)

 5) When updating KB/index names, we may want to amend the following
 description `TALKTYPEDESC='Mapping of... something?'` a bit. :)

 6) You should also document in the commit log the weblinks/oalinks fix
 done alongside the process.  Ideally, we should perhaps separate this
 fix into a commit of its own, since it is unrelated to synonym KBs.

-- 
Ticket URL: <http://invenio-software.org/ticket/506#comment:3>
Invenio <http://invenio-software.org>

Reply via email to