Re: [Dspace-tech] fulltext in browse and search

2011-10-07 Thread helix84
On Fri, Oct 7, 2011 at 02:39, Kim Shepherd kim.sheph...@gmail.com wrote:
 RE #2: You could use query-time boosting to boost the relevance of the 
 'fulltext' field... I don't have an example of the top of my head but I think 
 this should be possible... maybe even combining your original query with a 
 fulltext:{query}^10 (where '10' is the boost value... this is probably too 
 high! but suffices for demo purposes.)

Thanks Kim, that's what I was looking for. Your comment about the
boost value being too high - does it matter at all if the two OR-ed
queries are identical?

I'll try to integrate this into the discovery search interface as
optional checkbox, (on by default in my repository). Would others also
be interested in this functionality? Should I submit a patch?

Regards,
~~helix84

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] fulltext in browse and search

2011-10-06 Thread Bram Luyten
Hi Helix,

1. To this goal, you can put [* TO *] in the fulltext filter query, with a
query like  fulltext:[* TO *].
see it in action:
http://demo.dspace.org/xmlui/discover?scope=%2Fquery=fulltext%3A[*+TO+*]submit=Go

2. As far as I know, I don't think that's possible.

best regards,

Bram

-- 
[image: logo]
*Bram Luyten* *@mire*
*2888 Loker Avenue East, Suite 305, Carlsbad, CA. 92010*
*Esperantolaan 4, Heverlee 3001, Belgium*
  http://www.atmire.com/www.atmire.com


On Tue, Oct 4, 2011 at 11:46 AM, helix84 heli...@centrum.sk wrote:

 Hello,
 I'm using DSpace trunk with Discovery and I will stay on 1.8 when it comes
 out.

 Most of our records are bibliographic only and we will add full texts
 in small amounts as we gain access to them. We would like to make
 records with fulltexts prominently visible and make it also possible
 to search/browse only in items with fulltexts.

 Therefore I'd like to ask:
 1) Is it possible to have a Discovery facet for items with fulltext?
 2) Is it possible to rank items with full text before items without
 full text in browse/search?
 If so, how do I do it?

 Let's define fulltext items as item has at least one bitstream
 attached in the ORIGINAL bundle.

 Thanks in advance.

 Regards,
 ~~helix84


 --
 All the data continuously generated in your IT infrastructure contains a
 definitive record of customers, application performance, security
 threats, fraudulent activity and more. Splunk takes this data and makes
 sense of it. Business sense. IT sense. Common sense.
 http://p.sf.net/sfu/splunk-d2dcopy1
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] fulltext in browse and search

2011-10-06 Thread helix84
On Thu, Oct 6, 2011 at 14:05, Bram Luyten b...@mire.be wrote:
 1. To this goal, you can put [* TO *] in the fulltext filter query, with a 
 query like  fulltext:[* TO *].
 see it in action:
 http://demo.dspace.org/xmlui/discover?scope=%2Fquery=fulltext%3A[*+TO+*]submit=Go

Thanks, that worked great. I had to run filter-media first, it didn't
work without that step.

One more related question - it would be nice to display the number of
full texts (the same query) on the repository front page. What would
the XSL for that be?

Regards,
~~helix84

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] fulltext in browse and search

2011-10-06 Thread helix84
On Thu, Oct 6, 2011 at 15:28, helix84 heli...@centrum.sk wrote:
 On Thu, Oct 6, 2011 at 14:05, Bram Luyten b...@mire.be wrote:
 1. To this goal, you can put [* TO *] in the fulltext filter query, with a 
 query like  fulltext:[* TO *].
 see it in action:
 http://demo.dspace.org/xmlui/discover?scope=%2Fquery=fulltext%3A[*+TO+*]submit=Go

Also, I still don't know how to make the facet. I tried to add
,fulltext in the solr.facets.* configuration variables in
dspace-solr-search.cfg and running update-discovery-index, but this
didn't do anything.

Regards,
~~helix84

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] fulltext in browse and search

2011-10-06 Thread helix84
On Thu, Oct 6, 2011 at 15:28, helix84 heli...@centrum.sk wrote:
 On Thu, Oct 6, 2011 at 14:05, Bram Luyten b...@mire.be wrote:
 1. To this goal, you can put [* TO *] in the fulltext filter query, with a 
 query like  fulltext:[* TO *].
 see it in action:
 http://demo.dspace.org/xmlui/discover?scope=%2Fquery=fulltext%3A[*+TO+*]submit=Go

 Thanks, that worked great. I had to run filter-media first, it didn't
 work without that step.

 One more related question - it would be nice to display the number of
 full texts (the same query) on the repository front page. What would
 the XSL for that be?

Thanks to Bram's suggestion I was able to figure this out:

xsl:variable name=fulltext
select=document('http://localhost:8080/solr/search/select/?q=fulltext%3A[*+TO+*]amp;version=2.2amp;start=0amp;rows=0amp;indent=onamp;facet=true')/
Full texts: xsl:value-of
select=$fulltext/response/result[@numFound]/@numFound/


Still working on the facet, I'm on trunk (practically speaking 1.8).

Regards,
~~helix84

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] fulltext in browse and search

2011-10-06 Thread Kim Shepherd
RE #2: You could use query-time boosting to boost the relevance of the
'fulltext' field... I don't have an example of the top of my head but I
think this should be possible... maybe even combining your original query
with a fulltext:{query}^10 (where '10' is the boost value... this is
probably too high! but suffices for demo purposes.)

Another example:

* A normal search for Brian Cox:
http://demo.dspace.org/xmlui/discover?scope=%2Fquery=Brian+Cox
Note that the first result is a non-text bitstream, so not full text. The
second result is a text bitstream which should be indexed as fulltext.

* A boosted text search for Brian Cox:
http://demo.dspace.org/xmlui/discover?scope=%2Fquery=Brian+Cox+OR+(fulltext:*+AND+Brian+Cox)^10
Here we see the fulltext item appearing first, because the OR I wrote (which
just combines the original query with a fulltext:* query) is boosted by 10.
^VALUE is standard lucene queryparser syntax, so it's easy to use with Solr
too.

Cheers,

Kim


On 7 October 2011 01:05, Bram Luyten b...@mire.be wrote:

 Hi Helix,

 1. To this goal, you can put [* TO *] in the fulltext filter query, with
 a query like  fulltext:[* TO *].
 see it in action:

 http://demo.dspace.org/xmlui/discover?scope=%2Fquery=fulltext%3A[*+TO+*]submit=Go

 2. As far as I know, I don't think that's possible.

 best regards,

 Bram

 --
 [image: logo]
 *Bram Luyten* *@mire*
 *2888 Loker Avenue East, Suite 305, Carlsbad, CA. 92010*
 *Esperantolaan 4, Heverlee 3001, Belgium*
   http://www.atmire.com/www.atmire.com



 On Tue, Oct 4, 2011 at 11:46 AM, helix84 heli...@centrum.sk wrote:

 Hello,
 I'm using DSpace trunk with Discovery and I will stay on 1.8 when it comes
 out.

 Most of our records are bibliographic only and we will add full texts
 in small amounts as we gain access to them. We would like to make
 records with fulltexts prominently visible and make it also possible
 to search/browse only in items with fulltexts.

 Therefore I'd like to ask:
 1) Is it possible to have a Discovery facet for items with fulltext?
 2) Is it possible to rank items with full text before items without
 full text in browse/search?
 If so, how do I do it?

 Let's define fulltext items as item has at least one bitstream
 attached in the ORIGINAL bundle.

 Thanks in advance.

 Regards,
 ~~helix84


 --
 All the data continuously generated in your IT infrastructure contains a
 definitive record of customers, application performance, security
 threats, fraudulent activity and more. Splunk takes this data and makes
 sense of it. Business sense. IT sense. Common sense.
 http://p.sf.net/sfu/splunk-d2dcopy1
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech




 --
 All the data continuously generated in your IT infrastructure contains a
 definitive record of customers, application performance, security
 threats, fraudulent activity and more. Splunk takes this data and makes
 sense of it. Business sense. IT sense. Common sense.
 http://p.sf.net/sfu/splunk-d2dcopy1
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] fulltext in browse and search

2011-10-04 Thread helix84
Hello,
I'm using DSpace trunk with Discovery and I will stay on 1.8 when it comes out.

Most of our records are bibliographic only and we will add full texts
in small amounts as we gain access to them. We would like to make
records with fulltexts prominently visible and make it also possible
to search/browse only in items with fulltexts.

Therefore I'd like to ask:
1) Is it possible to have a Discovery facet for items with fulltext?
2) Is it possible to rank items with full text before items without
full text in browse/search?
If so, how do I do it?

Let's define fulltext items as item has at least one bitstream
attached in the ORIGINAL bundle.

Thanks in advance.

Regards,
~~helix84

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech