Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (additional information)

2015-06-22 Thread George Stanley Kozak
Hi,
This is an update to a problem with search results that I reported for my 
DSpace 5.2 Mirage2 XMLUI installation.
To make a long story short, I found that if I do a search on a term and that 
term appears in the bitstream name, then the extracted text’s bitstream name 
will appear in the snippet results of the search (as part of the fulltext).  I 
don’t know if this is a misconfiguration on my part or if this is how the 
fulltext searching is expected to display in the snippets.  Below is an example 
that will detail what I am talking about.  If anyone has any thoughts that they 
would like to share on this, I will be grateful.

Example:
I did a search on the term “Westbrooks” and in the search results, I saw:
Language Data and Rights
Westbrooks, Elaine (2006-03-31)
stream_source_info MWG_2006-03-31_Westbrooks.ppt.txt stream_content_type 
text/plain stream_size 9868 Content-Encoding UTF-8 stream_name 
MWG_2006-03-31_Westbrooks.ppt.txt Content-Type text/plain; charset=UTF-8 
Language...
Data and Rights Elaine L. Westbrooks March 31, 2006 Overview Ethics 
Undocumented Rights : Problems Rights Expression (Solutions) Ethics (1) 
Language Documentation facets of ethical research Code of Conduct Informed 
consent Ethics (2) Language Archiving...

(bitstream name: stream_source_info MWG_2006-03-31_Westbrooks.ppt.txt – this is 
the name of the extracted text which contains the fulltext and which fulfills 
the search criteria)

Another search, let’s say for the phrase “Overview Ethics Undocumented Rights” 
which appears within the extracted text will result in:
Language Data and Rights
Westbrooks, Elaine (2006-03-31)
Data and Rights Elaine L. Westbrooks March 31, 2006 Overview Ethics 
Undocumented Rights : Problems Rights Expression (Solutions) Ethics (1) 
Language Documentation facets of ethical research Code of Conduct Informed 
consent Ethics (2) Language Archiving...
Intellectual property rights International & Protected groups Preserve data 
Access management Human Subjects Institutional Review Board Confidentiality, 
privacy Problems Institutions/archives are responsible Linguists knowledgeable 
of access rights Documented...

(the name of the extracted text file does not appear because it does not meet 
the search criteria)

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
Monitor 25 network devices or servers for free with OpManager!
OpManager is web-based network management software that monitors 
network devices and physical & virtual servers, alerts via email & sms 
for fault. Monitor 25 devices for free with no restriction. Download now
http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update)

2015-06-18 Thread George Stanley Kozak
Thanks, Brian:
I’ll take a look.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



From: Brian Freels-Stendel [mailto:bfre...@unm.edu]
Sent: Thursday, June 18, 2015 6:25 PM
To: George Stanley Kozak; 'dspace-tech@lists.sourceforge.net'
Subject: RE: [Dspace-tech] Problem with Search Result display in Mirage2 theme 
(update)

Good afternoon,

I'm not positive, but I think there's a chance it may be coming from 
dspace-xmlui-mirage2\src\main\webapp\xsl\aspect\discovery\discovery.xsl, lines 
272-280.

B--

From: George Stanley Kozak [mailto:g...@cornell.edu]
Sent: Thursday, June 18, 2015 3:07 PM
To: 'dspace-tech@lists.sourceforge.net'
Subject: [Dspace-tech] Problem with Search Result display in Mirage2 theme 
(update)

Hi,
I previously reported a problem with search results that I was seeing with my 
DSpace 5.2 Mirage2 XMLUI installation.  helix84 and Hilton Gibson pointed me to 
the search snippets and the solr index.
I’ve done some poking around, and I can’t seem to find what’s causing this.  
So, I am going to see if anyone can help.
Here is the problem:
In the search results display (if the item has no abstract), the search 
snippets displays field names in the search results, such as   
“stream_source_info” and “stream_content_type” and “stream_size”

Here’s an example (I did a search on the term “vineyard”):

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard 
Report from 2008”, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
This is a draft of the Vineyard Report from 2008

Does anyone have any ideas why these field names would appear in my search 
results?  I rebuilt my solr indexes  (dspace index-discovery –b) and that 
didn’t seem to help.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update)

2015-06-18 Thread Brian Freels-Stendel
Good afternoon,

I'm not positive, but I think there's a chance it may be coming from 
dspace-xmlui-mirage2\src\main\webapp\xsl\aspect\discovery\discovery.xsl, lines 
272-280.

B--

From: George Stanley Kozak [mailto:g...@cornell.edu]
Sent: Thursday, June 18, 2015 3:07 PM
To: 'dspace-tech@lists.sourceforge.net'
Subject: [Dspace-tech] Problem with Search Result display in Mirage2 theme 
(update)

Hi,
I previously reported a problem with search results that I was seeing with my 
DSpace 5.2 Mirage2 XMLUI installation.  helix84 and Hilton Gibson pointed me to 
the search snippets and the solr index.
I’ve done some poking around, and I can’t seem to find what’s causing this.  
So, I am going to see if anyone can help.
Here is the problem:
In the search results display (if the item has no abstract), the search 
snippets displays field names in the search results, such as   
“stream_source_info” and “stream_content_type” and “stream_size”

Here’s an example (I did a search on the term “vineyard”):

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard 
Report from 2008”, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
This is a draft of the Vineyard Report from 2008

Does anyone have any ideas why these field names would appear in my search 
results?  I rebuilt my solr indexes  (dspace index-discovery –b) and that 
didn’t seem to help.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

[Dspace-tech] Problem with Search Result display in Mirage2 theme (update)

2015-06-18 Thread George Stanley Kozak
Hi,
I previously reported a problem with search results that I was seeing with my 
DSpace 5.2 Mirage2 XMLUI installation.  helix84 and Hilton Gibson pointed me to 
the search snippets and the solr index.
I’ve done some poking around, and I can’t seem to find what’s causing this.  
So, I am going to see if anyone can help.
Here is the problem:
In the search results display (if the item has no abstract), the search 
snippets displays field names in the search results, such as   
“stream_source_info” and “stream_content_type” and “stream_size”

Here’s an example (I did a search on the term “vineyard”):

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard 
Report from 2008”, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
This is a draft of the Vineyard Report from 2008

Does anyone have any ideas why these field names would appear in my search 
results?  I rebuilt my solr indexes  (dspace index-discovery –b) and that 
didn’t seem to help.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)

2015-06-17 Thread George Stanley Kozak
Thanks, helix84!

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84
Sent: Wednesday, June 17, 2015 9:57 AM
To: George Stanley Kozak
Cc: dspace-tech@lists.sourceforge.net
Subject: Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme 
(resend)

The text being displayed "stream_source_info ..." is the literal value coming 
from the "fulltext" or "fulltext_hl" field value in your Solr "search" core. 
I'm not sure why the literal value is being shown instead of just the parts of 
fulltext containing the search terms, but this should give you an idea where to 
start looking.


Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)

2015-06-17 Thread Hilton Gibson
Hi George

It's always simple once you understand ;-)
Thats how I try to fashion my documentation, simplify it.
It takes genius to simplify, it only take a smattering of intelligence to
complicate things.
I think somebody said that before ;-) But how very true.

Cheers

hg

*Hilton Gibson*
Ubuntu Linux Systems Administrator
Stellenbosch University Library
http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html


On 17 June 2015 at 15:49, George Stanley Kozak  wrote:

>  Hilton:
>
>
>
> Ahh, I see…so this is tied to the snippet view.  Now I understand.  My
> original example had no abstract, so the snippet view showed me data from
> the bitstream that had the keyword I was looking for and when I placed an
> abstract with the keyword, that was shown instead.
>
> Thanks for pointing this out to me.  Now I can explain it to others here
> at Cornell.
>
>
>
> George Kozak
>
> Digital Library Specialist
>
> Cornell University Library Information Technologies (CUL-IT)
>
> 218 Olin Library
>
> Cornell University
>
> Ithaca, NY 14853
>
> 607-255-8924
>
>
>
>
>
>
>
> *From:* Hilton Gibson [mailto:hilton.gib...@gmail.com]
> *Sent:* Wednesday, June 17, 2015 9:23 AM
> *To:* George Stanley Kozak
> *Cc:* dspace-tech@lists.sourceforge.net
> *Subject:* Re: [Dspace-tech] Problem with Search Result display in
> Mirage2 theme (resend)
>
>
>
> Hi George
>
>
>
> I understand the search as delivering snippets with the search term
> highlighted.
>
> See screenshot below.
>
>
> ​
>
> Perhaps this is a new behaviour with advanced search?
>
>
>
> Cheers
>
>
>
> hg
>
>
>  *Hilton Gibson*
>
> Ubuntu Linux Systems Administrator
>
> Stellenbosch University Library
>
> http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html
>
>
>
>
>
> On 17 June 2015 at 15:15, George Stanley Kozak  wrote:
>
>  Hi…
>
>
>
> I sent this to the list back when Open Repositories was goingon, and I
> suspect that it was lost in the shuffle, because I haven’t heard anything.
> So, I am resending.
>
>
>
> I am not sure if this has already been reported, but I found something
> strange with my DSpace 5.2 install which I have in development and am
> planning to move to production.
>
> I am using the XMLUI with Mirage2 theme.
>
>
>
> I found that in the search results display, if there is no abstract, the
> system displays random information.
>
>
>
> For instance:
>
>
>
> Pellegrini Vineyards
>
> Hearn, Russell (2008)
>
> stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt
> stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8
> stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type
> text/plain; charset=UTF-8...
>
> Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 •
> OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: •
> East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards-
> Chardonnay, Merlot...
>
>
>
> If I add an abstract to the item, then the search displays changes to:
>
>
>
> Pellegrini Vineyards
>
> Hearn, Russell (2008)
>
> Added abstract here…
>
>
>
> Has anyone else seen this?  Does anyone know what is going on or where I
> can go to fix this?
>
>
>
> George Kozak
>
> Digital Library Specialist
>
> Cornell University Library Information Technologies (CUL-IT)
>
> 218 Olin Library
>
> Cornell University
>
> Ithaca, NY 14853
>
> 607-255-8924
>
>
>
>
>
>
>
>
>
> --
>
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
>
>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)

2015-06-17 Thread helix84
The text being displayed "stream_source_info ..." is the literal value
coming from the "fulltext" or "fulltext_hl" field value in your Solr
"search" core. I'm not sure why the literal value is being shown instead of
just the parts of fulltext containing the search terms, but this should
give you an idea where to start looking.


Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)

2015-06-17 Thread George Stanley Kozak
Hilton:

Ahh, I see…so this is tied to the snippet view.  Now I understand.  My original 
example had no abstract, so the snippet view showed me data from the bitstream 
that had the keyword I was looking for and when I placed an abstract with the 
keyword, that was shown instead.
Thanks for pointing this out to me.  Now I can explain it to others here at 
Cornell.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



From: Hilton Gibson [mailto:hilton.gib...@gmail.com]
Sent: Wednesday, June 17, 2015 9:23 AM
To: George Stanley Kozak
Cc: dspace-tech@lists.sourceforge.net
Subject: Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme 
(resend)

Hi George

I understand the search as delivering snippets with the search term highlighted.
See screenshot below.
[cid:image002.jpg@01D0A8E2.80F50180]
​
Perhaps this is a new behaviour with advanced search?

Cheers

hg

Hilton Gibson
Ubuntu Linux Systems Administrator
Stellenbosch University Library
http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html


On 17 June 2015 at 15:15, George Stanley Kozak 
mailto:g...@cornell.edu>> wrote:
Hi…

I sent this to the list back when Open Repositories was goingon, and I suspect 
that it was lost in the shuffle, because I haven’t heard anything.  So, I am 
resending.

I am not sure if this has already been reported, but I found something strange 
with my DSpace 5.2 install which I have in development and am planning to move 
to production.
I am using the XMLUI with Mirage2 theme.

I found that in the search results display, if there is no abstract, the system 
displays random information.

For instance:

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
Added abstract here…

Has anyone else seen this?  Does anyone know what is going on or where I can go 
to fix this?

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924




--

___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net<mailto:DSpace-tech@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)

2015-06-17 Thread George Stanley Kozak
Hi…

I sent this to the list back when Open Repositories was goingon, and I suspect 
that it was lost in the shuffle, because I haven’t heard anything.  So, I am 
resending.

I am not sure if this has already been reported, but I found something strange 
with my DSpace 5.2 install which I have in development and am planning to move 
to production.
I am using the XMLUI with Mirage2 theme.

I found that in the search results display, if there is no abstract, the system 
displays random information.

For instance:

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
Added abstract here…

Has anyone else seen this?  Does anyone know what is going on or where I can go 
to fix this?

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

[Dspace-tech] Problem with Search Result display in Mirage2 theme

2015-06-11 Thread George Stanley Kozak
Hi…

I am not sure if this has already been reported, but I found something strange 
with my DSpace 5.2 install which I have in development and am planning to move 
to production.
I am using the XMLUI with Mirage2 theme.

I found that in the search results display, if there is no abstract, the system 
displays random information.

For instance:

Pellegrini Vineyards
Hearn, Russell (2008)
stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt 
stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 
stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type 
text/plain; charset=UTF-8...
Pellegrini Vineyards 4/30/08 Pellegrini Vineyards •  FIRST VINTAGE: 1991 •  
OPERATED BY: Bob and Joyce Pellegrini •  WINEMAKER: Russell Hearn •  WINES: •  
East End Select- Chardonnay, Rosé, Merlot •  Pellegrini Vineyards- Chardonnay, 
Merlot...

If I add an abstract to the item, then the search displays changes to:

Pellegrini Vineyards
Hearn, Russell (2008)
Added abstract here…

Has anyone else seen this?  Does anyone know what is going on or where I can go 
to fix this?

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
218 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924



--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette