Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (additional information)
Hi, This is an update to a problem with search results that I reported for my DSpace 5.2 Mirage2 XMLUI installation. To make a long story short, I found that if I do a search on a term and that term appears in the bitstream name, then the extracted text’s bitstream name will appear in the snippet results of the search (as part of the fulltext). I don’t know if this is a misconfiguration on my part or if this is how the fulltext searching is expected to display in the snippets. Below is an example that will detail what I am talking about. If anyone has any thoughts that they would like to share on this, I will be grateful. Example: I did a search on the term “Westbrooks” and in the search results, I saw: Language Data and Rights Westbrooks, Elaine (2006-03-31) stream_source_info MWG_2006-03-31_Westbrooks.ppt.txt stream_content_type text/plain stream_size 9868 Content-Encoding UTF-8 stream_name MWG_2006-03-31_Westbrooks.ppt.txt Content-Type text/plain; charset=UTF-8 Language... Data and Rights Elaine L. Westbrooks March 31, 2006 Overview Ethics Undocumented Rights : Problems Rights Expression (Solutions) Ethics (1) Language Documentation facets of ethical research Code of Conduct Informed consent Ethics (2) Language Archiving... (bitstream name: stream_source_info MWG_2006-03-31_Westbrooks.ppt.txt – this is the name of the extracted text which contains the fulltext and which fulfills the search criteria) Another search, let’s say for the phrase “Overview Ethics Undocumented Rights” which appears within the extracted text will result in: Language Data and Rights Westbrooks, Elaine (2006-03-31) Data and Rights Elaine L. Westbrooks March 31, 2006 Overview Ethics Undocumented Rights : Problems Rights Expression (Solutions) Ethics (1) Language Documentation facets of ethical research Code of Conduct Informed consent Ethics (2) Language Archiving... Intellectual property rights International & Protected groups Preserve data Access management Human Subjects Institutional Review Board Confidentiality, privacy Problems Institutions/archives are responsible Linguists knowledgeable of access rights Documented... (the name of the extracted text file does not appear because it does not meet the search criteria) George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical & virtual servers, alerts via email & sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update)
Thanks, Brian: I’ll take a look. George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Thursday, June 18, 2015 6:25 PM To: George Stanley Kozak; 'dspace-tech@lists.sourceforge.net' Subject: RE: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update) Good afternoon, I'm not positive, but I think there's a chance it may be coming from dspace-xmlui-mirage2\src\main\webapp\xsl\aspect\discovery\discovery.xsl, lines 272-280. B-- From: George Stanley Kozak [mailto:g...@cornell.edu] Sent: Thursday, June 18, 2015 3:07 PM To: 'dspace-tech@lists.sourceforge.net' Subject: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update) Hi, I previously reported a problem with search results that I was seeing with my DSpace 5.2 Mirage2 XMLUI installation. helix84 and Hilton Gibson pointed me to the search snippets and the solr index. I’ve done some poking around, and I can’t seem to find what’s causing this. So, I am going to see if anyone can help. Here is the problem: In the search results display (if the item has no abstract), the search snippets displays field names in the search results, such as “stream_source_info” and “stream_content_type” and “stream_size” Here’s an example (I did a search on the term “vineyard”): Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard Report from 2008”, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) This is a draft of the Vineyard Report from 2008 Does anyone have any ideas why these field names would appear in my search results? I rebuilt my solr indexes (dspace index-discovery –b) and that didn’t seem to help. George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update)
Good afternoon, I'm not positive, but I think there's a chance it may be coming from dspace-xmlui-mirage2\src\main\webapp\xsl\aspect\discovery\discovery.xsl, lines 272-280. B-- From: George Stanley Kozak [mailto:g...@cornell.edu] Sent: Thursday, June 18, 2015 3:07 PM To: 'dspace-tech@lists.sourceforge.net' Subject: [Dspace-tech] Problem with Search Result display in Mirage2 theme (update) Hi, I previously reported a problem with search results that I was seeing with my DSpace 5.2 Mirage2 XMLUI installation. helix84 and Hilton Gibson pointed me to the search snippets and the solr index. I’ve done some poking around, and I can’t seem to find what’s causing this. So, I am going to see if anyone can help. Here is the problem: In the search results display (if the item has no abstract), the search snippets displays field names in the search results, such as “stream_source_info” and “stream_content_type” and “stream_size” Here’s an example (I did a search on the term “vineyard”): Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard Report from 2008”, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) This is a draft of the Vineyard Report from 2008 Does anyone have any ideas why these field names would appear in my search results? I rebuilt my solr indexes (dspace index-discovery –b) and that didn’t seem to help. George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Problem with Search Result display in Mirage2 theme (update)
Hi, I previously reported a problem with search results that I was seeing with my DSpace 5.2 Mirage2 XMLUI installation. helix84 and Hilton Gibson pointed me to the search snippets and the solr index. I’ve done some poking around, and I can’t seem to find what’s causing this. So, I am going to see if anyone can help. Here is the problem: In the search results display (if the item has no abstract), the search snippets displays field names in the search results, such as “stream_source_info” and “stream_content_type” and “stream_size” Here’s an example (I did a search on the term “vineyard”): Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item (Let’s say, “This is a draft of the Vineyard Report from 2008”, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) This is a draft of the Vineyard Report from 2008 Does anyone have any ideas why these field names would appear in my search results? I rebuilt my solr indexes (dspace index-discovery –b) and that didn’t seem to help. George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)
Thanks, helix84! George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, June 17, 2015 9:57 AM To: George Stanley Kozak Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend) The text being displayed "stream_source_info ..." is the literal value coming from the "fulltext" or "fulltext_hl" field value in your Solr "search" core. I'm not sure why the literal value is being shown instead of just the parts of fulltext containing the search terms, but this should give you an idea where to start looking. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)
Hi George It's always simple once you understand ;-) Thats how I try to fashion my documentation, simplify it. It takes genius to simplify, it only take a smattering of intelligence to complicate things. I think somebody said that before ;-) But how very true. Cheers hg *Hilton Gibson* Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html On 17 June 2015 at 15:49, George Stanley Kozak wrote: > Hilton: > > > > Ahh, I see…so this is tied to the snippet view. Now I understand. My > original example had no abstract, so the snippet view showed me data from > the bitstream that had the keyword I was looking for and when I placed an > abstract with the keyword, that was shown instead. > > Thanks for pointing this out to me. Now I can explain it to others here > at Cornell. > > > > George Kozak > > Digital Library Specialist > > Cornell University Library Information Technologies (CUL-IT) > > 218 Olin Library > > Cornell University > > Ithaca, NY 14853 > > 607-255-8924 > > > > > > > > *From:* Hilton Gibson [mailto:hilton.gib...@gmail.com] > *Sent:* Wednesday, June 17, 2015 9:23 AM > *To:* George Stanley Kozak > *Cc:* dspace-tech@lists.sourceforge.net > *Subject:* Re: [Dspace-tech] Problem with Search Result display in > Mirage2 theme (resend) > > > > Hi George > > > > I understand the search as delivering snippets with the search term > highlighted. > > See screenshot below. > > > > > Perhaps this is a new behaviour with advanced search? > > > > Cheers > > > > hg > > > *Hilton Gibson* > > Ubuntu Linux Systems Administrator > > Stellenbosch University Library > > http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html > > > > > > On 17 June 2015 at 15:15, George Stanley Kozak wrote: > > Hi… > > > > I sent this to the list back when Open Repositories was goingon, and I > suspect that it was lost in the shuffle, because I haven’t heard anything. > So, I am resending. > > > > I am not sure if this has already been reported, but I found something > strange with my DSpace 5.2 install which I have in development and am > planning to move to production. > > I am using the XMLUI with Mirage2 theme. > > > > I found that in the search results display, if there is no abstract, the > system displays random information. > > > > For instance: > > > > Pellegrini Vineyards > > Hearn, Russell (2008) > > stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt > stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 > stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type > text/plain; charset=UTF-8... > > Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • > OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • > East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- > Chardonnay, Merlot... > > > > If I add an abstract to the item, then the search displays changes to: > > > > Pellegrini Vineyards > > Hearn, Russell (2008) > > Added abstract here… > > > > Has anyone else seen this? Does anyone know what is going on or where I > can go to fix this? > > > > George Kozak > > Digital Library Specialist > > Cornell University Library Information Technologies (CUL-IT) > > 218 Olin Library > > Cornell University > > Ithaca, NY 14853 > > 607-255-8924 > > > > > > > > > > -- > > ___ > DSpace-tech mailing list > DSpace-tech@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dspace-tech > List Etiquette: > https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette > > > -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)
The text being displayed "stream_source_info ..." is the literal value coming from the "fulltext" or "fulltext_hl" field value in your Solr "search" core. I'm not sure why the literal value is being shown instead of just the parts of fulltext containing the search terms, but this should give you an idea where to start looking. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)
Hilton: Ahh, I see…so this is tied to the snippet view. Now I understand. My original example had no abstract, so the snippet view showed me data from the bitstream that had the keyword I was looking for and when I placed an abstract with the keyword, that was shown instead. Thanks for pointing this out to me. Now I can explain it to others here at Cornell. George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 From: Hilton Gibson [mailto:hilton.gib...@gmail.com] Sent: Wednesday, June 17, 2015 9:23 AM To: George Stanley Kozak Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend) Hi George I understand the search as delivering snippets with the search term highlighted. See screenshot below. [cid:image002.jpg@01D0A8E2.80F50180] Perhaps this is a new behaviour with advanced search? Cheers hg Hilton Gibson Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html On 17 June 2015 at 15:15, George Stanley Kozak mailto:g...@cornell.edu>> wrote: Hi… I sent this to the list back when Open Repositories was goingon, and I suspect that it was lost in the shuffle, because I haven’t heard anything. So, I am resending. I am not sure if this has already been reported, but I found something strange with my DSpace 5.2 install which I have in development and am planning to move to production. I am using the XMLUI with Mirage2 theme. I found that in the search results display, if there is no abstract, the system displays random information. For instance: Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) Added abstract here… Has anyone else seen this? Does anyone know what is going on or where I can go to fix this? George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net<mailto:DSpace-tech@lists.sourceforge.net> https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Problem with Search Result display in Mirage2 theme (resend)
Hi… I sent this to the list back when Open Repositories was goingon, and I suspect that it was lost in the shuffle, because I haven’t heard anything. So, I am resending. I am not sure if this has already been reported, but I found something strange with my DSpace 5.2 install which I have in development and am planning to move to production. I am using the XMLUI with Mirage2 theme. I found that in the search results display, if there is no abstract, the system displays random information. For instance: Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) Added abstract here… Has anyone else seen this? Does anyone know what is going on or where I can go to fix this? George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Problem with Search Result display in Mirage2 theme
Hi… I am not sure if this has already been reported, but I found something strange with my DSpace 5.2 install which I have in development and am planning to move to production. I am using the XMLUI with Mirage2 theme. I found that in the search results display, if there is no abstract, the system displays random information. For instance: Pellegrini Vineyards Hearn, Russell (2008) stream_source_info Hearn_PellegriniVineyards_WIWP_2008.pdf.txt stream_content_type text/plain stream_size 2036 Content-Encoding UTF-8 stream_name Hearn_PellegriniVineyards_WIWP_2008.pdf.txt Content-Type text/plain; charset=UTF-8... Pellegrini Vineyards 4/30/08 Pellegrini Vineyards • FIRST VINTAGE: 1991 • OPERATED BY: Bob and Joyce Pellegrini • WINEMAKER: Russell Hearn • WINES: • East End Select- Chardonnay, Rosé, Merlot • Pellegrini Vineyards- Chardonnay, Merlot... If I add an abstract to the item, then the search displays changes to: Pellegrini Vineyards Hearn, Russell (2008) Added abstract here… Has anyone else seen this? Does anyone know what is going on or where I can go to fix this? George Kozak Digital Library Specialist Cornell University Library Information Technologies (CUL-IT) 218 Olin Library Cornell University Ithaca, NY 14853 607-255-8924 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette