Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
I think it may have to be my using of apache and ajp_proxy, as when I go directly to the port the text doesn't get all garbaled, and I am able to successfully browse by author. Any ideas on what I might need to change for apache? Thanks! Tom Ok, wait... we are not talking about the browse system but about the navigation of the facets in the xmlui https://dspace-test.lib.fit.edu/search- filter?field=authorstarts_with=baksay anyway, unsurprising also the browse doesn't work https://dspace- test.lib.fit.edu/browse?value=Baksay%2C+L%C3%A1szl%C3%B3+A.type=a uthor but looking to the URL it looks correct so the issue need to be on the rendering / receiving side. Also the URIEncoding of tomcat/apache look good as searching for László produce the right results. So my guess is that there are some bugs in the xml transformer or other view component. Do you run also jspui on the same server? Andrea I just turned on jspui: https://dspace-test.lib.fit.edu/jspui/simple- search?query=Baksay%2C+L%C3%A1szl%C3%B3+A -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.c lktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Does your Tomcat's *AJP* Connector include the URIEncoding=UTF-8 attribute? -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Does your Tomcat's *AJP* Connector include the URIEncoding=UTF-8 attribute? No it didn’t. Now it works! Should I have all the attributes that are on the tomcat port 8080 settings, on the AJP connector as well? Thanks! -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
You'll probably want to. They vary a little across Tomcat versions. Look in their Tomcat docs to see what they mean. On Jan 24, 2014 9:56 PM, Thomas Misilo misi...@fit.edu wrote: Does your Tomcat's *AJP* Connector include the URIEncoding=UTF-8 attribute? No it didn’t. Now it works! Should I have all the attributes that are on the tomcat port 8080 settings, on the AJP connector as well? Thanks! -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
what do you mean exactly for browse by author still not work? When I do browse by other, click the letter of the last name, and then find the author I am looking for it says there is for a example one article written by a particular author. See [1]. Do you have wrong entries in the browse list? Do you get wrong result when click on a specif author entry? I get “Search produced no results.” When I click an authors name. Are you using solr as browse dao provider? Not sure on this? [1]: https://dspace-test.lib.fit.edu/search-filter?field=authorstarts_with=baksay From: Andrea Bollini [mailto:a.boll...@cineca.it] Sent: Monday, January 20, 2014 12:56 AM To: Thomas Misilo; dspace-tech@lists.sourceforge.net Subject: R: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 HI Tom, what do you mean exactly for browse by author still not work? Do you have wrong entries in the browse list? Do you get wrong result when click on a specif author entry? Are you using solr as browse dao provider? Andrea Inviato da Samsung Mobile Messaggio originale Da: Thomas Misilo Data:20/01/2014 04:28 (GMT+01:00) A: 'Andrea Bollini' ,dspace-tech@lists.sourceforge.net Oggetto: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Thanks Andrea, It seems clearing out my indexes and rebuilding completely by using dspace index-discovery -b got search working. However, browse by author still doesn't seem to be working. Changing the character type. Here is my connector configuration: Connector port=8080 protocol=HTTP/1.1 maxThreads=150 minSpareThreads=25 maxSpareThreads=75 enableLookups=false redirectPort=8443 acceptCount=100 connectionTimeout=2 disableUploadTimeout=true URIEncoding=UTF-8/ Thanks, Tom -Original Message- From: Andrea Bollini [mailto:a.boll...@cineca.it] Sent: Friday, January 17, 2014 11:36 AM To: dspace-tech@lists.sourceforge.netmailto:dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I want just confirm that this improvement is enabled out-of-box in dspace 4. The enabling code is here: https://github.com/DSpace/DSpace/blob/dspace- 4.0/dspace/solr/search/conf/schema.xml#L241 mainly a matter of solr schema configuration. After the update you must reindex your content, as you have already done that the only other thing that can get result worst can be the URIEncoding on the tomcat connector. Please note that on the dspace demo server it works correctly http://demo.dspace.org/jspui/simple-search?query=Sanchez http://demo.dspace.org/jspui/simple-search?query=S%C3%A1nchez Andrea Il 16/01/2014 23.54, Brian Freels-Stendel ha scritto: Good afternoon, Maybe taking a step backward, do you have an AJP connector set up in Tomcat's server.xml? If so, does it also have URIEncoding=UTF-8? I don't remember if that point has been addressed B-- -Original Message- From: Thomas Misilo [mailto:misi...@fit.edu] Sent: Thursday, January 16, 2014 3:41 PM To: Brian Freels-Stendel; Smith, Ina ism...@sun.ac.zamailto:ism...@sun.ac.za Cc: dspace-tech@lists.sourceforge.netmailto:dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I looked a t it and I believe that those configuration changes were included in DSpace 4.0, via the pull request https://github.com/DSpace/DSpace/pull/287. Though I haven't had any luck. I made sure JAVA and TOMCAT both have the UTF-8 flags, and reindex everything, and it still isn't working. Thanks again for the ideas/help. Tom -Original Message- From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Wednesday, January 15, 2014 1:45 PM To: Smith, Ina ism...@sun.ac.zamailto:ism...@sun.ac.za; Thomas Misilo Cc: dspace-tech@lists.sourceforge.netmailto:dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set- up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.zamailto:ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.netmailto:dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
On Tue, Jan 21, 2014 at 1:54 PM, Thomas Misilo misi...@fit.edu wrote: Are you using solr as browse dao provider? Not sure on this? What are your values of browseDAO.class and browseCreateDAO.class in dspace.cfg? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
-Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Tuesday, January 21, 2014 8:03 AM To: Thomas Misilo Cc: Andrea Bollini; dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 On Tue, Jan 21, 2014 at 1:54 PM, Thomas Misilo misi...@fit.edu wrote: Are you using solr as browse dao provider? Not sure on this? What are your values of browseDAO.class and browseCreateDAO.class in dspace.cfg? It is commented out in my dspace.conf. # Define the DAO class to use this must meet your storage choice for # the browse system (RDBMS: PostgreSQL or Oracle, Solr). # By default, since DSpace 4.0, the Solr implementation is used # # PostgreSQL: # browseDAO.class = org.dspace.browse.BrowseDAOPostgres # browseCreateDAO.class = org.dspace.browse.BrowseCreateDAOPostgres # # Oracle: # browseDAO.class = org.dspace.browse.BrowseDAOOracle # browseCreateDAO.class = org.dspace.browse.BrowseCreateDAOOracle # # Solr: # browseDAO.class = org.dspace.browse.SolrBrowseDAO # browseCreateDAO.class = org.dspace.browse.SolrBrowseCreateDAO -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Ok, wait... we are not talking about the browse system but about the navigation of the facets in the xmlui https://dspace-test.lib.fit.edu/search-filter?field=authorstarts_with=baksay anyway, unsurprising also the browse doesn't work https://dspace-test.lib.fit.edu/browse?value=Baksay%2C+L%C3%A1szl%C3%B3+A.type=author but looking to the URL it looks correct so the issue need to be on the rendering / receiving side. Also the URIEncoding of tomcat/apache look good as searching for László produce the right results. So my guess is that there are some bugs in the xml transformer or other view component. Do you run also jspui on the same server? Andrea Il 21/01/2014 14.18, Thomas Misilo ha scritto: -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Tuesday, January 21, 2014 8:03 AM To: Thomas Misilo Cc: Andrea Bollini; dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 On Tue, Jan 21, 2014 at 1:54 PM, Thomas Misilo misi...@fit.edu wrote: Are you using solr as browse dao provider? Not sure on this? What are your values of browseDAO.class and browseCreateDAO.class in dspace.cfg? It is commented out in my dspace.conf. # Define the DAO class to use this must meet your storage choice for # the browse system (RDBMS: PostgreSQL or Oracle, Solr). # By default, since DSpace 4.0, the Solr implementation is used # # PostgreSQL: # browseDAO.class = org.dspace.browse.BrowseDAOPostgres # browseCreateDAO.class = org.dspace.browse.BrowseCreateDAOPostgres # # Oracle: # browseDAO.class = org.dspace.browse.BrowseDAOOracle # browseCreateDAO.class = org.dspace.browse.BrowseCreateDAOOracle # # Solr: # browseDAO.class = org.dspace.browse.SolrBrowseDAO # browseCreateDAO.class = org.dspace.browse.SolrBrowseCreateDAO -- Andrea Bollini Dipartimento Servizi e Soluzioni per l'Amministrazione Universitaria Divisione Ricerca Via dei Tizii, 6 00185 Roma, Italy tel. +39 06 44 486 087 - mob. +39 348 82 77 525 http://www.cineca.it -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
-Original Message- From: Andrea Bollini [mailto:a.boll...@cineca.it] Sent: Tuesday, January 21, 2014 9:32 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Ok, wait... we are not talking about the browse system but about the navigation of the facets in the xmlui https://dspace-test.lib.fit.edu/search-filter?field=authorstarts_with=baksay anyway, unsurprising also the browse doesn't work https://dspace-test.lib.fit.edu/browse?value=Baksay%2C+L%C3%A1szl%C3%B3+A.type=author but looking to the URL it looks correct so the issue need to be on the rendering / receiving side. Also the URIEncoding of tomcat/apache look good as searching for László produce the right results. So my guess is that there are some bugs in the xml transformer or other view component. Do you run also jspui on the same server? Andrea I just turned on jspui: https://dspace-test.lib.fit.edu/jspui/simple-search?query=Baksay%2C+L%C3%A1szl%C3%B3+A -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Thanks Andrea, It seems clearing out my indexes and rebuilding completely by using dspace index-discovery -b got search working. However, browse by author still doesn't seem to be working. Changing the character type. Here is my connector configuration: Connector port=8080 protocol=HTTP/1.1 maxThreads=150 minSpareThreads=25 maxSpareThreads=75 enableLookups=false redirectPort=8443 acceptCount=100 connectionTimeout=2 disableUploadTimeout=true URIEncoding=UTF-8/ Thanks, Tom -Original Message- From: Andrea Bollini [mailto:a.boll...@cineca.it] Sent: Friday, January 17, 2014 11:36 AM To: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I want just confirm that this improvement is enabled out-of-box in dspace 4. The enabling code is here: https://github.com/DSpace/DSpace/blob/dspace- 4.0/dspace/solr/search/conf/schema.xml#L241 mainly a matter of solr schema configuration. After the update you must reindex your content, as you have already done that the only other thing that can get result worst can be the URIEncoding on the tomcat connector. Please note that on the dspace demo server it works correctly http://demo.dspace.org/jspui/simple-search?query=Sanchez http://demo.dspace.org/jspui/simple-search?query=S%C3%A1nchez Andrea Il 16/01/2014 23.54, Brian Freels-Stendel ha scritto: Good afternoon, Maybe taking a step backward, do you have an AJP connector set up in Tomcat's server.xml? If so, does it also have URIEncoding=UTF-8? I don't remember if that point has been addressed B-- -Original Message- From: Thomas Misilo [mailto:misi...@fit.edu] Sent: Thursday, January 16, 2014 3:41 PM To: Brian Freels-Stendel; Smith, Ina ism...@sun.ac.za Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I looked a t it and I believe that those configuration changes were included in DSpace 4.0, via the pull request https://github.com/DSpace/DSpace/pull/287. Though I haven't had any luck. I made sure JAVA and TOMCAT both have the UTF-8 flags, and reindex everything, and it still isn't working. Thanks again for the ideas/help. Tom -Original Message- From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Wednesday, January 15, 2014 1:45 PM To: Smith, Ina ism...@sun.ac.za; Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set- up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
I want just confirm that this improvement is enabled out-of-box in dspace 4. The enabling code is here: https://github.com/DSpace/DSpace/blob/dspace-4.0/dspace/solr/search/conf/schema.xml#L241 mainly a matter of solr schema configuration. After the update you must reindex your content, as you have already done that the only other thing that can get result worst can be the URIEncoding on the tomcat connector. Please note that on the dspace demo server it works correctly http://demo.dspace.org/jspui/simple-search?query=Sanchez http://demo.dspace.org/jspui/simple-search?query=S%C3%A1nchez Andrea Il 16/01/2014 23.54, Brian Freels-Stendel ha scritto: Good afternoon, Maybe taking a step backward, do you have an AJP connector set up in Tomcat's server.xml? If so, does it also have URIEncoding=UTF-8? I don't remember if that point has been addressed B-- -Original Message- From: Thomas Misilo [mailto:misi...@fit.edu] Sent: Thursday, January 16, 2014 3:41 PM To: Brian Freels-Stendel; Smith, Ina ism...@sun.ac.za Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I looked a t it and I believe that those configuration changes were included in DSpace 4.0, via the pull request https://github.com/DSpace/DSpace/pull/287. Though I haven't had any luck. I made sure JAVA and TOMCAT both have the UTF-8 flags, and reindex everything, and it still isn't working. Thanks again for the ideas/help. Tom -Original Message- From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Wednesday, January 15, 2014 1:45 PM To: Smith, Ina ism...@sun.ac.za; Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set-up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg. clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette E-pos vrywaringsklousule Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan wie dit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie dokument geensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis en vee die e- pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit uit hierdie e-pos en/of die oopmaak van enige lêers
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
I looked a t it and I believe that those configuration changes were included in DSpace 4.0, via the pull request https://github.com/DSpace/DSpace/pull/287. Though I haven't had any luck. I made sure JAVA and TOMCAT both have the UTF-8 flags, and reindex everything, and it still isn't working. Thanks again for the ideas/help. Tom -Original Message- From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Wednesday, January 15, 2014 1:45 PM To: Smith, Ina ism...@sun.ac.za; Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set-up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg. clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette E-pos vrywaringsklousule Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan wie dit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie dokument geensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis en vee die e- pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit uit hierdie e-pos en/of die oopmaak van enige lêers aangeheg by hierdie e-pos nie. E-mail disclaimer This e-mail may contain confidential information and may be legally privileged and is intended only for the person to whom it is addressed. If you are not the intended recipient, you are notified that you may not use, distribute or copy this document in any manner whatsoever. Kindly also notify the sender immediately by telephone, and delete the e-mail. The University does not accept liability for any damage, loss or expense arising from this e-mail and/or accessing any files attached to this e-mail. -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg. clktrk ___ DSpace-tech mailing list DSpace-tech
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Good afternoon, Maybe taking a step backward, do you have an AJP connector set up in Tomcat's server.xml? If so, does it also have URIEncoding=UTF-8? I don't remember if that point has been addressed B-- -Original Message- From: Thomas Misilo [mailto:misi...@fit.edu] Sent: Thursday, January 16, 2014 3:41 PM To: Brian Freels-Stendel; Smith, Ina ism...@sun.ac.za Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 I looked a t it and I believe that those configuration changes were included in DSpace 4.0, via the pull request https://github.com/DSpace/DSpace/pull/287. Though I haven't had any luck. I made sure JAVA and TOMCAT both have the UTF-8 flags, and reindex everything, and it still isn't working. Thanks again for the ideas/help. Tom -Original Message- From: Brian Freels-Stendel [mailto:bfre...@unm.edu] Sent: Wednesday, January 15, 2014 1:45 PM To: Smith, Ina ism...@sun.ac.za; Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set-up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg. clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette E-pos vrywaringsklousule Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan wie dit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie dokument geensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis en vee die e- pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit uit hierdie e-pos en/of die oopmaak van enige lêers aangeheg by hierdie e-pos nie. E-mail disclaimer This e-mail may contain confidential information and may be legally privileged and is intended only for the person to whom it is addressed. If you are not the intended recipient, you are notified that you may not use, distribute or copy this document in any manner whatsoever. Kindly also notify the sender immediately by telephone, and delete the e-mail. The University does not accept liability for any
[Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Hi, I am having a problem where if I click an authors name from the browse by author or discovery, it cannot find the items even though it says there are 5 of them? Here [1] is the broken search result, [2] is the browse by author. The author in question is Baksay, László A. [5], though it is any author with diacritics. [1]: https://repository.lib.fit.edu/browse?value=Baksay%2C+L%C3%A1szl%C3%B3+A.type=author [2]: https://repository.lib.fit.edu/browse?rpp=20order=ASCsort_by=-1etal=-1type=authorstarts_with=B The repository is 3.1 right now, but I exported one of the items and imported it into DSpace 4.0, and I have the same issue. Thanks for any idea and help. Tom -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette E-pos vrywaringsklousule Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan wie dit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie dokument geensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis en vee die e-pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit uit hierdie e-pos en/of die oopmaak van enige lêers aangeheg by hierdie e-pos nie. E-mail disclaimer This e-mail may contain confidential information and may be legally privileged and is intended only for the person to whom it is addressed. If you are not the intended recipient, you are notified that you may not use, distribute or copy this document in any manner whatsoever. Kindly also notify the sender immediately by telephone, and delete the e-mail. The University does not accept liability for any damage, loss or expense arising from this e-mail and/or accessing any files attached to this e-mail. -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1
Good morning, If you're using Discovery, this ticket may help: https://jira.duraspace.org/browse/DS-1152. For the previous default search, this email thread may help: http://sourceforge.net/mailarchive/message.php?msg_id=29655187. I've been wondering if these solutions might not be in the default set-up. Are there other types of character encodings it would be a problem with? B-- -Original Message- From: Smith, Ina ism...@sun.ac.za [mailto:ism...@sun.ac.za] Sent: Wednesday, January 15, 2014 11:01 AM To: Thomas Misilo; 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 We've experienced the same problem (although UTF8/Unicode was activated within DSpace), and decided to do away with diacritics completely (in titles and surnames). End-users so far have not complained. I suspect they seldom use diacritics when entering search terms. It would be interesting to know how Google addresses this issue. Kind regards Ina Smith (Stellenbosch University, South Africa) From: Thomas Misilo [misi...@fit.edu] Sent: 15 January 2014 06:11 PM To: 'heli...@centrum.sk' Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 helix84, I did not have that attribute but I did add it to both servers, and restarted tomcat however I do get the same results, on both instances. Thanks! -Original Message- From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of helix84 Sent: Wednesday, January 15, 2014 10:59 AM To: Thomas Misilo Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Diacritics and DSpace 4.0 and 3.1 Quick check: does your Tomcat's Connector include the URIEncoding=UTF-8 attribute? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette E-pos vrywaringsklousule Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan wie dit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie dokument geensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis en vee die e- pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit uit hierdie e-pos en/of die oopmaak van enige lêers aangeheg by hierdie e-pos nie. E-mail disclaimer This e-mail may contain confidential information and may be legally privileged and is intended only for the person to whom it is addressed. If you are not the intended recipient, you are notified that you may not use, distribute or copy this document in any manner whatsoever. Kindly also notify the sender immediately by telephone, and delete the e-mail. The University does not accept liability for any damage, loss or expense arising from this e-mail and/or accessing any files attached to this e-mail. -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists