Some abstracts in our repository contain what I assume are UTF-8
characters, like '𝜌'. They display correctly in the HTML interface, but
cause an exception when retrieving the record via OAI-PMH:

java.io.IOException:
com.lyncode.xoai.dataprovider.exceptions.WritingXmlException: Error
trying to output '<oai_dc:dc
xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/";
xmlns:doc="http://www.lyncode.com/xoai";
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";
xmlns:dc="http://purl.org/dc/elements/1.1/";
xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/
http://www.openarchives.org/OAI/2.0/oai_dc.xsd";>
...
<dc:description>Ionic Liquids (ILs) are relatively newly formed types
of solvents. As part of ongoing research, research groups and
industries are focusing on solvents classified as ionic liquids, which
have a low melting point,and they have been given great attention
focusing on their thermophysical properties and applications. In order
to utilize or industrially exploit these types of solvents, the
understanding of intermolecular interactions and properties of pure
liquids and their mixtures is important. Thermophysical properties of
ionic liquid mixtures, especially density, viscosity and speed of
sound are measured as a function of temperature. Accurate analysis on
thermophysical properties of ionic liquids is more of paramount
interest as they indicate the transformation of ionic liquids from
small laboratory level to large-scale industrial implementation.In
this study, new data for the binary mixtures containing
{trihexyltetradecylphosphonium chloride ([P+14, 6, 6, 6] [Cl-]) IL and
propanoic acid (PA)} and {1-ethyl-3-methylimidazolium
tetrafluoroborate ([Emim][BF4]) + benzaldehyde or ethyl acetoacetate}
were investigated under atmospheric pressure (p equivalent to 0.1 MPa)
and at temperatures (293.15 to 313.15) K. Densities (&#55349;&#57100;)
...
'
        
org.dspace.xoai.services.impl.cache.DSpaceXOAICacheService.store(DSpaceXOAICacheService.java:114)
        
org.dspace.xoai.controller.DSpaceOAIDataProvider.contextAction(DSpaceOAIDataProvider.java:104)
        sun.reflect.GeneratedMethodAccessor633.invoke(Unknown Source)
        
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        java.lang.reflect.Method.invoke(Method.java:498)
        
org.springframework.web.method.support.InvocableHandlerMethod.invoke(InvocableHandlerMethod.java:215)
        
org.springframework.web.method.support.InvocableHandlerMethod.invokeForRequest(InvocableHandlerMethod.java:132)
        
org.springframework.web.servlet.mvc.method.annotation.ServletInvocableHandlerMethod.invokeAndHandle(ServletInvocableHandlerMethod.java:104)
        
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.invokeHandleMethod(RequestMappingHandlerAdapter.java:743)
        
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.handleInternal(RequestMappingHandlerAdapter.java:672)
        
org.springframework.web.servlet.mvc.method.AbstractHandlerMethodAdapter.handle(AbstractHandlerMethodAdapter.java:82)
        
org.springframework.web.servlet.DispatcherServlet.doDispatch(DispatcherServlet.java:933)
        
org.springframework.web.servlet.DispatcherServlet.doService(DispatcherServlet.java:867)
        
org.springframework.web.servlet.FrameworkServlet.processRequest(FrameworkServlet.java:951)
        
org.springframework.web.servlet.FrameworkServlet.doGet(FrameworkServlet.java:842)
        javax.servlet.http.HttpServlet.service(HttpServlet.java:622)
        
org.springframework.web.servlet.FrameworkServlet.service(FrameworkServlet.java:827)
        javax.servlet.http.HttpServlet.service(HttpServlet.java:729)
        org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52)

In the error message, the document's abstract,'𝜌'. has been encoded as
'&#55349;&#57100;', but seems to be causing the exception.

If I remove the abstract and wait a day (presumably for re-indexing), I can
access the record via OAI-PMH.

Is there a way to include these symbols that will not cause the exception?

I do have 'URIEncoding="UTF-8"' in my port 8080 connector definition in
server.xml.

DSpace version:  CRIS-5.10.0-SNAPSHOT
  SCM revision:  8390fec2945050541427ef1249dbbbd56b1ccdc4
    SCM branch:  fix-sword
            OS:  Linux(amd64) version 4.4.0-198-generic
  Applications:
     Discovery:  enabled.
           JRE:  Private Build version 1.8.0_282
   Ant version:  Apache Ant(TM) version 1.9.6 compiled on July 20 2018
 Maven version:  3.3.9

-- 
All messages to this mailing list should adhere to the DuraSpace Code of 
Conduct: https://duraspace.org/about/policies/code-of-conduct/
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dspace-tech+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/CA%2BxAuhMNmRgigkvJ096Mid%3DQGVxsfbPHp-L1Wg%3DRm0EWxJ6QUQ%40mail.gmail.com.

Reply via email to