[ https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marshall Schor reopened UIMA-2097: ---------------------------------- > Treatment of URLs with blanks is incorrect in some places > --------------------------------------------------------- > > Key: UIMA-2097 > URL: https://issues.apache.org/jira/browse/UIMA-2097 > Project: UIMA > Issue Type: Bug > Components: Core Java Framework > Affects Versions: 2.3.1SDK > Reporter: Marshall Schor > Assignee: Marshall Schor > Priority: Minor > Fix For: 2.3.2SDK > > > User reported Document Analyzer failing to write correct names in the output > directory, when the input directory had a blank in it (on Windows). Traced > this to failing URL handling. > Proper URL handling seems to need to observe these principles: > 1) URL may have "blanks" and other "invalid-in-URI chars" > 2) URL may have %20 style encoding of blanks and other need-to-be-escaped > characters > Creating files from these: need to use File(a-uri-form). > Creating URIs from URLs - if the URL has unescaped blanks, etc., the form > new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to > illegal chars for URI). > To make the URI creation put in the escape chars if they're not there, you > need to use the multi-arg form of new URI (see Javadocs). > Fix the patches in UIMA-1879 and UIMA-1748. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira