[ http://issues.apache.org/jira/browse/SANDBOX-153?page=comments#action_12424003 ] Markus Rogg commented on SANDBOX-153: -------------------------------------
"Delimiter is whitespace" JUnit test (CSVParserTest) public void testDelimiterIsWhitespace() throws IOException { String code = "one\ttwo\t\tfour \t five\t six"; TestCSVParser parser = new TestCSVParser(new StringReader(code)); parser.setStrategy(CSVStrategy.TDF_STRATEGY); System.out.println("---------\n" + code + "\n-------------"); assertEquals(CSVParser.TT_TOKEN + ";one;", parser.testNextToken()); assertEquals(CSVParser.TT_TOKEN + ";two;", parser.testNextToken()); assertEquals(CSVParser.TT_TOKEN + ";;", parser.testNextToken()); assertEquals(CSVParser.TT_TOKEN + ";four;", parser.testNextToken()); assertEquals(CSVParser.TT_TOKEN + ";five;", parser.testNextToken()); assertEquals(CSVParser.TT_EOF + ";six;", parser.testNextToken()); } > Delimiter should be never recognized as whitespace > -------------------------------------------------- > > Key: SANDBOX-153 > URL: http://issues.apache.org/jira/browse/SANDBOX-153 > Project: Commons Sandbox > Issue Type: Bug > Components: CSV > Reporter: Markus Rogg > > The CSV-Parser ignores whitespaces at the beginning of a token. If the > delimiter is a tabspace and data has no encapsulator the parser loses the > empty tokens. The parser should never recognize a delimiter as a whitespace. > A possible solution for the class CSVParser is to change the method > isWhitespace(int) : > private boolean isWhitespace(int c) { > return Character.isWhitespace((char) c) && (c != strategy.getDelimiter()); > } -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]