Fredrik Kjellberg created PDFBOX-1582: -----------------------------------------
Summary: Issues with available() and skip() on RandomAccessFileInputStream Key: PDFBOX-1582 URL: https://issues.apache.org/jira/browse/PDFBOX-1582 Project: PDFBox Issue Type: Test Components: Parsing Affects Versions: 1.8.1 Reporter: Fredrik Kjellberg Priority: Minor I'm trying to track down a strange bug when parsing PDF files on the IBM JDK that sometimes is giving me stack traces from RandomAccessFile classes. I started by writing unit tests for the PDFBox classes to verify their behavior and found a few issues. Can someone more familiar with the PDFBox code base please check the unit test I wrote and give advise on how it is supposed to work? I've added a TODO for each line where I'm in doubt what should be returned. This unit test is for RandomAccessFileInputStream where I've found a few issues. The first is what available() is supposed to return if the input stream tries to go beyond the EOF of the underlying file? When reading single bytes it count down while still returning -1 and when reading a buffer, it is returning what it think is left. The JDK documentation states that available() may not return the absolute truth, so perhaps returning what it think is left is okay, but it shouldn't count down in single reads beyond EOF? Maybe it should be set to zero once a read beyond the EOF is detected? Another issue is with skip() where the JDK documentation states that it should return the actual number of bytes skipped. When skipping beyond the EOF of the file, it does not return the actual number of skipped bytes. Also the underlying file is not updated with the new position. Is this correct behavior? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira