Not able to read field values from a PDF File if the field contains special
characters.
---------------------------------------------------------------------------------------
Key: PDFBOX-1123
URL: https://issues.apache.org/jira/browse/PDFBOX-1123
Project: PDFBox
Issue Type: Bug
Reporter: Rubesh MX
Priority: Critical
Hi, I am trying to read the field names in a PDF file, it is working with most
of the files, but in some files we are not able to read the field Id/name, the
reason being we have some field names as -
topmostSubform[0].Page1[0].c1_04_0_[0]
topmostSubform[0].Page1[0].c1_09_0_
topmostSubform[0].Page2[0].Table_Line4a[0].#subform[1].p2-t69[0]
Here all the field names starts with topmostSubform[0]. so when we try to get
the field names like PDField.getpartialname() - the field name is getting
truncated at '.' and we get only - topmostSubform[0] and since all the field
names starts with the same name the total count of fields are coming as 1.
Since there are some special characters like '.'; '_'; '#' this is causing the
issue. Could you please suggest on this? This is very critical.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira