[ 
https://issues.apache.org/jira/browse/PDFBOX-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rubesh MX updated PDFBOX-1123:
------------------------------

    Attachment: fspl.pdf

The file which contains the special characters in the field names is attached, 
as I had mentioned earlier when I am trying to read the field names it gets 
truncated at '.' and I am not able to read all the field names as well.

> Not able to read field values from a PDF File if the field contains special 
> characters.
> ---------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1123
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1123
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: Rubesh MX
>            Priority: Critical
>              Labels: Bug
>         Attachments: fspl.pdf
>
>   Original Estimate: 12h
>  Remaining Estimate: 12h
>
> Hi, I am trying to read the field names in a PDF file, it is working with 
> most of the files, but in some files we are not able to read the field 
> Id/name, the reason being we have some field names as -
> topmostSubform[0].Page1[0].c1_04_0_[0]
> topmostSubform[0].Page1[0].c1_09_0_
> topmostSubform[0].Page2[0].Table_Line4a[0].#subform[1].p2-t69[0]
> Here all the field names starts with topmostSubform[0]. so when we try to get 
> the field names like PDField.getpartialname() - the field name is getting 
> truncated at '.' and we get only - topmostSubform[0] and since all the field 
> names starts with the same name the total count of fields are coming as 1. 
> Since there are some special characters like '.'; '_'; '#' this is causing 
> the issue. Could you please suggest on this? This is very critical.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to