[
https://issues.apache.org/jira/browse/PDFBOX-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13118013#comment-13118013
]
Rubesh MX commented on PDFBOX-1123:
-----------------------------------
Hi Maruan, First, A big thanks for replying quickly. I am slightly confused
now. I need some clarification on your comment -
"You need to call .getKids() on a each field node which will give you all the
kids, inspect if you are at a field or at another node and move on until you
get to the final field"
So you mean to say that after I have the collection - var obj =
pdCat.getAcroForm().getFields().toArray(); rather than saying PDField pdd =
(PDField)stx; and then pdd.getPartialName(); I should be doing pdd.getKids()?
But I am not getting the right thing when I try this - I have not fully
understood what you were explaining, sorry.
Could you please clarify?
> Not able to read field values from a PDF File if the field contains special
> characters.
> ---------------------------------------------------------------------------------------
>
> Key: PDFBOX-1123
> URL: https://issues.apache.org/jira/browse/PDFBOX-1123
> Project: PDFBox
> Issue Type: Bug
> Reporter: Rubesh MX
> Priority: Minor
> Labels: acroform
> Attachments: fspl.pdf
>
> Original Estimate: 12h
> Remaining Estimate: 12h
>
> Hi, I am trying to read the field names in a PDF file, it is working with
> most of the files, but in some files we are not able to read the field
> Id/name, the reason being we have some field names as -
> topmostSubform[0].Page1[0].c1_04_0_[0]
> topmostSubform[0].Page1[0].c1_09_0_
> topmostSubform[0].Page2[0].Table_Line4a[0].#subform[1].p2-t69[0]
> Here all the field names starts with topmostSubform[0]. so when we try to get
> the field names like PDField.getpartialname() - the field name is getting
> truncated at '.' and we get only - topmostSubform[0] and since all the field
> names starts with the same name the total count of fields are coming as 1.
> Since there are some special characters like '.'; '_'; '#' this is causing
> the issue. Could you please suggest on this? This is very critical.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira